[go: up one dir, main page]

Follow
Sebastien Bubeck
Title
Cited by
Cited by
Year
Sparks of artificial general intelligence: Early experiments with gpt-4
S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ...
arXiv preprint arXiv:2303.12712, 2023
54602023
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
S Bubeck, N Cesa-Bianchi
Foundations and Trends in Machine Learning 5, 1-122, 2012
35562012
Convex optimization: Algorithms and complexity
S Bubeck
Foundations and Trends in Machine Learning 8, 231-357, 2014
30302014
Phi-4 technical report
M Abdin, J Aneja, H Behl, S Bubeck, R Eldan, S Gunasekar, M Harrison, ...
arXiv preprint arXiv:2412.08905, 2024
27612024
Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine
P Lee, S Bubeck, J Petro
New England Journal of Medicine 388 (13), 1233-1239, 2023
18052023
Is Q-learning provably efficient?
C Jin, Z Allen-Zhu, S Bubeck, MI Jordan
Advances in neural information processing systems 31, 2018
11502018
Best arm identification in multi-armed bandits
JY Audibert, S Bubeck, R Munos
COLT 2010, 2010
11162010
Textbooks are all you need
S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ...
arXiv preprint arXiv:2306.11644, 2023
8832023
Provably robust deep learning via adversarially trained smoothed classifiers
H Salman, J Li, I Razenshteyn, P Zhang, H Zhang, S Bubeck, G Yang
Advances in neural information processing systems 32, 2019
7202019
Pure exploration in multi-armed bandits problems
S Bubeck, R Munos, G Stoltz
Algorithmic Learning Theory, 23-37, 2009
6912009
Textbooks are all you need ii: phi-1.5 technical report
Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee
arXiv preprint arXiv:2309.05463, 2023
6442023
X-armed bandits
S Bubeck, R Munos, G Stoltz, C Szepesvári
Journal of Machine Learning Research 12, 1587-1627, 2011
5652011
Minimax policies for adversarial and stochastic bandits
JY Audibert, S Bubeck
COLT 2009, 2009
5622009
lil'UCB: An Optimal Exploration Algorithm for Multi-Armed Bandits
K Jamieson, M Malloy, R Nowak, S Bubeck
COLT 2014, 2013
5412013
Optimal algorithms for smooth and strongly convex distributed optimization in networks
K Scaman, F Bach, S Bubeck, YT Lee, L Massoulié
international conference on machine learning, 3027-3036, 2017
4242017
Phi-2: The surprising power of small language models
M Javaheripi, S Bubeck, M Abdin, J Aneja, S Bubeck, CCT Mendes, ...
Microsoft Research Blog 1 (3), 3, 2023
3932023
Bandits with heavy tail
S Bubeck, N Cesa-Bianchi, G Lugosi
IEEE Transactions on Information Theory 59 (11), 7711-7717, 2013
3902013
Sparks of artificial general intelligence: early experiments with GPT-4 (2023)
S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ...
arXiv preprint arXiv:2303.12712 1, 2023
3892023
Pure exploration in finitely-armed and continuous-armed bandits
S Bubeck, R Munos, G Stoltz
Theoretical Computer Science 412, 1832-1852, 2010
3642010
The best of both worlds: Stochastic and adversarial bandits
S Bubeck, A Slivkins
COLT 2012, 2012
3102012
The system can't perform the operation now. Try again later.
Articles 1–20