Sainbayar Sukhbaatar

Cited by

	All	Since 2021
Citations	11692	7868
h-index	32	32
i10-index	52	51

2600

1300

650

1950

20152016201720182019202020212022202320242025202663 276 521 778 971 1099 1093 1155 1273 1690 2570 82

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jason WestonMetaVerified email at fb.com
Rob FergusProfessor of Computer Science, New York UniversityVerified email at cs.nyu.edu
Arthur SzlamDeepMindVerified email at google.com
Weizhe YuanNew York UniversityVerified email at nyu.edu
Jing XuMeta AI Research (FAIR)Verified email at meta.com
Yuandong TianCo-founder, Stealth StartupVerified email at recursive.com
Piotr BojanowskiMeta FAIRVerified email at fb.com
Armand JoulinGoogle DeepMindVerified email at google.com
Kyunghyun ChoNew York University, GenentechVerified email at nyu.edu
Edouard GraveResearch Scientist, KyutaiVerified email at fb.com
Gabriel SynnaeveResearch scientist at Facebook AI ResearchVerified email at fb.com
Stephen RollerThinking MachinesVerified email at thinkingmachines.ai
Lubomir BourdevWaveOne, Inc.Verified email at wave.one
Joan BrunaProfessor of Computer Science, Data Science & Mathematics (aff), Courant Institute and CDS, NYUVerified email at cims.nyu.edu
Lina MezghaniInria, Meta AIVerified email at inria.fr
Karteek AlahariInriaVerified email at inria.fr
Manohar PaluriMetaVerified email at fb.com
Ilya KostrikovOpenAIVerified email at openai.com
Bolei ZhouAssociate Professor at UCLAVerified email at ucla.edu
Adam LererFacebook AI ResearchVerified email at fb.com

Sainbayar Sukhbaatar

FAIR team, Meta AI

Verified email at meta.com - Homepage

deep learning machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
End-To-End Memory Networks S Sukhbaatar, A Szlam, J Weston, R Fergus	3559	2015
Learning multiagent communication with backpropagation S Sukhbaatar, A Szlam, R Fergus Advances in Neural Information Processing Systems, 2244-2252, 2016	1686	2016
Training Convolutional Networks with Noisy Labels S Sukhbaatar, J Bruna, M Paluri, L Bourdev, R Fergus Accepted as a workshop contribution at ICLR 2015, 2014	1071*	2014
Self-rewarding language models W Yuan, RY Pang, K Cho, X Li, S Sukhbaatar, J Xu, JE Weston Forty-first International Conference on Machine Learning, 2024	630	2024
Intrinsic motivation and automatic curricula via asymmetric self-play S Sukhbaatar, Z Lin, I Kostrikov, G Synnaeve, A Szlam, R Fergus arXiv preprint arXiv:1703.05407, 2017	486	2017
Learning when to communicate at scale in multiagent cooperative and competitive tasks A Singh, T Jain, S Sukhbaatar arXiv preprint arXiv:1812.09755, 2018	459	2018
Simple baseline for visual question answering B Zhou, Y Tian, S Sukhbaatar, A Szlam, R Fergus arXiv preprint arXiv:1512.02167, 2015	446	2015
Adaptive attention span in transformers S Sukhbaatar, E Grave, P Bojanowski, A Joulin arXiv preprint arXiv:1905.07799, 2019	386	2019
Training large language models to reason in a continuous latent space S Hao, S Sukhbaatar, DJ Su, X Li, Z Hu, J Weston, Y Tian arXiv preprint arXiv:2412.06769, 2024	301	2024
Iterative reasoning preference optimization RY Pang, W Yuan, H He, K Cho, S Sukhbaatar, J Weston Advances in Neural Information Processing Systems 37, 116617-116637, 2024	280	2024
Hash layers for large sparse models S Roller, S Sukhbaatar, J Weston advances in neural information processing systems 34, 17555-17566, 2021	264	2021
Augmenting self-attention with persistent memory S Sukhbaatar, E Grave, G Lample, H Jegou, A Joulin arXiv preprint arXiv:1907.01470, 2019	156	2019
Meta-rewarding language models: Self-improving alignment with llm-as-a-meta-judge T Wu, W Yuan, O Golovneva, J Xu, Y Tian, J Jiao, JE Weston, ... Proceedings of the 2025 Conference on Empirical Methods in Natural Language …, 2025	153	2025
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	151	2024
Some things are more cringe than others: Preference optimization with the pairwise cringe loss J Xu, A Lee, S Sukhbaatar, J Weston CoRR, 2023	117	2023
Memory-augmented reinforcement learning for image-goal navigation L Mezghan, S Sukhbaatar, T Lavril, O Maksymets, D Batra, P Bojanowski, ... 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022	115	2022
System 2 attention (is something you might need too) J Weston, S Sukhbaatar arXiv preprint arXiv:2311.11829, 2023	98	2023
Composable planning with attributes A Zhang, S Sukhbaatar, A Lerer, A Szlam, R Fergus International Conference on Machine Learning, 5842-5851, 2018	97	2018
Mazebase: A sandbox for learning from games S Sukhbaatar, A Szlam, G Synnaeve, S Chintala, R Fergus arXiv preprint arXiv:1511.07401, 2015	91	2015
Addressing Some Limitations of Transformers with Feedback Memory A Fan, T Lavril, E Grave, A Joulin, S Sukhbaatar arXiv preprint arXiv:2002.09402, 2020	85*	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors