Alex Gu

Cited by

	All	Since 2021
Citations	5266	5257
h-index	12	12
i10-index	14	14

3200

1600

800

2400

2022202320242025202632 383 1608 3109 115

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Armando Solar-LezamaMITVerified email at csail.mit.edu
Koushik SenProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Naman JainUC BerkeleyVerified email at berkeley.edu
Wen-Ding LiCornell UniversityVerified email at cornell.edu
Sida I. WangFacebook AIVerified email at fb.com
Kaiyu YangMeta FAIRVerified email at meta.com
Satyapriya KrishnaHarvard UniversityVerified email at g.harvard.edu
Tessa HanHarvard UniversityVerified email at g.harvard.edu
Anima AnandkumarCalifornia Institute of Technology and NVIDIAVerified email at caltech.edu
Saad GodilChief Technology Officer, Hippocratic AIVerified email at nvidia.com
Shixing YuCornell UniversityVerified email at cornell.edu
Shahin JabbariDrexel UniversityVerified email at drexel.edu
Zhiwei Steven WuCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Himabindu LakkarajuAssistant Professor, Harvard University; Senior Staff Research Scientist, Google.Verified email at seas.harvard.edu
Parikshit RamIBM ResearchVerified email at ibm.com
Songtao LuAssistant Professor, CSE, The Chinese University of Hong KongVerified email at cse.cuhk.edu.hk
Tsui-Wei WengUCSDVerified email at ucsd.edu
Joel Joseph JacquilinIndian Institute of Technology (BHU) VaranasiVerified email at iitbhu.ac.in
Suvrit SraTUMVerified email at tum.de

Alex Gu

MIT

Verified email at mit.edu - Homepage

program synthesis machine learning large language models code generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Starcoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... arXiv preprint arXiv:2305.06161, 2023	1675*	2023
Livecodebench: Holistic and contamination free evaluation of large language models for code N Jain, K Han, A Gu, WD Li, F Yan, T Zhang, S Wang, A Solar-Lezama, ... arXiv preprint arXiv:2403.07974, 2024	945	2024
Starcoder 2 and the stack v2: The next generation A Lozhkov, R Li, LB Allal, F Cassano, J Lamy-Poirier, N Tazi, A Tang, ... arXiv preprint arXiv:2402.19173, 2024	517	2024
Leandojo: Theorem proving with retrieval-augmented language models K Yang, A Swope, A Gu, R Chalamala, P Song, S Yu, S Godil, RJ Prenger, ... Advances in Neural Information Processing Systems 36, 21573-21612, 2023	442	2023
The disagreement problem in explainable machine learning: A practitioner's perspective S Krishna, T Han, A Gu, S Wu, S Jabbari, H Lakkaraju arXiv preprint arXiv:2202.01602, 2022	385	2022
Bigcodebench: Benchmarking code generation with diverse function calls and complex instructions TY Zhuo, MC Vu, J Chim, H Hu, W Yu, R Widyasari, INB Yusuf, H Zhan, ... arXiv preprint arXiv:2406.15877, 2024	377	2024
Santacoder: don't reach for the stars! LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ... arXiv preprint arXiv:2301.03988, 2023	338*	2023
LINC: A neurosymbolic approach for logical reasoning by combining language models with first-order logic provers T Olausson, A Gu, B Lipkin, C Zhang, A Solar-Lezama, J Tenenbaum, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023	212	2023
Cruxeval: A benchmark for code reasoning, understanding and execution A Gu, B Rozière, H Leather, A Solar-Lezama, G Synnaeve, SI Wang arXiv preprint arXiv:2401.03065, 2024	211	2024
StarCoder: May the source be with you! arXiv 2023 R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... arXiv preprint arXiv:2305.06161, 0	44*
Min-max bilevel multi-objective optimization with applications in machine learning A Gu, S Lu, P Ram, L Weng arXiv preprint arXiv:2203.01924, 2022	26*	2022
The counterfeit conundrum: Can code language models grasp the nuances of their incorrect generations? A Gu, WD Li, N Jain, T Olausson, C Lee, K Sen, A Solar-Lezama Findings of the Association for Computational Linguistics: ACL 2024, 74-117, 2024	25	2024
Mixture of parrots: Experts improve memorization more than reasoning S Jelassi, C Mohri, D Brandfonbrener, A Gu, N Vyas, N Anand, ... arXiv preprint arXiv:2410.19034, 2024	12	2024
Solving Inequality Proofs with Large Language Models J Sheng, L Lyu, J Jin, T Xia, A Gu, J Zou, P Lu arXiv preprint arXiv:2506.07927, 2025	11	2025
Challenges and paths towards ai for software engineering A Gu, N Jain, WD Li, M Shetty, Y Shao, Z Li, D Yang, K Ellis, K Sen, ... arXiv preprint arXiv:2503.22625, 2025	9	2025
Cwm: An open-weights llm for research on code generation with world models Q Carbonneaux, G Cohen, J Gehring, J Kahn, J Kossen, F Kreuk, ... arXiv e-prints, arXiv: 2510.02387, 2025	8	2025
Three operator splitting with subgradients, stochastic gradients, and adaptive learning rates A Yurtsever, A Gu, S Sra Advances in Neural Information Processing Systems 34, 19743-19756, 2021	8	2021
Reproducibility report: La-maml: Look-ahead meta learning for continual learning J Joseph, A Gu arXiv preprint arXiv:2102.05824, 2021	6*	2021
Language agnostic code embeddings S Utpala, A Gu, PY Chen Proceedings of the 2024 Conference of the North American Chapter of the …, 2024	4	2024
Certified interpretability robustness for class activation mapping A Gu, TW Weng, PY Chen, S Liu, L Daniel arXiv preprint arXiv:2301.11324, 2023	4	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors