[go: up one dir, main page]

Follow
Alex Gu
Title
Cited by
Cited by
Year
Starcoder: may the source be with you!
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 2023
1675*2023
Livecodebench: Holistic and contamination free evaluation of large language models for code
N Jain, K Han, A Gu, WD Li, F Yan, T Zhang, S Wang, A Solar-Lezama, ...
arXiv preprint arXiv:2403.07974, 2024
9452024
Starcoder 2 and the stack v2: The next generation
A Lozhkov, R Li, LB Allal, F Cassano, J Lamy-Poirier, N Tazi, A Tang, ...
arXiv preprint arXiv:2402.19173, 2024
5172024
Leandojo: Theorem proving with retrieval-augmented language models
K Yang, A Swope, A Gu, R Chalamala, P Song, S Yu, S Godil, RJ Prenger, ...
Advances in Neural Information Processing Systems 36, 21573-21612, 2023
4422023
The disagreement problem in explainable machine learning: A practitioner's perspective
S Krishna, T Han, A Gu, S Wu, S Jabbari, H Lakkaraju
arXiv preprint arXiv:2202.01602, 2022
3852022
Bigcodebench: Benchmarking code generation with diverse function calls and complex instructions
TY Zhuo, MC Vu, J Chim, H Hu, W Yu, R Widyasari, INB Yusuf, H Zhan, ...
arXiv preprint arXiv:2406.15877, 2024
3772024
Santacoder: don't reach for the stars!
LB Allal, R Li, D Kocetkov, C Mou, C Akiki, CM Ferrandis, N Muennighoff, ...
arXiv preprint arXiv:2301.03988, 2023
338*2023
LINC: A neurosymbolic approach for logical reasoning by combining language models with first-order logic provers
T Olausson, A Gu, B Lipkin, C Zhang, A Solar-Lezama, J Tenenbaum, ...
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
2122023
Cruxeval: A benchmark for code reasoning, understanding and execution
A Gu, B Rozière, H Leather, A Solar-Lezama, G Synnaeve, SI Wang
arXiv preprint arXiv:2401.03065, 2024
2112024
StarCoder: May the source be with you! arXiv 2023
R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ...
arXiv preprint arXiv:2305.06161, 0
44*
Min-max bilevel multi-objective optimization with applications in machine learning
A Gu, S Lu, P Ram, L Weng
arXiv preprint arXiv:2203.01924, 2022
26*2022
The counterfeit conundrum: Can code language models grasp the nuances of their incorrect generations?
A Gu, WD Li, N Jain, T Olausson, C Lee, K Sen, A Solar-Lezama
Findings of the Association for Computational Linguistics: ACL 2024, 74-117, 2024
252024
Mixture of parrots: Experts improve memorization more than reasoning
S Jelassi, C Mohri, D Brandfonbrener, A Gu, N Vyas, N Anand, ...
arXiv preprint arXiv:2410.19034, 2024
122024
Solving Inequality Proofs with Large Language Models
J Sheng, L Lyu, J Jin, T Xia, A Gu, J Zou, P Lu
arXiv preprint arXiv:2506.07927, 2025
112025
Challenges and paths towards ai for software engineering
A Gu, N Jain, WD Li, M Shetty, Y Shao, Z Li, D Yang, K Ellis, K Sen, ...
arXiv preprint arXiv:2503.22625, 2025
92025
Cwm: An open-weights llm for research on code generation with world models
Q Carbonneaux, G Cohen, J Gehring, J Kahn, J Kossen, F Kreuk, ...
arXiv e-prints, arXiv: 2510.02387, 2025
82025
Three operator splitting with subgradients, stochastic gradients, and adaptive learning rates
A Yurtsever, A Gu, S Sra
Advances in Neural Information Processing Systems 34, 19743-19756, 2021
82021
Reproducibility report: La-maml: Look-ahead meta learning for continual learning
J Joseph, A Gu
arXiv preprint arXiv:2102.05824, 2021
6*2021
Language agnostic code embeddings
S Utpala, A Gu, PY Chen
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
42024
Certified interpretability robustness for class activation mapping
A Gu, TW Weng, PY Chen, S Liu, L Daniel
arXiv preprint arXiv:2301.11324, 2023
42023
The system can't perform the operation now. Try again later.
Articles 1–20