| TÜLU 3: Pushing Frontiers in Open Language Model Post-Training N Lambert, J Morrison, V Pyatkin, S Huang, H Ivison, F Brahman, ... COLM 2025, 2024 | 689* | 2024 |
| OLMo: Accelerating the Science of Language Models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... ACL 2024 🏆 Best Theme Paper Award, 2024 | 598* | 2024 |
| Can AI language models replace human participants? D Dillion, N Tandon, Y Gu, K Gray Trends in Cognitive Sciences 27 (7), 597-600, 2023 | 585 | 2023 |
| 2 OLMo 2 Furious T OLMo, P Walsh, L Soldaini, D Groeneveld, K Lo, S Arora, A Bhagia, ... COLM 2025, 2024 | 328* | 2024 |
| OLMoE: Open Mixture-of-Experts Language Models N Muennighoff, L Soldaini, D Groeneveld, K Lo, J Morrison, S Min, W Shi, ... ICLR 2025, 2024 | 160* | 2024 |
| WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models W Zhao, D Mondal, N Tandon, D Dillion, K Gray, Y Gu LREC-COLING 2024, 2024 | 50 | 2024 |
| OLMES: A Standard for Language Model Evaluations Y Gu, O Tafjord, B Kuehl, D Haddad, J Dodge, H Hajishirzi NAACL 2025 Findings, 2024 | 44 | 2024 |
| DREAM: Improving Situational QA by First Elaborating the Situation Y Gu, BD Mishra, P Clark NAACL 2022, 2021 | 42* | 2021 |
| SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs Y Gu, O Tafjord, H Kim, J Moore, RL Bras, P Clark, Y Choi arXiv preprint arXiv:2410.13648, 2024 | 29 | 2024 |
| What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations K Rao, L Jiang, V Pyatkin, Y Gu, N Tandon, N Dziri, F Brahman, Y Choi EMNLP 2023 Findings, 2023 | 20* | 2023 |
| Do language models have coherent mental models of everyday things? Y Gu, BD Mishra, P Clark ACL 2023, 2022 | 16 | 2022 |
| Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE Y Gu, Y Fu, V Pyatkin, I Magnusson, BD Mishra, P Clark The Third Workshop on Figurative Language Processing at EMNLP 2022, 2022 | 16 | 2022 |
| PROC2PDDL: Open-Domain Planning Representations from Texts T Zhang, L Zhang, Z Hou, Z Wang, Y Gu, P Clark, C Callison-Burch, ... The 2nd Workshop on Natural Language Reasoning and Structured Explanations …, 2024 | 11 | 2024 |
| Characterization of Singaporean Children's English: Comparisons to American and British Counterparts Using Archetypal Analysis Y Gu, NF Chen INTERSPEECH 2020, 4123-4127, 2020 | 6 | 2020 |
| Digital Socrates: Evaluating LLMs through Explanation Critiques Y Gu, O Tafjord, P Clark ACL 2024, 2023 | 5 | 2023 |
| Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation D Heineman, V Hofmann, I Magnusson, Y Gu, NA Smith, H Hajishirzi, ... arXiv preprint arXiv:2508.13144, 2025 | 4 | 2025 |
| One Venue, Two Conferences: The Separation of Chinese and American Citation Networks B Zhao, Y Gu, JZ Forde, N Saphra AI Cultures Workshop at NeurIPS 2022, 2022 | 4 | 2022 |
| Large-Scale Acoustic Characterization of Singaporean Children’s English Pronunciation Y Gu, NF Chen arXiv preprint arXiv:2202.09108, 2022 | 4 | 2022 |
| Measure More, Question More: Experimental Studies on Transformer-based Language Models and Complement Coercion Y Gu arXiv preprint arXiv:2212.10536, 2022 | 3 | 2022 |
| Olmo 3 T Olmo, A Ettinger, A Bertsch, B Kuehl, D Graham, D Heineman, ... arXiv preprint arXiv:2512.13961, 2025 | 1 | 2025 |