| How far can camels go? exploring the state of instruction tuning on open resources Y Wang, H Ivison, P Dasigi, J Hessel, T Khot, K Chandu, D Wadden, ... Advances in Neural Information Processing Systems 36, 74764-74786, 2023 | 441 | 2023 |
| Rewardbench: Evaluating reward models for language modeling N Lambert, V Pyatkin, J Morrison, LJV Miranda, BY Lin, K Chandu, N Dziri, ... Findings of the Association for Computational Linguistics: NAACL 2025, 1755-1797, 2025 | 438 | 2025 |
| OLMo: Accelerating the science of language models D Groeneveld, I Beltagy, E Walsh, A Bhagia, R Kinney, O Tafjord, A Jha, ... Proceedings of the 62nd annual meeting of the association for computational …, 2024 | 403 | 2024 |
| Dolma: An open corpus of three trillion tokens for language model pretraining research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... Proceedings of the 62nd annual meeting of the association for computational …, 2024 | 320 | 2024 |
| The unlocking spell on base llms: Rethinking alignment via in-context learning BY Lin, A Ravichander, X Lu, N Dziri, M Sclar, K Chandu, C Bhagavatula, ... arXiv preprint arXiv:2312.01552, 2023 | 266 | 2023 |
| The gem benchmark: Natural language generation, its evaluation and metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, A Aremu, ... Proceedings of the 1st Workshop on Natural Language Generation, Evaluation …, 2021 | 209 | 2021 |
| Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, SY Gadre, H Bansal, E Guha, ... Advances in Neural Information Processing Systems 37, 14200-14282, 2024 | 200 | 2024 |
| A survey of code-switched speech and language processing S Sitaram, KR Chandu, SK Rallabandi, AW Black arXiv preprint arXiv:1904.00784, 2019 | 166 | 2019 |
| Grounding'grounding'in NLP KR Chandu, Y Bisk, AW Black arXiv preprint arXiv:2106.02192, 2021 | 90 | 2021 |
| Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell …, 2024 | 88 | 2024 |
| Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, and Vaishaal Shankar …, 2024 | 85 | 2024 |
| " Answer ka type kya he?" Learning to Classify Questions in Code-Mixed Language KC Raghavi, MK Chinnakotla, M Shrivastava Proceedings of the 24th International Conference on World Wide Web, 853-858, 2015 | 74 | 2015 |
| The art of saying no: Contextual noncompliance in language models F Brahman, S Kumar, V Balachandran, P Dasigi, V Pyatkin, ... Advances in Neural Information Processing Systems 37, 49706-49748, 2024 | 70 | 2024 |
| Agent lumos: Unified and modular training for open-source language agents D Yin, F Brahman, A Ravichander, K Chandu, KW Chang, Y Choi, BY Lin Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 66 | 2024 |
| The Generative AI paradox:" What it can create, it may not understand" P West, X Lu, N Dziri, F Brahman, L Li, JD Hwang, L Jiang, J Fisher, ... arXiv preprint arXiv:2311.00059, 2023 | 57 | 2023 |
| Wildbench: Benchmarking llms with challenging tasks from real users in the wild BY Lin, Y Deng, K Chandu, F Brahman, A Ravichander, V Pyatkin, N Dziri, ... arXiv preprint arXiv:2406.04770, 2024 | 55 | 2024 |
| Lumos: Learning agents with unified data, modular design, and open-source llms D Yin, F Brahman, A Ravichander, K Chandu, KW Chang, Y Choi, BY Lin ICLR 2024 Workshop on Large Language Model (LLM) Agents, 2023 | 55 | 2023 |
| Faeze Brahman, Abhilasha Ravichander, Valentina Pyatkin, Nouha Dziri, Ronan Le Bras, and Yejin Choi. Wildbench: Benchmarking llms with challenging tasks from real users in the wild BY Lin, Y Deng, K Chandu arXiv preprint arXiv:2406.04770, 2024 | 53 | 2024 |
| Faeze Brahman, Abhilasha Ravichander, Valentina Pyatkin, Nouha Dziri, Ronan Le Bras, and Yejin Choi. 2024. Wildbench: Benchmarking llms with challenging tasks from real users … BY Lin, Y Deng, K Chandu arXiv preprint arXiv:2406.04770, 2024 | 52 | 2024 |
| Code-mixed question answering challenge: Crowd-sourcing data and techniques K Chandu, E Loginova, V Gupta, J van Genabith, G Neumann, ... Proceedings of the Third Workshop on Computational Approaches to Linguistic …, 2018 | 49 | 2018 |