| Shepherd: A critic for language model generation T Wang, P Yu, XE Tan, S O'Brien, R Pasunuru, J Dwivedi-Yu, ... arXiv preprint arXiv:2308.04592, 2023 | 106 | 2023 |
| Contrastive decoding improves reasoning in large language models S O'Brien, M Lewis arXiv preprint arXiv:2309.09117, 2023 | 79 | 2023 |
| Improving LLM abilities in idiomatic translation S Donthi, M Spencer, OB Patel, JY Doh, E Rodan, K Zhu, S O’Brien Proceedings of the First Workshop on Language Models for Low-Resource …, 2025 | 29 | 2025 |
| Question-analysis prompting improves LLM performance in reasoning tasks D Yugeswardeenoo, K Zhu, S O’Brien Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 23 | 2024 |
| AAVENUE: Detecting LLM Biases on NLU Tasks in AAVE via a Novel Benchmark A Gupta, P Meng, E Yurtseven, S O'Brien, K Zhu arXiv preprint arXiv:2408.14845, 2024 | 14 | 2024 |
| Are you really listening? boosting perceptual awareness in music-qa benchmarks Y Zang, S O'Brien, T Berg-Kirkpatrick, J McAuley, Z Novack arXiv preprint arXiv:2504.00369, 2025 | 13 | 2025 |
| DiversityMedQA: Assessing Demographic Biases in Medical Diagnosis using Large Language Models R Rawat, H McBride, D Nirmal, R Ghosh, J Moon, D Alamuri, S O'Brien, ... arXiv preprint arXiv:2409.01497, 2024 | 11 | 2024 |
| Pathfinder: Guided search over multi-step reasoning paths O Golovneva, S O'Brien, R Pasunuru, T Wang, L Zettlemoyer, ... arXiv preprint arXiv:2312.05180, 2023 | 10 | 2023 |
| A Model-independent Radio telescope dark matter search A Keller, S O’Brien, A Kamdar, NM Rapidis, AF Leder, K van Bibber The Astrophysical Journal 927 (1), 71, 2022 | 9 | 2022 |
| DiversityMedQA: A benchmark for assessing demographic biases in medical diagnosis using large language models R Rawat, H McBride, R Ghosh, D Nirmal, J Moon, D Alamuri, SO Brien, ... Proceedings of the Third Workshop on NLP for Positive Impact, 334-348, 2024 | 8 | 2024 |
| Self-updatable large language models with parameter integration Y Wang, X Liu, X Chen, S O'Brien, J Wu, J McAuley arXiv e-prints, arXiv: 2410.00487, 2024 | 7 | 2024 |
| Disentangling likes and dislikes in personalized generative explainable recommendation R Shimizu, T Wada, Y Wang, J Kruse, S O'Brien, S HtaungKham, L Song, ... Proceedings of the ACM on Web Conference 2025, 4793-4809, 2025 | 6 | 2025 |
| Introducing mapo: Momentum-aided gradient descent prompt optimization A Cui, P Nandyalam, A Rufail, E Cheung, A Lei, K Zhu, S O'Brien arXiv preprint arXiv:2410.19499, 2024 | 6 | 2024 |
| From Bias to Balance: Detecting Facial Expression Recognition Biases in Large Multimodal Foundation Models K Chhua, Z Wen, V Hathalia, K Zhu, S O'Brien arXiv preprint arXiv:2408.14842, 2024 | 5 | 2024 |
| Causal language control in multilingual transformers via sparse feature steering CT Chou, G Liu, J Sun, C Blondin, K Zhu, V Sharma, S O'Brien arXiv preprint arXiv:2507.13410, 2025 | 4 | 2025 |
| Endive: A cross-dialect benchmark for fairness and performance in large language models A Gupta, J Cheung, P Meng, S Sayyed, A Liao, K Zhu, S O'Brien arXiv preprint arXiv:2504.07100, 2025 | 4 | 2025 |
| Self-updatable large language models by integrating context into model parameters Y Wang, X Liu, X Chen, S O'Brien, J Wu, J McAuley arXiv preprint arXiv:2410.00487, 2024 | 4 | 2024 |
| Enhancing Language Model Reasoning via Weighted Reasoning in Self-Consistency T Knappe, R Li, A Chauhan, K Chhua, K Zhu, S O'Brien arXiv e-prints, arXiv: 2410.07839, 2024 | 4 | 2024 |
| Distill clip (dclip): Enhancing image-text retrieval via cross-modal transformer distillation D Csizmadia, A Codreanu, V Sim, V Prabhu, M Lu, K Zhu, S O'Brien, ... arXiv preprint arXiv:2505.21549, 2025 | 3 | 2025 |
| Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche Language J Alvarez, D Karajeanes, A Prado, J Ruttan, I Yang, S O’Brien, V Sharma, ... Proceedings of the Fifth Workshop on NLP for Indigenous Languages of the …, 2025 | 3 | 2025 |