| Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ... Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS …, 2023 | 509* | 2023 |
| Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023 | 449 | 2023 |
| Guiding Pretraining in Reinforcement Learning with Large Language Models Y Du*, O Watkins*, Z Wang, C Colas, T Darrell, P Abbeel, A Gupta, ... International Conference on Machine Learning (ICML) 2023, 2023 | 341 | 2023 |
| Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024 | 151 | 2024 |
| Vision-Language Models as Success Detectors Y Du, K Konyushkova, M Denil, A Raju, J Landon, F Hill, N de Freitas, ... Conference on Lifelong Learning Agents (CoLLAs) 2023, 2023 | 122 | 2023 |
| Robust Reinforcement Learning using Adversarial Populations E Vinitsky*, Y Du*, K Parvate*, K Jang, P Abbeel, A Bayen arXiv preprint arXiv:2008.01825, 2020 | 114 | 2020 |
| Auto-tuned sim-to-real transfer Y Du*, O Watkins*, T Darrell, P Abbeel, D Pathak 2021 IEEE International Conference on Robotics and Automation (ICRA), 1290-1296, 2021 | 109 | 2021 |
| Imagen 3 J Baldridge, J Bauer, M Bhutani, N Brichtova, A Bunner, L Castrejon, ... arXiv preprint arXiv:2408.07009, 2024 | 90 | 2024 |
| Learning to model the world with language J Lin, Y Du, O Watkins, D Hafner, P Abbeel, D Klein, A Dragan arXiv preprint arXiv:2308.01399, 2023 | 76 | 2023 |
| Ave: Assistance via empowerment Y Du, S Tiomkin, E Kiciman, D Polani, P Abbeel, A Dragan Advances in Neural Information Processing Systems 33, 4560-4571, 2020 | 63 | 2020 |
| Group surfing: A pedestrian-based approach to sidewalk robot navigation Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ... 2019 international conference on robotics and automation (ICRA), 6518-6524, 2019 | 52 | 2019 |
| It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation Y Du, P Abbeel, A Grover International Conference on Learning Representations (ICLR) 2022, 2022 | 25 | 2022 |
| What can ai learn from human exploration? intrinsically-motivated humans and agents in open-world exploration Y Du, E Kosoy, A Dayan, M Rufova, P Abbeel, A Gopnik Neurips 2023 workshop: Information-theoretic principles in cognitive systems, 2023 | 17 | 2023 |
| Bayesian Imitation Learning for End-to-End Mobile Manipulation Y Du, D Ho, AA Alemi, E Jang, M Khansari International Conference on Machine Learning (ICML) 2022, 2022 | 14 | 2022 |
| Intrinsically-motivated humans and agents in open-world exploration A Lidayan, Y Du, E Kosoy, M Rufova, P Abbeel, A Gopnik arXiv preprint arXiv:2503.23631, 2025 | 11 | 2025 |
| Practical visual deep imitation learning via task-level domain consistency M Khansari, D Ho, Y Du, A Fuentes, M Bennice, N Sievers, S Kirmani, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 1837-1844, 2023 | 11* | 2023 |
| Semi-Supervised One-Shot Imitation Learning P Wu, K Hakhamaneshi, Y Du, I Mordatch, A Rajeswaran, P Abbeel arXiv preprint arXiv:2408.05285, 2024 | 4 | 2024 |
| A Study on Improving Reasoning in Language Models Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu I Can’t Believe It’s Not Better! (ICBINB) Workshop @ NeurIPS 2023, 2023 | 3 | 2023 |
| Sidewalk delivery robot navigation: a pedestrian-based approach Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ... Human-Aiding Robotics: Open Issues and Future Direction 2018, 2018 | 2 | 2018 |
| Using embeddings, generated using robot action models, in controlling robot to perform robotic task D Ho, E Jang, M Khansari, YQ DU, AA Alemi US Patent App. 18/102,053, 2024 | 1 | 2024 |