[go: up one dir, main page]

Follow
Yuqing Du
Title
Cited by
Cited by
Year
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ...
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS …, 2023
509*2023
Aligning text-to-image models using human feedback
K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ...
arXiv preprint arXiv:2302.12192, 2023
4492023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Y Du*, O Watkins*, Z Wang, C Colas, T Darrell, P Abbeel, A Gupta, ...
International Conference on Machine Learning (ICML) 2023, 2023
3412023
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
1512024
Vision-Language Models as Success Detectors
Y Du, K Konyushkova, M Denil, A Raju, J Landon, F Hill, N de Freitas, ...
Conference on Lifelong Learning Agents (CoLLAs) 2023, 2023
1222023
Robust Reinforcement Learning using Adversarial Populations
E Vinitsky*, Y Du*, K Parvate*, K Jang, P Abbeel, A Bayen
arXiv preprint arXiv:2008.01825, 2020
1142020
Auto-tuned sim-to-real transfer
Y Du*, O Watkins*, T Darrell, P Abbeel, D Pathak
2021 IEEE International Conference on Robotics and Automation (ICRA), 1290-1296, 2021
1092021
Imagen 3
J Baldridge, J Bauer, M Bhutani, N Brichtova, A Bunner, L Castrejon, ...
arXiv preprint arXiv:2408.07009, 2024
902024
Learning to model the world with language
J Lin, Y Du, O Watkins, D Hafner, P Abbeel, D Klein, A Dragan
arXiv preprint arXiv:2308.01399, 2023
762023
Ave: Assistance via empowerment
Y Du, S Tiomkin, E Kiciman, D Polani, P Abbeel, A Dragan
Advances in Neural Information Processing Systems 33, 4560-4571, 2020
632020
Group surfing: A pedestrian-based approach to sidewalk robot navigation
Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ...
2019 international conference on robotics and automation (ICRA), 6518-6524, 2019
522019
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Y Du, P Abbeel, A Grover
International Conference on Learning Representations (ICLR) 2022, 2022
252022
What can ai learn from human exploration? intrinsically-motivated humans and agents in open-world exploration
Y Du, E Kosoy, A Dayan, M Rufova, P Abbeel, A Gopnik
Neurips 2023 workshop: Information-theoretic principles in cognitive systems, 2023
172023
Bayesian Imitation Learning for End-to-End Mobile Manipulation
Y Du, D Ho, AA Alemi, E Jang, M Khansari
International Conference on Machine Learning (ICML) 2022, 2022
142022
Intrinsically-motivated humans and agents in open-world exploration
A Lidayan, Y Du, E Kosoy, M Rufova, P Abbeel, A Gopnik
arXiv preprint arXiv:2503.23631, 2025
112025
Practical visual deep imitation learning via task-level domain consistency
M Khansari, D Ho, Y Du, A Fuentes, M Bennice, N Sievers, S Kirmani, ...
2023 IEEE International Conference on Robotics and Automation (ICRA), 1837-1844, 2023
11*2023
Semi-Supervised One-Shot Imitation Learning
P Wu, K Hakhamaneshi, Y Du, I Mordatch, A Rajeswaran, P Abbeel
arXiv preprint arXiv:2408.05285, 2024
42024
A Study on Improving Reasoning in Language Models
Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu
I Can’t Believe It’s Not Better! (ICBINB) Workshop @ NeurIPS 2023, 2023
32023
Sidewalk delivery robot navigation: a pedestrian-based approach
Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ...
Human-Aiding Robotics: Open Issues and Future Direction 2018, 2018
22018
Using embeddings, generated using robot action models, in controlling robot to perform robotic task
D Ho, E Jang, M Khansari, YQ DU, AA Alemi
US Patent App. 18/102,053, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20