| Implicit regularization in deep matrix factorization S Arora, N Cohen, W Hu, Y Luo Advances in neural information processing systems 32, 2019 | 724 | 2019 |
| Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma arXiv preprint arXiv:1807.03858, 2018 | 315 | 2018 |
| Towards resolving the implicit bias of gradient descent for matrix factorization: Greedy low-rank learning Z Li, Y Luo, K Lyu arXiv preprint arXiv:2012.09839, 2020 | 187 | 2020 |
| Safe reinforcement learning by imagining the near future G Thomas, Y Luo, T Ma Advances in Neural Information Processing Systems 34, 13859-13869, 2021 | 124 | 2021 |
| Provably efficient Q-learning with function approximation via distribution shift error checking oracle SS Du, Y Luo, R Wang, H Zhang Advances in Neural Information Processing Systems 32, 2019 | 113 | 2019 |
| Provable representation learning for imitation learning via bi-level optimization S Arora, S Du, S Kakade, Y Luo, N Saunshi International Conference on Machine Learning, 367-376, 2020 | 88 | 2020 |
| Learning barrier certificates: Towards safe reinforcement learning with zero training-time violations Y Luo, T Ma Advances in Neural Information Processing Systems 34, 25621-25632, 2021 | 59 | 2021 |
| Learning online alignments with continuous rewards policy gradient Y Luo, CC Chiu, N Jaitly, I Sutskever 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 54 | 2017 |
| Towards learning to play piano with dexterous hands and touch H Xu, Y Luo, S Wang, T Darrell, R Calandra 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022 | 47 | 2022 |
| On the expressivity of neural networks for deep reinforcement learning K Dong, Y Luo, T Yu, C Finn, T Ma International conference on machine learning, 2627-2637, 2020 | 38 | 2020 |
| Learning self-correctable policies and value functions from demonstrations with negative sampling Y Luo, H Xu, T Ma arXiv preprint arXiv:1907.05634, 2019 | 22 | 2019 |
| An online sequence-to-sequence model for noisy speech recognition CC Chiu, D Lawson, Y Luo, G Tucker, K Swersky, I Sutskever, N Jaitly arXiv preprint arXiv:1706.06428, 2017 | 10 | 2017 |
| Recurrent neural networks for online sequence generation CC Chiu, N Jaitly, I Sutskever, Y Luo US Patent 10,281,885, 2019 | 9 | 2019 |
| MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings H Chen, H Liu, Y Luo, L Wang, N Yang, F Wei, Z Dou arXiv preprint arXiv:2506.23115, 2025 | 7 | 2025 |
| Bootstrapping the expressivity with model-based planning K Dong, Y Luo, T Ma | 2 | 2019 |
| Towards Efficient and Effective Deep Model-Based Reinforcement Learning Y Luo Princeton University, 2022 | | 2022 |
| Recurrent neural networks for online sequence generation CC Chiu, N Jaitly, I Sutskever, Y Luo US Patent 10,656,605, 2020 | | 2020 |