| Humanity's last exam L Phan, A Gatti, Z Han, N Li, J Hu, H Zhang, CBC Zhang, M Shaaban, ... arXiv preprint arXiv:2501.14249, 2025 | 304 | 2025 |
| Tracevla: Visual trace prompting enhances spatial-temporal awareness for generalist robotic policies R Zheng*, Y Liang*, S Huang, J Gao, H Daumé III, A Kolobov, F Huang, ... ICLR 2025, 2024 | 127 | 2024 |
| Magma: A foundation model for multimodal ai agents J Yang, R Tan, Q Wu, R Zheng, B Peng, Y Liang, Y Gu, M Cai, S Ye, ... Proceedings of the Computer Vision and Pattern Recognition Conference, 14203 …, 2025 | 109 | 2025 |
| Who is the strongest enemy? towards optimal and efficient evasion attacks in deep rl Y Sun, R Zheng, Y Liang, F Huang ICLR 2022, 2021 | 99 | 2021 |
| Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning Y Liang, Y Sun, R Zheng, F Huang Advances in neural information processing systems 35, 22547-22561, 2022 | 70 | 2022 |
| Drm: Mastering visual reinforcement learning through dormant ratio minimization G Xu*, R Zheng*, Y Liang*, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ... ICLR 2024 (Spotlight), 2023 | 48 | 2023 |
| Is poisoning a real threat to LLM alignment? Maybe more so than you think P Pathmanathan, S Chakraborty, X Liu, Y Liang, F Huang arXiv preprint arXiv:2406.12091, 2024 | 37* | 2024 |
| Certifiably robust policy learning against adversarial multi-agent communication Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang The Eleventh International Conference on Learning Representations, 2023 | 28 | 2023 |
| Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets G Jiang, Y Sun, T Huang, H Li, Y Liang, H Xu ICLR 2025, 2024 | 22 | 2024 |
| Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Y Liang, Y Sun, R Zheng, X Liu, B Eysenbach, T Sandholm, F Huang, ... ICLR 2024, 2023 | 17 | 2023 |
| Premier-TACO is a few-shot policy learner: Pretraining multitask representation via temporal action-driven contrastive loss R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ... arXiv preprint arXiv:2402.06187, 2024 | 15 | 2024 |
| Certifiably robust policy learning against adversarial communication in multi-agent systems Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang arXiv preprint arXiv:2206.10158, 2022 | 15 | 2022 |
| Parallel knowledge transfer in multi-agent reinforcement learning Y Liang, B Li arXiv preprint arXiv:2003.13085, 2020 | 14 | 2020 |
| ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization T Ji*, Y Liang*, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, ... ICML 2024 (Oral), 2024 | 12 | 2024 |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies X Liu, C Deng, Y Sun, Y Liang, F Huang ICLR 2024 (Spotlight), 2024 | 12 | 2024 |
| ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs X Wang, Z Yang, C Feng, Y Liang, Y Zhou, X Liu, Z Zang, M Li, CC Lin, ... arXiv preprint arXiv:2506.10128, 2025 | 9 | 2025 |
| Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion Y Liang, T Xu, K Hu, G Jiang, F Huang, H Xu Annual Conference on Neural Information Processing Systems 38, 2024 | 9 | 2024 |
| Fdnas: Improving data privacy and model diversity in automl C Zhang, Y Liang, X Yuan, L Cheng arXiv preprint arXiv:2011.03372, 2020 | 5 | 2020 |
| InstantNet: Automated generation and deployment of instantaneously switchable-precision networks Y Fu, Z Yu, Y Zhang, Y Jiang, C Li, Y Liang, M Jiang, Z Wang, Y Lin 2021 58th ACM/IEEE Design Automation Conference (DAC), 757-762, 2021 | 4 | 2021 |
| MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning Z Cai, A Wang, A Satheesh, A Nakhawa, H Jae, K Powell, M Liu, N Jay, ... arXiv preprint arXiv:2506.05523, 2025 | 2 | 2025 |