Yongyuan Liang

Cited by

	All	Since 2021
Citations	958	955
h-index	13	12
i10-index	15	15

700

350

175

525

20202021202220232024202520263 7 27 62 133 687 37

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Furong HuangAssociate Professor of Computer Science, University of MarylandVerified email at umd.edu
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Hal Daumé IIIAssociate Professor of Computer Science, University of MarylandVerified email at umiacs.umd.edu
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Jianwei YangMember of Technical Staff, xAIVerified email at x.ai
John LangfordMicrosoft Research New YorkVerified email at hunch.net
Stephen McAleerAnthropicVerified email at openai.com
Tuomas SandholmAngel Jordan University Professor of Computer Science, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Benjamin EysenbachPrinceton UniversityVerified email at princeton.edu
Ziqiao MaUniversity of MichiganVerified email at umich.edu
Yue WangUSCVerified email at csail.mit.edu

Yongyuan Liang

University of Maryland, College Park

Verified email at umd.edu - Homepage

Large Language Models Large Multimodal Models Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Humanity's last exam L Phan, A Gatti, Z Han, N Li, J Hu, H Zhang, CBC Zhang, M Shaaban, ... arXiv preprint arXiv:2501.14249, 2025	304	2025
Tracevla: Visual trace prompting enhances spatial-temporal awareness for generalist robotic policies R Zheng, Y Liang, S Huang, J Gao, H Daumé III, A Kolobov, F Huang, ... ICLR 2025, 2024	127	2024
Magma: A foundation model for multimodal ai agents J Yang, R Tan, Q Wu, R Zheng, B Peng, Y Liang, Y Gu, M Cai, S Ye, ... Proceedings of the Computer Vision and Pattern Recognition Conference, 14203 …, 2025	109	2025
Who is the strongest enemy? towards optimal and efficient evasion attacks in deep rl Y Sun, R Zheng, Y Liang, F Huang ICLR 2022, 2021	99	2021
Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning Y Liang, Y Sun, R Zheng, F Huang Advances in neural information processing systems 35, 22547-22561, 2022	70	2022
Drm: Mastering visual reinforcement learning through dormant ratio minimization G Xu, R Zheng, Y Liang*, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ... ICLR 2024 (Spotlight), 2023	48	2023
Is poisoning a real threat to LLM alignment? Maybe more so than you think P Pathmanathan, S Chakraborty, X Liu, Y Liang, F Huang arXiv preprint arXiv:2406.12091, 2024	37*	2024
Certifiably robust policy learning against adversarial multi-agent communication Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang The Eleventh International Conference on Learning Representations, 2023	28	2023
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets G Jiang, Y Sun, T Huang, H Li, Y Liang, H Xu ICLR 2025, 2024	22	2024
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Y Liang, Y Sun, R Zheng, X Liu, B Eysenbach, T Sandholm, F Huang, ... ICLR 2024, 2023	17	2023
Premier-TACO is a few-shot policy learner: Pretraining multitask representation via temporal action-driven contrastive loss R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ... arXiv preprint arXiv:2402.06187, 2024	15	2024
Certifiably robust policy learning against adversarial communication in multi-agent systems Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang arXiv preprint arXiv:2206.10158, 2022	15	2022
Parallel knowledge transfer in multi-agent reinforcement learning Y Liang, B Li arXiv preprint arXiv:2003.13085, 2020	14	2020
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization T Ji, Y Liang, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, ... ICML 2024 (Oral), 2024	12	2024
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies X Liu, C Deng, Y Sun, Y Liang, F Huang ICLR 2024 (Spotlight), 2024	12	2024
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs X Wang, Z Yang, C Feng, Y Liang, Y Zhou, X Liu, Z Zang, M Li, CC Lin, ... arXiv preprint arXiv:2506.10128, 2025	9	2025
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion Y Liang, T Xu, K Hu, G Jiang, F Huang, H Xu Annual Conference on Neural Information Processing Systems 38, 2024	9	2024
Fdnas: Improving data privacy and model diversity in automl C Zhang, Y Liang, X Yuan, L Cheng arXiv preprint arXiv:2011.03372, 2020	5	2020
InstantNet: Automated generation and deployment of instantaneously switchable-precision networks Y Fu, Z Yu, Y Zhang, Y Jiang, C Li, Y Liang, M Jiang, Z Wang, Y Lin 2021 58th ACM/IEEE Design Automation Conference (DAC), 757-762, 2021	4	2021
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning Z Cai, A Wang, A Satheesh, A Nakhawa, H Jae, K Powell, M Liu, N Jay, ... arXiv preprint arXiv:2506.05523, 2025	2	2025

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors