[go: up one dir, main page]

Follow
Yongyuan Liang
Title
Cited by
Cited by
Year
Humanity's last exam
L Phan, A Gatti, Z Han, N Li, J Hu, H Zhang, CBC Zhang, M Shaaban, ...
arXiv preprint arXiv:2501.14249, 2025
3042025
Tracevla: Visual trace prompting enhances spatial-temporal awareness for generalist robotic policies
R Zheng*, Y Liang*, S Huang, J Gao, H Daumé III, A Kolobov, F Huang, ...
ICLR 2025, 2024
1272024
Magma: A foundation model for multimodal ai agents
J Yang, R Tan, Q Wu, R Zheng, B Peng, Y Liang, Y Gu, M Cai, S Ye, ...
Proceedings of the Computer Vision and Pattern Recognition Conference, 14203 …, 2025
1092025
Who is the strongest enemy? towards optimal and efficient evasion attacks in deep rl
Y Sun, R Zheng, Y Liang, F Huang
ICLR 2022, 2021
992021
Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning
Y Liang, Y Sun, R Zheng, F Huang
Advances in neural information processing systems 35, 22547-22561, 2022
702022
Drm: Mastering visual reinforcement learning through dormant ratio minimization
G Xu*, R Zheng*, Y Liang*, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ...
ICLR 2024 (Spotlight), 2023
482023
Is poisoning a real threat to LLM alignment? Maybe more so than you think
P Pathmanathan, S Chakraborty, X Liu, Y Liang, F Huang
arXiv preprint arXiv:2406.12091, 2024
37*2024
Certifiably robust policy learning against adversarial multi-agent communication
Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang
The Eleventh International Conference on Learning Representations, 2023
282023
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets
G Jiang, Y Sun, T Huang, H Li, Y Liang, H Xu
ICLR 2025, 2024
222024
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Y Liang, Y Sun, R Zheng, X Liu, B Eysenbach, T Sandholm, F Huang, ...
ICLR 2024, 2023
172023
Premier-TACO is a few-shot policy learner: Pretraining multitask representation via temporal action-driven contrastive loss
R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ...
arXiv preprint arXiv:2402.06187, 2024
152024
Certifiably robust policy learning against adversarial communication in multi-agent systems
Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang
arXiv preprint arXiv:2206.10158, 2022
152022
Parallel knowledge transfer in multi-agent reinforcement learning
Y Liang, B Li
arXiv preprint arXiv:2003.13085, 2020
142020
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
T Ji*, Y Liang*, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, ...
ICML 2024 (Oral), 2024
122024
Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies
X Liu, C Deng, Y Sun, Y Liang, F Huang
ICLR 2024 (Spotlight), 2024
122024
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
X Wang, Z Yang, C Feng, Y Liang, Y Zhou, X Liu, Z Zang, M Li, CC Lin, ...
arXiv preprint arXiv:2506.10128, 2025
92025
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion
Y Liang, T Xu, K Hu, G Jiang, F Huang, H Xu
Annual Conference on Neural Information Processing Systems 38, 2024
92024
Fdnas: Improving data privacy and model diversity in automl
C Zhang, Y Liang, X Yuan, L Cheng
arXiv preprint arXiv:2011.03372, 2020
52020
InstantNet: Automated generation and deployment of instantaneously switchable-precision networks
Y Fu, Z Yu, Y Zhang, Y Jiang, C Li, Y Liang, M Jiang, Z Wang, Y Lin
2021 58th ACM/IEEE Design Automation Conference (DAC), 757-762, 2021
42021
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning
Z Cai, A Wang, A Satheesh, A Nakhawa, H Jae, K Powell, M Liu, N Jay, ...
arXiv preprint arXiv:2506.05523, 2025
22025
The system can't perform the operation now. Try again later.
Articles 1–20