[go: up one dir, main page]

Follow
Jinliang Zheng
Jinliang Zheng
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
MixMAE: Mixed and masked autoencoder for efficient pretraining of hierarchical vision transformers
J Liu, X Huang, J Zheng, Y Liu, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
912023
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance
Y Zheng, R Liang, K Zheng, J Zheng, L Mao, J Li, W Gu, R Ai, SE Li, ...
ICLR 2025 (Oral), 2025
772025
Mixmim: Mixed and masked image modeling for efficient visual representation learning
J Liu, X Huang, J Zheng, Y Liu, H Li
622022
Universal Actions for Enhanced Embodied Foundation Models
J Zheng, J Li, D Liu, Y Zheng, Z Wang, Z Ou, Y Liu, J Liu, YQ Zhang, ...
CVPR 2025, 2025
46*2025
Gobigger: A scalable platform for cooperative-competitive multi-agent interactive simulation
M Zhang, S Zhang, Z Yang, L Chen, J Zheng, C Yang, C Li, H Zhou, Y Niu, ...
ICLR 2023, 2023
162023
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
J Li*, J Zheng*, Y Zheng*, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ...
ICML 2024, 2024
152024
Instruction-Guided Visual Masking
J Zheng*, J Li*, S Cheng, Y Zheng, J Li, J Liu, Y Liu, J Liu, X Zhan
NeurIPS 2024, 2024
142024
X-vla: Soft-prompted transformer as scalable cross-embodiment vision-language-action model
J Zheng, J Li, Z Wang, D Liu, X Kang, Y Feng, Y Zheng, J Zou, Y Chen, ...
🏅Champion @ AgiBot World Challenge @ IROS 2025, 2025
112025
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
J Liu*, J Zheng*, Y Liu, H Li
CVPR 2024, 2024
92024
MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment
X Huang*, J Liu*, J Zheng*, B Liu, J Wang, Y Liu, H Li, O Yoshie
Neurocomputing, 131164, 2025
6*2025
Flow matching-based autonomous driving planning with advanced interactive behavior modeling
T Tan, Y Zheng, R Liang, Z Wang, K Zheng, J Zheng, J Li, X Zhan, J Liu
NeurIPS 2025, 2025
32025
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
J Li*, Z Wang*, J Zheng*, X Zhou, G Wang, G Song, Y Liu, J Liu, ...
ICRA 2025, 2024
32024
PhysiAgent: An Embodied Agent Framework in Physical World
Z Wang, J Li, J Zheng, W Zhang, D Liu, Y Zheng, H Niu, J Yu, X Zhan
arXiv preprint arXiv:2509.24524, 2025
12025
Enhancing Vision-Language Model with Unmasked Token Alignment
J Liu, J Zheng, B Liu, Y Liu, H Li
TMLR 2024, 2024
12024
LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding
Y Hu, J Liu, K Wang, J Zheng, W Shi, M Zhang, Q Dou, R Liu, A Zhou, H Li
EMNLP 2025, 2025
2025
MixMAE: Mixed and masked autoencoder for efficient pretraining of hierarchical vision transformers
J Liu, X Huang, J Zheng, Y Liu, H Li
CVPR 2023, 6252-6261, 2023
2023
Efficient Robotic Policy Learning via Latent Space Backward Planning
D Liu, H Niu, Z Wang, J Zheng, Y Zheng, Z Ou, J HU, J Li, X Zhan
ICML 2025, 0
The system can't perform the operation now. Try again later.
Articles 1–17