[go: up one dir, main page]

Follow
Xiaofeng Wang
Xiaofeng Wang
Verified email at mail.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Drivedreamer: Towards real-world-driven world models for autonomous driving
X Wang, Z Zhu, G Huang, X Chen, J Zhu, J Lu
ECCV-2024, 2023
3842023
Openoccupancy: A large scale benchmark for surrounding semantic occupancy perception
X Wang, Z Zhu, W Xu, Y Zhang, Y Wei, X Chi, Y Ye, D Du, J Lu, X Wang
ICCV-2023, 2023
2672023
Mvster: Epipolar transformer for efficient multi-view stereo
X Wang, Z Zhu, G Huang, F Qin, Y Ye, Y He, X Chi, X Wang
ECCV-2022, 2022
1702022
Drivedreamer-2: Llm-enhanced world models for diverse driving video generation
G Zhao*, X Wang*, Z Zhu*, X Chen, G Huang, X Bao, X Wang
AAAI-2025 (Equal Contribution), 2024
1662024
On the Road with GPT-4V (ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
L Wen*, X Yang*, D Fu*, X Wang*, P Cai, X Li, MA Tao, Y Li, XU Linran, ...
ICLR-2024 (Equal Contribution) Workshop on Large Language Model (LLM) Agents, 2024
140*2024
Is sora a world simulator? a comprehensive survey on general world models and beyond
Z Zhu*, X Wang*, W Zhao*, C Min*, N Deng*, M Dou*, Y Wang*, B Shi, ...
(Equal Contribution) arXiv preprint arXiv:2405.03520, 2024
992024
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
F Liu, S Zhang, X Wang, Y Wei, H Qiu, Y Zhao, Y Zhang, Q Ye, F Wan
CVPR-2025, 2024
852024
Drivedreamer4d: World models are effective data machines for 4d driving scene representation
G Zhao*, C Ni*, X Wang*, Z Zhu*, G Huang, X Chen, B Wang, Y Zhang, ...
CVPR-2025 (Equal Contribution), 2024
672024
Worlddreamer: Towards general world models for video generation via predicting masked tokens
X Wang, Z Zhu, G Huang, B Wang, X Chen, J Lu
arXiv preprint arXiv:2401.09985, 2024
632024
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
C Ni*, G Zhao*, X Wang*, Z Zhu*, W Qin, G Huang, C Liu, Y Chen, ...
CVPR-2025 (Equal Contribution), 2024
562024
Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion
WZ B Li, Y Sun, Z Liang, D Du, Z Zhang, X Wang, Y Wang, X Jin
IJCAI-2024, 2024
43*2024
Crafting monocular cues and velocity guidance for self-supervised multi-frame depth learning
X Wang, Z Zhu, G Huang, X Chi, Y Ye, Z Chen, X Wang
AAAI-2023, 2023
352023
Are we ready for vision-centric driving streaming perception? the asap benchmark
X Wang, Z Zhu, Y Zhang, G Huang, Y Ye, W Xu, Z Chen, X Wang
CVPR-2023, 2023
292023
Recondreamer++: Harmonizing generative and reconstructive models for driving scene representation
G Zhao, X Wang, C Ni, Z Zhu, W Qin, G Huang, X Wang
arXiv preprint arXiv:2503.18438, 2025
242025
Egovid-5m: A large-scale video-action dataset for egocentric video generation
X Wang, K Zhao, F Liu, J Wang, G Zhao, X Bao, Z Zhu, Y Zhang, X Wang
arXiv preprint arXiv:2411.08380, 2024
212024
Wonderturbo: Generating interactive 3d world in 0.72 seconds
C Ni, X Wang, Z Zhu, W Wang, H Li, G Zhao, J Li, W Qin, G Huang, W Mei
arXiv preprint arXiv:2504.02261, 2025
162025
Embodiedreamer: Advancing real2sim2real transfer for policy training via embodied world modeling
B Wang, X Meng, X Wang, Z Zhu, A Ye, Y Wang, Z Yang, C Ni, G Huang, ...
arXiv preprint arXiv:2507.05198, 2025
132025
Vla-r1: Enhancing reasoning in vision-language-action models
A Ye, Z Zhang, B Wang, X Wang, D Zhang, Z Zhu
arXiv preprint arXiv:2510.01623, 2025
112025
RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer
L Liu, X Wang, G Zhao, K Li, W Qin, J Qiu, Z Zhu, G Huang, Z Su
arXiv preprint arXiv:2505.23171, 2025
102025
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
B Wang, X Wang, C Ni, G Zhao, Z Yang, Z Zhu, M Zhang, Y Zhou, X Chen, ...
CVPR-2025 (Equal Contribution), 2025
102025
The system can't perform the operation now. Try again later.
Articles 1–20