Haiyang Sun

Cited by

	All	Since 2021
Citations	591	591
h-index	9	9
i10-index	9	9

480

240

120

360

202420252026103 469 17

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Kun ZhanAI Researcher, LiAutoVerified email at lixiang.com
Lijun ZhouXiaomi CorporationVerified email at mails.ucas.edu.cn
Peng JiaCEO, Simplexity RoboticsVerified email at s-robots.com
Haotong LinZhejiang universityVerified email at zju.edu.cn
Sida PengZhejiang UniversityVerified email at zju.edu.cn
Tao TangSun Yat-sen UniversityVerified email at mail2.sysu.edu.cn
Xiaowei ZhouProfessor of Computer Science, Zhejiang UniversityVerified email at zju.edu.cn
Yunzhi YanZhejiang UniversityVerified email at zju.edu.cn
Xin YuAdelaide UniversityVerified email at adelaide.edu.au
Kaicheng YuAssistant Professor, Westlake University, PI of Autonomous Intelligence LabVerified email at westlake.edu.cn
Chenxu ZhouZhejiang UniversityVerified email at zju.edu.cn
Enhui MaWestlake UniversityVerified email at westlake.edu.cn
Gangwei XuHuazhong University of Science and TechnologyVerified email at hust.edu.cn
Hao ZhaoTsinghua UniversityVerified email at air.tsinghua.edu.cn
Xiaodan LiangProfessor of Computer Science, Sun Yat-sen University, MBZUAI, CMU, NUSVerified email at mail2.sysu.edu.cn
Bu JinHKUSTVerified email at connect.ust.hk
Shuyun WangThe University of QueenslandVerified email at uq.edu.au
Xiao-Xiao LongAssociate Professor at Nanjing University; AnySyn3DVerified email at nju.edu.cn
Pengfei LiInstitute for AI Industry Research, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn

Haiyang Sun

Xiaomi EV

Verified email at xiaomi.com

World Model Autonomous Driving 3D Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Street gaussians: Modeling dynamic urban scenes with gaussian splatting Y Yan, H Lin, C Zhou, W Wang, H Sun, K Zhan, X Lang, X Zhou, S Peng European Conference on Computer Vision, 156-173, 2024	321	2024
Unleashing generalization of end-to-end autonomous driving with controllable long video generation E Ma, L Zhou, T Tang, Z Zhang, D Han, J Jiang, K Zhan, P Jia, X Lang, ... arXiv preprint arXiv:2406.01349, 2024	49*	2024
Tod3cap: Towards 3d dense captioning in outdoor scenes B Jin, Y Zheng, P Li, W Li, Y Zheng, S Hu, X Liu, J Zhu, Z Yan, H Sun, ... European Conference on Computer Vision, 367-384, 2024	36*	2024
Recogdrive: A reinforced cognitive framework for end-to-end autonomous driving Y Li, K Xiong, X Guo, F Li, S Yan, G Xu, L Zhou, L Chen, H Sun, B Wang, ... arXiv preprint arXiv:2506.08052, 2025	35	2025
Bev-tsr: Text-scene retrieval in bev space for autonomous driving T Tang, D Wei, Z Jia, T Gao, C Cai, C Hou, P Jia, K Zhan, H Sun, ... Proceedings of the AAAI Conference on Artificial Intelligence 39 (7), 7275-7283, 2025	30*	2025
OpenSight: A simple open-vocabulary framework for LiDAR-based object detection H Zhang, J Xu, T Tang, H Sun, X Yu, Z Huang, K Yu European Conference on Computer Vision, 1-19, 2024	28	2024
Dive: Dit-based video generation with enhanced control J Jiang, G Hong, L Zhou, E Ma, H Hu, X Zhou, J Xiang, F Liu, K Yu, H Sun, ... arXiv preprint arXiv:2409.01595, 2024	24	2024
3drealcar: An in-the-wild rgb-d car dataset with 360-degree views X Du, Y Wang, H Sun, Z Wu, H Sheng, S Wang, J Ying, M Lu, T Zhu, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025	16	2025
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking T Tang, L Zhou, P Hao, Z He, K Ho, S Gu, Z Hao, H Sun, K Zhan, P Jia, ... arXiv preprint arXiv:2406.02147, 2024	11*	2024
Diverse sign language translation X Shen, L Shen, S Yuan, H Du, H Sun, X Yu arXiv preprint arXiv:2410.19586, 2024	9	2024
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency X Guo, Z Wu, K Xiong, Z Xu, L Zhou, G Xu, S Xu, H Sun, B Wang, G Chen, ... arXiv preprint arXiv:2506.07497, 2025	8	2025
Pixel-perfect depth with semantics-prompted diffusion transformers G Xu, H Lin, H Luo, X Wang, J Yao, L Zhu, Y Pu, C Chi, H Sun, B Wang, ... arXiv preprint arXiv:2510.07316, 2025	6	2025
Cogen: 3d consistent video generation via adaptive conditioning for autonomous driving Y Ji, Z Zhu, Z Zhu, K Xiong, M Lu, Z Li, L Zhou, H Sun, B Wang, T Lu arXiv preprint arXiv:2503.22231, 2025	5	2025
DreamCar: Leveraging Car-Specific Prior for In-the-Wild 3D Car Reconstruction X Du, H Sun, M Lu, T Zhu, X Yu IEEE Robotics and Automation Letters, 2024	5	2024
Uni-gaussians: Unifying camera and lidar simulation with gaussians for dynamic driving scenarios Z Yuan, Y Pu, H Luo, F Lang, C Chi, T Li, Y Shen, H Sun, B Wang, X Yang arXiv preprint arXiv:2503.08317, 2025	3	2025
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth B Jin, W Li, B Yang, Z Zhu, J Jiang, H Gao, H Sun, K Zhan, H Hu, X Zhang, ... arXiv preprint arXiv:2505.01729, 2025	2	2025
MiMo-Embodied: X-Embodied Foundation Model Technical Report X Hao, L Zhou, Z Huang, Z Hou, Y Tang, L Zhang, G Li, Z Lu, S Ren, ... arXiv preprint arXiv:2511.16518, 2025	1	2025
Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks K Zeng, Z Wu, K Xiong, X Wei, X Guo, Z Zhu, K Ho, L Zhou, B Zeng, M Lu, ... arXiv preprint arXiv:2510.19195, 2025	1	2025
CMamba: Learned Image Compression with State Space Models Z Wu, H Du, S Wang, M Lu, H Sun, Y Guo, X Yu arXiv preprint arXiv:2502.04988, 2025	1	2025
Pixel-Perfect Visual Geometry Estimation G Xu, H Lin, H Luo, H Sun, B Wang, G Chen, S Peng, H Ye, X Yang arXiv preprint arXiv:2601.05246, 2026		2026

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors