‪Haoran Tang‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2021
Citations	239	239
h-index	6	6
i10-index	6	6

0

200

100

50

150

20232024202520262 43 181 9

Public access

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ruyang LiuPeking UniversityVerified email at stu.pku.edu.cn
Xiaodan LiangProfessor of Computer Science, Sun Yat-sen University, MBZUAI, CMU, NUSVerified email at mail2.sysu.edu.cn
Meng CaoMohamed bin Zayed University of Artificial IntelligenceVerified email at mbzuai.ac.ae
Li Yuan, 袁粒Peking University, Shenzhen Graduate School, School of AI4S & ECEVerified email at pku.edu.cn

Haoran Tang

Haoran Tang

Peking University

Verified email at stu.pku.edu.cn

Text-video retrieval Video-LLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
St-llm: Large language models are effective temporal learners R Liu, C Li, H Tang, Y Ge, Y Shan, G Li European Conference on Computer Vision, 1-18, 2024	122	2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, ... Findings of the Association for Computational Linguistics (ACL 2024), 2024	35	2024
Muse: Mamba is efficient multi-scale learner for text-video retrieval H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 39, 2024	24	2024
Physgame: Uncovering physical commonsense violations in gameplay videos M Cao, H Tang, H Zhao*, H Guo, J Liu, G Zhang, R Liu, Q Sun, I Reid, ... arXiv preprint arXiv:2412.01800, 2024	22	2024
Ppllava: Varied video sequence understanding with prompt guidance R Liu, H Tang, H Liu, Y Ge, Y Shan, C Li, J Yang arXiv preprint arXiv:2411.02327, 2024	21	2024
Video simpleqa: Towards factuality evaluation in large video language models M Cao, P Hu, Y Wang, J Gu, H Tang, H Zhao, C Wang, J Dong, W Yu, ... arXiv preprint arXiv:2503.18923, 2025	12	2025
Flow4agent: Long-form video understanding via motion prior from optical flow R Liu, S Sun, H Tang, W Gao, G Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025	3	2025
Open-Vocabulary 3D Instruction Ambiguity Detection J Ding, H Tang, G Li arXiv preprint arXiv:2601.05991, 2026		2026
Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling M Cao, H Lin, H Li, H Tang, R Xu, D An, X Liu, I Reid, X Liang arXiv preprint arXiv:2512.01821, 2025		2025
Video Spatial Reasoning with Object-Centric 3D Rollout H Tang, M Cao, R Liu, X Liang, L Li, G Li, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 40, 2025		2025
Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos M Cao, H Tang, H Zhao, M Han, R Liu, Q Sun, X Chang, I Reid, X Liang

The system can't perform the operation now. Try again later.

Articles 1–11