| St-llm: Large language models are effective temporal learners R Liu, C Li, H Tang, Y Ge, Y Shan, G Li European Conference on Computer Vision, 1-18, 2024 | 122 | 2024 |
| RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter M Cao*, H Tang*, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, ... Findings of the Association for Computational Linguistics (ACL 2024), 2024 | 35 | 2024 |
| Muse: Mamba is efficient multi-scale learner for text-video retrieval H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 39, 2024 | 24 | 2024 |
| Physgame: Uncovering physical commonsense violations in gameplay videos M Cao*, H Tang*, H Zhao*, H Guo, J Liu, G Zhang, R Liu, Q Sun, I Reid, ... arXiv preprint arXiv:2412.01800, 2024 | 22 | 2024 |
| Ppllava: Varied video sequence understanding with prompt guidance R Liu, H Tang, H Liu, Y Ge, Y Shan, C Li, J Yang arXiv preprint arXiv:2411.02327, 2024 | 21 | 2024 |
| Video simpleqa: Towards factuality evaluation in large video language models M Cao, P Hu, Y Wang, J Gu, H Tang, H Zhao, C Wang, J Dong, W Yu, ... arXiv preprint arXiv:2503.18923, 2025 | 12 | 2025 |
| Flow4agent: Long-form video understanding via motion prior from optical flow R Liu, S Sun, H Tang, W Gao, G Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025 | 3 | 2025 |
| Open-Vocabulary 3D Instruction Ambiguity Detection J Ding, H Tang, G Li arXiv preprint arXiv:2601.05991, 2026 | | 2026 |
| Seeing through Imagination: Learning Scene Geometry via Implicit Spatial World Modeling M Cao, H Lin, H Li, H Tang, R Xu, D An, X Liu, I Reid, X Liang arXiv preprint arXiv:2512.01821, 2025 | | 2025 |
| Video Spatial Reasoning with Object-Centric 3D Rollout H Tang, M Cao, R Liu, X Liang, L Li, G Li, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 40, 2025 | | 2025 |
| Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos M Cao, H Tang, H Zhao, M Han, R Liu, Q Sun, X Chang, I Reid, X Liang | | |