| V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Z Cheng, J Hu, Z Liu, C Si, W Li, S Gong arXiv preprint arXiv:2503.11495, 2025 | 27 | 2025 |
| CoS: Chain-of-Shot Prompting for Long Video Understanding J Hu, Z Cheng, C Si, W Li, S Gong arXiv preprint arXiv:2502.06428, 2025 | 17 | 2025 |
| SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding Z Cheng, Y Pu, S Gong, P Kordjamshidi, Y Kong European Conference on Computer Vision 2024, 398-416, 2024 | 4 | 2024 |
| INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation J Hu, Z Cheng, S Gong arXiv preprint arXiv:2501.18753, 2025 | 3 | 2025 |
| Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding J Hu, Z Cheng, S Gong, I Guan, J HAO, J Wang, K Shao The Thirty-ninth Annual Conference on Neural Information Processing Systems, 0 | | |