Zhi Gao (高志)

Cited by

	All	Since 2021
Citations	987	952
h-index	16	16
i10-index	18	18

500

250

125

375

201920202021202220232024202520265 29 49 76 96 226 493 12

Public access

View all

13 articles

4 articles

available

not available

Based on funding mandates

Co-authors

yunde jiaProfessor of Computer Science, Beijing Institute of TechnologyVerified email at bit.edu.cn
Yuwei Wu(武玉伟)Beijing Institute of TechnologyVerified email at bit.edu.cn
Mehrtash HarandiDepartment of Electrical and Computer Systems Engineering, Monash UniversityVerified email at monash.edu
Xiaomeng FanBeijing Institute of TechnologyVerified email at bit.edu.cn

Zhi Gao (高志)

Beijing Institute of Technology

Verified email at bit.edu.cn - Homepage

Computer Vision Machine Learning Multi-Modal Learning Riemannian Geometry


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding Y Fan, X Ma, R Wu, Y Du, J Li, Z Gao, Q Li European Conference on Computer Vision, 75-92, 2024	154	2024
A hyperbolic-to-hyperbolic graph convolutional network J Dai, Y Wu, Z Gao, Y Jia Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	119	2021
Curvature generation in curved spaces for few-shot learning Z Gao, Y Wu, Y Jia, M Harandi Proceedings of the IEEE/CVF international conference on computer vision …, 2021	93	2021
Meta-causal learning for single domain generalization J Chen, Z Gao, X Wu, J Luo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	88	2023
Deep convolutional network with locality and sparsity constraints for texture classification X Bu, Y Wu, Z Gao, Y Jia Pattern Recognition 91, 34-46, 2019	72	2019
Clova: A closed-loop visual assistant with tool usage and update Z Gao, Y Du, X Zhang, X Ma, W Han, SC Zhu, Q Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024	53	2024
Revisiting bilinear pooling: A coding perspective Z Gao, Y Wu, X Zhang, J Dai, Y Jia, M Harandi Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3954-3961, 2020	50	2020
A robust distance measure for similarity-based classification on the SPD manifold Z Gao, Y Wu, M Harandi, Y Jia IEEE transactions on neural networks and learning systems 31 (9), 3230-3244, 2019	48	2019
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL X Zhang, Z Gao, B Zhang, P Li, X Zhang, Y Liu, T Yuan, Y Wu, Y Jia, ... arXiv preprint arXiv:2505.15436, 2025	43	2025
Learning a robust representation via a deep network on symmetric positive definite manifolds Z Gao, Y Wu, X Bu, T Yu, J Yuan, Y Jia Pattern Recognition 92, 1-12, 2019	43	2019
Multi-modal agent tuning: Building a vlm-driven agent for efficient tool usage Z Gao, B Zhang, P Li, X Ma, T Yuan, Y Fan, Y Wu, Y Jia, SC Zhu, Q Li arXiv preprint arXiv:2412.15606, 2024	42	2024
Learning to optimize on SPD manifolds Z Gao, Y Wu, Y Jia, M Harandi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	28	2020
Hyperbolic feature augmentation via distribution estimation and infinite sampling on manifolds Z Gao, Y Wu, Y Jia, M Harandi Advances in neural information processing systems 35, 34421-34435, 2022	22	2022
Learning to optimize on Riemannian manifolds Z Gao, Y Wu, X Fan, M Harandi, Y Jia IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (5), 5935-5952, 2022	21	2022
Curvature-adaptive meta-learning for fast adaptation to manifold data Z Gao, Y Wu, M Harandi, Y Jia IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1545-1562, 2022	21	2022
Exploring data geometry for continual learning Z Gao, C Xu, F Li, Y Jia, M Harandi, Y Wu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	19	2023
TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials B Zhang, Z Shang, Z Gao, W Zhang, R Xie, X Ma, T Yuan, X Wu, SC Zhu, ... arXiv preprint arXiv:2504.12679, 2025	16	2025
Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning P Li, Z Gao, B Zhang, Y Mi, X Ma, C Shi, T Yuan, Y Wu, Y Jia, SC Zhu, Q Li arXiv preprint arXiv:2504.21561, 2025	10	2025
Mmke-bench: A multimodal editing benchmark for diverse visual knowledge Y Du, K Jiang, Z Gao, C Shi, Z Zheng, S Qi, Q Li arXiv preprint arXiv:2502.19870, 2025	7	2025
Fire: A dataset for feedback integration and refinement evaluation of multimodal models P Li, Z Gao, B Zhang, T Yuan, Y Wu, M Harandi, Y Jia, SC Zhu, Q Li Advances in Neural Information Processing Systems 37, 101618-101640, 2024	7	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors