[go: up one dir, main page]

Follow
Zhi Gao (高志)
Title
Cited by
Cited by
Year
VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding
Y Fan, X Ma, R Wu, Y Du, J Li, Z Gao, Q Li
European Conference on Computer Vision, 75-92, 2024
1542024
A hyperbolic-to-hyperbolic graph convolutional network
J Dai, Y Wu, Z Gao, Y Jia
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
1192021
Curvature generation in curved spaces for few-shot learning
Z Gao, Y Wu, Y Jia, M Harandi
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
932021
Meta-causal learning for single domain generalization
J Chen, Z Gao, X Wu, J Luo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
882023
Deep convolutional network with locality and sparsity constraints for texture classification
X Bu, Y Wu, Z Gao, Y Jia
Pattern Recognition 91, 34-46, 2019
722019
Clova: A closed-loop visual assistant with tool usage and update
Z Gao, Y Du, X Zhang, X Ma, W Han, SC Zhu, Q Li
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024
532024
Revisiting bilinear pooling: A coding perspective
Z Gao, Y Wu, X Zhang, J Dai, Y Jia, M Harandi
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3954-3961, 2020
502020
A robust distance measure for similarity-based classification on the SPD manifold
Z Gao, Y Wu, M Harandi, Y Jia
IEEE transactions on neural networks and learning systems 31 (9), 3230-3244, 2019
482019
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
X Zhang, Z Gao, B Zhang, P Li, X Zhang, Y Liu, T Yuan, Y Wu, Y Jia, ...
arXiv preprint arXiv:2505.15436, 2025
432025
Learning a robust representation via a deep network on symmetric positive definite manifolds
Z Gao, Y Wu, X Bu, T Yu, J Yuan, Y Jia
Pattern Recognition 92, 1-12, 2019
432019
Multi-modal agent tuning: Building a vlm-driven agent for efficient tool usage
Z Gao, B Zhang, P Li, X Ma, T Yuan, Y Fan, Y Wu, Y Jia, SC Zhu, Q Li
arXiv preprint arXiv:2412.15606, 2024
422024
Learning to optimize on SPD manifolds
Z Gao, Y Wu, Y Jia, M Harandi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
282020
Hyperbolic feature augmentation via distribution estimation and infinite sampling on manifolds
Z Gao, Y Wu, Y Jia, M Harandi
Advances in neural information processing systems 35, 34421-34435, 2022
222022
Learning to optimize on Riemannian manifolds
Z Gao, Y Wu, X Fan, M Harandi, Y Jia
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (5), 5935-5952, 2022
212022
Curvature-adaptive meta-learning for fast adaptation to manifold data
Z Gao, Y Wu, M Harandi, Y Jia
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1545-1562, 2022
212022
Exploring data geometry for continual learning
Z Gao, C Xu, F Li, Y Jia, M Harandi, Y Wu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
192023
TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials
B Zhang, Z Shang, Z Gao, W Zhang, R Xie, X Ma, T Yuan, X Wu, SC Zhu, ...
arXiv preprint arXiv:2504.12679, 2025
162025
Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning
P Li, Z Gao, B Zhang, Y Mi, X Ma, C Shi, T Yuan, Y Wu, Y Jia, SC Zhu, Q Li
arXiv preprint arXiv:2504.21561, 2025
102025
Mmke-bench: A multimodal editing benchmark for diverse visual knowledge
Y Du, K Jiang, Z Gao, C Shi, Z Zheng, S Qi, Q Li
arXiv preprint arXiv:2502.19870, 2025
72025
Fire: A dataset for feedback integration and refinement evaluation of multimodal models
P Li, Z Gao, B Zhang, T Yuan, Y Wu, M Harandi, Y Jia, SC Zhu, Q Li
Advances in Neural Information Processing Systems 37, 101618-101640, 2024
72024
The system can't perform the operation now. Try again later.
Articles 1–20