[go: up one dir, main page]

Follow
Zhuowan Li
Zhuowan Li
Google Deepmind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
12662025
Fd-gan: Pose-guided feature distilling gan for robust person re-identification
Y Ge, Z Li, H Zhao, G Yin, S Yi, X Wang, H Li
Proceedings of 32nd Conference on Neural Information Processing Systems …, 2018
4592018
Retrieval augmented generation or long-context llms? a comprehensive study and hybrid approach
Z Li, C Li, M Zhang, Q Mei, M Bendersky
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
123*2024
Super-clevr: A virtual benchmark to diagnose domain robustness in visual reasoning
Z Li, X Wang, E Stengel-Eskin, A Kortylewski, W Ma, B Van Durme, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1182023
Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering
V Gupta, Z Li, A Kortylewski, C Zhang, Y Li, A Yuille
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
90*2022
Visual commonsense in pretrained unimodal and multimodal models
C Zhang, B Van Durme, Z Li, E Stengel-Eskin
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
542022
Context-aware group captioning via self-attention and contrastive features
Z Li, Q Tran, L Mai, Z Lin, AL Yuille
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
542020
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Z Li, B Jasani, P Tang, S Ghadar
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
322024
Calibrating concepts and operations: Towards symbolic reasoning on real images
Z Li, E Stengel-Eskin, Y Zhang, C Xie, QH Tran, B Van Durme, A Yuille
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
212021
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
X Wang, W Ma, Z Li, A Kortylewski, A Yuille
Thirty-seventh Conference on Neural Information Processing Systems, 2023
202023
Causal-cog: A causal-effect look at context generation for boosting multi-modal language models
S Zhao, Z Li, Y Lu, A Yuille, Y Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
172024
Effective training data synthesis for improving mllm chart understanding
Y Yang, Z Zhang, Y Hou, Z Li, G Liu, A Payani, YS Ting, L Zheng
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025
52025
Exovip: Step-by-step verification and exploration with exoskeleton modules for compositional visual reasoning
Y Wang, A Yuille, Z Li, Z Zheng
arXiv preprint arXiv:2408.02210, 2024
52024
Localization vs. semantics: Visual representations in unimodal and multimodal models
Z Li, C Xie, B Van Durme, A Yuille
Proceedings of the 18th Conference of the European Chapter of the …, 2024
5*2024
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation
A Salemi, C Li, M Zhang, Q Mei, W Kong, T Chen, Z Li, M Bendersky, ...
arXiv preprint arXiv:2501.04167, 2025
12025
Pathways of Thoughts: Multi-Directional Thinking for Long-form Personalized Question Answering
A Salemi, C Li, M Zhang, Q Mei, Z Li, SA Hombaiah, W Kong, T Chen, ...
arXiv preprint arXiv:2509.19094, 2025
2025
Contrastive captioning for image groups
T Quan, M Long, L Zhe, L Zhuowan
US Patent US20240037939A1, 2024
2024
On the Diagnosis and Generalization of Compositional Visual Reasoning
Z Li
Johns Hopkins University, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–18