Zhuowan Li

Cited by

	All	Since 2021
Citations	2270	2159
h-index	11	11
i10-index	11	11

1600

800

400

1200

2019202020212022202320242025202634 74 110 117 151 191 1510 76

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Alan YuilleProfessor of Cognitive Science and Computer Science, Johns Hopkins UniversityVerified email at jhu.edu
Benjamin Van DurmeJohns Hopkins University / MicrosoftVerified email at jhu.edu
Cheng LiGoogle DeepMindVerified email at google.com
Qiaozhu MeiProfessor, University of MichiganVerified email at umich.edu
Adam KortylewskiCISPA Helmholtz CenterVerified email at cispa.de
Elias Stengel-EskinAssistant Professor, University of Texas at AustinVerified email at cs.unc.edu
Mingyang (Michael) ZhangMeta Superintelligence Labs, ex-Google DeepMindVerified email at google.com
Michael BenderskyGoogle DeepMindVerified email at google.com
Quan Hung TranResearch Scientist - Adobe ResearchVerified email at adobe.com
Cihang XieAssistant Professor, University of California, Santa CruzVerified email at ucsc.edu
Peng TangAmazon Web ServicesVerified email at amazon.com
Zhe L. LinSenior Principal Scientist, Adobe ResearchVerified email at adobe.com
Long MaiResearch ScientistVerified email at adobe.com

Zhuowan Li

Google Deepmind

Verified email at google.com - Homepage

Computer vision Natural Language Processing Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025	1266	2025
Fd-gan: Pose-guided feature distilling gan for robust person re-identification Y Ge, Z Li, H Zhao, G Yin, S Yi, X Wang, H Li Proceedings of 32nd Conference on Neural Information Processing Systems …, 2018	459	2018
Retrieval augmented generation or long-context llms? a comprehensive study and hybrid approach Z Li, C Li, M Zhang, Q Mei, M Bendersky Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024	123*	2024
Super-clevr: A virtual benchmark to diagnose domain robustness in visual reasoning Z Li, X Wang, E Stengel-Eskin, A Kortylewski, W Ma, B Van Durme, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	118	2023
Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering V Gupta, Z Li, A Kortylewski, C Zhang, Y Li, A Yuille Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	90*	2022
Visual commonsense in pretrained unimodal and multimodal models C Zhang, B Van Durme, Z Li, E Stengel-Eskin Proceedings of the 2022 Conference of the North American Chapter of the …, 2022	54	2022
Context-aware group captioning via self-attention and contrastive features Z Li, Q Tran, L Mai, Z Lin, AL Yuille Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	54	2020
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA Z Li, B Jasani, P Tang, S Ghadar Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	32	2024
Calibrating concepts and operations: Towards symbolic reasoning on real images Z Li, E Stengel-Eskin, Y Zhang, C Xie, QH Tran, B Van Durme, A Yuille Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	21	2021
3D-Aware Visual Question Answering about Parts, Poses and Occlusions X Wang, W Ma, Z Li, A Kortylewski, A Yuille Thirty-seventh Conference on Neural Information Processing Systems, 2023	20	2023
Causal-cog: A causal-effect look at context generation for boosting multi-modal language models S Zhao, Z Li, Y Lu, A Yuille, Y Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	17	2024
Effective training data synthesis for improving mllm chart understanding Y Yang, Z Zhang, Y Hou, Z Li, G Liu, A Payani, YS Ting, L Zheng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025	5	2025
Exovip: Step-by-step verification and exploration with exoskeleton modules for compositional visual reasoning Y Wang, A Yuille, Z Li, Z Zheng arXiv preprint arXiv:2408.02210, 2024	5	2024
Localization vs. semantics: Visual representations in unimodal and multimodal models Z Li, C Xie, B Van Durme, A Yuille Proceedings of the 18th Conference of the European Chapter of the …, 2024	5*	2024
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation A Salemi, C Li, M Zhang, Q Mei, W Kong, T Chen, Z Li, M Bendersky, ... arXiv preprint arXiv:2501.04167, 2025	1	2025
Pathways of Thoughts: Multi-Directional Thinking for Long-form Personalized Question Answering A Salemi, C Li, M Zhang, Q Mei, Z Li, SA Hombaiah, W Kong, T Chen, ... arXiv preprint arXiv:2509.19094, 2025		2025
Contrastive captioning for image groups T Quan, M Long, L Zhe, L Zhuowan US Patent US20240037939A1, 2024		2024
On the Diagnosis and Generalization of Compositional Visual Reasoning Z Li Johns Hopkins University, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors