Chunyuan Li

Cited by

	All	Since 2021
Citations	58348	55018
h-index	82	71
i10-index	144	136

29000

14500

7250

21750

20162017201820192020202120222023202420252026237 303 660 815 1040 1691 2676 5356 15809 28441 710

Public access

View all

44 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Jianwei YangMember of Technical Staff, xAIVerified email at x.ai
Lawrence CarinDuke UniversityVerified email at duke.edu
Haotian LiuxAIVerified email at x.ai
Yong Jae LeeProfessor, UW-Madison and Research Scientist, Adobe ResearchVerified email at wisc.edu
Feng LiResearch Scientist, Google DeepMindVerified email at google.com
Chen ChangyouAssociate Professor at University at BuffaloVerified email at buffalo.edu
Yuanhan ZhangPhD Candidate, MMLab@NTUVerified email at e.ntu.edu.sg
Ziwei LiuAssociate Professor, Nanyang Technological UniversityVerified email at ntu.edu.sg
Pengchuan ZhangMeta AIVerified email at fb.com
Ricardo HenaoDuke UniversityVerified email at duke.edu
Zhe GanResearch Scientist, AppleVerified email at apple.com
Yunchen PuResearch Scientist, FacebookVerified email at fb.com
Renrui ZhangSeed & MMLab & PKUVerified email at pku.edu.cn
Brian (Bo) LiPhD Student@NTU, SingaporeVerified email at e.ntu.edu.sg
Baolin PengMicrosoft Research, RedmondVerified email at microsoft.com
Xiujun LiUniversity of Washington / AppleVerified email at cs.washington.edu
Yizhe ZhangAppleVerified email at apple.com
Shilong LiuPostdoc Fellow, Princeton UniversityVerified email at princeton.edu
Guoyin WangQwen PilotVerified email at alibaba-inc.com

Chunyuan Li

xAI

Verified email at x.ai - Homepage

Deep Learning Vision Language Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Visual instruction tuning H Liu, C Li, Q Wu, YJ Lee NeurIPS, 2023	11580	2023
Improved baselines with visual instruction tuning H Liu, C Li, Y Li, YJ Lee Computer Vision and Pattern Recognition (CVPR), 2024	4251	2024
Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ... European Conference on Computer Vision (ECCV), 2024	3913	2024
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks X Li, X Yin, C Li, X Hu, P Zhang, L Zhang, L Wang, H Hu, L Dong, F Wei, ... European Conference on Computer Vision (ECCV), 2020	2656	2020
Grounded Language-Image Pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... CVPR, 2022	1855	2022
Llava-onevision: Easy visual task transfer B Li, Y Zhang, D Guo, R Zhang, F Li, H Zhang, K Zhang, P Zhang, Y Li, ... Transactions on Machine Learning Research, 2024	1815	2024
Llava-med: Training a large language-and-vision assistant for biomedicine in one day C Li, C Wong, S Zhang*, N Usuyama, H Liu, J Yang, T Naumann, ... NeurIPS, 2023	1574	2023
Llava-next: Improved reasoning, ocr, and world knowledge H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024	1537*	2024
Instruction tuning with gpt-4 B Peng, C Li, P He*, M Galley, J Gao arXiv preprint arXiv:2304.03277, 2023	1329	2023
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts P Lu, H Bansal, T Xia, J Liu, C Li, H Hajishirzi, H Cheng, KW Chang, ... ICLR, 2023	1292	2023
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021	1251	2021
Variational Autoencoder for Deep Learning of Images, Labels and Captions Y Pu, Z Gan, R Henao, X Yuan, C Li, A Stevens, L Carin Neural Information Processing Systems (NIPS), 2016	1179	2016
Gligen: Open-set grounded text-to-image generation Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee Computer Vision and Pattern Recognition (CVPR), 2023	1102	2023
RegionCLIP: Region-based Language-Image Pretraining Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ... CVPR, 2022	899	2022
Otter: A multi-modal model with in-context instruction tuning B Li, Y Zhang, L Chen, J Wang, F Pu, JA Cahyono, J Yang, C Li, Z Liu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025	856	2025
Focal self-attention for local-global interactions in vision transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao arXiv preprint arXiv:2107.00641, 2021	846*	2021
A whole-slide foundation model for digital pathology from real-world data H Xu, N Usuyama, J Bagga, S Zhang, R Rao, T Naumann, C Wong, ... Nature 630 (8015), 181-188, 2024	748	2024
Trustllm: Trustworthiness in large language models Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li, C Gao, Y Huang, W Lyu, ... arXiv preprint arXiv:2401.05561, 2024	643*	2024
Measuring the intrinsic dimension of objective landscapes C Li, H Farkhoor, R Liu, J Yosinski ICLR, 2018	593	2018
Joint Embedding of Words and Labels for Text Classification G Wang, C Li, W Wang, Y Zhang, D Shen, X Zhang, R Henao, L Carin Annual Meeting of the Association for Computational Linguistics (ACL), 2018	585	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors