[go: up one dir, main page]

Follow
Chunyuan Li
Chunyuan Li
xAI
Verified email at x.ai - Homepage
Title
Cited by
Cited by
Year
Visual instruction tuning
H Liu*, C Li*, Q Wu, YJ Lee
NeurIPS, 2023
115802023
Improved baselines with visual instruction tuning
H Liu, C Li, Y Li, YJ Lee
Computer Vision and Pattern Recognition (CVPR), 2024
42512024
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ...
European Conference on Computer Vision (ECCV), 2024
39132024
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
X Li, X Yin, C Li, X Hu, P Zhang, L Zhang, L Wang, H Hu, L Dong, F Wei, ...
European Conference on Computer Vision (ECCV), 2020
26562020
Grounded Language-Image Pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
CVPR, 2022
18552022
Llava-onevision: Easy visual task transfer
B Li, Y Zhang, D Guo, R Zhang, F Li, H Zhang, K Zhang, P Zhang, Y Li, ...
Transactions on Machine Learning Research, 2024
18152024
Llava-med: Training a large language-and-vision assistant for biomedicine in one day
C Li*, C Wong*, S Zhang*, N Usuyama, H Liu, J Yang, T Naumann, ...
NeurIPS, 2023
15742023
Llava-next: Improved reasoning, ocr, and world knowledge
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024
1537*2024
Instruction tuning with gpt-4
B Peng*, C Li*, P He*, M Galley, J Gao
arXiv preprint arXiv:2304.03277, 2023
13292023
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
P Lu, H Bansal, T Xia, J Liu, C Li, H Hajishirzi, H Cheng, KW Chang, ...
ICLR, 2023
12922023
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
12512021
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Y Pu, Z Gan, R Henao, X Yuan, C Li, A Stevens, L Carin
Neural Information Processing Systems (NIPS), 2016
11792016
Gligen: Open-set grounded text-to-image generation
Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee
Computer Vision and Pattern Recognition (CVPR), 2023
11022023
RegionCLIP: Region-based Language-Image Pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
CVPR, 2022
8992022
Otter: A multi-modal model with in-context instruction tuning
B Li, Y Zhang, L Chen, J Wang, F Pu, JA Cahyono, J Yang, C Li, Z Liu
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025
8562025
Focal self-attention for local-global interactions in vision transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
arXiv preprint arXiv:2107.00641, 2021
846*2021
A whole-slide foundation model for digital pathology from real-world data
H Xu, N Usuyama, J Bagga, S Zhang, R Rao, T Naumann, C Wong, ...
Nature 630 (8015), 181-188, 2024
7482024
Trustllm: Trustworthiness in large language models
Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li, C Gao, Y Huang, W Lyu, ...
arXiv preprint arXiv:2401.05561, 2024
643*2024
Measuring the intrinsic dimension of objective landscapes
C Li, H Farkhoor, R Liu, J Yosinski
ICLR, 2018
5932018
Joint Embedding of Words and Labels for Text Classification
G Wang, C Li, W Wang, Y Zhang, D Shen, X Zhang, R Henao, L Carin
Annual Meeting of the Association for Computational Linguistics (ACL), 2018
5852018
The system can't perform the operation now. Try again later.
Articles 1–20