Jianwei Yang

Cited by

	All	Since 2021
Citations	31949	28969
h-index	54	52
i10-index	80	76

14000

7000

3500

10500

20162017201820192020202120222023202420252026129 287 514 823 1050 1290 2078 3917 8092 13175 368

Public access

View all

21 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Chunyuan LixAIVerified email at x.ai
Pengchuan ZhangMeta AIVerified email at fb.com
Lei ZhangInternational Digital Economy Academy (IDEA)Verified email at idea.edu.cn
Xiyang DaiMicrosoftVerified email at microsoft.com
Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechVerified email at gatech.edu
Lijuan WangMicrosoft GenAIVerified email at microsoft.com
Feng LiResearch Scientist, Google DeepMindVerified email at google.com
Dhruv BatraPrev: FAIR (Meta AI), Georgia TechVerified email at dhruvbatra.com
Hao ZhangNVIDIA ResearchVerified email at nvidia.com
Shilong LiuPostdoc Fellow, Princeton UniversityVerified email at princeton.edu
Bin XiaoMicrosoftVerified email at microsoft.com
Jiasen LuResearch Scientist, AppleVerified email at apple.com
Yong Jae LeeProfessor, UW-Madison and Research Scientist, Adobe ResearchVerified email at wisc.edu
Tianhe RenPhD student of Electrical and Electronic Engineering, The University of Hong KongVerified email at idea.edu.cn
Xueyan ZouPostDoc at UC San DiegoVerified email at wisc.edu
Yiwu ZhongAssistant Professor, Peking UniversityVerified email at wisc.edu
Haotian LiuxAIVerified email at x.ai
Stan Z. Li (李子青)Westlake University & CAS Institute of AutomationVerified email at westlake.edu.cn
Linjie LiMicrosoftVerified email at microsoft.com

Jianwei Yang

Member of Technical Staff, xAI

Verified email at x.ai - Homepage

Multimodal Omni Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ... European conference on computer vision, 38-55, 2024	3913	2024
Phi-3 technical report M Abdin, J Aneja, H Behl, S Bubeck, R Eldan, S Gunasekar, M Harrison, ... arXiv preprint arXiv:2412.08905, 2024	2768	2024
Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016	2245	2016
Grounded language-image pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	1851	2022
Llava-med: Training a large language-and-vision assistant for biomedicine in one day C Li, C Wong, S Zhang, N Usuyama, H Liu, J Yang, T Naumann, H Poon, ... Advances in Neural Information Processing Systems 36, 28541-28564, 2023	1578	2023
Vinvl: Revisiting visual representations in vision-language models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	1570*	2021
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021	1241	2021
Graph R-CNN for Scene Graph Generation J Yang, J Lu, S Lee, D Batra, D Parikh arXiv preprint arXiv:1808.00191, 2018	1160	2018
Joint unsupervised learning of deep representations and image clusters J Yang, D Parikh, D Batra Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016	1146	2016
Gligen: Open-set grounded text-to-image generation Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	1102	2023
Regionclip: Region-based language-image pretraining Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	900	2022
Segment everything everywhere all at once X Zou, J Yang, H Zhang, F Li, L Li, J Wang, L Wang, J Gao, YJ Lee Advances in Neural Information Processing Systems 36, 2024	885	2024
Focal attention for long-range interactions in vision transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao Advances in Neural Information Processing Systems 34, 30008-30022, 2021	842*	2021
A whole-slide foundation model for digital pathology from real-world data H Xu, N Usuyama, J Bagga, S Zhang, R Rao, T Naumann, C Wong, ... Nature 630 (8015), 181-188, 2024	751	2024
Neural Baby Talk J Lu, J Yang, D Batra, D Parikh arXiv preprint arXiv:1803.09845, 2018	638	2018
Learn convolutional neural network for face anti-spoofing J Yang, Z Lei, SZ Li arXiv preprint arXiv:1408.5601, 2014	609	2014
Dynamic detr: End-to-end object detection with dynamic attention X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2021	538	2021
Set-of-mark prompting unleashes extraordinary visual grounding in gpt-4v J Yang, H Zhang, F Li, X Zou, C Li, J Gao arXiv preprint arXiv:2310.11441, 2023	526	2023
Focal Modulation Networks J Yang, C Li, X Dai, L Yuan, J Gao arXiv preprint arXiv:2203.11926v3, 2022	518	2022
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao Proceedings of the IEEE/CVF international conference on computer vision …, 2021	487	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors