[go: up one dir, main page]

Follow
Jianwei Yang
Jianwei Yang
Member of Technical Staff, xAI
Verified email at x.ai - Homepage
Title
Cited by
Cited by
Year
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ...
European conference on computer vision, 38-55, 2024
39132024
Phi-3 technical report
M Abdin, J Aneja, H Behl, S Bubeck, R Eldan, S Gunasekar, M Harrison, ...
arXiv preprint arXiv:2412.08905, 2024
27682024
Hierarchical question-image co-attention for visual question answering
J Lu, J Yang, D Batra, D Parikh
Advances in neural information processing systems 29, 2016
22452016
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
18512022
Llava-med: Training a large language-and-vision assistant for biomedicine in one day
C Li, C Wong, S Zhang, N Usuyama, H Liu, J Yang, T Naumann, H Poon, ...
Advances in Neural Information Processing Systems 36, 28541-28564, 2023
15782023
Vinvl: Revisiting visual representations in vision-language models
P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
1570*2021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
12412021
Graph R-CNN for Scene Graph Generation
J Yang*, J Lu*, S Lee, D Batra, D Parikh
arXiv preprint arXiv:1808.00191, 2018
11602018
Joint unsupervised learning of deep representations and image clusters
J Yang, D Parikh, D Batra
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
11462016
Gligen: Open-set grounded text-to-image generation
Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
11022023
Regionclip: Region-based language-image pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
9002022
Segment everything everywhere all at once
X Zou*, J Yang*, H Zhang*, F Li*, L Li, J Wang, L Wang, J Gao, YJ Lee
Advances in Neural Information Processing Systems 36, 2024
8852024
Focal attention for long-range interactions in vision transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
Advances in Neural Information Processing Systems 34, 30008-30022, 2021
842*2021
A whole-slide foundation model for digital pathology from real-world data
H Xu, N Usuyama, J Bagga, S Zhang, R Rao, T Naumann, C Wong, ...
Nature 630 (8015), 181-188, 2024
7512024
Neural Baby Talk
J Lu*, J Yang*, D Batra, D Parikh
arXiv preprint arXiv:1803.09845, 2018
6382018
Learn convolutional neural network for face anti-spoofing
J Yang, Z Lei, SZ Li
arXiv preprint arXiv:1408.5601, 2014
6092014
Dynamic detr: End-to-end object detection with dynamic attention
X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
5382021
Set-of-mark prompting unleashes extraordinary visual grounding in gpt-4v
J Yang*, H Zhang*, F Li*, X Zou*, C Li, J Gao
arXiv preprint arXiv:2310.11441, 2023
5262023
Focal Modulation Networks
J Yang, C Li, X Dai, L Yuan, J Gao
arXiv preprint arXiv:2203.11926v3, 2022
5182022
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding
P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
4872021
The system can't perform the operation now. Try again later.
Articles 1–20