Songyang Zhang

Cited by

	All	Since 2021
Citations	8093	8007
h-index	39	38
i10-index	53	53

4400

2200

1100

3300

202020212022202320242025202658 143 373 691 2314 4368 114

Public access

View all

16 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Kai ChenShanghai AI LaboratoryVerified email at pjlab.org.cn
Dahua LinThe Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Xuming HeShanghaiTech UniversityVerified email at shanghaitech.edu.cn
Haodong DuanCUHK, PKUVerified email at pjlab.org.cn
Conghui HeShanghai AI LaboratoryVerified email at pjlab.org.cn
Jiaqi WangShanghai AI LaboratoryVerified email at pjlab.org.cn
Hang YanComputer Science, Fudan UniversityVerified email at fudan.edu.cn
Shipeng YanBytedanceVerified email at shanghaitech.edu.cn
Yuan LIU 柳源WeChat AIVerified email at tencent.com
Maosong CaoShanghai AI LabVerified email at shanghaitech.edu.cn
Wangbo ZhaoNational University of SingaporeVerified email at u.nus.edu
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Jian SunChief Scientist of Megvii, Managing Director of Megvii ResearchVerified email at megvii.com
Zeming LiHong Kong University of Science and Technology (HKUST)Verified email at connect.ust.hk
Yining LiShanghai AI LaboratoryVerified email at pjlab.org.cn
Rongjie LiPhD graduate, ShanghaiTech UniversityVerified email at shanghaitech.edu.cn
Ziwei LiuAssociate Professor, Nanyang Technological UniversityVerified email at ntu.edu.sg
Yongfei LiuBytedanceVerified email at bytedance.com
Shuaiyi HuangMETA FAIRVerified email at umd.edu
Ping Luo (羅平)Associate Professor, The University of Hong Kong; MMLAB@HKUVerified email at hku.hk

Songyang Zhang

Other names张松阳

Tencent Hunyuan

Verified email at tencent.com - Homepage

Deep Learning Large Language Model Vision-Language Model Agent


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mmbench: Is your multi-modal model an all-around player? Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ... European conference on computer vision, 216-233, 2024	1809	2024
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	555	2024
Part-aware prototype network for few-shot semantic segmentation Y Liu, X Zhang, S Zhang, X He European conference on computer vision, 142-158, 2020	468	2020
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition S Zhang, Z Li, S Yan, X He, J Sun Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021	432	2021
OpenCompass: A universal evaluation platform for foundation models. OC Contributors https://github.com/open-compass/opencompass, 2023	410	2023
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	387	2024
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation R Li, S Zhang, B Wan, X He Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021	330	2021
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ... arXiv preprint arXiv:2309.15112, 2023	317	2023
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team	271	2023
Internvl3. 5: Advancing open-source multimodal models in versatility, reasoning, and efficiency W Wang, Z Gao, L Gu, H Pu, L Cui, X Wei, Z Liu, L Jing, S Ye, J Shao, ... arXiv preprint arXiv:2508.18265, 2025	226	2025
Lawbench: Benchmarking legal knowledge of large language models Z Fei, X Shen, D Zhu, F Zhou, Z Han, A Huang, S Zhang, K Chen, Z Yin, ... Proceedings of the 2024 conference on empirical methods in natural language …, 2024	223	2024
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024	206	2024
SGTR: End-to-end Scene Graph Generation with Transformer R Li, S Zhang, X He Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2022	186	2022
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... Advances in Neural Information Processing Systems 37, 42566-42592, 2024	180	2024
ProSA: Assessing and understanding the prompt sensitivity of LLMs J Zhuo, S Zhang, X Fang, H Duan, D Lin, K Chen arXiv preprint arXiv:2410.12405, 2024	122	2024
Internlm-math: Open math large language models toward verifiable reasoning H Ying, S Zhang, L Li, Z Zhou, Y Shao, Z Fei, Y Ma, J Hong, K Liu, Z Wang, ... arXiv preprint arXiv:2402.06332, 2024	121	2024
Dynamic context correspondence network for semantic alignment S Huang, Q Wang, S Zhang, S Yan, X He Proceedings of the IEEE/CVF international conference on computer vision …, 2019	116	2019
Mathbench: Evaluating the theory and application proficiency of llms with a hierarchical mathematics benchmark H Liu, Z Zheng, Y Qiao, H Duan, Z Fei, F Zhou, W Zhang, S Zhang, D Lin, ... arXiv preprint arXiv:2405.12209, 2024	113	2024
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition S Zhang, S Yan, X He Proceedings of the 36th International Conference on Machine Learning, 2019	97	2019
Action Quality Assessment with Temporal Parsing Transformer Y Bai, D Zhou, S Zhang, J Wang, E Ding, Y Guan, Y Long, J Wang European Conference on Computer Vision, 2022	87	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors