[go: up one dir, main page]

Follow
Ge Zhang
Ge Zhang
M-A-P, Bytedance Seed, University of Waterloo
Verified email at bytedance.com
Title
Cited by
Cited by
Year
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ...
CVPR Best Paper Nomination, 2023
17622023
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Y Wang, X Ma, G Zhang, Y Ni, A Chandra, S Guo, W Ren, A Arulraj, X He, ...
Neurips D&B 2024, 2024
11412024
Yi: Open Foundation Models by 01. AI
A Young, B Chen, C Li, C Huang, G Zhang, G Zhang, H Li, J Zhu, J Chen, ...
Yi Tech Report, 2024
7532024
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
X Yue, X Qu, G Zhang, Y Fu, W Huang, H Sun, Y Su, W Chen
ICLR, 2023
5132023
Long-context LLMs Struggle with Long In-context Learning
T Li, G Zhang, QD Do, X Yue, W Chen
TMLR, 2024
3592024
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Y Li*, R Yuan*, G Zhang*, Y Ma*, X Chen, H Yin, C Lin, A Ragni, ...
ICLR, 2023
260*2023
Mmmu-pro: A more robust multi-discipline multimodal understanding benchmark
X Yue, T Zheng, Y Ni, Y Wang, K Zhang, S Tong, Y Sun, B Yu, G Zhang, ...
ACL, 2024
2472024
Training Socially Aligned Language Models in Simulated Human Society
R Liu, R Yang, C Jia, G Zhang, D Zhou, AM Dai, D Yang, S Vosoughi
ICLR, 2023
246*2023
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
J Zhan, J Dai, J Ye, Y Zhou, D Zhang, Z Liu, X Zhang, R Yuan, G Zhang, ...
ACL, 2024
2452024
AutoAgents: A Framework for Automatic Agent Generation
G Chen, S Dong, Y Shu, G Zhang, J Sesay, BF Karlsson, J Fu, Y Shi
IJCAI, 2023
2452023
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
T Zheng*, G Zhang*, T Shen*, X Liu*, BY Lin, J Fu, W Chen, X Yue
ACL Findings, 2024
2322024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
B Gao, F Song, Z Yang, Z Cai, Y Miao, Q Dong, L Li, C Ma, L Chen, R Xu, ...
arXiv preprint arXiv:2410.07985, 2024
2232024
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
J Feng, S Huang, X Qu, G Zhang, Y Qin, B Zhong, C Jiang, J Chi, ...
arXiv preprint arXiv:2504.11536, 2025
1772025
MAmmoTH2: Scaling Instructions from the Web
X Yue, T Zheng, G Zhang, W Chen
Neurips 2024, 2024
1462024
Massive Editing for Large Language Models via Meta Learning
C Tan, G Zhang, J Fu
ICLR, 2023
1352023
Seed-thinking-v1. 5: Advancing superb reasoning models with reinforcement learning
BD Seed, Y Yuan, Y Yue, M Wang, X Zuo, J Chen, L Yan, W Xu, C Zhang, ...
arXiv preprint arXiv:2504.13914, 2025
1302025
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
S Huang, T Cheng, JK Liu, J Hao, L Song, Y Xu, J Yang, JH Liu, C Zhang, ...
ACL, 2024
130*2024
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
C Wei, Y Chen, H Chen, H Hu, G Zhang, J Fu, A Ritter, W Chen
ECCV 2024, 2023
1302023
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
W Ren, H Yang, G Zhang, C Wei, X Du, S Huang, W Chen
TMLR, 2024
120*2024
Supergpqa: Scaling llm evaluation across 285 graduate disciplines
X Du, Y Yao, K Ma, B Wang, T Zheng, K Zhu, M Liu, Y Liang, X Jin, Z Wei, ...
Neurips 2025, 2025
113*2025
The system can't perform the operation now. Try again later.
Articles 1–20