[go: up one dir, main page]

Follow
Kai Zhang
Title
Cited by
Cited by
Year
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
X Yue*, Y Ni*, K Zhang*, T Zheng*, R Liu, G Zhang, S Stevens, D Jiang, ...
CVPR 2024 (𝐁𝐞𝐬𝐭 𝐏𝐚𝐩𝐞𝐫 𝐅𝐢𝐧𝐚𝐥𝐢𝐬𝐭), 2024
17572024
ChatDoctor: A Medical Chat Model Fine-Tuned on A Large Language Model Using Medical Domain Knowledge
Y Li, Z Li, K Zhang, R Dan, S Jiang, Y Zhang
Cureus 15 (6), 2023
992*2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
K Zhang, L Mo, W Chen, H Sun, Y Su
NeurIPS 2023, 2023
4602023
TravelPlanner: A Benchmark for Real-world Planning with Language Agents
J Xie*, K Zhang*, J Chen, T Zhu, R Lou, Y Tian, Y Xiao, Y Su
ICML 2024 (𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭), 2024
3092024
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
J Xie*, K Zhang*, J Chen, R Lou, Y Su
ICLR 2024 (𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭), 2024
263*2024
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
X Yue, T Zheng, Y Ni, Y Wang, K Zhang, S Tong, Y Sun, B Yu, G Zhang, ...
ACL 2025, 2025
2442025
RpBERT: A Text-Image Relation Propagation-based BERT Model for Multimodal NER
L Sun, J Wang, K Zhang, Y Su, F Weng
AAAI 2021, 2021
2052021
Automatic Evaluation of Attribution by Large Language Models
X Yue, B Wang, Z Chen, K Zhang, Y Su, H Sun
Findings of EMNLP 2023, 2023
1422023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
R Lou, K Zhang, W Yin
Computational Linguistics, 2024
135*2024
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
K Zhang, BJ Gutiérrez, Y Su
Findings of ACL 2023, 2023
1312023
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
F Wang, X Fu, JY Huang, Z Li, Q Liu, X Liu, MD Ma, N Xu, W Zhou, ...
ICLR 2025, 2025
1212025
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology
Y Sun, C Zhu, S Zheng, K Zhang, Z Shui, X Yu, Y Zhao, H Li, Y Zhang, ...
AAAI 2024 (𝐎𝐫𝐚𝐥), 2024
121*2024
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
S Wu, J Xie, J Chen, T Zhu, K Zhang, Y Xiao
COLM 2024; KnowledgeNLP@ACL 2024 (𝐎𝐫𝐚𝐥), 2024
872024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
K Zhang, Y Luan, H Hu, K Lee, S Qiao, W Chen, Y Su, MW Chang
ICML 2024 (𝐎𝐫𝐚𝐥), 2024
792024
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Y Gu*, K Zhang*, Y Ning*, B Zheng*, B Gou, T Xue, C Chang, ...
TMLR 2025, 2025
67*2025
ImagenHub: Standardizing the Evaluation of Conditional Image Generation Models
M Ku, T Li, K Zhang, Y Lu, X Fu, W Zhuang, W Chen
ICLR 2024, 2023
602023
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
M Cai, R Tan, J Zhang, B Zou, K Zhang, F Yao, F Zhu, J Gu, Y Zhong, ...
arXiv preprint arXiv:2410.10818, 2024
442024
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology
Y Sun, H Wu, C Zhu, S Zheng, Q Chen, K Zhang, Y Zhang, X Lan, ...
ECCV 2024 (𝐁𝐞𝐬𝐭 𝐏𝐚𝐩𝐞𝐫 𝐅𝐢𝐧𝐚𝐥𝐢𝐬𝐭), 2024
44*2024
Open Hierarchical Relation Extraction
K Zhang, Y Yao, R Xie, X Han, Z Liu, F Lin, L Lin, M Sun
NAACL 2021, 2021
422021
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
R Lou, K Zhang, J Xie, Y Sun, J Ahn, H Xu, Y Su, W Yin
ICLR 2024, 2024
37*2024
The system can't perform the operation now. Try again later.
Articles 1–20