Cunxiang Wang

Cited by

	All	Since 2021
Citations	7113	7026
h-index	22	22
i10-index	26	26

4000

2000

1000

3000

202020212022202320242025202671 29 65 317 2432 3979 162

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yue ZhangWestlake UniversityVerified email at wias.org.cn
Xing Xie 谢幸Assistant Managing Director, Microsoft Research Asia, ACM Fellow, IEEE Fellow, CCF FellowVerified email at microsoft.com
Jindong WangAssistant Professor, William & Mary; Ex Senior Researcher, Microsoft ResearchVerified email at wm.edu
Philip S. YuProfessor of Computer Science, University of Illinons at ChicagoVerified email at cs.uic.edu
Yidong Wang (王一栋)Ph.D. candidate @ PKU | M.Eng. @ TokyoTech | B.S. @ NJUVerified email at stu.pku.edu.cn
Zehan QiTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yuanhao YueFudan UniversityVerified email at m.fudan.edu.cn
Xiaoze LiuPhD Student, ECE at Purdue UniversityVerified email at purdue.edu
Qipeng GuoFudan UniversityVerified email at fudan.edu.cn
Hongru (Merlin) WANGPostdoc@Edinburgh, Ph.D @CUHK, Prev. @UIUC @EdinburghNLPVerified email at se.cuhk.edu.hk
Yunzhi YaoZhejiang UniversityVerified email at g.ucla.edu
Minlie HuangTsinghua UniversityVerified email at tsinghua.edu.cn
Guangsheng BaoPh.D. Candidate, Westlake University & Zhejiang University.Verified email at westlake.edu.cn
Liang ShuailongTiktokVerified email at bytedance.com
Hongbo ZhangWestlake University; Zhejiang UniversityVerified email at westlake.edu.cn
Xiaodan ZhuECE & Ingenuity Labs Research Institute, Queen's University, CanadaVerified email at queensu.ca
Pai LiuUniversity of RochesterVerified email at ur.rochester.edu
Ruoxi NingUniversity of WaterlooVerified email at uwaterloo.ca
Feiliang RenNortheastern UniversityVerified email at ise.neu.edu.cn
Tang JieWeBank Chair Professor, Tsinghua UniversityVerified email at tsinghua.edu.cn

Cunxiang Wang

Tsinghua University; ZhipuAI

Verified email at tsinghua.edu.cn - Homepage

Large Language Models LLM Evaluation LLM Post-training


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey on evaluation of large language models Y Chang, X Wang, J Wang, Y Wu, L Yang, K Zhu, H Chen, X Yi, C Wang, ... ACM transactions on intelligent systems and technology 15 (3), 1-45, 2024	5023	2024
Pandalm: An automatic evaluation benchmark for llm instruction tuning optimization Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, J Wang, ... ICLR 2024, 2023	351	2023
Survey on factuality in large language models C Wang, X Liu, Y Yue, Q Guo, X Hu, X Tang, T Zhang, C Jiayang, Y Yao, ... ACM Computing Surveys 58 (1), 1-37, 2025	343*	2025
Knowledge conflicts for llms: A survey R Xu, Z Qi, Z Guo, C Wang, H Wang, Y Zhang, W Xu EMNLP2024, 2024	221	2024
Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation C Wang, S Liang, Y Zhang, X Li, T Gao ACL 2019, 4020–4026, 2019	122	2019
SemEval-2020 task 4: Commonsense validation and explanation C Wang, S Liang, Y Jin, Y Wang, X Zhu, Y Zhang SemEval-2020 Task track, 2020	121	2020
Evaluating Open-QA Evaluation C Wang, S Cheng, Q Guo, Y Yue, B Ding, Z Xu, Y Wang, X Hu, Z Zhang, ... Advances in Neural Information Processing Systems 36, 2023	113	2023
Glm-4.5: Agentic, reasoning, and coding (arc) foundation models A Zeng, X Lv, Q Zheng, Z Hou, B Chen, C Xie, C Wang, D Yin, H Zeng, ... arXiv preprint arXiv:2508.06471, 2025	112	2025
Can generative pre-trained language models serve as knowledge bases for closed-book qa? C Wang, P Liu, Y Zhang ACL 2021, 2021	100	2021
Ragchecker: A fine-grained framework for diagnosing retrieval-augmented generation D Ru, L Qiu, X Hu, T Zhang, P Shi, S Chang, C Jiayang, C Wang, S Sun, ... Advances in Neural Information Processing Systems 37, 21999-22027, 2024	97	2024
Llms with chain-of-thought are non-causal reasoners G Bao, H Zhang, L Yang, C Wang, Y Zhang CoRR, 2024	47	2024
A survey on evaluation of large language models. arXiv Y Chang, X Wang, J Wang, Y Wu, L Yang, K Zhu, H Chen, X Yi, C Wang, ... Preprint posted online on Dec 29, 2023	44	2023
NovelQA: Benchmarking question answering on documents exceeding 200k tokens C Wang, R Ning, B Pan, T Wu, Q Guo, C Deng, G Bao, X Hu, Z Zhang, ... ICLR2025, 2024	42*	2024
A survey on evaluation of large language models (2023) Y Chang, X Wang, J Wang, Y Wu, L Yang, K Zhu, H Chen, X Yi, C Wang, ...	42*
Shield: Evaluation and defense strategies for copyright compliance in llm text generation X Liu, T Sun, T Xu, F Wu, C Wang, X Wang, J Gao EMNLP2024, 2024	37	2024
Self-dc: When to retrieve and when to generate? self divide-and-conquer for compositional unknown questions H Wang, B Xue, B Zhou, T Zhang, C Wang, G Chen, H Wang, K Wong CoRR, 2024	35*	2024
Spar: Self-play with tree-search refinement to improve instruction-following in large language models J Cheng, X Liu, C Wang, X Gu, Y Lu, D Zhang, Y Dong, J Tang, H Wang, ... ICLR, 2025	27	2025
LongRAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall Z Qi, R Xu, Z Guo, C Wang, H Zhang, W Xu arXiv preprint arXiv:2410.23000, 2024	26	2024
Exploring generalization ability of pretrained language models on arithmetic and logical reasoning C Wang, B Zheng, Y Niu, Y Zhang CCF International Conference on Natural Language Processing and Chinese …, 2021	26	2021
RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question Answering C Wang, H Yu, Y Zhang Findings of the Association for Computational Linguistics: ACL 2023, 2023	24	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors