| Glm-130b: An open bilingual pre-trained model Z Aohan, L Xiao, Z Du, W Zihan, L Hanyu, D Ming, Y Zhuoyi, Y Xu, ... arXiv preprint arXiv:2210.02414, 2022 | 1488* | 2022 |
| Chatglm: A family of large language models from glm-130b to glm-4 all tools T GLM, A Zeng, B Xu, B Wang, C Zhang, D Yin, D Zhang, D Rojas, G Feng, ... arXiv preprint arXiv:2406.12793, 2024 | 1449* | 2024 |
| Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023 | 963* | 2023 |
| Webglm: Towards an efficient web-enhanced question answering system with human preferences X Liu, H Lai, H Yu, Y Xu, A Zeng, ... arXiv preprint arXiv:2306.07906, 2023 | 122* | 2023 |
| Alignbench: Benchmarking chinese alignment of large language models X Liu, X Lei, S Wang, Y Huang, A Feng, B Wen, J Cheng, P Ke, Y Xu, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 116 | 2024 |
| Glm-4.5: Agentic, reasoning, and coding (arc) foundation models A Zeng, X Lv, Q Zheng, Z Hou, B Chen, C Xie, C Wang, D Yin, H Zeng, ... arXiv preprint arXiv:2508.06471, 2025 | 112 | 2025 |
| Autoglm: Autonomous foundation agents for guis X Liu, B Qin, D Liang, G Dong, H Lai, H Zhang, H Zhao, IL Iong, J Sun, ... arXiv preprint arXiv:2411.00820, 2024 | 57 | 2024 |
| Androidlab: Training and systematic benchmarking of android autonomous agents Y Xu, X Liu, X Sun, S Cheng, H Yu, H Lai, S Zhang, D Zhang, J Tang, ... Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025 | 50 | 2025 |
| Chatglm-math: Improving math problem-solving in large language models with a self-critique pipeline Y Xu, X Liu, X Liu, Z Hou, Y Li, X Zhang, Z Wang, A Zeng, Z Du, Z Wenyi, ... Findings of the Association for Computational Linguistics: EMNLP 2024, 9733-9760, 2024 | 44* | 2024 |
| GOAL: A challenging knowledge-grounded video captioning benchmark for real-time soccer commentary generation J Qi, J Yu, T Tu, K Gao, Y Xu, X Guan, X Wang, B Xu, L Hou, J Li, J Tang Proceedings of the 32nd ACM international conference on information and …, 2023 | 29 | 2023 |
| Visualagentbench: Towards large multimodal models as visual foundation agents X Liu, T Zhang, Y Gu, IL Iong, Y Xu, X Song, S Zhang, H Lai, X Liu, H Zhao, ... arXiv preprint arXiv:2408.06327, 2024 | 28 | 2024 |
| XDAI: A tuning-free framework for exploiting pre-trained language models in knowledge grounded dialogue generation J Yu, X Zhang, Y Xu, X Lei, X Guan, J Zhang, L Hou, J Li, J Tang Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 27 | 2022 |
| A survey of post-training scaling in large language models H Lai, X Liu, J Gao, J Cheng, Z Qi, Y Xu, S Yao, D Zhang, J Du, Z Hou, ... Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025 | 16 | 2025 |
| A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation J Yu, X Zhang, Y Xu, X Lei, Z Yao, J Zhang, L Hou, J Li arXiv preprint arXiv:2404.03491, 2024 | 7 | 2024 |
| Mobilerl: Online agentic reinforcement learning for mobile gui agents Y Xu, X Liu, X Liu, J Fu, H Zhang, B Jing, S Zhang, Y Wang, W Zhao, ... arXiv preprint arXiv:2509.18119, 2025 | 5 | 2025 |
| AndroidGen: Building an Android Language Agent under Data Scarcity H Lai, J Gao, X Liu, Y Xu, S Zhang, Y Dong, J Tang arXiv preprint arXiv:2504.19298, 2025 | 3 | 2025 |
| AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework H Zhang, X Liu, B Lv, X Sun, B Jing, IL Iong, Z Hou, Z Qi, H Lai, Y Xu, R Lu, ... arXiv preprint arXiv:2510.04206, 2025 | 1 | 2025 |