| Chatglm: A family of large language models from glm-130b to glm-4 all tools T GLM, A Zeng, B Xu, B Wang, C Zhang, D Yin, D Zhang, D Rojas, G Feng, ... arXiv preprint arXiv:2406.12793, 2024 | 1193 | 2024 |
| Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023 | 966* | 2023 |
| WebGLM: towards an efficient web-enhanced question answering system with human preferences X Liu, H Lai, H Yu, Y Xu, A Zeng, Z Du, P Zhang, Y Dong, J Tang Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and …, 2023 | 122* | 2023 |
| Middleware for llms: Tools are instrumental for language agents in complex environments Y Gu, Y Shu, H Yu, X Liu, Y Dong, J Tang, J Srinivasa, H Latapie, Y Su arXiv preprint arXiv:2402.14672, 2024 | 61 | 2024 |
| Androidlab: Training and systematic benchmarking of android autonomous agents Y Xu, X Liu, X Sun, S Cheng, H Yu, H Lai, S Zhang, D Zhang, J Tang, ... Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025 | 50 | 2025 |
| Autowebglm: A large language model-based web navigating agent H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ... Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024 | 39 | 2024 |
| Ui-tars-2 technical report: Advancing gui agent with multi-turn reinforcement learning H Wang, H Zou, H Song, J Feng, J Fang, J Lu, L Liu, Q Luo, S Liang, ... arXiv preprint arXiv:2509.02544, 2025 | 33 | 2025 |
| VisualAgentBench: Towards Large Multimodal Models as Visual Agents X Liu, T Zhang, Y Gu, IL Iong, S XiXuan, Y Xu, S Zhang, H Lai, J Sun, ... The Thirteenth International Conference on Learning Representations, 0 | 28* | |
| Openwebagent: An open toolkit to enable web agents on large language models IL Iong, X Liu, Y Chen, H Lai, S Yao, P Shen, H Yu, Y Dong, J Tang Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 27 | 2024 |
| Autowebglm: Bootstrap and reinforce a large language model-based web navigating agent H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ... CoRR, 2024 | 19 | 2024 |
| Alphavae: Unified end-to-end RGBA image reconstruction and generation with alpha-aware representation learning Z Wang, H Yu, J Zhan, C Yuan arXiv preprint arXiv:2507.09308, 2025 | 3 | 2025 |
| Editthinker: Unlocking iterative reasoning for any image editor H Li, M Zhang, D Zheng, Z Guo, Y Jia, K Feng, H Yu, Y Liu, Y Feng, P Pei, ... arXiv preprint arXiv:2512.05965, 2025 | 2 | 2025 |
| SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought G Li, W Jiang, M Chen, Y Li, H Yu, S Dong, T Ren, M Tang, C Yuan arXiv preprint arXiv:2505.24181, 2025 | 2 | 2025 |
| ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices H Yu, T Jiang, S Jia, S Yan, S Liu, H Qian, G Li, S Dong, C Yuan Proceedings of the Computer Vision and Pattern Recognition Conference, 4508-4517, 2025 | 2 | 2025 |
| WebGLM: Towards an Efficient and Reliable Web-Enhanced Question Answering System H Lai, X Liu, H Yu, Y Xu, IL Iong, S Yao, A Zeng, Z Du, Y Dong, J Tang ACM Transactions on Information Systems, 2025 | 1 | 2025 |
| OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation H Yu, J Zhan, Z Wang, J Wang, H Zhang, H Li, X Chen, Y Wei, C Yuan arXiv preprint arXiv:2511.20211, 2025 | | 2025 |