[go: up one dir, main page]

Follow
Hao Yu
Hao Yu
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Chatglm: A family of large language models from glm-130b to glm-4 all tools
T GLM, A Zeng, B Xu, B Wang, C Zhang, D Yin, D Zhang, D Rojas, G Feng, ...
arXiv preprint arXiv:2406.12793, 2024
11932024
Agentbench: Evaluating llms as agents
X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ...
arXiv preprint arXiv:2308.03688, 2023
966*2023
WebGLM: towards an efficient web-enhanced question answering system with human preferences
X Liu, H Lai, H Yu, Y Xu, A Zeng, Z Du, P Zhang, Y Dong, J Tang
Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and …, 2023
122*2023
Middleware for llms: Tools are instrumental for language agents in complex environments
Y Gu, Y Shu, H Yu, X Liu, Y Dong, J Tang, J Srinivasa, H Latapie, Y Su
arXiv preprint arXiv:2402.14672, 2024
612024
Androidlab: Training and systematic benchmarking of android autonomous agents
Y Xu, X Liu, X Sun, S Cheng, H Yu, H Lai, S Zhang, D Zhang, J Tang, ...
Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025
502025
Autowebglm: A large language model-based web navigating agent
H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ...
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024
392024
Ui-tars-2 technical report: Advancing gui agent with multi-turn reinforcement learning
H Wang, H Zou, H Song, J Feng, J Fang, J Lu, L Liu, Q Luo, S Liang, ...
arXiv preprint arXiv:2509.02544, 2025
332025
VisualAgentBench: Towards Large Multimodal Models as Visual Agents
X Liu, T Zhang, Y Gu, IL Iong, S XiXuan, Y Xu, S Zhang, H Lai, J Sun, ...
The Thirteenth International Conference on Learning Representations, 0
28*
Openwebagent: An open toolkit to enable web agents on large language models
IL Iong, X Liu, Y Chen, H Lai, S Yao, P Shen, H Yu, Y Dong, J Tang
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
272024
Autowebglm: Bootstrap and reinforce a large language model-based web navigating agent
H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ...
CoRR, 2024
192024
Alphavae: Unified end-to-end RGBA image reconstruction and generation with alpha-aware representation learning
Z Wang, H Yu, J Zhan, C Yuan
arXiv preprint arXiv:2507.09308, 2025
32025
Editthinker: Unlocking iterative reasoning for any image editor
H Li, M Zhang, D Zheng, Z Guo, Y Jia, K Feng, H Yu, Y Liu, Y Feng, P Pei, ...
arXiv preprint arXiv:2512.05965, 2025
22025
SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought
G Li, W Jiang, M Chen, Y Li, H Yu, S Dong, T Ren, M Tang, C Yuan
arXiv preprint arXiv:2505.24181, 2025
22025
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
H Yu, T Jiang, S Jia, S Yan, S Liu, H Qian, G Li, S Dong, C Yuan
Proceedings of the Computer Vision and Pattern Recognition Conference, 4508-4517, 2025
22025
WebGLM: Towards an Efficient and Reliable Web-Enhanced Question Answering System
H Lai, X Liu, H Yu, Y Xu, IL Iong, S Yao, A Zeng, Z Du, Y Dong, J Tang
ACM Transactions on Information Systems, 2025
12025
OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation
H Yu, J Zhan, Z Wang, J Wang, H Zhang, H Li, X Chen, Y Wei, C Yuan
arXiv preprint arXiv:2511.20211, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–16