| A multimodal foundation agent for financial trading: Tool-augmented, diversified, and generalist W Zhang, L Zhao, H Xia, S Sun, J Sun, M Qin, X Li, Y Zhao, Y Zhao, X Cai, ... KDD, 4314-4325, 2024 | 135 | 2024 |
| Cradle: Empowering foundation agents towards general computer control W Tan, W Zhang, X Xu, H Xia, Z Ding, B Li, B Zhou, J Yue, J Jiang, Y Li, ... ICML, 2025 | 128* | 2025 |
| Synapse: Trajectory-as-exemplar prompting with memory for computer control L Zheng, R Wang, X Wang, B An ICLR, 2024 | 119 | 2024 |
| True knowledge comes from practice: Aligning llms with embodied environments via reinforcement learning W Tan, W Zhang, S Liu, L Zheng, X Wang, B An ICLR, 2024 | 95 | 2024 |
| RMIX: Learning risk-sensitive policies for cooperative reinforcement learning agents W Qiu, X Wang, R Yu, R Wang, X He, B An, S Obraztsova, Z Rabinovich NeurIPS 34, 23049-23062, 2021 | 73 | 2021 |
| Agentstudio: A toolkit for building general virtual agents L Zheng, Z Huang, Z Xue, X Wang, B An, S Yan ICLR, 2025 | 52 | 2025 |
| Earnhft: Efficient hierarchical reinforcement learning for high frequency trading M Qin, S Sun, W Zhang, H Xia, X Wang, B An AAAI 38 (13), 14669-14676, 2024 | 48 | 2024 |
| Learning to collaborate in multi-module recommendation via multi-agent reinforcement learning without communication X He, B An, Y Li, H Chen, R Wang, X Wang, R Yu, X Li, Z Wang Proceedings of the 14th ACM Conference on Recommender Systems, 210-219, 2020 | 40 | 2020 |
| If multi-agent debate is the answer, what is the question H Zhang, Z Cui, X Wang, Q Zhang, Z Wang, D Wu, S Hu arXiv preprint arXiv:2502.08788, 2025 | 27 | 2025 |
| Mastering stock markets with efficient mixture of diversified trading experts S Sun, X Wang, W Xue, X Lou, B An Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 25 | 2023 |
| Trademaster: A holistic quantitative trading platform empowered by reinforcement learning S Sun, M Qin, W Zhang, H Xia, C Zong, J Ying, Y Xie, L Zhao, X Wang, ... Advances in Neural Information Processing Systems 36, 59047-59061, 2023 | 23 | 2023 |
| Solving large-scale extensive-form network security games via neural fictitious self-play W Xue, Y Zhang, S Li, X Wang, B An, CK Yeo IJCAI, 2021 | 23 | 2021 |
| Solving large-scale pursuit-evasion games using pre-trained strategies S Li, X Wang, Y Zhang, W Xue, J Černý, B An AAAI 37 (10), 11586-11594, 2023 | 20 | 2023 |
| Market-gan: Adding control to financial market data generation with semantic context H Xia, S Sun, X Wang, B An Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15996 …, 2024 | 18 | 2024 |
| Catching Captain Jack: Efficient time and space dependent patrols to combat oil-siphoning in international waters X Wang, B An, M Strobel, F Kong Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 18 | 2018 |
| Keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM C Wang, Y Xu, Z Peng, C Zhang, B Chen, X Wang, L Feng, B An arXiv preprint arXiv:2401.00426, 2023 | 17 | 2023 |
| Learning expensive coordination: An event-based deep rl approach Z Shi, R Yu, X Wang, R Wang, Y Zhang, H Lai, B An ICLR, 2019 | 17* | 2019 |
| Reinforcement learning with maskable stock representation for portfolio management in customizable stock pools W Zhang, Y Zhao, S Sun, J Ying, Y Xie, Z Song, X Wang, B An Proceedings of the ACM Web Conference 2024, 187-198, 2024 | 16 | 2024 |
| CFR-MIX: Solving imperfect information extensive-form games with combinatorial action space S Li, Y Zhang, X Wang, W Xue, B An IJCAI, 2021 | 16* | 2021 |
| Grasper: A Generalist Pursuer for Pursuit-Evasion Problems P Li, S Li, X Wang, J Cerny, Y Zhang, S McAleer, H Chan, B An AAMAS, 2024 | 15 | 2024 |