| Reinforcement learning based dynamic model combination for time series forecasting Y Fu, D Wu, B Boulet Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 6639-6647, 2022 | 73 | 2022 |
| Seed diffusion: A large-scale diffusion language model with high-speed inference Y Song, Z Zhang, C Luo, P Gao, F Xia, H Luo, Z Li, Y Yang, H Yu, X Qu, ... arXiv preprint arXiv:2508.02193, 2025 | 42 | 2025 |
| FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning Y Fu, H Zhang, D Wu, W Xu, B Boulet International Conference on Machine Learning 235, 14256-14274, 2024 | 25 | 2024 |
| A closer look at offline RL agents Y Fu, D Wu, B Boulet Advances in Neural Information Processing Systems 35, 8591-8604, 2022 | 25 | 2022 |
| Robot Policy Learning with Temporal Optimal Transport Reward Y Fu, H Zhang, D Wu, W Xu, B Boulet The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024 | 5 | 2024 |