[go: up one dir, main page]

Follow
Xiao Hu
Xiao Hu
Verified email at mails.tsinghua.edu.cn
Title
Cited by
Cited by
Year
Fault diagnosis using novel AdaBoost based discriminant locality preserving projection with resamples
YL He, Y Zhao, X Hu, XN Yan, QX Zhu, Y Xu
Engineering Applications of Artificial Intelligence 91, 103631, 2020
712020
Thyme: Think Beyond Images
YF Zhang, X Lu, S Yin, C Fu, W Chen, X Hu, B Wen, K Jiang, C Liu, ...
arXiv preprint arXiv:2508.11630, 2025
61*2025
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang
arXiv preprint arXiv:2305.15669, 2023
392023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
X Hu, J Li, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2024, Spotlight, 2023
362023
Kwai Keye-VL Technical Report
KK Team, B Yang, B Wen, C Liu, C Chu, C Song, C Rao, C Yi, D Li, ...
arXiv preprint arXiv:2507.01949, 2025
292025
Mind the gap: Offline policy optimization for imperfect rewards
J Li*, X Hu*, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2023, 2023
282023
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
YF Zhang, X Lu, X Hu, C Fu, B Wen, T Zhang, C Liu, K Jiang, K Chen, ...
arXiv preprint arXiv:2505.02835, 2025
252025
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
S Huang, Q Gallouédec, F Felten, A Raffin, RFJ Dossa, Y Zhao, ...
arXiv preprint arXiv:2402.03046, 2024
192024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
J Li, J Zheng, Y Zheng, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ...
ICML 2024, 2024
152024
Kwai Keye-VL 1.5 Technical Report
B Yang, B Wen, B Ding, C Liu, C Chu, C Song, C Rao, C Yi, D Li, D Zang, ...
arXiv preprint arXiv:2509.01563, 2025
142025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
X Zhan, X Zhu, P Cheng, X Hu, Z He, H Geng, J Leng, H Zheng, C Liu, ...
ICLR 2025, 2025
92025
Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
X Hu, X Lu, L Mao, YF Zhang, T Zhang, B Wen, F Yang, T Gao, G Zhou
arXiv preprint arXiv:2505.21067, 2025
72025
Large-Scale Data Center Cooling Control via Sample-Efficient Reinforcement Learning
N Mu, X Hu, QS Jia, X Zhu, X He
2024 IEEE 20th International Conference on Automation Science and …, 2024
72024
Integrating Mechanism and Data: Reinforcement Learning Based on Multi-Fidelity Model for Data Center Cooling Control
N Mu, X Hu, QS Jia
2023 China Automation Congress (CAC), 5283-5288, 2023
52023
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries
N Mu, H Hu, X Hu, Y Yang, B Xu, QS Jia
ICML 2025, 2025
42025
Novel L2-Discriminant Locality Preserving Projection Integrated with Adaboost and Its Application to Fault Diagnosis
X Hu, Y Zhao, Y Xu, YL He, QX Zhu
2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS …, 2020
22020
Simulation and AI for Critical Infrastructure
QS Jia, C Duan, S Feng, Y Zhu, X Hu
2024 Winter Simulation Conference (WSC), 57-71, 2024
12024
Vehicle Extreme Control based on Offline Reinforcement Leaning
S Zhao, J Li, X Hu, J Zhang, C He
2022 China Automation Congress (CAC), 4539-4543, 2022
12022
面向数据中心绿色可靠运行的强化学习方法
贾庆山, 唐静娴, 吴俊杰, 胡潇, 林依挺, 夏恒
智能科学与技术学报 2 (4), 341-347, 0
The system can't perform the operation now. Try again later.
Articles 1–19