[go: up one dir, main page]

Follow
Yihan Du
Yihan Du
Assistant Professor, SUTD ESD
Verified email at sutd.edu.sg - Homepage
Title
Cited by
Cited by
Year
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path
Y Du, S Wang, L Huang
International Conference on Learning Representations (ICLR), 2023
372023
Combinatorial pure exploration with full-bandit or partial linear feedback
Y Du, Y Kuroki, W Chen
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021
32*2021
Exploration-driven policy optimization in RLHF: Theoretical insights on efficient data utilization
Y Du, A Winnicki, G Dalal, S Mannor, R Srikant
International Conference on Machine Learning (ICML), 2024
282024
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation
Y Du, Y Yan, S Chen, Y Hua
Neurocomputing 384, 67-83, 2020
272020
Collaborative pure exploration in kernel bandit
Y Du, W Chen, Y Yuroki, L Huang
International Conference on Learning Representations (ICLR), 2023
202023
Combinatorial pure exploration for dueling bandit
W Chen, Y Du, L Huang, H Zhao (*in alphabetical order)
International Conference on Machine Learning (ICML), 1531-1541, 2020
162020
Provably safe reinforcement learning with step-wise violation constraints
N Xiong, Y Du, L Huang
Advances in Neural Information Processing Systems (NeurIPS) 36, 2024
142024
Multi-task Representation Learning for Pure Exploration in Linear Bandits
Y Du, L Huang, W Sun
International Conference on Machine Learning (ICML), 2023
112023
Continuous mean-covariance bandits
Y Du, S Wang, Z Fang, L Huang
Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021
82021
A one-size-fits-all solution to conservative bandit problems
Y Du, S Wang, L Huang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021
82021
Combinatorial pure exploration with bottleneck reward function
Y Du, Y Kuroki, W Chen
Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021
72021
Dueling bandits: from two-dueling to multi-dueling
Y Du, S Wang, L Huang
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020
72020
Object-adaptive LSTM network for visual tracking
Y Du, Y Yan, S Chen, Y Hua, H Wang
International Conference on Pattern Recognition (ICPR), 1719-1724, 2018
72018
Provably efficient iterated cvar reinforcement learning with function approximation
Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang
International Conference on Learning Representations (ICLR), 2023
52023
Cascading Reinforcement Learning
Y Du, R Srikant, W Chen
International Conference on Learning Representations (ICLR, Spotlight), 2024
22024
Reinforcement Learning with Segment Feedback
Y Du, A Winnicki, G Dalal, S Mannor, R Srikant
arXiv preprint arXiv:2502.01876, 2025
12025
Branching reinforcement learning
Y Du, W Chen
International Conference on Machine Learning (ICML), 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–17