Yihan Du

Cited by

	All	Since 2021
Citations	230	222
h-index	8	8
i10-index	8	8

20202021202220232024202520268 22 25 42 64 68 1

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Longbo HuangProfessor, IIIS, Tsinghua University, ACM Distinguished ScientistVerified email at tsinghua.edu.cn
Wei Chen （陈卫）Microsoft Research, ACM/IEEE FellowVerified email at microsoft.com
Yuko KurokiCENTAI InstituteVerified email at centai.eu
R. SrikantUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Anna WinnickiStanford UniversityVerified email at stanford.edu
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Gal DalalSr. Research Scientist, NvidiaVerified email at nvidia.com
Haoyu ZhaoPrinceton UniversityVerified email at princeton.edu
Wen SunAssistant Professor, Cornell UniversityVerified email at cornell.edu
Zhixuan FangTsinghua UniversityVerified email at mail.tsinghua.edu.cn

Yihan Du

Assistant Professor, SUTD ESD

Verified email at sutd.edu.sg - Homepage

Reinforcement Learning Online Learning Representation Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path Y Du, S Wang, L Huang International Conference on Learning Representations (ICLR), 2023	37	2023
Combinatorial pure exploration with full-bandit or partial linear feedback Y Du, Y Kuroki, W Chen Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	32*	2021
Exploration-driven policy optimization in RLHF: Theoretical insights on efficient data utilization Y Du, A Winnicki, G Dalal, S Mannor, R Srikant International Conference on Machine Learning (ICML), 2024	28	2024
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation Y Du, Y Yan, S Chen, Y Hua Neurocomputing 384, 67-83, 2020	27	2020
Collaborative pure exploration in kernel bandit Y Du, W Chen, Y Yuroki, L Huang International Conference on Learning Representations (ICLR), 2023	20	2023
Combinatorial pure exploration for dueling bandit W Chen, Y Du, L Huang, H Zhao (*in alphabetical order) International Conference on Machine Learning (ICML), 1531-1541, 2020	16	2020
Provably safe reinforcement learning with step-wise violation constraints N Xiong, Y Du, L Huang Advances in Neural Information Processing Systems (NeurIPS) 36, 2024	14	2024
Multi-task Representation Learning for Pure Exploration in Linear Bandits Y Du, L Huang, W Sun International Conference on Machine Learning (ICML), 2023	11	2023
Continuous mean-covariance bandits Y Du, S Wang, Z Fang, L Huang Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021	8	2021
A one-size-fits-all solution to conservative bandit problems Y Du, S Wang, L Huang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	8	2021
Combinatorial pure exploration with bottleneck reward function Y Du, Y Kuroki, W Chen Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021	7	2021
Dueling bandits: from two-dueling to multi-dueling Y Du, S Wang, L Huang International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	7	2020
Object-adaptive LSTM network for visual tracking Y Du, Y Yan, S Chen, Y Hua, H Wang International Conference on Pattern Recognition (ICPR), 1719-1724, 2018	7	2018
Provably efficient iterated cvar reinforcement learning with function approximation Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang International Conference on Learning Representations (ICLR), 2023	5	2023
Cascading Reinforcement Learning Y Du, R Srikant, W Chen International Conference on Learning Representations (ICLR, Spotlight), 2024	2	2024
Reinforcement Learning with Segment Feedback Y Du, A Winnicki, G Dalal, S Mannor, R Srikant arXiv preprint arXiv:2502.01876, 2025	1	2025
Branching reinforcement learning Y Du, W Chen International Conference on Machine Learning (ICML), 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–17

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors