[go: up one dir, main page]

Follow
Jian Qian
Jian Qian
Verified email at hku.hk - Homepage
Title
Cited by
Cited by
Year
The statistical complexity of interactive decision making
DJ Foster, SM Kakade, J Qian, A Rakhlin
arXiv preprint arXiv:2112.13487, 2021
2862021
Convex and non-convex optimization under generalized smoothness
H Li, J Qian, Y Tian, A Rakhlin, A Jadbabaie
Advances in Neural Information Processing Systems 36, 40238-40271, 2023
1032023
Importance resampling for off-policy prediction
M Schlegel, W Chung, D Graves, J Qian, M White
Advances in Neural Information Processing Systems 32, 2019
522019
Exploration bonus for regret minimization in discrete and continuous average reward mdps
J Qian, R Fruit, M Pirotta, A Lazaric
Advances in Neural Information Processing Systems 32, 2019
50*2019
Model-free reinforcement learning with the decision-estimation coefficient
DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari
Thirty-seventh Conference on Neural Information Processing Systems, 2023
36*2023
Towards minimax optimal reinforcement learning in factored markov decision processes
Y Tian, J Qian, S Sra
Advances in Neural Information Processing Systems 33, 19896-19907, 2020
352020
Concentration inequalities for multinoulli random variables
J Qian, R Fruit, M Pirotta, A Lazaric
arXiv preprint arXiv:2001.11595, 2020
242020
Robust learning under clean-label attack
A Blum, S Hanneke, J Qian, H Shao
Conference on Learning Theory, 591-634, 2021
152021
Byzantine-robust federated linear bandits
A Jadbabaie, H Li, J Qian, Y Tian
2022 IEEE 61st Conference on Decision and Control (CDC), 5206-5213, 2022
142022
Online estimation via offline estimation: An information-theoretic framework
DJ Foster, Y Han, J Qian, A Rakhlin
Advances in Neural Information Processing Systems 37, 42840-42898, 2024
132024
How Does Variance Shape the Regret in Contextual Bandits?
Z Jia, J Qian, A Rakhlin, CY Wei
Advances in Neural Information Processing Systems 37, 83730-83785, 2024
102024
Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability
F Chen, DJ Foster, Y Han, J Qian, A Rakhlin, Y Xu
Advances in Neural Information Processing Systems 37, 75585-75641, 2024
102024
Offline oracle-efficient learning for contextual mdps via layerwise exploration-exploitation tradeoff
J Qian, H Hu, D Simchi-Levi
Advances in Neural Information Processing Systems 37, 133743-133775, 2024
72024
Refined Risk Bounds for Unbounded Losses via Transductive Priors
J Qian, A Rakhlin, N Zhivotovskiy
arXiv preprint arXiv:2410.21621, 2024
72024
Bridging multiple worlds: multi-marginal optimal transport for causal partial-identification problem
Z Gao, S Ge, J Qian
arXiv preprint arXiv:2406.07868, 2024
52024
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits
Y Gu, Y Han, J Qian
arXiv preprint arXiv:2503.00273, 2025
22025
The Non-linear -Design and Applications to Interactive Learning
A Agarwal, J Qian, A Rakhlin, T Zhang
Forty-first International Conference on Machine Learning, 2024
12024
Sigmoid-FTRL: Design-Based Adaptive Neyman Allocation for AIPW Estimators
F Chen, S Ge, J Qian, C Harshaw
arXiv preprint arXiv:2511.19905, 2025
2025
MUSE: Multi-Treatment Experiment Design for Winner Selection and Effect Estimation
J Xu, J Qian, Z Gao
arXiv preprint arXiv:2510.04489, 2025
2025
To bootstrap or to rollout? An optimal and adaptive interpolation
W Mou, J Qian
arXiv preprint arXiv:2411.09731, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20