[go: up one dir, main page]

Follow
Lihong Li (李力鸿)
Lihong Li (李力鸿)
AI Research Scientist, Meta
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
A contextual-bandit approach to personalized news article recommendation
L Li, W Chu, J Langford, RE Schapire
Proceedings of the 19th international conference on World wide web, 661-670, 2010
41192010
An empirical evaluation of thompson sampling
O Chapelle, L Li
Advances in neural information processing systems 24, 2011
21452011
Parallelized stochastic gradient descent
M Zinkevich, M Weimer, L Li, A Smola
Advances in neural information processing systems 23, 2010
19372010
Contextual bandits with linear payoff functions
W Chu, L Li, L Reyzin, R Schapire
Proceedings of the fourteenth international conference on artificial …, 2011
15362011
Neural approaches to conversational AI
J Gao, M Galley, L Li
The 41st international ACM SIGIR conference on research & development in …, 2018
11292018
Doubly robust policy evaluation and learning
M Dudík, J Langford, L Li
arXiv preprint arXiv:1103.4601, 2011
10442011
Doubly Robust Policy Evaluation and Learning
M Dudık, J Langford, L Li
1044*
Doubly robust off-policy value evaluation for reinforcement learning
N Jiang, L Li
International conference on machine learning, 652-661, 2016
9882016
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
L Li, W Chu, J Langford, X Wang
Proceedings of the fourth ACM international conference on Web search and …, 2011
7592011
Towards a unified theory of state abstraction for MDPs.
L Li, TJ Walsh, ML Littman
AI&M 1 (2), 3, 2006
7412006
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
7192006
Taming the monster: A fast and simple algorithm for contextual bandits
A Agarwal, D Hsu, S Kale, J Langford, L Li, R Schapire
International conference on machine learning, 1638-1646, 2014
6652014
Sparse online learning via truncated gradient
J Langford, L Li, T Zhang
Advances in neural information processing systems 21, 2008
6232008
Towards end-to-end reinforcement learning of dialogue agents for information access
B Dhingra, L Li, X Li, J Gao, YN Chen, F Ahmad, L Deng
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
591*2017
Doubly robust policy evaluation and optimization
M Dudík, D Erhan, J Langford, L Li
5762014
End-to-end task-completion neural dialogue systems
X Li, YN Chen, L Li, J Gao, A Celikyilmaz
arXiv preprint arXiv:1703.01008, 2017
4952017
Neuro-symbolic program synthesis
E Parisotto, A Mohamed, R Singh, L Li, D Zhou, P Kohli
arXiv preprint arXiv:1611.01855, 2016
4652016
Breaking the curse of horizon: Infinite-horizon off-policy estimation
Q Liu, L Li, Z Tang, D Zhou
Advances in neural information processing systems 31, 2018
4642018
Provably optimal algorithms for generalized linear contextual bandits
L Li, Y Lu, D Zhou
International Conference on Machine Learning, 2071-2080, 2017
4472017
Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections
O Nachum, Y Chow, B Dai, L Li
Advances in neural information processing systems 32, 2019
4402019
The system can't perform the operation now. Try again later.
Articles 1–20