Marek Petrik

Cited by

	All	Since 2021
Citations	2648	1541
h-index	29	23
i10-index	57	37

380

190

285

2007200820092010201120122013201420152016201720182019202020212022202320242025202614 21 43 51 48 50 46 55 61 53 74 106 170 183 223 270 368 338 338 4

Public access

View all

32 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mohammad GhavamzadehQualcomm AI ResearchVerified email at qti.qualcomm.com
Shlomo ZilbersteinProfessor of Computer Science, University of Massachusetts AmherstVerified email at cs.umass.edu
Chin Pang HoCity University of Hong KongVerified email at cityu.edu.hk
Dharmashankar SubramanianPrincipal Research Staff Member/Manager, IBM ResearchVerified email at us.ibm.com
Reazul Hasan RusselResearch Scientist at MetaVerified email at wildcats.unh.edu
Bahram BehzadianMetaVerified email at meta.com
Sridhar MahadevanDirector, Adobe Research & Professor, University of Massachusetts, AmherstVerified email at cs.umass.edu
Sechan OhMoloco, Previously at IBM, StanfordVerified email at molocoads.com
Ji LiuMetaVerified email at meta.com
Julien Grand-ClémentHEC ParisVerified email at hec.fr
Jia Lin HauUniversity of New HampshireVerified email at unh.edu
Bo LiuUniversity of Arizona, AAAI SM, IEEE SMVerified email at cs.umass.edu
Daniel S. BrownAssistant Professor, Robotics Center and Kahlert School of Computing, University of UtahVerified email at cs.utah.edu
Wolfram WiesemannProfessor of Analytics and Operations, Imperial College Business SchoolVerified email at imperial.ac.uk
Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Qiuhao WangSouthwestern University of Finance and EconomicsVerified email at swufe.edu.cn
Ronny LussIBM ResearchVerified email at us.ibm.com
Xihong SuComputer Science, University of New HampshireVerified email at cs.unh.edu
Elita LoboUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Amit DhurandharPrincipal Research Scientist, IBMVerified email at us.ibm.com

Marek Petrik

University of New Hampshire

Verified email at cs.unh.edu - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An approximate solution method for large risk-averse Markov decision processes M Petrik, D Subramanian arXiv preprint arXiv:1210.4901, 2012	203	2012
Finite-sample analysis of proximal gradient td algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik arXiv preprint arXiv:2006.14364, 2020	187	2020
Safe policy improvement by minimizing robust baseline regret M Ghavamzadeh, M Petrik, Y Chow Advances in Neural Information Processing Systems 29, 2016	182	2016
An Analysis of Laplacian Methods for Value Function Approximation in MDPs. M Petrik IJCAI, 2574-2579, 2007	98	2007
Feature selection using regularization in approximate linear programs for Markov decision processes M Petrik, G Taylor, R Parr, S Zilberstein arXiv preprint arXiv:1005.1860, 2010	94	2010
Fast Bellman updates for robust MDPs CP Ho, M Petrik, W Wiesemann International Conference on Machine Learning, 1979-1988, 2018	87	2018
Biasing approximate dynamic programming with a lower discount factor M Petrik, B Scherrer Advances in neural information processing systems 21, 2008	82	2008
Beyond confidence regions: Tight bayesian ambiguity sets for robust mdps M Petrik, RH Russel Advances in neural information processing systems 32, 2019	80	2019
Partial policy iteration for l1-robust markov decision processes CP Ho, M Petrik, W Wiesemann Journal of Machine Learning Research 22 (275), 1-46, 2021	79	2021
A practical method for solving contextual bandit problems using decision trees AN Elmachtoub, R McNellis, S Oh, M Petrik arXiv preprint arXiv:1706.04687, 2017	74	2017
Learning parallel portfolios of algorithms M Petrik, S Zilberstein Annals of Mathematics and Artificial Intelligence 48 (1), 85-106, 2006	63	2006
Bayesian robust optimization for imitation learning D Brown, S Niekum, M Petrik Advances in Neural Information Processing Systems 33, 2479-2491, 2020	59	2020
Tight approximations of dynamic risk measures DA Iancu, M Petrik, D Subramanian Mathematics of Operations Research 40 (3), 655-682, 2015	59	2015
RAAM: The benefits of robustness in approximating aggregated MDPs in reinforcement learning M Petrik, D Subramanian Advances in Neural Information Processing Systems 27, 2014	48	2014
Constraint relaxation in approximate linear programs M Petrik, S Zilberstein Proceedings of the 26th Annual International Conference on Machine Learning …, 2009	48	2009
A bilinear programming approach for multiagent planning M Petrik, S Zilberstein Journal of Artificial Intelligence Research 35, 235-274, 2009	47	2009
Average-Reward Decentralized Markov Decision Processes. M Petrik, S Zilberstein IJCAI, 1997-2002, 2007	40	2007
Entropic risk optimization in discounted MDPs JL Hau, M Petrik, M Ghavamzadeh International Conference on Artificial Intelligence and Statistics, 47-76, 2023	39	2023
Policy gradient in robust mdps with global convergence guarantee Q Wang, CP Ho, M Petrik International Conference on Machine Learning, 35763-35797, 2023	38	2023
Fast Algorithms for -constrained S-rectangular Robust MDPs B Behzadian, M Petrik, CP Ho Advances in Neural Information Processing Systems 34, 25982-25992, 2021	38	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors