[go: up one dir, main page]

Follow
Amir-massoud Farahmand
Amir-massoud Farahmand
Polytechnique Montreal, Mila, University of Toronto
Verified email at cs.toronto.edu - Homepage
Title
Cited by
Cited by
Year
Error propagation for approximate policy and value iteration
A Farahmand, C Szepesvári, R Munos
Advances in Neural Information Processing Systems (NeurIPS), 568-576, 2010
3132010
Regularized Policy Iteration
A Farahmand, M Ghavamzadeh, S Mannor, C Szepesvári
Advances in Neural Information Processing Systems 21 (NeurIPS 2008), 441-448, 2009
1742009
Learning from Limited Demonstrations
B Kim, A Farahmand, J Pineau, D Precup
Advances in Neural Information Processing Systems (NeurIPS), 2859-2867, 2013
1652013
Value-aware loss function for model-based reinforcement learning
A Farahmand, A Barreto, D Nikovski
Artificial Intelligence and Statistics (AISTATS), 1486-1494, 2017
1622017
Manifold-adaptive dimension estimation
A Farahmand, C Szepesvári, JY Audibert
Proceedings of the 24th International Conference on Machine Learning (ICML …, 2007
1562007
Regularized policy iteration with nonparametric function spaces
A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor
Journal of Machine Learning Research (JMLR) 17 (1), 4809-4874, 2016
1442016
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems
A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor
American Control Conference (ACC), 725-730, 2009
100*2009
Robust jacobian estimation for uncalibrated visual servoing
A Shademan, A Farahmand, M Jägersand
IEEE International Conference on Robotics and Automation (ICRA), 5564-5569, 2010
932010
Model Selection in Reinforcement Learning
AM Farahmand, C Szepesvári
Machine learning 85 (3), 299-332, 2011
902011
Iterative Value-Aware Model Learning
A Farahmand
Advances in Neural Information Processing Systems (NeurIPS), 9072-9083, 2018
862018
Action-Gap Phenomenon in Reinforcement Learning
AM Farahmand
Neural Information Processing Systems (NeurIPS), 2011
742011
Deep reinforcement learning for partial differential equation control
A Farahmand, S Nabi, DN Nikovski
American Control Conference (ACC), 3120-3127, 2017
592017
Global visual-motor estimation for uncalibrated visual servoing
A Farahmand, A Shademan, M Jagersand
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS …, 2007
55*2007
Regularization in Reinforcement Learning
AM Farahmand
Department of Computing Science, University of Alberta, 2011
522011
Policy-aware model learning for policy gradient methods
R Abachi, M Ghavamzadeh, A Farahmand
arXiv:2003.00030, 2020
482020
Value Gradient weighted Model-Based Reinforcement Learning
CA Voelcker, V Liao, A Garg, A Farahmand
International Conference on Learning Representations (ICLR), 2022
452022
Attentional network for visual object detection
K Hara, MY Liu, O Tuzel, A Farahmand
arXiv preprint arXiv:1702.01478, 2017
432017
Model-based and model-free reinforcement learning for visual servoing
A Farahmand, A Shademan, M Jagersand, C Szepesvári
IEEE International Conference on Robotics and Automation (ICRA), 2917-2924, 2009
39*2009
Understanding and Mitigating the Limitations of Prioritized Experience Replay
Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo
The 38th Conference on Uncertainty in Artificial Intelligence (UAI), 2022
382022
Method for Data-Driven Learning-based Control of HVAC Systems using High-Dimensional Sensory Observations
A Farahmand, S Nabi, P Grover, DN Nikovski
US Patent App. 15/290,038, 2018
382018
The system can't perform the operation now. Try again later.
Articles 1–20