| Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 3439 | 2024 |
| Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025 | 1378 | 2025 |
| The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019 | 140 | 2019 |
| others. 2025. Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 1 | 120 | 1 |
| Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021 | 90 | 2021 |
| What can I do here? A theory of affordances in reinforcement learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning, 5243-5253, 2020 | 84 | 2020 |
| Vision-language models as a source of rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023 | 59 | 2023 |
| Optimal policy switching algorithms for reinforcement learning G Comanici, D Precup Proceedings of the 9th International Conference on Autonomous Agents and …, 2010 | 52 | 2010 |
| On-the-fly algorithms for bisimulation metrics G Comanici, P Panangaden, D Precup 2012 ninth international conference on quantitative evaluation of systems …, 2012 | 24 | 2012 |
| Basis function discovery using spectral clustering and bisimulation metrics G Comanici, D Precup International Workshop on Adaptive and Learning Agents, 85-99, 2011 | 22 | 2011 |
| Representation discovery for mdps using bisimulation metrics S Ruan, G Comanici, P Panangaden, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015 | 21 | 2015 |
| Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 1979-1991, 2021 | 19 | 2021 |
| Finding increasingly large extremal graphs with alphazero and tabu search A Mehrabian, A Anand, H Kim, N Sonnerat, M Balog, G Comanici, ... arXiv preprint arXiv:2311.03583, 2023 | 15 | 2023 |
| An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013 | 13 | 2013 |
| Knowledge representation for reinforcement learning using general value functions G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ... | 11 | 2018 |
| Basis refinement strategies for linear value function approximation in MDPs G Comanici, D Precup, P Panangaden Advances in neural information processing systems 28, 2015 | 9 | 2015 |
| An AI system to help scientists write expert-level empirical software E Aygün, A Belyaeva, G Comanici, M Coram, H Cui, J Garrison, RJA Kast, ... arXiv preprint arXiv:2509.06503, 2025 | 8 | 2025 |
| AndroidEnv: A Reinforcement Learning Platform for Android. abs/2105.13231 (2021) D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint cs.LG/2105.13231, 2021 | 6 | 2021 |
| What can I do here K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup A theory of affordances in reinforcement learning. arXiv [cs. LG], 2020 | 6 | 2020 |
| A study of off-policy learning in computational sustainability C Paduraru, D Precup, J Pineau, G Comanici European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012 | 4 | 2012 |