[go: up one dir, main page]

Follow
Gheorghe Comanici
Gheorghe Comanici
Research Scientist, Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
34392024
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13782025
The option keyboard: Combining skills in reinforcement learning
A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ...
Advances in Neural Information Processing Systems 32, 2019
1402019
others. 2025. Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 1
1201
Androidenv: A reinforcement learning platform for android
D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ...
arXiv preprint arXiv:2105.13231, 2021
902021
What can I do here? A theory of affordances in reinforcement learning
K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup
International Conference on Machine Learning, 5243-5253, 2020
842020
Vision-language models as a source of rewards
K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ...
arXiv preprint arXiv:2312.09187, 2023
592023
Optimal policy switching algorithms for reinforcement learning
G Comanici, D Precup
Proceedings of the 9th International Conference on Autonomous Agents and …, 2010
522010
On-the-fly algorithms for bisimulation metrics
G Comanici, P Panangaden, D Precup
2012 ninth international conference on quantitative evaluation of systems …, 2012
242012
Basis function discovery using spectral clustering and bisimulation metrics
G Comanici, D Precup
International Workshop on Adaptive and Learning Agents, 85-99, 2011
222011
Representation discovery for mdps using bisimulation metrics
S Ruan, G Comanici, P Panangaden, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
212015
Temporally abstract partial models
K Khetarpal, Z Ahmed, G Comanici, D Precup
Advances in Neural Information Processing Systems 34, 1979-1991, 2021
192021
Finding increasingly large extremal graphs with alphazero and tabu search
A Mehrabian, A Anand, H Kim, N Sonnerat, M Balog, G Comanici, ...
arXiv preprint arXiv:2311.03583, 2023
152023
An empirical analysis of off-policy learning in discrete mdps
C Păduraru, D Precup, J Pineau, G Comănici
European Workshop on Reinforcement Learning, 89-102, 2013
132013
Knowledge representation for reinforcement learning using general value functions
G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ...
112018
Basis refinement strategies for linear value function approximation in MDPs
G Comanici, D Precup, P Panangaden
Advances in neural information processing systems 28, 2015
92015
An AI system to help scientists write expert-level empirical software
E Aygün, A Belyaeva, G Comanici, M Coram, H Cui, J Garrison, RJA Kast, ...
arXiv preprint arXiv:2509.06503, 2025
82025
AndroidEnv: A Reinforcement Learning Platform for Android. abs/2105.13231 (2021)
D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ...
arXiv preprint cs.LG/2105.13231, 2021
62021
What can I do here
K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup
A theory of affordances in reinforcement learning. arXiv [cs. LG], 2020
62020
A study of off-policy learning in computational sustainability
C Paduraru, D Precup, J Pineau, G Comanici
European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012
42012
The system can't perform the operation now. Try again later.
Articles 1–20