Gheorghe Comanici

Cited by

	All	Since 2021
Citations	5523	5406
h-index	13	11
i10-index	15	12

4100

2050

1025

3075

202020212022202320242025202630 45 93 88 973 4053 151

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Gheorghe Comanici

Research Scientist, Google DeepMind

Verified email at google.com

LLMs for Science Reinforcement Learning Hierarchical Behavior Bisimulation metrics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024	3439	2024
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025	1378	2025
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	140	2019
others. 2025. Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 1	120	1
Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021	90	2021
What can I do here? A theory of affordances in reinforcement learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning, 5243-5253, 2020	84	2020
Vision-language models as a source of rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023	59	2023
Optimal policy switching algorithms for reinforcement learning G Comanici, D Precup Proceedings of the 9th International Conference on Autonomous Agents and …, 2010	52	2010
On-the-fly algorithms for bisimulation metrics G Comanici, P Panangaden, D Precup 2012 ninth international conference on quantitative evaluation of systems …, 2012	24	2012
Basis function discovery using spectral clustering and bisimulation metrics G Comanici, D Precup International Workshop on Adaptive and Learning Agents, 85-99, 2011	22	2011
Representation discovery for mdps using bisimulation metrics S Ruan, G Comanici, P Panangaden, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	21	2015
Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 1979-1991, 2021	19	2021
Finding increasingly large extremal graphs with alphazero and tabu search A Mehrabian, A Anand, H Kim, N Sonnerat, M Balog, G Comanici, ... arXiv preprint arXiv:2311.03583, 2023	15	2023
An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013	13	2013
Knowledge representation for reinforcement learning using general value functions G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ...	11	2018
Basis refinement strategies for linear value function approximation in MDPs G Comanici, D Precup, P Panangaden Advances in neural information processing systems 28, 2015	9	2015
An AI system to help scientists write expert-level empirical software E Aygün, A Belyaeva, G Comanici, M Coram, H Cui, J Garrison, RJA Kast, ... arXiv preprint arXiv:2509.06503, 2025	8	2025
AndroidEnv: A Reinforcement Learning Platform for Android. abs/2105.13231 (2021) D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint cs.LG/2105.13231, 2021	6	2021
What can I do here K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup A theory of affordances in reinforcement learning. arXiv [cs. LG], 2020	6	2020
A study of off-policy learning in computational sustainability C Paduraru, D Precup, J Pineau, G Comanici European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012	4	2012

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by