Jordi Grau-Moya

Cited by

	All	Since 2021
Citations	1612	1323
h-index	20	17
i10-index	26	24

540

270

135

405

20142015201620172018201920202021202220232024202520266 8 11 24 33 76 126 76 105 176 417 525 19

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Tim GeneweinDeepMindVerified email at google.com
Grégoire DelétangGoogle DeepMindVerified email at google.com
Anian RuossQuadratureVerified email at quadrature.ai
Joel VenessUniversal Artificial IntelligenceVerified email at godemperor.ai
Li Kevin WenliangSenior Research Scientist @ Google DeepMind; Research Fellow @ University College LondonVerified email at google.com
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Elliot CattResearch Scientist, Google DeepMindVerified email at google.com
Daniel Alexander BraunProfessor, Ulm UniversityVerified email at uni-ulm.de
Pedro A. OrtegaArtificial Intelligence & Machine LearningVerified email at adaptiveagents.org
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Matthew AitchisonGoogle DeepmindVerified email at deepmind.com
Markus KuneschGoogle DeepMindVerified email at google.com
Chris CundyFAR.AIVerified email at far.ai
Paul-Ambroise DuquenneMeta AI, FAIRVerified email at meta.com
Haitham Bou-AmmarRL-Team Leader, BO-Team Leader, MAS-Team Leader Huawei Noah's Ark Lab, H. Assistant Professor @ UCLVerified email at huawei.com
Peter VrancxPrincipal Scientist at IMECVerified email at imec.be
John ReidGoogle DeepMindVerified email at deepmind.com
Sergio Pascual-DíazAI Product and ResearchVerified email at prysmx.com
Mehdi BennaniGoogle DeepMindVerified email at deepmind.com
Róbert CsordásOpenAIVerified email at openai.com

Jordi Grau-Moya

Research Scientist at Google DeepMind

Verified email at deepmind.com - Homepage

Reinforcement Learning Machine Learning Information Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Language modeling is compression G Delétang, A Ruoss, PA Duquenne, E Catt, T Genewein, C Mattern, ... arXiv preprint arXiv:2309.10668, 2023	291	2023
Neural networks and the chomsky hierarchy G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ... arXiv preprint arXiv:2207.02098, 2022	242	2022
Bounded rationality, abstraction and hierarchical decision-making: an information-theoretic optimality principle T Genewein, F Leibfried, J Grau-Moya, DAB Braun Frontiers in Robotics and AI 2, 27, 2015	153	2015
Randomized positional encodings boost length generalization of transformers A Ruoss, G Delétang, T Genewein, J Grau-Moya, R Csordás, M Bennani, ... arXiv preprint arXiv:2305.16843, 2023	141	2023
Shaking the foundations: delusions in sequence models for interaction and control PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ... arXiv preprint arXiv:2110.10819, 2021	80	2021
Soft Q-Learning with Mutual-Information Regularization J Grau-Moya, F Leibfried, P Vrancx International Conference on Learning Representations (ICLR), 2019	70	2019
Balancing Two-Player Stochastic Games with Soft Q-Learning J Grau-Moya, F Leibfried, H Bou-Ammar Proceedings of the 27th International Joint Conference on Artificial …, 2018	64	2018
A unified bellman optimality principle combining reward maximization and empowerment F Leibfried, S Pascual-Diaz, J Grau-Moya Advances in Neural Information Processing Systems, 7869-7880, 2019	54	2019
Signaling equilibria in sensorimotor interactions F Leibfried, J Grau-Moya, DA Braun Cognition 141, 73-86, 2015	52	2015
Grandmaster-level chess without search A Ruoss, G Delétang, S Medapati, J Grau-Moya, LK Wenliang, E Catt, ... CoRR, 2024	43	2024
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes J Grau-Moya, F Leibfried, T Genewein, DA Braun Joint European Conference on Machine Learning and Knowledge Discovery in …, 2016	42	2016
Amortized planning with large-scale transformers: A case study on chess A Ruoss, G Delétang, S Medapati, J Grau-Moya, LK Wenliang, E Catt, ... Advances in Neural Information Processing Systems 37, 65765-65790, 2024	40	2024
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning F Leibfried, J Grau-Moya Conference on Robot Learning (CoRL), 2019	32	2019
An information-theoretic optimality principle for deep reinforcement learning F Leibfried, J Grau-Moya, H Bou-Ammar NeurIPS Workshop on Deep Reinforcement Learning, 2017	32	2017
Learning Universal Predictors J Grau-Moya, T Genewein, M Hutter, L Orseau, G Delétang, E Catt, ... Proceedings of the 41st International Conference on Machine Learning, PMLR …, 2024	27	2024
The effect of model uncertainty on cooperation in sensorimotor interactions J Grau-Moya, E Hez, G Pezzulo, DA Braun Journal of The Royal Society Interface 10 (87), 20130554, 2013	24	2013
Disentangled Skill Embeddings for Reinforcement Learning JC Petangoda, S Pascual-Diaz, V Adam, P Vrancx, J Grau-Moya NeurIPS Workshop on Learning Transferable Skills, 2019	22	2019
Language modeling is compression, 2024 G Delétang, A Ruoss, PA Duquenne, E Catt, T Genewein, C Mattern, ... URL https://arxiv. org/abs/2309.10668, 0	22
Llms are greedy agents: Effects of rl fine-tuning on decision-making abilities T Schmied, J Bornschein, J Grau-Moya, M Wulfmeier, R Pascanu arXiv preprint arXiv:2504.16078, 2025	20	2025
Memory-based meta-learning on non-stationary distributions T Genewein, G Delétang, A Ruoss, LK Wenliang, E Catt, V Dutordoir, ... International conference on machine learning, 11173-11195, 2023	20	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors