[go: up one dir, main page]

Follow
Jordi Grau-Moya
Jordi Grau-Moya
Research Scientist at Google DeepMind
Verified email at deepmind.com - Homepage
Title
Cited by
Cited by
Year
Language modeling is compression
G Delétang, A Ruoss, PA Duquenne, E Catt, T Genewein, C Mattern, ...
arXiv preprint arXiv:2309.10668, 2023
2912023
Neural networks and the chomsky hierarchy
G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ...
arXiv preprint arXiv:2207.02098, 2022
2422022
Bounded rationality, abstraction and hierarchical decision-making: an information-theoretic optimality principle
T Genewein, F Leibfried, J Grau-Moya, DAB Braun
Frontiers in Robotics and AI 2, 27, 2015
1532015
Randomized positional encodings boost length generalization of transformers
A Ruoss, G Delétang, T Genewein, J Grau-Moya, R Csordás, M Bennani, ...
arXiv preprint arXiv:2305.16843, 2023
1412023
Shaking the foundations: delusions in sequence models for interaction and control
PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ...
arXiv preprint arXiv:2110.10819, 2021
802021
Soft Q-Learning with Mutual-Information Regularization
J Grau-Moya, F Leibfried, P Vrancx
International Conference on Learning Representations (ICLR), 2019
702019
Balancing Two-Player Stochastic Games with Soft Q-Learning
J Grau-Moya, F Leibfried, H Bou-Ammar
Proceedings of the 27th International Joint Conference on Artificial …, 2018
642018
A unified bellman optimality principle combining reward maximization and empowerment
F Leibfried, S Pascual-Diaz, J Grau-Moya
Advances in Neural Information Processing Systems, 7869-7880, 2019
542019
Signaling equilibria in sensorimotor interactions
F Leibfried, J Grau-Moya, DA Braun
Cognition 141, 73-86, 2015
522015
Grandmaster-level chess without search
A Ruoss, G Delétang, S Medapati, J Grau-Moya, LK Wenliang, E Catt, ...
CoRR, 2024
432024
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
J Grau-Moya, F Leibfried, T Genewein, DA Braun
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2016
422016
Amortized planning with large-scale transformers: A case study on chess
A Ruoss, G Delétang, S Medapati, J Grau-Moya, LK Wenliang, E Catt, ...
Advances in Neural Information Processing Systems 37, 65765-65790, 2024
402024
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
F Leibfried, J Grau-Moya
Conference on Robot Learning (CoRL), 2019
322019
An information-theoretic optimality principle for deep reinforcement learning
F Leibfried, J Grau-Moya, H Bou-Ammar
NeurIPS Workshop on Deep Reinforcement Learning, 2017
322017
Learning Universal Predictors
J Grau-Moya, T Genewein, M Hutter, L Orseau, G Delétang, E Catt, ...
Proceedings of the 41st International Conference on Machine Learning, PMLR …, 2024
272024
The effect of model uncertainty on cooperation in sensorimotor interactions
J Grau-Moya, E Hez, G Pezzulo, DA Braun
Journal of The Royal Society Interface 10 (87), 20130554, 2013
242013
Disentangled Skill Embeddings for Reinforcement Learning
JC Petangoda, S Pascual-Diaz, V Adam, P Vrancx, J Grau-Moya
NeurIPS Workshop on Learning Transferable Skills, 2019
222019
Language modeling is compression, 2024
G Delétang, A Ruoss, PA Duquenne, E Catt, T Genewein, C Mattern, ...
URL https://arxiv. org/abs/2309.10668, 0
22
Llms are greedy agents: Effects of rl fine-tuning on decision-making abilities
T Schmied, J Bornschein, J Grau-Moya, M Wulfmeier, R Pascanu
arXiv preprint arXiv:2504.16078, 2025
202025
Memory-based meta-learning on non-stationary distributions
T Genewein, G Delétang, A Ruoss, LK Wenliang, E Catt, V Dutordoir, ...
International conference on machine learning, 11173-11195, 2023
202023
The system can't perform the operation now. Try again later.
Articles 1–20