| Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals*, I Babuschkin*, WM Czarnecki*, M Mathieu*, A Dudzik*, ... Nature 575, 350–354, 2019 | 6668 | 2019 |
| Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... AAMAS 2018, 2017 | 2665 | 2017 |
| Reinforcement learning with unsupervised auxiliary tasks M Jaderberg*, V Mnih*, WM Czarnecki*, T Schaul, JZ Leibo, D Silver, ... ICLR 2017, 2017 | 1606 | 2017 |
| Progress & Compress: A scalable framework for continual learning J Schwarz, J Luketina, WM Czarnecki, A Grabska-Barwinska, YW Teh, ... ICML 2018, 2018 | 1213 | 2018 |
| On loss functions for deep neural networks in classification K Janocha, WM Czarnecki TFML 2017, 2017 | 1104 | 2017 |
| Population based training of neural networks M Jaderberg, V Dalibard, S Osindero, WM Czarnecki, J Donahue, ... arXiv preprint arXiv:1711.09846, 2017 | 1093 | 2017 |
| Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg*, WM Czarnecki*, I Dunning*, L Marris, G Lever, ... Science 364 (6443), 859-865, 2019 | 1002 | 2019 |
| Distral: Robust Multitask Reinforcement Learning YW Teh, V Bapst, WM Czarnecki, J Quan, J Kirkpatrick, R Hadsell, ... NIPS 2017, 2017 | 733 | 2017 |
| Decoupled neural interfaces using synthetic gradients M Jaderberg, WM Czarnecki, S Osindero, O Vinyals, A Graves, ... ICML 2017, 2017 | 489 | 2017 |
| Multi-task deep reinforcement learning with popart M Hessel, H Soyer, L Espeholt, W Czarnecki, S Schmitt, H van Hasselt AAAI 2019, 2018 | 395 | 2018 |
| Sobolev Training for Neural Networks WM Czarnecki, S Osindero, M Jaderberg, G Świrszcz, R Pascanu NIPS 2017, 2017 | 380 | 2017 |
| Grounded language learning in a simulated 3d world KM Hermann, F Hill, S Green, F Wang, R Faulkner, H Soyer, D Szepesvari, ... CoRR, abs/1706.06551, 2017 | 373* | 2017 |
| Open-ended Learning in Symmetric Zero-sum Games D Balduzzi, M Garnelo, Y Bachrach, WM Czarnecki, J Perolat, ... ICML 2019, 2019 | 248 | 2019 |
| Open-Ended Learning Leads to Generally Capable Agents OEL Team, A Stooke, A Mahajan, C Barros, C Deck, J Bauer, J Sygnowski, ... arXiv preprint arXiv:2107.12808, 2021 | 237 | 2021 |
| Distilling Policy Distillation WM Czarnecki, R Pascanu, S Osindero, SM Jayakumar, G Swirszcz, ... AISTATS 2019, 2019 | 214 | 2019 |
| Discovering Reinforcement Learning Algorithms J Oh, M Hessel, WM Czarnecki, Z Xu, H van Hasselt, S Singh, D Silver NeurIPS 2020, 2020 | 196 | 2020 |
| Adapting auxiliary losses using gradient similarity Y Du, WM Czarnecki, SM Jayakumar, R Pascanu, B Lakshminarayanan NeurIPS 2018 MetaLearning Workshop, 2018 | 194 | 2018 |
| Human-level performance in first-person multiplayer games with population-based deep reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... arXiv preprint arXiv:1807.01281, 2018 | 184 | 2018 |
| Kickstarting Deep Reinforcement Learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... NIPS 2018 DL Workshop, 2018 | 181 | 2018 |
| Multiplicative interactions and where to find them SM Jayakumar, WM Czarnecki, J Menick, J Schwarz, J Rae, S Osindero, ... ICLR 2020, 2019 | 173 | 2019 |