| Sparq attention: Bandwidth-efficient llm inference L Ribar, I Chelombiev, L Hudlass-Galley, C Blake, C Luschi, D Orr arXiv preprint arXiv:2312.04985, 2023 | 96 | 2023 |
| Training and inference of large language models using 8-bit floating point SP Perez, Y Zhang, J Briggs, C Blake, J Levy-Kramer, P Balanca, ... arXiv preprint arXiv:2309.17224, 2023 | 31 | 2023 |
| Snowflake: Scaling GNNs to high-dimensional continuous control via parameter freezing C Blake, V Kurin, M Igl, S Whiteson Advances in Neural Information Processing Systems 34, 23983-23992, 2021 | 23 | 2021 |
| u-P: The Unit-Scaled Maximal Update Parametrization C Blake, C Eichenberg, J Dean, L Balles, LY Prince, B Deiseroth, ... arXiv preprint arXiv:2407.17465, 2024 | 21 | 2024 |
| Unit scaling: Out-of-the-box low-precision training C Blake, D Orr, C Luschi International Conference on Machine Learning, 2548-2576, 2023 | 15 | 2023 |
| The winnability of klondike solitaire and many other patience games C Blake, IP Gent arXiv preprint arXiv:1906.12314, 2019 | 5* | 2019 |
| thecharlesblake/Solvitaire: Release for Zenodo DOI-issuing C Blake, I Gent Zenodo, 0 | | |