Edward Beeching

Cited by

	All	Since 2021
Citations	2813	2801
h-index	19	19
i10-index	22	22

1600

800

400

1200

20212022202320242025202619 34 141 1065 1505 33

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Edward Beeching

Research Scientist, Hugging Face

Verified email at insa-lyon.fr - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	831	2023
Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert, ...	594	2020
Open llm leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ...	421	2023
Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions J Li, E Beeching, L Tunstall, B Lipkin, R Soletskyi, S Huang, K Rasul, L Yu, ... Hugging Face repository 13 (9), 9, 2024	202	2024
Numinamath LI Jia, E Beeching, L Tunstall, B Lipkin, R Soletskyi, SC Huang, K Rasul, ...	121	2024
Optimizing test-time compute via meta reinforcement fine-tuning Y Qu, MYR Yang, A Setlur, L Tunstall, EE Beeching, R Salakhutdinov, ... arXiv preprint arXiv:2503.07572, 2025	84	2025
The alignment handbook L Tunstall, E Beeching, N Lambert, N Rajani, S Huang, K Rasul, AM Rush, ... URL https://github. com/huggingface/alignment-handbook 6, 2023	71	2023
Learning to plan with uncertain topological maps E Beeching, J Dibangoye, O Simonin, C Wolf European Conference on Computer Vision, 473-490, 2020	62	2020
Numinamath J Li, E Beeching, L Tunstall, B Lipkin, R Soletskyi, SC Huang, K Rasul, ... available at GitHub Repository Project Numina: https://github. com …, 2024	61	2024
No robots N Rajani, L Tunstall, E Beeching, N Lambert, AM Rush, T Wolf Hugging Face repository, 2023	50	2023
Zephyr: Direct distillation of lm alignment, 2023 L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... URL https://arxiv. org/abs/2310.16944 6, 2023	40	2023
Scaling test-time compute with open models E Beeching, L Tunstall, S Rush URL https://huggingface. co/spaces/HuggingFaceH4/blogpost-scaling-test-time …, 2024	33	2024
Deep reinforcement learning on a budget: 3d control and reasoning without a supercomputer E Beeching, J Debangoye, O Simonin, C Wolf 2020 25th International Conference on Pattern Recognition (ICPR), 158-165, 2021	33	2021
Egomap: Projective mapping and structured egocentric memory for deep RL E Beeching, J Dibangoye, O Simonin, C Wolf Joint European conference on machine learning and knowledge discovery in …, 2020	31	2020
Creating a coding assistant with starcoder. Hugging Face Blog (2023) L Tunstall, N Lambert, N Rajani, E Beeching, T Le Scao, L von Werra, ...	27	2023
Creating a coding assistant with starcoder L Tunstall, N Lambert, N Rajani, E Beeching, T Le Scao, L von Werra, ... Hugging Face Blog, 2023, 2023	25	2023
Godot reinforcement learning agents E Beeching, J Debangoye, O Simonin, C Wolf arXiv preprint arXiv:2112.03636, 2021	25	2021
StackLLaMA: An RL Finetuned LLaMA Model for Stack Exchange Question and Answering E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ... See https://huggingface. co/blog/stackllama (accessed 14 April 2023), 2023	22	2023
TRL: transformer reinforcement learning (2020) L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert, ... URL https://github. com/huggingface/trl, 0	20
Jack of all trades, master of some, a multi-purpose transformer agent Q Gallouédec, E Beeching, C Romac, E Dellandréa arXiv preprint arXiv:2402.09844, 2024	19	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by