Yanqi Zhou

Cited by

	All	Since 2021
Citations	39917	38124
h-index	36	33
i10-index	47	43

13000

6500

3250

9750

201820192020202120222023202420252026194 351 1005 2290 4473 8238 10516 12215 360

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Yanqi Zhou

Google Deepmind

Verified email at google.com - Homepage

Scaling LLMs Co-design Synthetic Data


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploring the limits of transfer learning with a unified text-to-text transformer C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... Journal of machine learning research 21 (140), 1-67, 2020	29191	2020
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022	2132	2022
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025	1337	2025
Deep learning scaling is predictable, empirically J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ... arXiv preprint arXiv:1712.00409, 2017	1171	2017
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International conference on machine learning, 5547-5569, 2022	1075	2022
Deep Voice 2: Multi-Speaker Neural Text-to-Speech YZ Sercan Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng ... Neural Information Processing Systems (NIPS), 2017	711*	2017
Mixture-of-experts with expert choice routing Y Zhou, T Lei, H Liu, N Du, Y Huang, V Zhao, AM Dai, QV Le, J Laudon Advances in Neural Information Processing Systems 35, 7103-7114, 2022	611	2022
Neural voice cloning with a few samples S Arik, J Chen, K Peng, W Ping, Y Zhou Advances in neural information processing systems 31, 2018	565	2018
OpenPiton: An open source manycore research framework J Balkind, M McKeown, Y Fu, T Nguyen, Y Zhou, A Lavrov, M Shahrad, ... ACM SIGPLAN Notices 51 (4), 217-232, 2016	342	2016
Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... Access mode: https://arxiv. org/abs, 1910	261	1910
Toju Duke, Lucas Dixon, Kun Zhang, Quoc Le, Yonghui Wu, Zhifeng Chen, and Claire Cui. GLaM: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... Proceedings of the 39th International Conference on Machine Learning 162 …, 2022	251	2022
Lamda: Language models for dialog applications. arXiv 2022 R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022	139	2022
Do transformer modifications transfer across implementations and applications? S Narang, HW Chung, Y Tay, L Fedus, T Fevry, M Matena, K Malkan, ... Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021	137	2021
Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx A Memaripour, A Badam, A Phanishayee, Y Zhou, R Alagappan, ... EuroSys '17 Proceedings of the Twelfth European Conference on Computer …, 2017	133	2017
Mixture-of-experts meets instruction tuning: A winning combination for large language models S Shen, L Hou, Y Zhou, N Du, S Longpre, J Wei, HW Chung, B Zoph, ... arXiv preprint arXiv:2305.14705, 2023	131	2023
A learned performance model for tensor processing units S Kaufman, P Phothilimthana, Y Zhou, C Mendis, S Roy, A Sabne, ... Proceedings of Machine Learning and Systems 3, 387-400, 2021	122	2021
Systems and methods for multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, K PENG, W PING, J RAIMAN, Y ZHOU US Patent 10,896,669, 2021	119	2021
Renelito Delos Santos R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ...	117	2022
Conditional adapters: Parameter-efficient transfer learning with fast inference T Lei, J Bai, S Brahma, J Ainslie, K Lee, Y Zhou, N Du, V Zhao, Y Wu, B Li, ... Advances in Neural Information Processing Systems 36, 8152-8172, 2023	110	2023
Lifelong language pretraining with distribution-specialized experts W Chen, Y Zhou, N Du, Y Huang, J Laudon, Z Chen, C Cui International Conference on Machine Learning, 5383-5395, 2023	92	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by