Joel Hestness

Cited by

	All	Since 2021
Citations	11407	5306
h-index	21	20
i10-index	29	27

1400

700

350

1050

2010201120122013201420152016201720182019202020212022202320242025202636 62 149 329 503 621 714 808 848 927 893 923 880 957 1184 1324 35

Co-authors

David A. WoodUniversity of Wisconsin, MadisonVerified email at cs.wisc.edu
Newsha ArdalaniResearch Scientist, Meta AI Research (FAIR)Verified email at cs.wisc.edu
Steve KecklerVice President of Architecture Research, NVIDIAVerified email at cs.utexas.edu

Joel Hestness

Distinguished Research Scientist, Cerebras Systems

Verified email at cerebras.net

Deep Learning Language Understanding Heterogeneous Systems High-performance Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The gem5 simulator N Binkert, B Beckmann, G Black, SK Reinhardt, A Saidi, A Basu, ... ACM SIGARCH computer architecture news 39 (2), 1-7, 2011	6693	2011
Deep learning scaling is predictable, empirically J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ... arXiv preprint arXiv:1712.00409, 2017	1171	2017
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting SO Arik, M Kliegl, R Child, J Hestness, A Gibiansky, C Fougner, ... Interspeech 2017, 2017	903	2017
gem5-gpu: A heterogeneous cpu-gpu simulator J Power, J Hestness, MS Orr, MD Hill, DA Wood IEEE Computer Architecture Letters 14 (1), 34-36, 2014	363	2014
Kilo-NOC: a heterogeneous network-on-chip architecture for scalability and service guarantees B Grot, J Hestness, SW Keckler, O Mutlu ACM SIGARCH computer architecture news 39 (3), 401-412, 2011	293	2011
Express cube topologies for on-chip interconnects B Grot, J Hestness, SW Keckler, O Mutlu 2009 IEEE 15th International Symposium on High Performance Computer …, 2009	277	2009
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey Blog post, 2023	247	2023
Netrace: dependency-driven trace-based network-on-chip simulation J Hestness, B Grot, SW Keckler Proceedings of the Third International Workshop on Network on Chip …, 2010	191	2010
Jais and jais-chat: Arabic-centric foundation and instruction-tuned open generative large language models N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, W Marshall, ... arXiv preprint arXiv:2308.16149, 2023	146	2023
Cerebras-gpt: Open compute-optimal language models trained on the cerebras wafer-scale cluster N Dey, G Gosal, H Khachane, W Marshall, R Pathria, M Tom, J Hestness arXiv preprint arXiv:2304.03208, 2023	126	2023
Compositional generalization for primitive substitutions Y Li, L Zhao, J Wang, J Hestness arXiv preprint arXiv:1910.02612, 2019	117	2019
Beyond human-level accuracy: Computational challenges in deep learning J Hestness, N Ardalani, G Diamos Proceedings of the 24th symposium on principles and practice of parallel …, 2019	116	2019
Running PARSEC 2.1 on M5 M Gebhart, J Hestness, E Fatehi, P Gratz, SW Keckler The University of Texas at Austin, Department of Computer Science, Tech. Rep, 2009	96	2009
The Gem5 Simulator. SIGARCH Comput. Archit. News 39, 2 (Aug. 2011), 1–7 N Binkert, B Beckmann, G Black, SK Reinhardt, A Saidi, A Basu, ...	89	2011
A comparative analysis of microarchitecture effects on CPU and GPU memory system behavior J Hestness, SW Keckler, DA Wood 2014 IEEE International Symposium on Workload Characterization (IISWC), 150-160, 2014	82	2014
Slimpajama-dc: Understanding data combinations for llm training Z Shen, T Tao, L Ma, W Neiswanger, Z Liu, H Wang, B Tan, J Hestness, ... arXiv preprint arXiv:2309.10818, 2023	78	2023
GPU computing pipeline inefficiencies and optimization opportunities in heterogeneous CPU-GPU processors J Hestness, SW Keckler, DA Wood 2015 IEEE International Symposium on Workload Characterization, 87-97, 2015	73	2015
Netrace: Dependency-tracking traces for efficient network-on-chip experimentation J Hestness, SW Keckler The University of Texas at Austin, Dept. of Computer Science, Tech. Rep, 2011	60	2011
Pipelined backpropagation at scale: training large models without batches A Kosson, V Chiley, A Venigalla, J Hestness, U Koster Proceedings of Machine Learning and Systems 3, 479-501, 2021	40	2021
Time and the value of data E Valavi, J Hestness, N Ardalani, M Iansiti arXiv preprint arXiv:2203.09118, 2022	31	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors