[go: up one dir, main page]

Follow
Joel Hestness
Joel Hestness
Distinguished Research Scientist, Cerebras Systems
Verified email at cerebras.net
Title
Cited by
Cited by
Year
The gem5 simulator
N Binkert, B Beckmann, G Black, SK Reinhardt, A Saidi, A Basu, ...
ACM SIGARCH computer architecture news 39 (2), 1-7, 2011
66932011
Deep learning scaling is predictable, empirically
J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ...
arXiv preprint arXiv:1712.00409, 2017
11712017
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
SO Arik, M Kliegl, R Child, J Hestness, A Gibiansky, C Fougner, ...
Interspeech 2017, 2017
9032017
gem5-gpu: A heterogeneous cpu-gpu simulator
J Power, J Hestness, MS Orr, MD Hill, DA Wood
IEEE Computer Architecture Letters 14 (1), 34-36, 2014
3632014
Kilo-NOC: a heterogeneous network-on-chip architecture for scalability and service guarantees
B Grot, J Hestness, SW Keckler, O Mutlu
ACM SIGARCH computer architecture news 39 (3), 401-412, 2011
2932011
Express cube topologies for on-chip interconnects
B Grot, J Hestness, SW Keckler, O Mutlu
2009 IEEE 15th International Symposium on High Performance Computer …, 2009
2772009
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama
D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey
Blog post, 2023
2472023
Netrace: dependency-driven trace-based network-on-chip simulation
J Hestness, B Grot, SW Keckler
Proceedings of the Third International Workshop on Network on Chip …, 2010
1912010
Jais and jais-chat: Arabic-centric foundation and instruction-tuned open generative large language models
N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, W Marshall, ...
arXiv preprint arXiv:2308.16149, 2023
1462023
Cerebras-gpt: Open compute-optimal language models trained on the cerebras wafer-scale cluster
N Dey, G Gosal, H Khachane, W Marshall, R Pathria, M Tom, J Hestness
arXiv preprint arXiv:2304.03208, 2023
1262023
Compositional generalization for primitive substitutions
Y Li, L Zhao, J Wang, J Hestness
arXiv preprint arXiv:1910.02612, 2019
1172019
Beyond human-level accuracy: Computational challenges in deep learning
J Hestness, N Ardalani, G Diamos
Proceedings of the 24th symposium on principles and practice of parallel …, 2019
1162019
Running PARSEC 2.1 on M5
M Gebhart, J Hestness, E Fatehi, P Gratz, SW Keckler
The University of Texas at Austin, Department of Computer Science, Tech. Rep, 2009
962009
The Gem5 Simulator. SIGARCH Comput. Archit. News 39, 2 (Aug. 2011), 1–7
N Binkert, B Beckmann, G Black, SK Reinhardt, A Saidi, A Basu, ...
892011
A comparative analysis of microarchitecture effects on CPU and GPU memory system behavior
J Hestness, SW Keckler, DA Wood
2014 IEEE International Symposium on Workload Characterization (IISWC), 150-160, 2014
822014
Slimpajama-dc: Understanding data combinations for llm training
Z Shen, T Tao, L Ma, W Neiswanger, Z Liu, H Wang, B Tan, J Hestness, ...
arXiv preprint arXiv:2309.10818, 2023
782023
GPU computing pipeline inefficiencies and optimization opportunities in heterogeneous CPU-GPU processors
J Hestness, SW Keckler, DA Wood
2015 IEEE International Symposium on Workload Characterization, 87-97, 2015
732015
Netrace: Dependency-tracking traces for efficient network-on-chip experimentation
J Hestness, SW Keckler
The University of Texas at Austin, Dept. of Computer Science, Tech. Rep, 2011
602011
Pipelined backpropagation at scale: training large models without batches
A Kosson, V Chiley, A Venigalla, J Hestness, U Koster
Proceedings of Machine Learning and Systems 3, 479-501, 2021
402021
Time and the value of data
E Valavi, J Hestness, N Ardalani, M Iansiti
arXiv preprint arXiv:2203.09118, 2022
312022
The system can't perform the operation now. Try again later.
Articles 1–20