| A view of cloud computing M Armbrust, A Fox, R Griffith, AD Joseph, R Katz, A Konwinski, G Lee, ... Communications of the ACM 53 (4), 50-58, 2010 | 15509 | 2010 |
| Spark: Cluster computing with working sets M Zaharia, M Chowdhury, MJ Franklin, S Shenker, I Stoica 2nd USENIX workshop on hot topics in cloud computing (HotCloud 10), 2010 | 13207* | 2010 |
| Above the clouds: A berkeley view of cloud computing M Armbrust, A Fox, R Griffith, AD Joseph, RH Katz, A Konwinski, G Lee, ... Technical Report UCB/EECS-2009-28, EECS Department, University of California …, 2009 | 8927 | 2009 |
| On the opportunities and risks of foundation models R Bommasani arXiv preprint arXiv:2108.07258, 2021 | 8178 | 2021 |
| Apache spark: a unified engine for big data processing M Zaharia, RS Xin, P Wendell, T Das, M Armbrust, A Dave, X Meng, ... Communications of the ACM 59 (11), 56-65, 2016 | 3817 | 2016 |
| Mesos: A platform for {Fine-Grained} resource sharing in the data center B Hindman, A Konwinski, M Zaharia, A Ghodsi, AD Joseph, R Katz, ... 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11), 2011 | 2757 | 2011 |
| Mllib: Machine learning in apache spark X Meng, J Bradley, B Yavuz, E Sparks, S Venkataraman, D Liu, ... Journal of Machine Learning Research 17 (34), 1-7, 2016 | 2613 | 2016 |
| Improving MapReduce performance in heterogeneous environments. M Zaharia, A Konwinski, AD Joseph, RH Katz, I Stoica Osdi 8 (4), 7, 2008 | 2469 | 2008 |
| Colbert: Efficient and effective passage search via contextualized late interaction over bert O Khattab, M Zaharia Proceedings of the 43rd International ACM SIGIR conference on research and …, 2020 | 2117 | 2020 |
| Spark sql: Relational data processing in spark M Armbrust, RS Xin, C Lian, Y Huai, D Liu, JK Bradley, X Meng, T Kaftan, ... Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015 | 2091 | 2015 |
| Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling M Zaharia, D Borthakur, J Sen Sarma, K Elmeleegy, S Shenker, I Stoica Proceedings of the 5th European conference on Computer systems, 265-278, 2010 | 2043 | 2010 |
| Dominant resource fairness: Fair allocation of multiple resource types A Ghodsi, M Zaharia, B Hindman, A Konwinski, S Shenker, I Stoica 8th USENIX symposium on networked systems design and implementation (NSDI 11), 2011 | 1818 | 2011 |
| Discretized streams: Fault-tolerant streaming computation at scale M Zaharia, T Das, H Li, T Hunter, S Shenker, I Stoica Proceedings of the twenty-fourth ACM symposium on operating systems …, 2013 | 1590 | 2013 |
| Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the international conference for high performance computing …, 2021 | 1293 | 2021 |
| PipeDream: Generalized pipeline parallelism for DNN training D Narayanan, A Harlap, A Phanishayee, V Seshadri, NR Devanur, ... Proceedings of the 27th ACM symposium on operating systems principles, 1-15, 2019 | 1278 | 2019 |
| Sparrow: distributed, low latency scheduling K Ousterhout, P Wendell, M Zaharia, I Stoica Proceedings of the twenty-fourth ACM symposium on operating systems …, 2013 | 863 | 2013 |
| Managing data transfers in computer clusters with orchestra M Chowdhury, M Zaharia, J Ma, MI Jordan, I Stoica SIGCOMM 41 (4), 2011 | 844 | 2011 |
| Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters M Zaharia, T Das, H Li, S Shenker, I Stoica Proceedings of the 4th USENIX conference on Hot Topics in Cloud Computing, 10-10, 2012 | 785 | 2012 |
| Learning spark: lightning-fast big data analysis H Karau, A Konwinski, P Wendell, M Zaharia " O'Reilly Media, Inc.", 2015 | 777 | 2015 |
| How is ChatGPT’s behavior changing over time? L Chen, M Zaharia, J Zou Harvard Data Science Review 6 (2), 2024 | 761 | 2024 |