| Efficient large-scale language model training on gpu clusters using megatron-lm D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ... Proceedings of the international conference for high performance computing …, 2021 | 1298 | 2021 |
| PipeDream: generalized pipeline parallelism for DNN training D Narayanan, A Harlap, A Phanishayee, V Seshadri, NR Devanur, ... SOSP 2019: Proceedings of the 27th ACM Symposium on Operating Systems …, 2019 | 1280 | 2019 |
| FAWN: A Fast Array of Wimpy Nodes DG Andersen, J Franklin, M Kaminsky, A Phanishayee, L Tan, ... SOSP 2009: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems …, 2009 | 818 | 2009 |
| The non-IID data quagmire of decentralized machine learning K Hsieh, A Phanishayee, O Mutlu, PB Gibbons ICML 2020: International Conference on Machine Learning (arXiv preprint …, 2019 | 814 | 2019 |
| Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang USENIX ATC 2019 (arXiv preprint arXiv:1901.05758), 2019 | 606* | 2019 |
| Safe and effective fine-grained TCP retransmissions for datacenter communication V Vasudevan, A Phanishayee, H Shah, E Krevat, DG Andersen, ... SIGCOMM 2009: 39 (4), 303-314, 2009 | 604 | 2009 |
| ProjecToR: Agile Reconfigurable Data Center Interconnect M Ghobadi, R Mahajan, A Phanishayee, N Devanur, J Kulkarni, ... SIGCOMM 2016, 216-229, 2016 | 422 | 2016 |
| Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. A Phanishayee, E Krevat, V Vasudevan, DG Andersen, GR Ganger, ... FAST 2008: 6th USENIX Conference on File and Storage Technologies 8, 1-14, 2008 | 375 | 2008 |
| PipeDream: Fast and efficient pipeline parallel DNN training A Harlap, D Narayanan, A Phanishayee, V Seshadri, N Devanur, ... arXiv preprint arXiv:1806.03377, 2018 | 360 | 2018 |
| Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia OSDI 2020: 14th USENIX Symposium on Operating Systems Design and …, 2020 | 351 | 2020 |
| Memory-efficient pipeline-parallel dnn training D Narayanan, A Phanishayee, K Shi, X Chen, M Zaharia International Conference on Machine Learning, 7937-7947, 2021 | 342 | 2021 |
| Themis: Fair and Efficient GPU Cluster Scheduling K Mahajan, A Balasubramanian, A Singhvi, S Venkataraman, A Akella, ... NSDI 2020: 17th USENIX Symposium on Networked Systems Design and …, 2020 | 319 | 2020 |
| TBD: Benchmarking and Analyzing Deep Neural Network Training H Zhu, M Akrout, B Zheng, A Pelegris, A Phanishayee, B Schroeder, ... IISWC 2018 - International Symposium on Workload Characterization - arXiv …, 2018 | 267 | 2018 |
| Gist: Efficient Data Encoding for Deep Neural Network Training A Jain, A Phanishayee, J Mars, L Tang, G Pekhimenko ISCA 2018: Proceedings of The 45th International Symposium on Computer …, 2018 | 193 | 2018 |
| Blink: Fast and generic collectives for distributed ML G Wang, S Venkataraman, A Phanishayee, J Thelin, N Devanur, I Stoica MLSys 2020: Third Conference on Machine Learning and Systems (arXiv preprint …, 2019 | 189 | 2019 |
| Parameter hub: a rack-scale parameter server for distributed deep neural network training L Luo, J Nelson, L Ceze, A Phanishayee, A Krishnamurthy SOCC 2018: Proceedings of the ACM Symposium on Cloud Computing, 41-54, 2018 | 169 | 2018 |
| Analyzing and mitigating data stalls in DNN training J Mohan, A Phanishayee, A Raniwala, V Chidambaram VLDB 2021 (arXiv:2007.06775), 2020 | 166 | 2020 |
| CheckFreq: Frequent, Fine-Grained DNN Checkpointing J Mohan, A Phanishayee, V Chidambaram 19th USENIX Conference on File and Storage Technologies (FAST 21), 203-216, 2021 | 163 | 2021 |
| Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx A Memaripour, A Badam, A Phanishayee, Y Zhou, R Alagappan, ... EuroSys 2017: Twelfth European Conference on Computer Systems, 499-512, 2017 | 133 | 2017 |
| Data center topology having multiple classes of reliability M Ghobadi, R Mahajan, A Phanishayee, D Zhuo, XK Zou US Patent 10,187,292, 2019 | 115 | 2019 |