| SuperLU users’ guide XS Li, JW Demmel, JR Gilbert, L Grigori, P Sao, M Shao, I Yamazaki Lawrence Berkeley National Laboratory, 1999 | 510* | 1999 |
| Self-stabilizing iterative solvers P Sao, R Vuduc Proceedings of the Workshop on Latest Advances in Scalable Algorithms for …, 2013 | 125 | 2013 |
| A distributed CPU-GPU sparse direct solver P Sao, R Vuduc, XS Li European Conference on Parallel Processing, 487-498, 2014 | 74 | 2014 |
| Traversing large graphs on GPUs with unified memory P Gera, H Kim, P Sao, H Kim, DA Bader Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States) 13 (7), 2020 | 71 | 2020 |
| A distributed kernel summation framework for general-dimension machine learning D Lee, P Sao, R Vuduc, AG Gray Statistical Analysis and Data Mining: The ASA Data Science Journal 7 (1), 1--13, 2014 | 37 | 2014 |
| A supernodal all-pairs shortest path algorithm P Sao, R Kannan, P Gera, R Vuduc Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020 | 31 | 2020 |
| A communication-avoiding 3D LU factorization algorithm for sparse matrices P Sao, XS Li, R Vuduc 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 30 | 2018 |
| Newly released capabilities in the distributed-memory SuperLU sparse direct solver XS Li, P Lin, Y Liu, P Sao ACM Transactions on Mathematical Software 49 (1), 1-20, 2023 | 28 | 2023 |
| A sparse direct solver for distributed memory Xeon Phi-accelerated systems P Sao, X Liu, R Vuduc, X Li 2015 IEEE International Parallel and Distributed Processing Symposium, 71-81, 2015 | 27 | 2015 |
| A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems P Sao, XS Li, R Vuduc Journal of Parallel and Distributed Computing 131, 218-234, 2019 | 22 | 2019 |
| A communication-avoiding 3D sparse triangular solver P Sao, R Kannan, XS Li, R Vuduc Proceedings of the ACM International Conference on Supercomputing, 127-137, 2019 | 14 | 2019 |
| Scalable knowledge graph analytics at 136 petaflop/s R Kannan, P Sao, H Lu, D Herrmannova, V Thakkar, R Patton, R Vuduc, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 13 | 2020 |
| A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs A Prokopenko, P Sao, D Lebrun-Grandie Proceedings of the 51st International Conference on Parallel Processing, 1-10, 2022 | 9 | 2022 |
| Sparse binary matrix-vector multiplication on neuromorphic computers CD Schuman, B Kay, P Date, R Kannan, P Sao, TE Potok 2021 IEEE International Parallel and Distributed Processing Symposium …, 2021 | 9 | 2021 |
| A self-correcting connected components algorithm P Sao, O Green, C Jain, R Vuduc Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale …, 2016 | 9 | 2016 |
| Unified communication optimization strategies for sparse triangular solver on cpu and gpu clusters Y Liu, N Ding, P Sao, S Williams, XS Li Proceedings of the International Conference for High Performance Computing …, 2023 | 7 | 2023 |
| Exaflops biomedical knowledge graph analytics R Kannan, P Sao, H Lu, J Kurzak, G Schenk, Y Shi, SH Lim, S Israni, ... SC22: International Conference for High Performance Computing, Networking …, 2022 | 6 | 2022 |
| Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters P Sao, H Lu, R Kannan, V Thakkar, R Vuduc, T Potok Proceedings of the 30th International Symposium on High-Performance Parallel …, 2021 | 6 | 2021 |
| Interface for sparse linear algebra operations A Abdelfattah, W Ahrens, H Anzt, C Armstrong, B Brock, A Buluc, F Busato, ... arXiv preprint arXiv:2411.13259, 2024 | 5 | 2024 |
| Pandora: A parallel dendrogram construction algorithm for single linkage clustering on gpu P Sao, A Prokopenko, D Lebrun-Grandié Proceedings of the 53rd International Conference on Parallel Processing, 908-918, 2024 | 5 | 2024 |