Piyush Kumar Sao

Cited by

	All	Since 2021
Citations	1049	373
h-index	12	10
i10-index	12	10

120

199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024202520265 12 9 12 12 16 19 19 25 17 18 11 11 13 16 28 42 70 58 57 56 63 66 65 52 104 56 93 3

Public access

View all

19 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Richard VuducGeorgia Institute of TechnologyVerified email at cc.gatech.edu
Xiaoye Sherry LiLawrence Berkeley National LaboratoryVerified email at lbl.gov
Ramakrishnan KannanOak Ridge National LaboratoryVerified email at ornl.gov
Thomas E. PotokComputational Data Analytics Group Leader, Oak Ridge National LaboratoryVerified email at ornl.gov
Hao LuOak Ridge National labVerified email at ornl.gov
Prasun GeraNvidiaVerified email at nvidia.com
David A. BaderDistinguished Professor, New Jersey Institute of TechnologyVerified email at njit.edu
Andrey ProkopenkoOak Ridge National LaboratoryVerified email at ornl.gov
Xing LiuResearch Scientist, Meta Platforms, Inc.Verified email at meta.com
Christian EngelmannDistinguished Scientist and Research Group Leader, Oak Ridge National LaboratoryVerified email at ornl.gov

Piyush Kumar Sao

Oak Ridge National Laboratory

Verified email at ornl.gov - Homepage

High Performance Computing Numerical Analysis Graph Algorithms Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
SuperLU users’ guide XS Li, JW Demmel, JR Gilbert, L Grigori, P Sao, M Shao, I Yamazaki Lawrence Berkeley National Laboratory, 1999	510*	1999
Self-stabilizing iterative solvers P Sao, R Vuduc Proceedings of the Workshop on Latest Advances in Scalable Algorithms for …, 2013	125	2013
A distributed CPU-GPU sparse direct solver P Sao, R Vuduc, XS Li European Conference on Parallel Processing, 487-498, 2014	74	2014
Traversing large graphs on GPUs with unified memory P Gera, H Kim, P Sao, H Kim, DA Bader Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States) 13 (7), 2020	71	2020
A distributed kernel summation framework for general-dimension machine learning D Lee, P Sao, R Vuduc, AG Gray Statistical Analysis and Data Mining: The ASA Data Science Journal 7 (1), 1--13, 2014	37	2014
A supernodal all-pairs shortest path algorithm P Sao, R Kannan, P Gera, R Vuduc Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020	31	2020
A communication-avoiding 3D LU factorization algorithm for sparse matrices P Sao, XS Li, R Vuduc 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018	30	2018
Newly released capabilities in the distributed-memory SuperLU sparse direct solver XS Li, P Lin, Y Liu, P Sao ACM Transactions on Mathematical Software 49 (1), 1-20, 2023	28	2023
A sparse direct solver for distributed memory Xeon Phi-accelerated systems P Sao, X Liu, R Vuduc, X Li 2015 IEEE International Parallel and Distributed Processing Symposium, 71-81, 2015	27	2015
A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems P Sao, XS Li, R Vuduc Journal of Parallel and Distributed Computing 131, 218-234, 2019	22	2019
A communication-avoiding 3D sparse triangular solver P Sao, R Kannan, XS Li, R Vuduc Proceedings of the ACM International Conference on Supercomputing, 127-137, 2019	14	2019
Scalable knowledge graph analytics at 136 petaflop/s R Kannan, P Sao, H Lu, D Herrmannova, V Thakkar, R Patton, R Vuduc, ... SC20: International Conference for High Performance Computing, Networking …, 2020	13	2020
A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs A Prokopenko, P Sao, D Lebrun-Grandie Proceedings of the 51st International Conference on Parallel Processing, 1-10, 2022	9	2022
Sparse binary matrix-vector multiplication on neuromorphic computers CD Schuman, B Kay, P Date, R Kannan, P Sao, TE Potok 2021 IEEE International Parallel and Distributed Processing Symposium …, 2021	9	2021
A self-correcting connected components algorithm P Sao, O Green, C Jain, R Vuduc Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale …, 2016	9	2016
Unified communication optimization strategies for sparse triangular solver on cpu and gpu clusters Y Liu, N Ding, P Sao, S Williams, XS Li Proceedings of the International Conference for High Performance Computing …, 2023	7	2023
Exaflops biomedical knowledge graph analytics R Kannan, P Sao, H Lu, J Kurzak, G Schenk, Y Shi, SH Lim, S Israni, ... SC22: International Conference for High Performance Computing, Networking …, 2022	6	2022
Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters P Sao, H Lu, R Kannan, V Thakkar, R Vuduc, T Potok Proceedings of the 30th International Symposium on High-Performance Parallel …, 2021	6	2021
Interface for sparse linear algebra operations A Abdelfattah, W Ahrens, H Anzt, C Armstrong, B Brock, A Buluc, F Busato, ... arXiv preprint arXiv:2411.13259, 2024	5	2024
Pandora: A parallel dendrogram construction algorithm for single linkage clustering on gpu P Sao, A Prokopenko, D Lebrun-Grandié Proceedings of the 53rd International Conference on Parallel Processing, 908-918, 2024	5	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors