| A practical automatic polyhedral parallelizer and locality optimizer U Bondhugula, A Hartono, J Ramanujam, P Sadayappan Proceedings of the 29th ACM SIGPLAN Conference on Programming Language …, 2008 | 1282 | 2008 |
| Pluto: A practical and fully automatic polyhedral program optimization system U Bondhugula, A Hartono, J Ramanujam, P Sadayappan Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language …, 2008 | 427 | 2008 |
| Annotation-based empirical performance tuning using Orio A Hartono, B Norris, P Sadayappan 2009 ieee international symposium on parallel & distributed processing, 1-11, 2009 | 214 | 2009 |
| Designing high performance and scalable MPI intra-node communication support for clusters L Chai, A Hartono, DK Panda 2006 IEEE International Conference on Cluster Computing, 1-10, 2006 | 119 | 2006 |
| Parametric multi-level tiling of imperfectly nested loops A Hartono, MM Baskaran, C Bastoul, A Cohen, S Krishnamoorthy, ... Proceedings of the 23rd international conference on Supercomputing, 147-157, 2009 | 115 | 2009 |
| Parameterized tiling revisited MM Baskaran, A Hartono, S Tavarageri, T Henretty, J Ramanujam, ... Proceedings of the 8th annual IEEE/ACM international symposium on Code …, 2010 | 101 | 2010 |
| Dyntile: Parametric tiled loop generation for parallel execution on multicore processors A Hartono, MM Baskaran, J Ramanujam, P Sadayappan 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 76 | 2010 |
| Automated operation minimization of tensor contraction expressions in electronic structure calculations A Hartono, A Sibiryakov, M Nooijen, G Baumgartner, DE Bernholdt, ... International Conference on Computational Science, 155-164, 2005 | 48 | 2005 |
| Performance optimization of tensor contraction expressions for many-body methods in quantum chemistry A Hartono, Q Lu, T Henretty, S Krishnamoorthy, H Zhang, G Baumgartner, ... The Journal of Physical Chemistry A 113 (45), 12715-12723, 2009 | 43 | 2009 |
| Towards effective automatic parallelization for multicore systems U Bondhugula, M Baskaran, A Hartono, S Krishnamoorthy, J Ramanujam, ... 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-5, 2008 | 43 | 2008 |
| Annotations for productivity and performance portability A Hartono, W Gropp, B Norris Chapman & Hall/CRC Computational Science, 443-462, 2007 | 41 | 2007 |
| Parametric tiling of affine loop nests S Tavarageri, A Hartono, M Baskaran, LN Pouchet, J Ramanujam, ... Proc. 15th Workshop on Compilers for Parallel Computers. Vienna, Austria, 2010 | 35 | 2010 |
| Lightweight restricted transactional memory for speculative compiler optimization C Wang, Y Wu, SS Baghsorkhi, A Hartono, R Valentine US Patent 10,324,768, 2019 | 27 | 2019 |
| Methods and systems to vectorize scalar computer program loops having loop-carried dependences J Bharadwaj, N Vasudevan, A Hartono, SS Baghsorkhi US Patent 9,268,541, 2016 | 26 | 2016 |
| Identifying cost-effective common subexpressions to reduce operation count in tensor contraction evaluations A Hartono, Q Lu, X Gao, S Krishnamoorthy, M Nooijen, G Baumgartner, ... International Conference on Computational Science, 267-275, 2006 | 25 | 2006 |
| Loop vectorization methods and apparatus N Vasudevan, J Bharadwaj, CJ Hughes, MB Girkar, MJ Charney, ... US Patent 9,244,677, 2016 | 23 | 2016 |
| Instruction to reduce elements in a vector register with strided access pattern A Hartono, J Bharadwaj, N Vasudevan, SS Baghsorkhi, VW Lee, D Kim US Patent 9,921,832, 2018 | 19 | 2018 |
| Primetile: A parametric multi-level tiler for imperfect loop nests A Hartono, MM Baskaran, C Bastoul, A Cohen, S Krishnamoorthy, ... ACM International Conference on Supercomputing (ICS). New York, 2009 | 18 | 2009 |
| Apparatus and method for selecting elements of a vector computation VW Lee, J Bharadwaj, D Kim, N Vasudevan, TF Ngai, A Hartono, ... US Patent App. 13/992,530, 2013 | 17 | 2013 |
| Method and apparatus for speculative vectorization N Vasudevan, C Wang, Y Wu, A Hartono, SS Baghsorkhi US Patent 9,710,279, 2017 | 16 | 2017 |