| Efficient GPU spatial-temporal multitasking Y Liang, HP Huynh, K Rupnow, RSM Goh, D Chen IEEE Transactions on Parallel and Distributed Systems 26 (3), 748-760, 2014 | 117 | 2014 |
| Improving main memory hash joins on Intel Xeon Phi processors S Jha, B He, M Lu, X Cheng, HP Huynh | 113 | 2015 |
| Improving GPGPU energy-efficiency through concurrent kernel execution and DVFS Q Jiao, M Lu, HP Huynh, T Mitra 2015 IEEE/ACM International Symposium on Code Generation and Optimization …, 2015 | 90 | 2015 |
| Optimizing the mapreduce framework on intel xeon phi coprocessor M Lu, L Zhang, HP Huynh, Z Ong, Y Liang, B He, RSM Goh, R Huynh 2013 IEEE International Conference on Big Data, 125-130, 2013 | 81 | 2013 |
| Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon Phi WT Tang, R Zhao, M Lu, Y Liang, HP Huynh, X Li, RSM Goh Code Generation and Optimization (CGO), 2015 IEEE/ACM International …, 2015 | 78 | 2015 |
| Scalable framework for mapping streaming applications onto multi-GPU systems HP Huynh, A Hagiescu, WF Wong, RSM Goh Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012 | 69 | 2012 |
| Hierarchical parallel algorithm for modularity-based community detection using GPUs CY Cheong, HP Huynh, D Lo, RSM Goh European conference on parallel processing, 775-787, 2013 | 54 | 2013 |
| An efficient framework for dynamic reconfiguration of instruction-set customization HP Huynh, JE Sim, T Mitra Proceedings of the 2007 international conference on Compilers, architecture …, 2007 | 43 | 2007 |
| Automated architecture-aware mapping of streaming applications onto GPUs A Hagiescu, HP Huynh, WF Wong, RSM Goh 2011 IEEE International Parallel & Distributed Processing Symposium, 467-478, 2011 | 41 | 2011 |
| Mrphi: An optimized mapreduce framework on intel xeon phi coprocessors M Lu, Y Liang, HP Huynh, Z Ong, B He, RSM Goh IEEE Transactions on Parallel and Distributed Systems 26 (11), 3066-3078, 2014 | 39 | 2014 |
| Exploiting sparsity to accelerate fully connected layers of cnn-based applications on mobile socs X Xie, D Du, Q Li, Y Liang, WT Tang, ZL Ong, M Lu, HP Huynh, RSM Goh ACM Transactions on Embedded Computing Systems (TECS) 17 (2), 1-25, 2017 | 28 | 2017 |
| Runtime Adaptive Extensible Embedded Processors—A Survey HP Huynh, T Mitra International Workshop on Embedded Computer Systems, 215-225, 2009 | 27 | 2009 |
| Scale-free sparse matrix-vector multiplication on many-core architectures Y Liang, WT Tang, R Zhao, M Lu, HP Huynh, RSM Goh IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2017 | 21 | 2017 |
| Mapping streaming applications onto GPU systems HP Huynh, A Hagiescu, OZ Liang, WF Wong, RSM Goh IEEE Transactions on Parallel and Distributed Systems 25 (9), 2374-2385, 2013 | 18 | 2013 |
| Efficient custom instructions generation for system-level design HP Huynh, Y Liang, T Mitra 2010 International Conference on Field-Programmable Technology, 445-448, 2010 | 17 | 2010 |
| Evaluating design trade-offs in customizable processors UD Bordoloi, HP Huynh, S Chakraborty, T Mitra Proceedings of the 46th Annual Design Automation Conference, 244-249, 2009 | 17 | 2009 |
| Runtime reconfiguration of custom instructions for real-time embedded systems HP Huynh, T Mitra 2009 Design, Automation & Test in Europe Conference & Exhibition, 1536-1541, 2009 | 15 | 2009 |
| Instruction-set customization for real-time embedded systems HP Huynh, T Mitra 2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007 | 13 | 2007 |
| Efficient query processing on many-core architectures: A case study with intel xeon phi processor X Cheng, B He, M Lu, CT Lau, HP Huynh, RSM Goh Proceedings of the 2016 International Conference on Management of Data, 2081 …, 2016 | 12 | 2016 |
| Design space exploration of instruction set customizable MPSoCs for multimedia applications UD Bordoloi, HP Huynh, T Mitra, S Chakraborty 2010 International Conference on Embedded Computer Systems: Architectures …, 2010 | 9 | 2010 |