| Characterization and analysis of dynamic parallelism in unstructured GPU applications J Wang, S Yalamanchili 2014 IEEE International Symposium on Workload Characterization (IISWC), 51-60, 2014 | 122 | 2014 |
| Optimizing data warehousing applications for GPUs using kernel fusion/fission H Wu, G Diamos, J Wang, S Cadambi, S Yalamanchili, S Chakradhar Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW …, 2012 | 94 | 2012 |
| Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 43 (3S), 528-540, 2015 | 91 | 2015 |
| Laperm: Locality aware scheduler for dynamic parallelism on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 44 (3), 583-595, 2016 | 76 | 2016 |
| Efficient relational algebra algorithms and data structures for GPU G Diamos, H Wu, A Lele, J Wang, S Yalamanchili CERCS, Georgia Institute of Technology, Tech. Rep. GIT-CERCS-12-01, 2012 | 44 | 2012 |
| Relational algorithms for multi-bulk-synchronous processors G Diamos, H Wu, J Wang, A Lele, S Yalamanchili ACM SIGPLAN Notices 48 (8), 301-302, 2013 | 30 | 2013 |
| Accelerating simulation of agent-based models on heterogeneous architectures J Wang, N Rubin, H Wu, S Yalamanchili Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 19 | 2013 |
| Characterization and transformation of unstructured control flow in bulk synchronous GPU applications H Wu, G Diamos, J Wang, S Li, S Yalamanchili The International Journal of High Performance Computing Applications 26 (2 …, 2012 | 18 | 2012 |
| Paralleljs: An execution framework for javascript on heterogeneous systems J Wang, N Rubin, S Yalamanchili Proceedings of Workshop on General Purpose Processing Using GPUs, 72-80, 2014 | 17 | 2014 |
| General-purpose join algorithms for large graph triangle listing on heterogeneous systems D Zinn, H Wu, J Wang, M Aref, S Yalamanchili Proceedings of the 9th Annual Workshop on General Purpose Processing Using …, 2016 | 10 | 2016 |
| Next-generation consumer audio application specific embedded processor J Kong, P Liu, X Chen, J Wang, X Pan, J Wang, H Xiao, Z Wei, R Ying 2010 IEEE 8th Symposium on Application Specific Processors (SASP), 1-7, 2010 | 10 | 2010 |
| Acceleration and optimization of dynamic parallelism for irregular applications on GPUs J Wang Georgia Institute of Technology, 2016 | 4 | 2016 |
| Split table extension: A low complexity LVQ extension scheme in low bitrate audio coding J Wang, P Liu, J Kong, R Ying IEEE Signal Processing Letters 17 (1), 59-62, 2009 | 3 | 2009 |
| Exploring dynamic parallelism for irregular applications on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili Vertex 1, 3, 0 | 1 | |