| Effective multi-GPU communication using multiple CUDA streams and threads M Sourouri, T Gillberg, SB Baden, X Cai 2014 20th IEEE International Conference on Parallel and Distributed Systems …, 2014 | 46 | 2014 |
| The READEX formalism for automatic tuning for energy efficiency J Schuchart, M Gerndt, PG Kjeldsberg, M Lysaght, D Horák, L Říha, ... Computing 99 (8), 727-745, 2017 | 44 | 2017 |
| Panda: A Compiler Framework for Concurrent CPU+GPU Execution of 3D Stencil Computations on GPU-accelerated Supercomputers M Sourouri, SB Baden, X Cai International Journal of Parallel Programming 45 (3), 711-729, 2017 | 32 | 2017 |
| Towards fine-grained dynamic tuning of HPC applications on modern multi-core architectures M Sourouri, EB Raknes, N Reissmann, J Langguth, D Hackenberg, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 27 | 2017 |
| Scalable heterogeneous CPU-GPU computations for unstructured tetrahedral meshes J Langguth, M Sourouri, GT Lines, SB Baden, X Cai IEEE Micro 35 (4), 6-15, 2015 | 26 | 2015 |
| Memory bandwidth contention: Communication vs computation tradeoffs in supercomputers with multicore architectures J Langguth, X Cai, M Sourouri 2018 IEEE 24th International Conference on Parallel and Distributed Systems …, 2018 | 24 | 2018 |
| CPU+GPU programming of stencil computations for resource-efficient use of GPU clusters M Sourouri, J Langguth, F Spiga, SB Baden, X Cai 2015 IEEE 18th International Conference on Computational Science and …, 2015 | 24 | 2015 |
| A new parallel 3D front propagation algorithm for fast simulation of geological folds T Gillberg, M Sourouri, X Cai Procedia Computer Science 9, 947-955, 2012 | 18 | 2012 |
| Parallel solutions of static Hamilton-Jacobi equations for simulations of geological folds T Gillberg, AM Bruaset, Ø Hjelle, M Sourouri Journal of Mathematics in Industry 4 (1), 10, 2014 | 13 | 2014 |
| On the performance and energy efficiency of the pgas programming model on multicore architectures J Lagraviere, J Langguth, M Sourouri, PH Ha, X Cai 2016 International Conference on High Performance Computing & Simulation …, 2016 | 9 | 2016 |
| Multi-gpu implementations of parallel 3d sweeping algorithms with application to geological folding E Krishnasamy, M Sourouri, X Cai Procedia Computer Science 51, 1494-1503, 2015 | 8 | 2015 |
| Accelerating 3D Elastic Wave Equations on Knights Landing based Intel Xeon Phi processors M Sourouri, E Birger Raknes 19th EGU General Assembly Conference Abstracts 19, 2017 | 1 | 2017 |
| Scalable Heterogeneous Supercomputing: Programming Methodologies and Automated Code Generation M Sourouri University of Oslo, 2016 | 1 | 2016 |
| A parallel front propagation method: simulating geological folds on parallel architectures M Sourouri | 1 | 2012 |
| A Parallel Front Propagation Method M Sourouri | | 2012 |
| Key exercise 1: Using Finite Difference Method to solve the 2D Wave Equation K Støverud, M Sourouri, I Drøsdal | | 2011 |
| Document history Version Date Author/Editor Description A Gocht, USM TUD, M Lysaght, V Kannan, M Gerndt, A Chowdhury, ... | | |