A taxonomy of task-based parallel programming technologies for high-performance computing P Thoman, K Dichev, T Heller, R Iakymchuk, X Aguilar, K Hasanov, ... The Journal of Supercomputing 74 (4), 1422-1434, 2018 | 195 | 2018 |
A multi-objective auto-tuning framework for parallel codes H Jordan, P Thoman, JJ Durillo, S Pellegrini, P Gschwandtner, ... SC'12: Proceedings of the International Conference on High Performance …, 2012 | 128 | 2012 |
INSPIRE: The Insieme parallel intermediate representation H Jordan, S Pellegrini, P Thoman, K Kofler, T Fahringer Proceedings of the 22nd international conference on Parallel architectures …, 2013 | 66 | 2013 |
Automatic OpenCL device characterization: Guiding optimized kernel design P Thoman, K Kofler, H Studt, J Thomson, T Fahringer Euro-Par 2011 Parallel Processing: 17th International Conference, Euro-Par …, 2011 | 51 | 2011 |
Application-level energy awareness for openmp F Alessi, P Thoman, G Georgakoudis, T Fahringer, DS Nikolopoulos OpenMP: Heterogenous Execution and Data Movements: 11th International …, 2015 | 45 | 2015 |
Automatic OpenMP loop scheduling: a combined compiler and runtime approach P Thoman, H Jordan, S Pellegrini, T Fahringer International Workshop on OpenMP, 88-101, 2012 | 45 | 2012 |
Celerity: High-level c++ for accelerator clusters P Thoman, P Salzmann, B Cosenza, T Fahringer Euro-Par 2019: Parallel Processing: 25th International Conference on …, 2019 | 43 | 2019 |
GPU-based multigrid: Real-time performance in high resolution nonlinear image processing H Grossauer, P Thoman Computer Vision Systems: 6th International Conference, ICVS 2008 Santorini …, 2008 | 35 | 2008 |
Adaptive granularity control in task parallel programs using multiversioning P Thoman, H Jordan, T Fahringer Euro-Par 2013 Parallel Processing: 19th International Conference, Aachen …, 2013 | 31 | 2013 |
SYCL-bench: a versatile cross-platform benchmark suite for heterogeneous computing S Lal, A Alpay, P Salzmann, B Cosenza, A Hirsch, N Stawinoga, ... Euro-Par 2020: Parallel Processing: 26th International Conference on …, 2020 | 26 | 2020 |
ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs F Knorr, P Thoman, T Fahringer Proceedings of the International Conference for High Performance Computing …, 2021 | 19 | 2021 |
ndzip: A high-throughput parallel lossless compressor for scientific data F Knorr, P Thoman, T Fahringer 2021 Data Compression Conference (DCC), 103-112, 2021 | 19 | 2021 |
Scalo: Scalability-aware parallelism orchestration for multi-threaded workloads G Georgakoudis, H Vandierendonck, P Thoman, BRD Supinski, ... ACM Transactions on Architecture and Code Optimization (TACO) 14 (4), 1-25, 2017 | 19 | 2017 |
On the quality of implementation of the c++ 11 thread support library P Thoman, P Gschwandtner, T Fahringer 2015 23rd euromicro international conference on parallel, distributed, and …, 2015 | 16 | 2015 |
Compiler multiversioning for automatic task granularity control P Thoman, H Jordan, T Fahringer Concurrency and Computation: Practice and Experience 26 (14), 2367-2385, 2014 | 16 | 2014 |
A context-aware primitive for nested recursive parallelism H Jordan, P Thoman, P Zangerl, T Heller, T Fahringer Euro-Par 2016: Parallel Processing Workshops: Euro-Par 2016 International …, 2017 | 14 | 2017 |
Insieme-rs: A compiler-supported parallel runtime system P Thoman na, 2013 | 13 | 2013 |
An asynchronous dataflow-driven execution model for distributed accelerator computing P Salzmann, F Knorr, P Thoman, P Gschwandtner, B Cosenza, ... 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023 | 12 | 2023 |
Sycl-bench: A versatile single-source benchmark suite for heterogeneous computing S Lal, A Alpay, P Salzmann, B Cosenza, N Stawinoga, P Thoman, ... Proceedings of the International Workshop on OpenCL, 1-1, 2020 | 11 | 2020 |
Sylkan: towards a Vulkan compute target platform for SYCL P Thoman, D Gogl, T Fahringer Proceedings of the 9th International Workshop on OpenCL, 1-12, 2021 | 10 | 2021 |