Skip to main content

Research Repository

Advanced Search

Outputs (78)

Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++ (2025)
Journal Article
Weinzierl, T., & Radtke, P. (in press). Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++. Concurrency and Computation: Practice and Experience,

The C++ programming language provides classes and structs as fundamental modeling entities. Consequently, C++ code tends to favour array-of-structs (AoS) for encoding data sequences, even thoughstructure-of-arrays (SoA) yields better performance for... Read More about Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++.

Compiler support for semi-manual AoS-to-SoA conversions with data views (2025)
Presentation / Conference Contribution
Radtke, P., & Weinzierl, T. (2024, September). Compiler support for semi-manual AoS-to-SoA conversions with data views. Presented at PPAM 2024 - 15th International Conference on Parallel Processing & Applied Mathematics, Ostrava, Czech Republic

The C programming language and its cousins such as C++ stipulate the static storage of sets of structured data: Developers have to commit to one, invariant data model -- typically a structure-of-arrays (SoA) or an array-of-structs (AoS) -- unles... Read More about Compiler support for semi-manual AoS-to-SoA conversions with data views.

SYCL compute kernels for ExaHyPE (2024)
Presentation / Conference Contribution
Loi, C. M., Bockhorst, H., & Weinzierl, T. (2024, March). SYCL compute kernels for ExaHyPE. Presented at 2024 SIAM Conference on Parallel Processing for Scientific Computing (PP), Baltimore, MD

We discuss three SYCL realisations of a simple Finite Volume scheme over multiple Cartesian patches. The realisation flavours differ in the way how they map the compute steps onto loops and tasks: We compare an implementation that is exclusively usin... Read More about SYCL compute kernels for ExaHyPE.

ExaGRyPE: Numerical general relativity solvers based upon the hyperbolic PDEs solver engine ExaHyPE (2024)
Journal Article
Zhang, H., Li, B., Weinzierl, T., & Barrera-Hinojosa, C. (2025). ExaGRyPE: Numerical general relativity solvers based upon the hyperbolic PDEs solver engine ExaHyPE. Computer Physics Communications, 307, Article 109435. https://doi.org/10.1016/j.cpc.2024.109435

ExaGRyPE describes a suite of solvers and solver ingredients for numerical relativity that are based upon ExaHyPE 2, the second generation of our Exascale Hyperbolic PDE Engine. Numerical relativity simulations are crucial in resolv... Read More about ExaGRyPE: Numerical general relativity solvers based upon the hyperbolic PDEs solver engine ExaHyPE.

Detrimental task execution patterns in mainstream OpenMP runtimes (2024)
Presentation / Conference Contribution
Weinzierl, T., Tuft, A., & Klemm, M. (2024, September). Detrimental task execution patterns in mainstream OpenMP runtimes. Presented at IWOMP 2024, Perth, Australia

The OpenMP API offers both task-based and data-parallel concepts to scientific computing. While it provides descriptive and prescriptive annotations, it is in many places deliberately unspecific how to implement its annotations. As the predomina... Read More about Detrimental task execution patterns in mainstream OpenMP runtimes.

Grundlagen des parallelen wissenschaftlichen Rechnens: Ein erster Leitfaden zu numerischen Konzepten und Programmiermethoden (2024)
Book
Weinzierl, T. (2024). Grundlagen des parallelen wissenschaftlichen Rechnens: Ein erster Leitfaden zu numerischen Konzepten und Programmiermethoden. Springer. https://doi.org/10.1007/978-3-031-49082-8

Neue Erkenntnisse in vielen wissenschaftlichen und technischen Bereichen sind ohne den Einsatz numerischer Simulationen, die auf modernen Computern effizient ablaufen, nicht denkbar. Je schneller wir neue Ergebnisse erhalten, desto größer und genauer... Read More about Grundlagen des parallelen wissenschaftlichen Rechnens: Ein erster Leitfaden zu numerischen Konzepten und Programmiermethoden.

A multiscale optimisation algorithm for shape and material reconstruction from a single X-ray image (2024)
Presentation / Conference Contribution
Westmacott, H., Ivrissimtzis, I., & Weinzierl, T. (2024, January). A multiscale optimisation algorithm for shape and material reconstruction from a single X-ray image. Presented at ICIGP 2024: The 7th International Conference on Image and Graphics Processing, Beijing, China

We produce thickness and bone to soft tissue ratio estimations from a single, 2D medical X-ray image. For this, we simulate the scattering of the rays through a model of the object and embed this simulation into an optimiser which iteratively adjusts... Read More about A multiscale optimisation algorithm for shape and material reconstruction from a single X-ray image.

Efficient GPU Offloading with OpenMP for a Hyperbolic Finite Volume Solver on Dynamically Adaptive Meshes (2023)
Presentation / Conference Contribution
Wille, M., Weinzierl, T., Brito Gadeschi, G., & Bader, M. (2023, December). Efficient GPU Offloading with OpenMP for a Hyperbolic Finite Volume Solver on Dynamically Adaptive Meshes. Presented at ISC High Performance 2023, Hamburg

We identify and show how to overcome an OpenMP bottleneck in the administration of GPU memory. It arises for a wave equation solver on dynamically adaptive block-structured Cartesian meshes, which keeps all CPU threads busy and allows all of them to... Read More about Efficient GPU Offloading with OpenMP for a Hyperbolic Finite Volume Solver on Dynamically Adaptive Meshes.

A multiresolution Discrete Element Method for triangulated objects with implicit time stepping (2022)
Journal Article
Noble, P., & Weinzierl, T. (2022). A multiresolution Discrete Element Method for triangulated objects with implicit time stepping. SIAM Journal on Scientific Computing, 44(4), A2121-A2149. https://doi.org/10.1137/21m1421842

Simulations of many rigid bodies colliding with each other sometimes yield particularly interesting results if the colliding objects differ significantly in size and are nonspherical. The most expensive part within such a simulation code is the colli... Read More about A multiresolution Discrete Element Method for triangulated objects with implicit time stepping.