Skip to main content

Research Repository

Advanced Search

All Outputs (8)

Efficient Implementation of ADER Discontinuous Galerkin Schemes for a Scalable Hyperbolic PDE Engine (2018)
Journal Article
Dumbser, M., Fambri, F., Tavelli, M., Bader, M., & Weinzierl, T. (2018). Efficient Implementation of ADER Discontinuous Galerkin Schemes for a Scalable Hyperbolic PDE Engine. Axioms, 7(3), Article 63. https://doi.org/10.3390/axioms7030063

In this paper we discuss a new and very efficient implementation of high order accurate arbitrary high order schemes using derivatives discontinuous Galerkin (ADER-DG) finite element schemes on modern massively parallel supercomputers. The numerical... Read More about Efficient Implementation of ADER Discontinuous Galerkin Schemes for a Scalable Hyperbolic PDE Engine.

A Multi-Core Ready Discrete Element Method With Triangles Using Dynamically Adaptive Multiscale Grids (2018)
Journal Article
Krestenitis, K., & Weinzierl, T. (2019). A Multi-Core Ready Discrete Element Method With Triangles Using Dynamically Adaptive Multiscale Grids. Concurrency and Computation: Practice and Experience, 31(19), Article e4935. https://doi.org/10.1002/cpe.4935

The simulation of vast numbers of rigid bodies of non‐analytical shapes and of tremendously different sizes that collide with each other is computationally challenging. A bottleneck is the identification of all particle contact points per time step.... Read More about A Multi-Core Ready Discrete Element Method With Triangles Using Dynamically Adaptive Multiscale Grids.

An experience report on (auto-)tuning of mesh-based PDE solvers on shared memory systems (2018)
Conference Proceeding
Charrier, D. E., & Weinzierl, T. (2018). An experience report on (auto-)tuning of mesh-based PDE solvers on shared memory systems. In R. Wyrzykowski, J. . J. Dongarra, E. Deelman, & K. Karczewski (Eds.), Parallel processing and applied mathematics : 12th International Conference, PPAM 2017, Lublin, Poland, September 10-13, 2017 ; revised selected papers. Part I (3-13). https://doi.org/10.1007/978-3-319-78054-2_1

With the advent of manycore systems, shared memory parallelisation has gained importance in high performance computing. Once a code is decomposed into tasks or parallel regions, it becomes crucial to identify reasonable grain sizes, i.e. minimum prob... Read More about An experience report on (auto-)tuning of mesh-based PDE solvers on shared memory systems.

Fast DEM collision checks on multicore nodes (2018)
Conference Proceeding
Krestenitis, K., Weinzierl, T., & Koziara, T. (2018). Fast DEM collision checks on multicore nodes. In R. Wyrzykowski, J. . J. Dongarra, E. Deelman, & K. Karczewski (Eds.), Parallel processing and applied mathematics : 12th International conference, PPAM 2017, Lublin, Poland, September 10-13 ; revised selected papers. Part 1 (123-132). https://doi.org/10.1007/978-3-319-78024-5_12

Many particle simulations today rely on spherical or analytical particle shape descriptions. They find non-spherical, triangulated particle models computationally infeasible due to expensive collision detections. We propose a hybrid collision detecti... Read More about Fast DEM collision checks on multicore nodes.

Quasi-matrix-free hybrid multigrid on dynamically adaptive Cartesian grids (2018)
Journal Article
Weinzierl, M., & Weinzierl, T. (2018). Quasi-matrix-free hybrid multigrid on dynamically adaptive Cartesian grids. ACM Transactions on Mathematical Software, 44(3), Article 32. https://doi.org/10.1145/3165280

We present a family of spacetree-based multigrid realizations using the tree’s multiscale nature to derive coarse grids. They align with matrix-free geometric multigrid solvers as they never assemble the system matrices, which is cumbersome for dynam... Read More about Quasi-matrix-free hybrid multigrid on dynamically adaptive Cartesian grids.

A Case Study for a New Invasive Extension of Intel's Threading Building Blocks (2018)
Conference Proceeding
Schreiber, M., & Weinzierl, T. (2018). A Case Study for a New Invasive Extension of Intel's Threading Building Blocks. In J. Weidendorfer, & C. Trinitis (Eds.), Proceedings of HiPEAC 2018 — 3rd COSH Workshop on Co-Scheduling of HPC Applications (COSH 2018) (21-26). https://doi.org/10.14459/2018md1428538

We study codes deploying multiple MPI ranks to one node where each rank is parallelised with TBB. A static assignment of cores to ranks here is disadvantageous if the load is not perfectly balanced, the runtime is subject to fluctuations or one MPI r... Read More about A Case Study for a New Invasive Extension of Intel's Threading Building Blocks.