D.E. Charrier
Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver
Charrier, D.E.; Hazelwood, B.; Tutlyaeva, E.; Bader, M.; Dumbser, M.; Kudryavtsev, A.; Moskovsky, A.; Weinzierl, T.
Authors
B. Hazelwood
E. Tutlyaeva
M. Bader
M. Dumbser
A. Kudryavtsev
A. Moskovsky
Professor Tobias Weinzierl tobias.weinzierl@durham.ac.uk
Professor
Abstract
We study the performance behaviour of a seismic simulation using the ExaHyPE engine with a specific focus on memory characteristics and energy needs. ExaHyPE combines dynamically adaptive mesh refinement (AMR) with ADER-DG. It is parallelized using tasks, and it is cache efficient. AMR plus ADER-DG yields a task graph which is highly dynamic in nature and comprises both arithmetically expensive tasks and tasks which challenge the memory’s latency. The expensive tasks and thus the whole code benefit from AVX vectorization, although we suffer from memory access bursts. A frequency reduction of the chip improves the code’s energy-to-solution. Yet, it does not mitigate burst effects. The bursts’ latency penalty becomes worse once we add Intel Optane technology, increase the core count significantly or make individual, computationally heavy tasks fall out of close caches. Thread overbooking to hide away these latency penalties becomes contra-productive with noninclusive caches as it destroys the cache and vectorization character. In cases where memory-intense and computationally expensive tasks overlap, ExaHyPE’s cache-oblivious implementation nevertheless can exploit deep, noninclusive, heterogeneous memory effectively, as main memory misses arise infrequently and slow down only few cores. We thus propose that upcoming supercomputing simulation codes with dynamic, inhomogeneous task graphs are actively supported by thread runtimes in intermixing tasks of different compute character, and we propose that future hardware actively allows codes to downclock the cores running particular task types.
Citation
Charrier, D., Hazelwood, B., Tutlyaeva, E., Bader, M., Dumbser, M., Kudryavtsev, A., Moskovsky, A., & Weinzierl, T. (2019). Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver. International Journal of High Performance Computing Applications, 33(5), 973-986. https://doi.org/10.1177/1094342019842645
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 11, 2019 |
Online Publication Date | Apr 15, 2019 |
Publication Date | Sep 30, 2019 |
Deposit Date | Mar 11, 2019 |
Publicly Available Date | Apr 28, 2019 |
Journal | International Journal of High Performance Computing Applications |
Print ISSN | 1094-3420 |
Electronic ISSN | 1741-2846 |
Publisher | SAGE Publications |
Peer Reviewed | Peer Reviewed |
Volume | 33 |
Issue | 5 |
Pages | 973-986 |
DOI | https://doi.org/10.1177/1094342019842645 |
Public URL | https://durham-repository.worktribe.com/output/1301452 |
Related Public URLs | https://arxiv.org/abs/1810.03940 |
Files
Published Journal Article
(754 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc/4.0/
Published Journal Article (Advance online version)
(754 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by-nc/4.0/
Copyright Statement
Advance online version
Accepted Journal Article
(1 Mb)
PDF
Copyright Statement
This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
You might also like
ExaHyPE: An engine for parallel dynamically adaptive simulations of wave problems
(2020)
Journal Article
An experience report on (auto-)tuning of mesh-based PDE solvers on shared memory systems
(2018)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search