Sukarn Agarwal
Architecting Selective Refresh based Multi-Retention Cache for Heterogeneous System (ARMOUR)
Agarwal, Sukarn; Chakraborty, Shounak; Själander, Magnus
Authors
Abstract
The increasing use of chiplets, and the demand for high-performance yet low-power systems, will result in heterogeneous systems that combine both CPUs and accelerators (e.g., general-purpose GPUs). Chiplet based designs also enable the inclusion of emerging memory technologies, since such technologies can reside on a separate chiplet without requiring complex integration in existing high-performance process technologies. One such emerging memory technology is spin-transfer torque (STT) memory, which has the potential to replace SRAM as the last-level cache (LLC). STT-RAM has the advantage of high density, non-volatility, and reduced leakage power, but suffers from a higher write latency and energy, as compared to SRAM. However, by relaxing the retention time, the write latency and energy can be reduced at the cost of the STT-RAM becoming more volatile. The retention time and write latency/energy can be traded against each other by creating an LLC with multiple retention zones. With a multi-retention LLC, the challenge is to direct the memory accesses to the most advantageous zone, to optimize for overall performance and energy efficiency. We propose ARMOUR, a mechanism for efficient management of memory accesses to a multi-retention LLC, where based on the initial requester (CPU or GPU) the cache blocks are allocated in the high (CPU) or low (GPU) retention zone. Furthermore, blocks that are about to expire are either refreshed (CPU) or written back (GPU). In addition, ARMOUR evicts CPU blocks with an estimated short lifetime, which further improves cache performance by reducing cache pollution. Our evaluation shows that ARMOUR improves average performance by 28.9% compared to a baseline STT-RAM based LLC and reduces the energy-delay product (EDP) by 74.5% compared to an iso-area SRAM LLC.
Citation
Agarwal, S., Chakraborty, S., & Själander, M. (2023, July). Architecting Selective Refresh based Multi-Retention Cache for Heterogeneous System (ARMOUR). Presented at 2023 60th ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, USA
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2023 60th ACM/IEEE Design Automation Conference (DAC) |
Start Date | Jul 9, 2023 |
End Date | Jul 13, 2023 |
Acceptance Date | Dec 15, 2022 |
Online Publication Date | Sep 15, 2023 |
Publication Date | Jul 9, 2023 |
Deposit Date | Jan 9, 2025 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Book Title | 2023 60th ACM/IEEE Design Automation Conference (DAC) |
DOI | https://doi.org/10.1109/dac56929.2023.10247878 |
Public URL | https://durham-repository.worktribe.com/output/3329021 |
You might also like
MAFin: Maximizing Accuracy in FinFET based Approximated Real-Time Computing
(2024)
Presentation / Conference Contribution
ARCTIC: Approximate Real-Time Computing in a Cache-Conscious Multicore Environment
(2024)
Journal Article
TREAFET: Temperature-Aware Real-Time Task Scheduling for FinFET based Multicores
(2024)
Journal Article
TEEMO: Temperature Aware Energy Efficient Multi-Retention STT-RAM Cache Architecture
(2024)
Presentation / Conference Contribution
NTHPC: Embracing Near-Threshold Operation for High Performance Multi-core Systems
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search