Umit Demirbaga
Energy-based Predictive Root Cause Analysis for Real-Time Anomaly Detection in Big Data Systems
Demirbaga, Umit; Singh Aujla, Gagangeet; Sun, Hongjian
Authors
Dr Gagangeet Aujla gagangeet.s.aujla@durham.ac.uk
Associate Professor in Computer Science
Professor Hongjian Sun hongjian.sun@durham.ac.uk
Professor
Abstract
As the scale of data continues to grow exponentially, managing resource allocation and energy consumption in big data systems becomes increasingly complex and critical. Moreover, with big data systems, energy efficiency is more important daily. In cloud environments, it can be the determining factor between reduced costs and lowered environmental damage. This paper presents a deep learning-based framework for accurately predicting instant energy consumption in real-time and detecting anomalies of different sizes in big data clusters. We use SmartMonit to gather task execution and real-time infrastructure data. A Feedforward Neural Network (FNN) predicts energy consumption from CPU utilisation, memory usage, and task profiling research. The system will track any deviation from predicted consumption with root cause analysis (RCA) if there are significant anomalies. We also integrate an Autoencoder to identify straggler tasks and inefficient resource utilisation. User-defined functions are next applied to examine these anomalies and try to detect the underlying reasons, like distributed data processing, locality of computation exploitation, or resource waste. Given the scale and heterogeneity of big data workloads, the system's ability to dynamically adjust and optimise resource usage is essential for handling complex processing tasks. The experimental results prove that the proposed system effectively enhances resource allocation and decreases wasted energy.
Citation
Demirbaga, U., Singh Aujla, G., & Sun, H. (2025, June). Energy-based Predictive Root Cause Analysis for Real-Time Anomaly Detection in Big Data Systems. Presented at 2025 IEEE International Conference on Communications (ICC), Montreal, Canada
Presentation Conference Type | Edited Proceedings |
---|---|
Conference Name | 2025 IEEE International Conference on Communications (ICC) |
Start Date | Jun 8, 2025 |
End Date | Jun 12, 2025 |
Acceptance Date | Jan 17, 2025 |
Deposit Date | Jan 18, 2025 |
Peer Reviewed | Peer Reviewed |
Keywords | Index Terms-Energy efficiency; Cloud computing; Predictive analytics; Big Data; Anomaly detection |
Public URL | https://durham-repository.worktribe.com/output/3342090 |
Publisher URL | https://icc2025.ieee-icc.org/ |
This file is under embargo due to copyright reasons.
You might also like
Uncovering hidden and complex relations of pandemic dynamics using an AI driven system
(2024)
Journal Article
Trusted Explainable AI for 6G-Enabled Edge Cloud Ecosystem
(2023)
Journal Article
Compliance Checking of Cloud Providers: Design and Implementation
(2023)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search