Matthew Poyser matthew.poyser@durham.ac.uk
Academic Visitor
On the Impact of Lossy Image and Video Compression on the Performance of Deep Convolutional Neural Network Architectures
Poyser, M.; Atapour-Abarghouei, A.; Breckon, T.P.
Authors
Dr Amir Atapour-Abarghouei amir.atapour-abarghouei@durham.ac.uk
Assistant Professor
Professor Toby Breckon toby.breckon@durham.ac.uk
Professor
Abstract
Recent advances in generalized image understanding have seen a surge in the use of deep convolutional neural networks (CNN) across a broad range of image-based detection, classification and prediction tasks. Whilst the reported performance of these approaches is impressive, this study investigates the hitherto unapproached question of the impact of commonplace image and video compression techniques on the performance of such deep learning architectures. Focusing on the JPEG and H.264 (MPEG-4 AVC) as a representative proxy for contemporary lossy image/video compression techniques that are in common use within network-connected image/video devices and infrastructure, we examine the impact on performance across five discrete tasks: human pose estimation, semantic segmentation, object detection, action recognition, and monocular depth estimation. As such, within this study we include a variety of network architectures and domains spanning end-to-end convolution, encoder-decoder, region-based CNN (R-CNN), dual-stream, and generative adversarial networks (GAN). Our results show a non-linear and non-uniform relationship between network performance and the level of lossy compression applied. Notably, performance decreases significantly below a JPEG quality (quantization) level of 15% and a H.264 Constant Rate Factor (CRF) of 40. However, retraining said architectures on pre-compressed imagery conversely recovers network performance by up to 78.4% in some cases. Furthermore, there is a correlation between architectures employing an encoder-decoder pipeline and those that demonstrate resilience to lossy image compression. The characteristics of the relationship between input compression to output task performance can be used to inform design decisions within future image/video devices and infrastructure.
Citation
Poyser, M., Atapour-Abarghouei, A., & Breckon, T. (2021, January). On the Impact of Lossy Image and Video Compression on the Performance of Deep Convolutional Neural Network Architectures. Presented at 25th International Conference on Pattern Recognition (ICPR2020), Milan, Italy
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 25th International Conference on Pattern Recognition (ICPR2020) |
Start Date | Jan 10, 2021 |
End Date | Jan 15, 2021 |
Acceptance Date | Jun 22, 2020 |
Online Publication Date | May 5, 2021 |
Publication Date | 2021 |
Deposit Date | Aug 12, 2020 |
Publicly Available Date | Aug 13, 2020 |
Series ISSN | 1051-4651 |
DOI | https://doi.org/10.1109/icpr48806.2021.9412455 |
Public URL | https://durham-repository.worktribe.com/output/1140363 |
Additional Information | Conference website: https://www.micc.unifi.it/icpr2020/ |
Files
Accepted Conference Proceeding
(2.8 Mb)
PDF
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Does lossy image compression affect racial bias within face recognition?
(2022)
Presentation / Conference Contribution
UAV-ReID: A Benchmark on Unmanned Aerial Vehicle Re-Identification in Video Imagery
(2022)
Presentation / Conference Contribution
Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark
(2021)
Presentation / Conference Contribution
HINT: High-quality INpainting Transformer with Mask-Aware Encoding and Enhanced Attention
(2024)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search