Chao Song
WDFSR: Normalizing Flow based on Wavelet-Domain for Super-Resolution
Song, Chao; Li, Shaobang; Li, Frederick W. B.; Yang, Bailin
Abstract
We propose a Normalizing flow based on the wavelet framework for super-resolution called WDFSR. It learns the conditional distribution mapping between low-resolution images in the RGB domain and high-resolution images in the wavelet domain to generate high-resolution images of different styles simultaneously. To address the issue of some flow-based models being sensitive to datasets, resulting in training fluctuations that reduce the mapping ability of the model and weaken generalization, we designed a method that combines a T-distribution and QR decomposition layer. This method alleviates the problem while maintaining the model's ability to map different distributions and produce higher quality images. Good contextual conditional features can promote model training and enhance distribution mapping capabilities for conditional distribution mapping. Therefore, we propose a Refinement layer combined with an attention mechanism to refine and fuse the extracted condition features for improving image quality. Extensive experiments on many super-resolution datasets show that WDFSR outperforms most general CNN models and flow-based models in terms of PSNR and perception quality. We also demonstrate that our framework works well for other low-level vision tasks, such as low-light enhancement. The pre-trained models and source code with guidance for reference are available at https://github.com/Lisbegin/WDFSR.
Citation
Song, C., Li, S., Li, F. W. B., & Yang, B. (in press). WDFSR: Normalizing Flow based on Wavelet-Domain for Super-Resolution. Computational Visual Media,
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 22, 2023 |
Deposit Date | Sep 12, 2023 |
Journal | Computational Visual Media |
Print ISSN | 2096-0433 |
Publisher | SpringerOpen |
Peer Reviewed | Peer Reviewed |
Public URL | https://durham-repository.worktribe.com/output/1735734 |
Publisher URL | https://www.springer.com/journal/41095 |
You might also like
Advances in Web-Based Learning - ICWL 2015
(-0001)
Book
Tackling Data Bias in Painting Classification with Style Transfer
(2023)
Presentation / Conference Contribution
Aesthetic Enhancement via Color Area and Location Awareness
(2022)
Presentation / Conference Contribution
STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos
(2022)
Presentation / Conference Contribution
STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising
(2021)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search