Zhaoyi Jiang
An end-to-end dynamic point cloud geometry compression in latent space
Jiang, Zhaoyi; Wang, Guoliang; Tam, Gary K. L.; Song, Chao; Yang, Bailin; Li, Frederick W. B.
Authors
Guoliang Wang
Gary K. L. Tam
Chao Song
Bailin Yang
Dr Frederick Li frederick.li@durham.ac.uk
Associate Professor
Abstract
Dynamic point clouds are widely used for 3D data representation in various applications such as immersive and mixed reality, robotics and autonomous driving. However, their irregularity and large scale make efficient compression and transmission a challenge. Existing methods require high bitrates to encode point clouds since temporal correlation is not well considered. This paper proposes an end-to-end dynamic point cloud compression network that operates in latent space, resulting in more accurate motion estimation and more effective motion compensation. Specifically, a multi-scale motion estimation network is introduced to obtain accurate motion vectors. Motion information computed at a coarser level is upsampled and warped to the finer level based on cost volume analysis for motion compensation. Additionally, a residual compression network is designed to mitigate the effects of noise and inaccurate predictions by encoding latent residuals, resulting in smaller conditional entropy and better results. The proposed method achieves an average 12.09% and 14.76% (D2) BD-Rate gain over state-of-the-art Deep Dynamic Point Cloud Compression (D-DPCC) in experimental results. Compared to V-PCC, our framework showed an average improvement of 81.29% (D1) and 77.57% (D2).
Citation
Jiang, Z., Wang, G., Tam, G. K. L., Song, C., Yang, B., & Li, F. W. B. (2023). An end-to-end dynamic point cloud geometry compression in latent space. Displays, 80, Article 102528. https://doi.org/10.1016/j.displa.2023.102528
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 28, 2023 |
Online Publication Date | Sep 14, 2023 |
Publication Date | 2023-12 |
Deposit Date | Sep 12, 2023 |
Publicly Available Date | Sep 20, 2023 |
Journal | Displays |
Print ISSN | 0141-9382 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 80 |
Article Number | 102528 |
DOI | https://doi.org/10.1016/j.displa.2023.102528 |
Public URL | https://durham-repository.worktribe.com/output/1735649 |
Files
Accepted Journal Article
(1.5 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
For the purpose of Open Access the author has applied a CC BY copyright licence to any Author Accepted Manuscript version arising from this submission.
You might also like
Advances in Web-Based Learning - ICWL 2015
(-0001)
Book
Tackling Data Bias in Painting Classification with Style Transfer
(2023)
Presentation / Conference Contribution
Aesthetic Enhancement via Color Area and Location Awareness
(2022)
Presentation / Conference Contribution
STIT: Spatio-Temporal Interaction Transformers for Human-Object Interaction Recognition in Videos
(2022)
Presentation / Conference Contribution
STGAE: Spatial-Temporal Graph Auto-Encoder for Hand Motion Denoising
(2021)
Presentation / Conference Contribution
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search