A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Men, Qianhui; Ho, Edmond S.L.; Shum, Hubert P.H.; Leung, Howard

doi:10.1109/tcsvt.2020.3038145

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Men, Qianhui; Ho, Edmond S.L.; Shum, Hubert P.H.; Leung, Howard

Authors

Qianhui Men

Edmond S.L. Ho

Professor Hubert Shum hubert.shum@durham.ac.uk
Professor

Howard Leung

Abstract

Recurrent neural network (RNN) has become popular for human motion prediction thanks to its ability to capture temporal dependencies. However, it has limited capacity in modeling the complex spatial relationship in the human skeletal structure. In this work, we present a novel diffusion convolutional recurrent predictor for spatial and temporal movement forecasting, with multi-step random walks traversing bidirectionally along an adaptive graph to model interdependency among body joints. In the temporal domain, existing methods rely on a single forward predictor with the produced motion deflecting to the drift route, which leads to error accumulations over time. We propose to supplement the forward predictor with a forward discriminator to alleviate such motion drift in the long term under adversarial training. The solution is further enhanced by a backward predictor and a backward discriminator to effectively reduce the error, such that the system can also look into the past to improve the prediction at early frames. The two-way spatial diffusion convolutions and two-way temporal predictors together form a quadruple network. Furthermore, we train our framework by modeling the velocity from observed motion dynamics instead of static poses to predict future movements that effectively reduces the discontinuity problem at early prediction. Our method outperforms the state of the arts on both 3D and 2D datasets, including the Human3.6M, CMU Motion Capture and Penn Action datasets. The results also show that our method correctly predicts both high-dynamic and low-dynamic moving trends with less motion drift.

Citation

Men, Q., Ho, E. S., Shum, H. P., & Leung, H. (2021). A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction. IEEE Transactions on Circuits and Systems for Video Technology, 31(9), 3417-3432. https://doi.org/10.1109/tcsvt.2020.3038145

Journal Article Type	Article
Acceptance Date	Nov 2, 2020
Online Publication Date	Nov 16, 2020
Publication Date	2021-09
Deposit Date	Nov 3, 2020
Publicly Available Date	Nov 24, 2020
Journal	IEEE Transactions on Circuits and Systems for Video Technology
Print ISSN	1051-8215
Electronic ISSN	1558-2205
Publisher	Institute of Electrical and Electronics Engineers
Peer Reviewed	Peer Reviewed
Volume	31
Issue	9
Pages	3417-3432
DOI	https://doi.org/10.1109/tcsvt.2020.3038145
Public URL	https://durham-repository.worktribe.com/output/1287741

Files

Accepted Journal Article (8.1 Mb)
PDF

Copyright Statement
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.