HOMEto discover JOINING USto achieve PUBLICATIONSto innovate GRANTSto establish ACTIVITIESto engage PEOPLEto collaborate TEACHINGto inspire CONTACTSto explore

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum and Howard Leung
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021

Impact Factor: 11.1^† Top 10% Journal in Engineering, Electrical & Electronic^† Citation: 30^#

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

† According to Journal Citation Reports 2024

# According to Google Scholar 2025

Abstract

Recurrent neural network (RNN) has become popular for human motion prediction thanks to its ability to capture temporal dependencies. However, it has limited capacity in modeling the complex spatial relationship in the human skeletal structure. In this work, we present a novel diffusion convolutional recurrent predictor for spatial and temporal movement forecasting, with multi-step random walks traversing bidirectionally along an adaptive graph to model interdependency among body joints. In the temporal domain, existing methods rely on a single forward predictor with the produced motion deflecting to the drift route, which leads to error accumulations over time. We propose to supplement the forward predictor with a forward discriminator to alleviate such motion drift in the long term under adversarial training. The solution is further enhanced by a backward predictor and a backward discriminator to effectively reduce the error, such that the system can also look into the past to improve the prediction at early frames. The two-way spatial diffusion convolutions and two-way temporal predictors together form a quadruple network. Furthermore, we train our framework by modeling the velocity from observed motion dynamics instead of static poses to predict future movements that effectively reduces the discontinuity problem at early prediction. Our method outperforms the state of the arts on both 3D and 2D datasets, including the Human3.6M, CMU Motion Capture and Penn Action datasets. The results also show that our method correctly predicts both high-dynamic and low-dynamic moving trends with less motion drift.

Downloads

Paper (5.1MB)

Video (38.5MB)

Presentation Slides (43.2MB)

GitHub

DOI - Publisher's Page

YouTube

Cite This Research

Plain Text

Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum and Howard Leung, "A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction," IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 9, pp. 3417-3432, IEEE, 2021.

BibTeX

@article{men21quadruple,
author={Men, Qianhui and Ho, Edmond S. L. and Shum, Hubert P. H. and Leung, Howard},
journal={IEEE Transactions on Circuits and Systems for Video Technology},
title={A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction},
year={2021},
volume={31},
number={9},
pages={3417--3432},
numpages={16},
doi={10.1109/TCSVT.2020.3038145},
issn={1051-8215},
publisher={IEEE},
}

RIS

TY  - JOUR
AU  - Men, Qianhui
AU  - Ho, Edmond S. L.
AU  - Shum, Hubert P. H.
AU  - Leung, Howard
T2  - IEEE Transactions on Circuits and Systems for Video Technology
TI  - A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction
PY  - 2021
VL  - 31
IS  - 9
SP  - 3417
EP  - 3432
DO  - 10.1109/TCSVT.2020.3038145
SN  - 1051-8215
PB  - IEEE
ER  -

Similar Research

He Wang, Edmond S. L. Ho, Hubert P. H. Shum and Zhanxing Zhu, "Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling", IEEE Transactions on Visualization and Computer Graphics (TVCG), 2021

Edmond S. L. Ho, Hubert P. H. Shum, He Wang and Li Yi, "Synthesizing Motion with Relative Emotion Strength", Proceedings of the 2017 ACM SIGGRAPH Asia Workshop on Data-Driven Animation Techniques (D2AT), 2017

Qianhui Men, Hubert P. H. Shum, Edmond S. L. Ho and Howard Leung, "GAN-Based Reactive Motion Synthesis with Class-Aware Discriminators for Human-Human Interaction", Computers and Graphics (C&G), 2022

Liuyang Zhou, Lifeng Shang, Hubert P. H. Shum and Howard Leung, "Human Motion Variation Synthesis with Multivariate Gaussian Processes", Computer Animation and Virtual Worlds (CAVW) - Proceedings of the 2014 International Conference on Computer Animation and Social Agents (CASA), 2014

Hubert P. H. Shum, Ludovic Hoyet, Edmond S. L. Ho, Taku Komura and Franck Multon, "Natural Preparation Behavior Synthesis", Computer Animation and Virtual Worlds (CAVW), 2013

Hubert P. H. Shum, Ludovic Hoyet, Edmond S. L. Ho, Taku Komura and Franck Multon, "Preparation Behaviour Synthesis with Reinforcement Learning", Proceedings of the 2013 International Conference on Computer Animation and Social Agents (CASA), 2013

Hubert P. H. Shum, Taku Komura and Pranjul Yadav, "Angular Momentum Guided Motion Concatenation", Computer Animation and Virtual Worlds (CAVW) - Proceedings of the 2009 International Conference on Computer Animation and Social Agents (CASA), 2009

HomeGoogle ScholarLinkedInYouTubeGitHubORCIDResearchGateEmail

Last updated on 1 February 2026
RSS Feed