Learning Trajectory Dependencies for Human Motion Prediction

Human motion prediction, i.e., forecasting future body poses given observed pose sequence, has typically been tackled with recurrent neural networks (RNNs). However, as evidenced by prior work, the resulted RNN models suffer from prediction errors accumulation, leading to undesired discontinuities in motion prediction. In this paper, we propose a simple feed-forward deep network for motion prediction, which takes into account both temporal smoothness and spatial dependencies among human body joints. In this context, we then propose to encode temporal information by working in trajectory space, instead of the traditionally-used pose space. This alleviates us from manually defining the range of temporal dependencies (or temporal convolutional filter size, as done in previous work). Moreover, spatial dependency of human pose is encoded by treating a human pose as a generic graph (rather than a human skeletal kinematic tree) formed by links between every pair of body joints. Instead of using a pre-defined graph structure, we design a new graph convolutional network to learn graph connectivity automatically. This allows the network to capture long range dependencies beyond that of human kinematic tree. We evaluate our approach on several standard benchmark datasets for motion prediction, including Human3.6M, the CMU motion capture dataset and 3DPW. Our experiments clearly demonstrate that the proposed approach achieves state of the art performance, and is applicable to both angle-based and position-based pose representations. The code is available at https://github.com/wei-mao-2019/LearnTrajDep

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Multi-Person Pose forecasting Expi - common actions split LTD Average MPJPE (mm) @ 1000 ms 303 # 6
Average MPJPE (mm) @ 600 ms 226 # 6
Average MPJPE (mm) @ 400 ms 169 # 6
Average MPJPE (mm) @ 200 ms 90 # 6
Multi-Person Pose forecasting Expi - unseen actions split LTD Average MPJPE (mm) @ 800 ms 272 # 4
Average MPJPE (mm) @ 600 ms 233 # 5
Average MPJPE (mm) @ 400 ms 177 # 5
Human Pose Forecasting Human3.6M LTD-GCN MAR, walking, 400ms 0.56 # 3
MAR, walking, 1,000ms 0.67 # 1
Average MPJPE (mm) @ 1000 ms 113.0 # 7
Average MPJPE (mm) @ 400ms 63.5 # 8

Methods


No methods listed for this paper. Add relevant methods here