Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

7 Apr 2020Zhe WangDaeyun ShinCharless C. Fowlkes

Monocular estimation of 3d human pose has attracted increased attention with the availability of large ground-truth motion capture datasets. However, the diversity of training data available is limited and it is not clear to what extent methods generalize outside the specific datasets they are trained on... (read more)

PDF Abstract

Results from the Paper


 Ranked #1 on 3D Human Pose Estimation on Surreal (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
RESULT BENCHMARK
3D Human Pose Estimation 3DPW Cross Dataset Generalization PA-MPJPE 65.2 # 3
3D Human Pose Estimation Human3.6M Cross Dataset Generalization Average MPJPE (mm) 52.0 # 24
Using 2D ground-truth joints Yes # 1
Multi-View or Monocular Monocular # 1
3D Human Pose Estimation MPI-INF-3DHP Cross Dataset Generalization 3DPCK 84.3 # 8
MJPE 90.3 # 6
3D Human Pose Estimation Surreal Cross Dataset Generalization MPJPE 37.1 # 1
PCK3D 97.3 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet