Human Pose Regression by Combining Indirect Part Detection and Contextual Information

6 Oct 2017  ·  Diogo C. Luvizon, Hedi Tabia, David Picard ·

In this paper, we propose an end-to-end trainable regression approach for human pose estimation from still images. We use the proposed Soft-argmax function to convert feature maps directly to joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently, contextual information can be included to the pose predictions in a seamless way. We evaluated our method on two very challenging datasets, the Leeds Sports Poses (LSP) and the MPII Human Pose datasets, reaching the best performance among all the existing regression methods and comparable results to the state-of-the-art detection based approaches.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Pose Estimation Leeds Sports Poses Soft-argmax + contextual information PCK 90.5% # 11

Methods


No methods listed for this paper. Add relevant methods here