TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	Average MPJPE (mm)	45.1	# 15
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	Use Video Sequence	Yes	# 1
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	Frames Needed	243	# 33
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	Need Ground Truth 2D Pose	No	# 1
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	2D detector	CPN	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/attention-mechanism-exploits-temporal/monocular-3d-human-pose-estimation-on-human3)](https://paperswithcode.com/sota/monocular-3d-human-pose-estimation-on-human3?p=attention-mechanism-exploits-temporal)`

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

CVPR 2020 · Ruixu Liu, Ju Shen, He Wang, Chen Chen, Sen-ching Cheung, Vijayan Asari ·

We propose a novel attention-based framework for 3D human pose estimation from a monocular video. Despite the general success of end-to-end deep learning paradigms, our approach is based on two key observations: (1) temporal incoherence and jitter are often yielded from a single frame prediction; (2) error rate can be remarkably reduced by increasing the receptive field in a video. Therefore, we design an attentional mechanism to adaptively identify significant frames and tensor outputs from each deep neural net layer, leading to a more optimal estimation. To achieve large temporal receptive fields, multi-scale dilated convolutions are employed to model long-range dependencies among frames. The architecture is straightforward to implement and can be flexibly adopted for real-time applications. Any off-the-shelf 2D pose estimation system, e.g. Mocap libraries, can be easily integrated in an ad-hoc fashion. We both quantitatively and qualitatively evaluate our method on various standard benchmark datasets (e.g. Human3.6M, HumanEva). Our method considerably outperforms all the state-of-the-art algorithms up to 8% error reduction (average mean per joint position error: 34.7) as compared to the best-reported results. Code is available at: (https://github.com/lrxjason/Attention3DHumanPose)

PDF Abstract

Code

Add Remove Mark official

lrxjason/Attention3DHumanPose official

152

Tasks

Add Remove

2D Pose Estimation

3D Human Pose Estimation

Monocular 3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

Results from the Paper

Add Remove

Ranked #15 on Monocular 3D Human Pose Estimation on Human3.6M

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular 3D Human Pose Estimation	Human3.6M	Attention3DHumanPose	Average MPJPE (mm)	45.1	# 15	Compare
			Use Video Sequence	Yes	# 1	Compare
			Frames Needed	243	# 33	Compare
			Need Ground Truth 2D Pose	No	# 1	Compare
			2D detector	CPN	# 1	Compare

Methods

Add Remove

Mish • Softplus • Tanh Activation

Edit Social Preview

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove