TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	Human3.6M	GAST (T=81)	Average MPJPE (mm)	45.7	# 118
3D Human Pose Estimation	Human3.6M	GAST (T=81)	PA-MPJPE	35.9	# 28
3D Human Pose Estimation	Human3.6M	GAST (T=27)	Average MPJPE (mm)	46.2	# 121
3D Human Pose Estimation	Human3.6M	GAST (T=27)	PA-MPJPE	36	# 29
3D Human Pose Estimation	Human3.6M	GAST (T=9)	Average MPJPE (mm)	49	# 148
3D Human Pose Estimation	Human3.6M	GAST (T=9)	PA-MPJPE	37.4	# 37
3D Human Pose Estimation	HumanEva-I	GAST	Mean Reconstruction Error (mm)	21.2	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gast-net-graph-attention-spatio-temporal/3d-human-pose-estimation-on-humaneva-i)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-humaneva-i?p=gast-net-graph-attention-spatio-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gast-net-graph-attention-spatio-temporal/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=gast-net-graph-attention-spatio-temporal)`

A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video

11 Mar 2020 · Junfa Liu, Juan Rojas, Zhijun Liang, Yihui Li, Yisheng Guan ·

Spatio-temporal information is key to resolve occlusion and depth ambiguity in 3D pose estimation. Previous methods have focused on either temporal contexts or local-to-global architectures that embed fixed-length spatio-temporal information. To date, there have not been effective proposals to simultaneously and flexibly capture varying spatio-temporal sequences and effectively achieves real-time 3D pose estimation. In this work, we improve the learning of kinematic constraints in the human skeleton: posture, local kinematic connections, and symmetry by modeling local and global spatial information via attention mechanisms. To adapt to single- and multi-frame estimation, the dilated temporal model is employed to process varying skeleton sequences. Also, importantly, we carefully design the interleaving of spatial semantics with temporal dependencies to achieve a synergistic effect. To this end, we propose a simple yet effective graph attention spatio-temporal convolutional network (GAST-Net) that comprises of interleaved temporal convolutional and graph attention blocks. Experiments on two challenging benchmark datasets (Human3.6M and HumanEva-I) and YouTube videos demonstrate that our approach effectively mitigates depth ambiguity and self-occlusion, generalizes to half upper body estimation, and achieves competitive performance on 2D-to-3D video pose estimation. Code, video, and supplementary information is available at: \href{http://www.juanrojas.net/gast/}{http://www.juanrojas.net/gast/}

PDF Abstract

Code

Add Remove Mark official

fabro66/GAST-Net-3DPoseEstimation official

308

Tasks

Add Remove

3D Human Pose Estimation

3D Pose Estimation

Graph Attention

Pose Estimation

Temporal Sequences

Datasets

Human3.6M

Results from the Paper

Edit

Ranked #11 on 3D Human Pose Estimation on HumanEva-I

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	Human3.6M	GAST (T=81)	Average MPJPE (mm)	45.7	# 118	Compare
3D Human Pose Estimation	Human3.6M	GAST (T=81)	PA-MPJPE	35.9	# 28	Compare
3D Human Pose Estimation	Human3.6M	GAST (T=27)	Average MPJPE (mm)	46.2	# 121	Compare
3D Human Pose Estimation	Human3.6M	GAST (T=27)	PA-MPJPE	36	# 29	Compare
3D Human Pose Estimation	Human3.6M	GAST (T=9)	Average MPJPE (mm)	49	# 148	Compare
3D Human Pose Estimation	Human3.6M	GAST (T=9)	PA-MPJPE	37.4	# 37	Compare
3D Human Pose Estimation	HumanEva-I	GAST	Mean Reconstruction Error (mm)	21.2	# 11	Compare

Methods

Add Remove

Convolution

Edit Social Preview

A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove