TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	Average MPJPE (mm)	38.4	# 2
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	Use Video Sequence	Yes	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	Frames Needed	243	# 33
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	Need Ground Truth 2D Pose	No	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	2D detector	SH	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	Average MPJPE (mm)	45.1	# 15
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	Use Video Sequence	Yes	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	Frames Needed	27	# 27
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	Need Ground Truth 2D Pose	No	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	2D detector	SH	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	Average MPJPE (mm)	42.5	# 11
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	Use Video Sequence	Yes	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	Frames Needed	81	# 29
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	Need Ground Truth 2D Pose	No	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	2D detector	SH	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	Average MPJPE (mm)	38.4	# 2
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	Use Video Sequence	Yes	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	Frames Needed	243	# 33
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	Need Ground Truth 2D Pose	No	# 1
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	2D detector	SH	# 1
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-B (T=81)	AUC	84.2	# 4
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-B (T=81)	MPJPE	18.2	# 4
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-B (T=81)	PCK	98.3	# 5
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-XS (T=27)	AUC	83.5	# 6
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-XS (T=27)	MPJPE	19.2	# 5
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-XS (T=27)	PCK	98.2	# 7
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-S (T=81)	AUC	84.5	# 3
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-S (T=81)	MPJPE	17.1	# 3
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-S (T=81)	PCK	98.3	# 5
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-L (T=81)	AUC	85.3	# 2
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-L (T=81)	MPJPE	16.2	# 1
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-L (T=81)	PCK	98.2	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motionagformer-enhancing-3d-human-pose/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=motionagformer-enhancing-3d-human-pose)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motionagformer-enhancing-3d-human-pose/monocular-3d-human-pose-estimation-on-human3)](https://paperswithcode.com/sota/monocular-3d-human-pose-estimation-on-human3?p=motionagformer-enhancing-3d-human-pose)`

MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network

25 Oct 2023 · Soroush Mehraban, Vida Adeli, Babak Taati ·

Recent transformer-based approaches have demonstrated excellent performance in 3D human pose estimation. However, they have a holistic view and by encoding global relationships between all the joints, they do not capture the local dependencies precisely. In this paper, we present a novel Attention-GCNFormer (AGFormer) block that divides the number of channels by using two parallel transformer and GCNFormer streams. Our proposed GCNFormer module exploits the local relationship between adjacent joints, outputting a new representation that is complementary to the transformer output. By fusing these two representation in an adaptive way, AGFormer exhibits the ability to better learn the underlying 3D structure. By stacking multiple AGFormer blocks, we propose MotionAGFormer in four different variants, which can be chosen based on the speed-accuracy trade-off. We evaluate our model on two popular benchmark datasets: Human3.6M and MPI-INF-3DHP. MotionAGFormer-B achieves state-of-the-art results, with P1 errors of 38.4mm and 16.2mm, respectively. Remarkably, it uses a quarter of the parameters and is three times more computationally efficient than the previous leading model on Human3.6M dataset. Code and models are available at https://github.com/TaatiTeam/MotionAGFormer.

PDF Abstract

Code

Add Remove Mark official

taatiteam/motionagformer official

Tasks

Add Remove

3D Human Pose Estimation

Monocular 3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #1 on 3D Human Pose Estimation on MPI-INF-3DHP

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-L	Average MPJPE (mm)	38.4	# 2	Compare
			Use Video Sequence	Yes	# 1	Compare
			Frames Needed	243	# 33	Compare
			Need Ground Truth 2D Pose	No	# 1	Compare
			2D detector	SH	# 1	Compare
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-XS	Average MPJPE (mm)	45.1	# 15	Compare
			Use Video Sequence	Yes	# 1	Compare
			Frames Needed	27	# 27	Compare
			Need Ground Truth 2D Pose	No	# 1	Compare
			2D detector	SH	# 1	Compare
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-S	Average MPJPE (mm)	42.5	# 11	Compare
			Use Video Sequence	Yes	# 1	Compare
			Frames Needed	81	# 29	Compare
			Need Ground Truth 2D Pose	No	# 1	Compare
			2D detector	SH	# 1	Compare
Monocular 3D Human Pose Estimation	Human3.6M	MotionAGFormer-B	Average MPJPE (mm)	38.4	# 2	Compare
			Use Video Sequence	Yes	# 1	Compare
			Frames Needed	243	# 33	Compare
			Need Ground Truth 2D Pose	No	# 1	Compare
			2D detector	SH	# 1	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-B (T=81)	AUC	84.2	# 4	Compare
			MPJPE	18.2	# 4	Compare
			PCK	98.3	# 5	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-XS (T=27)	AUC	83.5	# 6	Compare
			MPJPE	19.2	# 5	Compare
			PCK	98.2	# 7	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-S (T=81)	AUC	84.5	# 3	Compare
			MPJPE	17.1	# 3	Compare
			PCK	98.3	# 5	Compare
3D Human Pose Estimation	MPI-INF-3DHP	MotionAGFormer-L (T=81)	AUC	85.3	# 2	Compare
			MPJPE	16.2	# 1	Compare
			PCK	98.2	# 7	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove