TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81 CPN GT)	Average MPJPE (mm)	28.3	# 24
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81 CPN GT)	Using 2D ground-truth joints	Yes	# 2
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81 CPN GT)	Multi-View or Monocular	Monocular	# 1
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81)	Average MPJPE (mm)	43.7	# 93
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81)	Using 2D ground-truth joints	No	# 2
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81)	Multi-View or Monocular	Monocular	# 1
3D Human Pose Estimation	MPI-INF-3DHP	CrossFormer	AUC	57.5	# 27
3D Human Pose Estimation	MPI-INF-3DHP	CrossFormer	MPJPE	76.3	# 31
3D Human Pose Estimation	MPI-INF-3DHP	CrossFormer	PCK	89.1	# 29

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crossformer-cross-spatio-temporal-transformer/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=crossformer-cross-spatio-temporal-transformer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/crossformer-cross-spatio-temporal-transformer/3d-human-pose-estimation-on-mpi-inf-3dhp)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-mpi-inf-3dhp?p=crossformer-cross-spatio-temporal-transformer)`

CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation

24 Mar 2022 · Mohammed Hassanin, Abdelwahed Khamiss, Mohammed Bennamoun, Farid Boussaid, Ibrahim Radwan ·

3D human pose estimation can be handled by encoding the geometric dependencies between the body parts and enforcing the kinematic constraints. Recently, Transformer has been adopted to encode the long-range dependencies between the joints in the spatial and temporal domains. While they had shown excellence in long-range dependencies, studies have noted the need for improving the locality of vision Transformers. In this direction, we propose a novel pose estimation Transformer featuring rich representations of body joints critical for capturing subtle changes across frames (i.e., inter-feature representation). Specifically, through two novel interaction modules; Cross-Joint Interaction and Cross-Frame Interaction, the model explicitly encodes the local and global dependencies between the body joints. The proposed architecture achieved state-of-the-art performance on two popular 3D human pose estimation datasets, Human3.6 and MPI-INF-3DHP. In particular, our proposed CrossFormer method boosts performance by 0.9% and 0.3%, compared to the closest counterpart, PoseFormer, using the detected 2D poses and ground-truth settings respectively.

PDF Abstract

Code

Add Remove Mark official

mfawzy/CrossFormer official

Tasks

Add Remove

3D Human Pose Estimation

Pose Estimation

Datasets

Human3.6M

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #24 on 3D Human Pose Estimation on Human3.6M

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81 CPN GT)	Average MPJPE (mm)	28.3	# 24	Compare
			Using 2D ground-truth joints	Yes	# 2	Compare
			Multi-View or Monocular	Monocular	# 1	Compare
3D Human Pose Estimation	Human3.6M	CrossFormer (T=81)	Average MPJPE (mm)	43.7	# 93	Compare
			Using 2D ground-truth joints	No	# 2	Compare
			Multi-View or Monocular	Monocular	# 1	Compare
3D Human Pose Estimation	MPI-INF-3DHP	CrossFormer	AUC	57.5	# 27	Compare
			MPJPE	76.3	# 31	Compare
			PCK	89.1	# 29	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove