TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Human Pose Estimation	3DPW	HeatER	PA-MPJPE	45.9	# 41
3D Human Pose Estimation	3DPW	HeatER	MPJPE	73.4	# 30
3D Human Pose Estimation	3DPW	HeatER	MPVPE	86.9	# 29
3D Human Pose Estimation	Human3.6M	HeatER	Average MPJPE (mm)	49.9	# 163
3D Human Pose Estimation	Human3.6M	HeatER	PA-MPJPE	32.8	# 15

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/heater-an-efficient-and-unified-network-for/3d-human-pose-estimation-on-3dpw)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-3dpw?p=heater-an-efficient-and-unified-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/heater-an-efficient-and-unified-network-for/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=heater-an-efficient-and-unified-network-for)`

FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER

CVPR 2023 · Ce Zheng, Matias Mendieta, Taojiannan Yang, Guo-Jun Qi, Chen Chen ·

Recently, vision transformers have shown great success in a set of human reconstruction tasks such as 2D human pose estimation (2D HPE), 3D human pose estimation (3D HPE), and human mesh reconstruction (HMR) tasks. In these tasks, feature map representations of the human structural information are often extracted first from the image by a CNN (such as HRNet), and then further processed by transformer to predict the heatmaps (encodes each joint's location into a feature map with a Gaussian distribution) for HPE or HMR. However, existing transformer architectures are not able to process these feature map inputs directly, forcing an unnatural flattening of the location-sensitive human structural information. Furthermore, much of the performance benefit in recent HPE and HMR methods has come at the cost of ever-increasing computation and memory needs. Therefore, to simultaneously address these problems, we propose FeatER, a novel transformer design that preserves the inherent structure of feature map representations when modeling attention while reducing memory and computational costs. Taking advantage of FeatER, we build an efficient network for a set of human reconstruction tasks including 2D HPE, 3D HPE, and HMR. A feature map reconstruction module is applied to improve the performance of the estimated human pose and mesh. Extensive experiments demonstrate the effectiveness of FeatER on various human pose and mesh datasets. For instance, FeatER outperforms the SOTA method MeshGraphormer by requiring 5% of Params and 16% of MACs on Human3.6M and 3DPW datasets. The project webpage is https://zczcwh.github.io/feater_page/.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

zczcwh/feater official

Tasks

Add Remove

2D Human Pose Estimation

3D Human Pose Estimation

Pose Estimation

Datasets

MS COCO

Human3.6M

3DPW

Results from the Paper

Edit

Ranked #29 on 3D Human Pose Estimation on 3DPW

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Human Pose Estimation	3DPW	HeatER	PA-MPJPE	45.9	# 41	Compare
			MPJPE	73.4	# 30	Compare
			MPVPE	86.9	# 29	Compare
3D Human Pose Estimation	Human3.6M	HeatER	Average MPJPE (mm)	49.9	# 163	Compare
3D Human Pose Estimation	Human3.6M	HeatER	PA-MPJPE	32.8	# 15	Compare

Methods

Add Remove

Heatmap

Edit Social Preview

FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove