TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
3D Human Pose Estimation	Human3.6M	HDFormer (HR-Net, T=96)	Average MPJPE (mm)	40.3	# 74
3D Human Pose Estimation	Human3.6M	HDFormer(CPN, T=96)	Average MPJPE (mm)	42.6	# 82

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hdformer-high-order-directed-transformer-for/3d-human-pose-estimation-on-human36m)](https://paperswithcode.com/sota/3d-human-pose-estimation-on-human36m?p=hdformer-high-order-directed-transformer-for)`

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation

3 Feb 2023 · Hanyuan Chen, Jun-Yan He, Wangmeng Xiang, Zhi-Qi Cheng, Wei Liu, Hanbing Liu, Bin Luo, Yifeng Geng, Xuansong Xie ·

Human pose estimation is a challenging task due to its structured data sequence nature. Existing methods primarily focus on pair-wise interaction of body joints, which is insufficient for scenarios involving overlapping joints and rapidly changing poses. To overcome these issues, we introduce a novel approach, the High-order Directed Transformer (HDFormer), which leverages high-order bone and joint relationships for improved pose estimation. Specifically, HDFormer incorporates both self-attention and high-order attention to formulate a multi-order attention module. This module facilitates first-order "joint$\leftrightarrow$joint", second-order "bone$\leftrightarrow$joint", and high-order "hyperbone$\leftrightarrow$joint" interactions, effectively addressing issues in complex and occlusion-heavy situations. In addition, modern CNN techniques are integrated into the transformer-based architecture, balancing the trade-off between performance and efficiency. HDFormer significantly outperforms state-of-the-art (SOTA) models on Human3.6M and MPI-INF-3DHP datasets, requiring only 1/10 of the parameters and significantly lower computational costs. Moreover, HDFormer demonstrates broad real-world applicability, enabling real-time, accurate 3D pose estimation. The source code is in https://github.com/hyer/HDFormer

PDF Abstract

Code

Add Remove Mark official

hyer/hdformer official

Tasks

Add Remove

3D Human Pose Estimation

3D Pose Estimation

Pose Estimation

Vocal Bursts Intensity Prediction

Datasets

Human3.6M

MPI-INF-3DHP

Results from the Paper

Edit

Ranked #74 on 3D Human Pose Estimation on Human3.6M

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
3D Human Pose Estimation	Human3.6M	HDFormer (HR-Net, T=96)	Average MPJPE (mm)	40.3	# 74		Compare
3D Human Pose Estimation	Human3.6M	HDFormer(CPN, T=96)	Average MPJPE (mm)	42.6	# 82		Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • SPEED • Transformer

Edit Social Preview

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove