Pose Estimation

1331 papers with code • 28 benchmarks • 113 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Benchmarks

Add a Result

These leaderboards are used to track progress in Pose Estimation

Dataset	Best Model	Compare
MPII Human Pose	PCT (swin-l, test set)	See all
COCO test-dev	ViTPose (ViTAE-G, ensemble)	See all
Leeds Sports Poses	OmniPose	See all
OCHuman	ViTPose (ViTAE-G, GT bounding boxes)	See all
CrowdPose	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	See all
MS COCO	OmniPose (WASPv2)	See all
AIC	Hulk(Finetune, ViT-L)	See all
ITOP front-view	AdaPose	See all
ITOP top-view	DECA-D3	See all
UPenn Action	OmniPose	See all
J-HMDB	SimpleBaseline + HANet	See all
MPII Single Person	4xRSN-50	See all
COCO val2017	MogaNet-B (384x288)	See all
300W (Full)	SPIGA	See all
DensePose-COCO	Parsing R-CNN + ResNext101	See all
FLIC Elbows	Stacked Hourglass Networks	See all
FLIC Wrists	Stacked Hourglass Networks	See all
UAV-Human	AlphaPose	See all
BRACE	HRNet fine-tuned on BRACE	See all
COCO minival	MSPN	See all
3DPW	HybridCap	See all
MPII	OmniPose (WASPv2)	See all
ApolloCar3D	GSNet	See all
Pix3D	Mid-Level based	See all
KITTI 2015	GeoNet	See all
MERL-RAV	SPIGA	See all
MS-COCO	UniHCP (finetune)	See all
COCO 2017 val	MogaNet-S (384x288)	See all

Show all 28 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Pose Estimation models and implementations

open-mmlab/mmpose

32 papers

4,966

PaddlePaddle/PaddleDetection

9 papers

12,029

DeepLabCut/DeepLabCut

6 papers

4,274

osmr/imgclsmob

6 papers

2,917

Datasets

Subtasks

6D Pose Estimation

Hand Pose Estimation

6D Pose Estimation using RGB

Multi-Person Pose Estimation

Head Pose Estimation

Human Pose Forecasting

6D Pose Estimation using RGBD

Animal Pose Estimation

Vehicle Pose Estimation

RF-based Pose Estimation

Car Pose Estimation

Hand Joint Reconstruction

Activeness Detection

Semi-supervised 2D and 3D landmark labeling

Most implemented papers

Most implemented Social Latest No code

DensePose: Dense Human Pose Estimation In The Wild

facebookresearch/detectron2 • • CVPR 2018

In this work, we establish dense correspondences between RGB image and a surface-based representation of the human body, a task we refer to as dense human pose estimation.

Paper
Code

HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation

HRNet/Higher-HRNet-Human-Pose-Estimation • • CVPR 2020

HigherHRNet even surpasses all top-down methods on CrowdPose test (67. 6% AP), suggesting its robustness in crowded scene.

Paper
Code

SuperGlue: Learning Feature Matching with Graph Neural Networks

magicleap/SuperGluePretrainedNetwork • • CVPR 2020

This paper introduces SuperGlue, a neural network that matches two sets of local features by jointly finding correspondences and rejecting non-matchable points.

Paper
Code

Visual Attention Network

Visual-Attention-Network/VAN-Classification • • 20 Feb 2022

In this paper, we propose a novel linear attention named large kernel attention (LKA) to enable self-adaptive and long-range correlations in self-attention while avoiding its shortcomings.

Paper
Code

DeeperCut: A Deeper, Stronger, and Faster Multi-Person Pose Estimation Model

DeepLabCut/DeepLabCut • • 10 May 2016

The goal of this paper is to advance the state-of-the-art of articulated pose estimation in scenes with multiple people.

Paper
Code

Lite-HRNet: A Lightweight High-Resolution Network

HRNet/Lite-HRNet • • CVPR 2021

We introduce a lightweight unit, conditional channel weighting, to replace costly pointwise (1x1) convolutions in shuffle blocks.

Paper
Code

RMPE: Regional Multi-person Pose Estimation

MVIG-SJTU/AlphaPose • • ICCV 2017

In this paper, we propose a novel regional multi-person pose estimation (RMPE) framework to facilitate pose estimation in the presence of inaccurate human bounding boxes.

Paper
Code

ArtTrack: Articulated Multi-person Tracking in the Wild

eldar/pose-tensorflow • • CVPR 2017

In this paper we propose an approach for articulated tracking of multiple people in unconstrained videos.

Paper
Code

A simple yet effective baseline for 3d human pose estimation

una-dinosauria/3d-pose-baseline • • ICCV 2017

Following the success of deep convolutional networks, state-of-the-art methods for 3d human pose estimation have focused on deep end-to-end systems that predict 3d joint locations given raw image pixels.

Paper
Code

Fine-Grained Head Pose Estimation Without Keypoints

natanielruiz/deep-head-pose • • 2 Oct 2017

Estimating the head pose of a person is a crucial problem that has a large amount of applications such as aiding in gaze estimation, modeling attention, fitting 3D models to video and performing face alignment.

Paper
Code

Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result