Pose Estimation

1366 papers with code • 28 benchmarks • 114 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Benchmarks

Add a Result

These leaderboards are used to track progress in Pose Estimation

Dataset	Best Model	Compare
MPII Human Pose	PCT (swin-l, test set)	See all
COCO test-dev	ViTPose (ViTAE-G, ensemble)	See all
Leeds Sports Poses	OmniPose	See all
OCHuman	ViTPose (ViTAE-G, GT bounding boxes)	See all
CrowdPose	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	See all
MS COCO	OmniPose (WASPv2)	See all
AIC	Hulk(Finetune, ViT-L)	See all
ITOP front-view	AdaPose	See all
ITOP top-view	DECA-D3	See all
UPenn Action	OmniPose	See all
J-HMDB	SimpleBaseline + HANet	See all
MPII Single Person	4xRSN-50	See all
COCO val2017	MogaNet-B (384x288)	See all
300W (Full)	SPIGA	See all
DensePose-COCO	Parsing R-CNN + ResNext101	See all
FLIC Elbows	Stacked Hourglass Networks	See all
FLIC Wrists	Stacked Hourglass Networks	See all
UAV-Human	AlphaPose	See all
BRACE	HRNet fine-tuned on BRACE	See all
COCO minival	MSPN	See all
3DPW	HybridCap	See all
MPII	OmniPose (WASPv2)	See all
ApolloCar3D	GSNet	See all
Pix3D	Mid-Level based	See all
KITTI 2015	GeoNet	See all
MERL-RAV	SPIGA	See all
MS-COCO	UniHCP (finetune)	See all
COCO 2017 val	MogaNet-S (384x288)	See all

Show all 28 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Pose Estimation models and implementations

open-mmlab/mmpose

32 papers

5,099

PaddlePaddle/PaddleDetection

9 papers

12,173

DeepLabCut/DeepLabCut

6 papers

4,321

osmr/imgclsmob

6 papers

2,926

Datasets

Subtasks

6D Pose Estimation

Hand Pose Estimation

6D Pose Estimation using RGB

Multi-Person Pose Estimation

Head Pose Estimation

Human Pose Forecasting

Animal Pose Estimation

6D Pose Estimation using RGBD

Vehicle Pose Estimation

RF-based Pose Estimation

Car Pose Estimation

Hand Joint Reconstruction

Activeness Detection

Semi-supervised 2D and 3D landmark labeling

Most implemented papers

Most implemented Social Latest No code

PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data

NVlabs/PAMTRI • • ICCV 2019

In comparison with person re-identification (ReID), which has been widely studied in the research community, vehicle ReID has received less attention.

Paper
Code

DETRs with Hybrid Matching

HDETR/H-Deformable-DETR • • CVPR 2023

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so that object detection does not require a hand-crafted NMS (non-maximum suppression) to remove duplicate detections.

Paper
Code

DeepPose: Human Pose Estimation via Deep Neural Networks

open-mmlab/mmpose • • CVPR 2014

We propose a method for human pose estimation based on Deep Neural Networks (DNNs).

Paper
Code

Rethinking on Multi-Stage Networks for Human Pose Estimation

megvii-detection/MSPN • • 1 Jan 2019

Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods.

Paper
Code

Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation

hughw19/NOCS_CVPR2019 • • CVPR 2019

The goal of this paper is to estimate the 6D pose and dimensions of unseen object instances in an RGB-D image.

Paper
Code

Improvements to Target-Based 3D LiDAR to Camera Calibration

UMich-BipedLab/extrinsic_lidar_camera_calibration • 7 Oct 2019

The homogeneous transformation between a LiDAR and monocular camera is required for sensor fusion tasks, such as SLAM.

Paper
Code

BlazePose: On-device Real-time Body Pose tracking

google/mediapipe • • 17 Jun 2020

We present BlazePose, a lightweight convolutional neural network architecture for human pose estimation that is tailored for real-time inference on mobile devices.

Paper
Code

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

sense-x/uniformer • • 24 Jan 2022

Different from the typical transformer blocks, the relation aggregators in our UniFormer block are equipped with local and global token affinity respectively in shallow and deep layers, allowing to tackle both redundancy and dependency for efficient and effective representation learning.

Paper
Code

AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time

MVIG-SJTU/AlphaPose • • 7 Nov 2022

Accurate whole-body multi-person pose estimation and tracking is an important yet challenging topic in computer vision.

Paper
Code

Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach

xingyizhou/pose-hg-3d • • ICCV 2017

We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure.

Paper
Code

Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result