Pose Estimation

1351 papers with code • 28 benchmarks • 114 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Benchmarks

Add a Result

These leaderboards are used to track progress in Pose Estimation

Dataset	Best Model	Compare
MPII Human Pose	PCT (swin-l, test set)	See all
COCO test-dev	ViTPose (ViTAE-G, ensemble)	See all
Leeds Sports Poses	OmniPose	See all
OCHuman	ViTPose (ViTAE-G, GT bounding boxes)	See all
CrowdPose	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	See all
MS COCO	OmniPose (WASPv2)	See all
AIC	Hulk(Finetune, ViT-L)	See all
ITOP front-view	AdaPose	See all
ITOP top-view	DECA-D3	See all
UPenn Action	OmniPose	See all
J-HMDB	SimpleBaseline + HANet	See all
MPII Single Person	4xRSN-50	See all
COCO val2017	MogaNet-B (384x288)	See all
300W (Full)	SPIGA	See all
DensePose-COCO	Parsing R-CNN + ResNext101	See all
FLIC Elbows	Stacked Hourglass Networks	See all
FLIC Wrists	Stacked Hourglass Networks	See all
UAV-Human	AlphaPose	See all
BRACE	HRNet fine-tuned on BRACE	See all
COCO minival	MSPN	See all
3DPW	HybridCap	See all
MPII	OmniPose (WASPv2)	See all
ApolloCar3D	GSNet	See all
Pix3D	Mid-Level based	See all
KITTI 2015	GeoNet	See all
MERL-RAV	SPIGA	See all
MS-COCO	UniHCP (finetune)	See all
COCO 2017 val	MogaNet-S (384x288)	See all

Show all 28 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Pose Estimation models and implementations

open-mmlab/mmpose

32 papers

5,023

PaddlePaddle/PaddleDetection

9 papers

12,086

DeepLabCut/DeepLabCut

6 papers

4,297

osmr/imgclsmob

6 papers

2,918

Datasets

Subtasks

6D Pose Estimation

Hand Pose Estimation

6D Pose Estimation using RGB

Multi-Person Pose Estimation

Head Pose Estimation

Human Pose Forecasting

Animal Pose Estimation

6D Pose Estimation using RGBD

Vehicle Pose Estimation

RF-based Pose Estimation

Car Pose Estimation

Hand Joint Reconstruction

Activeness Detection

Semi-supervised 2D and 3D landmark labeling

Latest papers

Most implemented Social Latest No code

DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector

parskatt/dedode • • 13 Apr 2024

First, we find that DeDoDe keypoints tend to cluster together, which we fix by performing non-max suppression on the target distribution of the detector during training.

306

13 Apr 2024

Paper
Code

EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams

faceonlive/ai-research • 12 Apr 2024

In response to the existing limitations, this paper 1) introduces a new problem, i. e., 3D human motion capture from an egocentric monocular event camera with a fisheye lens, and 2) proposes the first approach to it called EventEgo3D (EE3D).

181

12 Apr 2024

Paper
Code

DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker

faceonlive/ai-research • 8 Apr 2024

Inspired by this, even though the bounding boxes of objects are close on the camera plane, we can differentiate them in the depth dimension, thereby establishing a 3D perception of the objects.

181

08 Apr 2024

Paper
Code

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

faceonlive/ai-research • 4 Apr 2024

Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications.

181

04 Apr 2024

Paper
Code

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

martyrpenink/sdpose • • 4 Apr 2024

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

04 Apr 2024

Paper
Code

Semi-Supervised Unconstrained Head Pose Estimation in the Wild

hnuzhy/semiuhpe • 3 Apr 2024

Existing head pose estimation datasets are either composed of numerous samples by non-realistic synthesis or lab collection, or limited images by labor-intensive annotating.

03 Apr 2024

Paper
Code

SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation

camma-public/selfpose3d • • 2 Apr 2024

Unlike current state-of-the-art fully-supervised methods, our approach does not require any 2d or 3d ground-truth poses and uses only the multi-view input images from a calibrated camera setup and 2d pseudo poses generated from an off-the-shelf 2d human pose estimator.

02 Apr 2024

Paper
Code

KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation

JihuaPeng/KTPFormer • 31 Mar 2024

This paper presents a novel Kinematics and Trajectory Prior Knowledge-Enhanced Transformer (KTPFormer), which overcomes the weakness in existing transformer-based methods for 3D human pose estimation that the derivation of Q, K, V vectors in their self-attention mechanisms are all based on simple linear mapping.

31 Mar 2024

Paper
Code

Video-Based Human Pose Regression via Decoupled Space-Time Aggregation

zgspose/dsta • • 29 Mar 2024

In light of this, we propose a novel Decoupled Space-Time Aggregation network (DSTA) to separately capture the spatial contexts between adjacent joints and the temporal cues of each individual joint, thereby avoiding the conflation of spatiotemporal dimensions.

104

29 Mar 2024

Paper
Code

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

leeiieeo/ag-pose • 28 Mar 2024

(2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features.

28 Mar 2024

Paper
Code

Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result