Pose Estimation

1334 papers with code • 28 benchmarks • 113 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Benchmarks

Add a Result

These leaderboards are used to track progress in Pose Estimation

Dataset	Best Model	Compare
MPII Human Pose	PCT (swin-l, test set)	See all
COCO test-dev	ViTPose (ViTAE-G, ensemble)	See all
Leeds Sports Poses	OmniPose	See all
OCHuman	ViTPose (ViTAE-G, GT bounding boxes)	See all
CrowdPose	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	See all
MS COCO	OmniPose (WASPv2)	See all
AIC	Hulk(Finetune, ViT-L)	See all
ITOP front-view	AdaPose	See all
ITOP top-view	DECA-D3	See all
UPenn Action	OmniPose	See all
J-HMDB	SimpleBaseline + HANet	See all
MPII Single Person	4xRSN-50	See all
COCO val2017	MogaNet-B (384x288)	See all
300W (Full)	SPIGA	See all
DensePose-COCO	Parsing R-CNN + ResNext101	See all
FLIC Elbows	Stacked Hourglass Networks	See all
FLIC Wrists	Stacked Hourglass Networks	See all
UAV-Human	AlphaPose	See all
BRACE	HRNet fine-tuned on BRACE	See all
COCO minival	MSPN	See all
3DPW	HybridCap	See all
MPII	OmniPose (WASPv2)	See all
ApolloCar3D	GSNet	See all
Pix3D	Mid-Level based	See all
KITTI 2015	GeoNet	See all
MERL-RAV	SPIGA	See all
MS-COCO	UniHCP (finetune)	See all
COCO 2017 val	MogaNet-S (384x288)	See all

Show all 28 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Pose Estimation models and implementations

open-mmlab/mmpose

32 papers

4,982

PaddlePaddle/PaddleDetection

9 papers

12,041

DeepLabCut/DeepLabCut

6 papers

4,276

osmr/imgclsmob

6 papers

2,917

Datasets

Subtasks

6D Pose Estimation

Hand Pose Estimation

6D Pose Estimation using RGB

Multi-Person Pose Estimation

Head Pose Estimation

Human Pose Forecasting

6D Pose Estimation using RGBD

Animal Pose Estimation

Vehicle Pose Estimation

RF-based Pose Estimation

Car Pose Estimation

Hand Joint Reconstruction

Activeness Detection

Semi-supervised 2D and 3D landmark labeling

Latest papers with no code

Most implemented Social Latest No code

GLID: Pre-training a Generalist Encoder-Decoder Vision Model

no code yet • 11 Apr 2024

This paper proposes a GeneraLIst encoder-Decoder (GLID) pre-training method for better handling various downstream computer vision tasks.

Paper
Add Code

Measuring proximity to standard planes during fetal brain ultrasound scanning

no code yet • 10 Apr 2024

This paper introduces a novel pipeline designed to bring ultrasound (US) plane pose estimation closer to clinical use for more effective navigation to the standard planes (SPs) in the fetal brain.

Paper
Add Code

Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes

no code yet • 9 Apr 2024

For pose estimation, a feature-metric bundle adjustment (FBA) method is designed for accurate and robust camera tracking in large-scale scenes.

Paper
Add Code

Learning 3D-Aware GANs from Unposed Images with Template Feature Field

no code yet • 8 Apr 2024

Collecting accurate camera poses of training images has been shown to well serve the learning of 3D-aware generative adversarial networks (GANs) yet can be quite expensive in practice.

Paper
Add Code

Learning a Category-level Object Pose Estimator without Pose Annotations

no code yet • 8 Apr 2024

Instead of using manually annotated images, we leverage diffusion models (e. g., Zero-1-to-3) to generate a set of images under controlled pose differences and propose to learn our object pose estimator with those images.

Paper
Add Code

Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks

no code yet • 8 Apr 2024

This work addresses the intersection of hands by exploiting an occupancy network that represents the hand's volume as a continuous manifold.

Paper
Add Code

Multi Positive Contrastive Learning with Pose-Consistent Generated Images

no code yet • 4 Apr 2024

Model pre-training has become essential in various recognition tasks.

Paper
Add Code

3D Congealing: 3D-Aware Image Alignment in the Wild

no code yet • 2 Apr 2024

The framework optimizes for the canonical representation together with the pose for each input image, and a per-image coordinate map that warps 2D pixel coordinates to the 3D canonical frame to account for the shape matching.

Paper
Add Code

Marrying NeRF with Feature Matching for One-step Pose Estimation

no code yet • 1 Apr 2024

Given the image collection of an object, we aim at building a real-time image-based pose estimation method, which requires neither its CAD model nor hours of object-specific training.

Paper
Add Code

OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos

no code yet • 31 Mar 2024

Omnidirectional cameras are extensively used in various applications to provide a wide field of vision.

Paper
Add Code

Pose Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result