Pose Estimation

1351 papers with code • 28 benchmarks • 114 datasets

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Libraries

Use these libraries to find Pose Estimation models and implementations
32 papers
5,023
6 papers
2,918

DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector

parskatt/dedode 13 Apr 2024

First, we find that DeDoDe keypoints tend to cluster together, which we fix by performing non-max suppression on the target distribution of the detector during training.

306
13 Apr 2024

EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams

faceonlive/ai-research 12 Apr 2024

In response to the existing limitations, this paper 1) introduces a new problem, i. e., 3D human motion capture from an egocentric monocular event camera with a fisheye lens, and 2) proposes the first approach to it called EventEgo3D (EE3D).

181
12 Apr 2024

DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker

faceonlive/ai-research 8 Apr 2024

Inspired by this, even though the bounding boxes of objects are close on the camera plane, we can differentiate them in the depth dimension, thereby establishing a 3D perception of the objects.

181
08 Apr 2024

HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud

faceonlive/ai-research 4 Apr 2024

Extracting keypoint locations from input hand frames, known as 3D hand pose estimation, is a critical task in various human-computer interaction applications.

181
04 Apr 2024

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

martyrpenink/sdpose 4 Apr 2024

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

5
04 Apr 2024

Semi-Supervised Unconstrained Head Pose Estimation in the Wild

hnuzhy/semiuhpe 3 Apr 2024

Existing head pose estimation datasets are either composed of numerous samples by non-realistic synthesis or lab collection, or limited images by labor-intensive annotating.

2
03 Apr 2024

SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation

camma-public/selfpose3d 2 Apr 2024

Unlike current state-of-the-art fully-supervised methods, our approach does not require any 2d or 3d ground-truth poses and uses only the multi-view input images from a calibrated camera setup and 2d pseudo poses generated from an off-the-shelf 2d human pose estimator.

8
02 Apr 2024

KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation

JihuaPeng/KTPFormer 31 Mar 2024

This paper presents a novel Kinematics and Trajectory Prior Knowledge-Enhanced Transformer (KTPFormer), which overcomes the weakness in existing transformer-based methods for 3D human pose estimation that the derivation of Q, K, V vectors in their self-attention mechanisms are all based on simple linear mapping.

8
31 Mar 2024

Video-Based Human Pose Regression via Decoupled Space-Time Aggregation

zgspose/dsta 29 Mar 2024

In light of this, we propose a novel Decoupled Space-Time Aggregation network (DSTA) to separately capture the spatial contexts between adjacent joints and the temporal cues of each individual joint, thereby avoiding the conflation of spatiotemporal dimensions.

104
29 Mar 2024

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

leeiieeo/ag-pose 28 Mar 2024

(2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features.

14
28 Mar 2024