Keypoint Detection

150 papers with code • 7 benchmarks • 11 datasets

Keypoint Detection involves simultaneously detecting people and localizing their keypoints. Keypoints are the same thing as interest points. They are spatial locations, or points in the image that define what is interesting or what stand out in the image. They are invariant to image rotation, shrinkage, translation, distortion, and so on.

( Image credit: PifPaf: Composite Fields for Human Pose Estimation; "Learning to surf" by fotologic, license: CC-BY-2.0 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Keypoint Detection

Dataset	Best Model	Compare
MS COCO	4xRSN-50(384×288)	See all
COCO test-dev	HRNet*	See all
MPII Multi-Person	AlphaPose	See all
OCHuman	MIPNet (HRNet-W48)	See all
COCO test-challenge	Simple Base+*	See all
Pascal3D+	ConvNet + deformable shape model	See all
ApolloCar3D	GSNet	See all

Libraries

Use these libraries to find Keypoint Detection models and implementations

open-mmlab/mmpose

12 papers

5,009

osmr/imgclsmob

6 papers

2,917

PaddlePaddle/PaddleDetection

5 papers

12,074

CMU-Perceptual-Computing-Lab/openpo…

3 papers

29,885

See all 10 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Rotate to Attend: Convolutional Triplet Attention Module

LandskapeAI/triplet-attention • • 6 Oct 2020

In this paper, we investigate light-weight but effective attention mechanisms and present triplet attention, a novel method for computing attention weights by capturing cross-dimension interaction using a three-branch structure.

Paper
Code

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

vitae-transformer/vitpose • • 26 Apr 2022

In this paper, we show the surprisingly good capabilities of plain vision transformers for pose estimation from various aspects, namely simplicity in model structure, scalability in model size, flexibility in training paradigm, and transferability of knowledge between models, through a simple baseline model called ViTPose.

Paper
Code

Data Distillation: Towards Omni-Supervised Learning

facebookresearch/detectron • • CVPR 2018

We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data.

Paper
Code

MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network

salihkaragoz/pose-residual-network-pytorch • • ECCV 2018

In this paper, we present MultiPoseNet, a novel bottom-up multi-person pose estimation architecture that combines a multi-task model with a novel assignment method.

Paper
Code

Slimmable Neural Networks

JiahuiYu/slimmable_networks • • ICLR 2019

Instead of training individual networks with different width configurations, we train a shared network with switchable batch normalization.

Paper
Code

Learning Delicate Local Representations for Multi-Person Pose Estimation

caiyuanhao1998/RSN • • ECCV 2020

To tackle this problem, we propose an efficient attention mechanism - Pose Refine Machine (PRM) to make a trade-off between local and global representations in output features and further refine the keypoint locations.

Paper
Code

Deep Alignment Network: A convolutional neural network for robust face alignment

MarekKowalski/DeepAlignmentNetwork • • 6 Jun 2017

Our method uses entire face images at all stages, contrary to the recently proposed face alignment methods that rely on local patches.

Paper
Code

AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding

AIChallenger/AI_Challenger • 17 Nov 2017

Significant progress has been achieved in Computer Vision by leveraging large-scale image datasets.

Paper
Code

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

google-coral/project-posenet • • ECCV 2018

We present a box-free bottom-up approach for the tasks of pose estimation and instance segmentation of people in multi-person images using an efficient single-shot model.

Paper
Code

CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark

open-mmlab/mmpose • • CVPR 2019

In this paper, we propose a novel and efficient method to tackle the problem of pose estimation in the crowd and a new dataset to better evaluate algorithms.

Paper
Code

Keypoint Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result