TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Human Pose Estimation	DeepFashion	StableKeypoints	PCK	70	# 1
Unsupervised Human Pose Estimation	Human3.6M	StableKeypoints	NME	4.45	# 5
Unsupervised Human Pose Estimation	Tai-Chi-HD	StableKeypoints	MAE	234.89	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-keypoints-from-pretrained/unsupervised-human-pose-estimation-on)](https://paperswithcode.com/sota/unsupervised-human-pose-estimation-on?p=unsupervised-keypoints-from-pretrained)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-keypoints-from-pretrained/unsupervised-human-pose-estimation-on-tai-chi)](https://paperswithcode.com/sota/unsupervised-human-pose-estimation-on-tai-chi?p=unsupervised-keypoints-from-pretrained)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsupervised-keypoints-from-pretrained/unsupervised-human-pose-estimation-on-human3)](https://paperswithcode.com/sota/unsupervised-human-pose-estimation-on-human3?p=unsupervised-keypoints-from-pretrained)`

Unsupervised Keypoints from Pretrained Diffusion Models

29 Nov 2023 · Eric Hedlin, Gopal Sharma, Shweta Mahajan, Xingzhe He, Hossam Isack, Abhishek Kar Helge Rhodin, Andrea Tagliasacchi, Kwang Moo Yi ·

Unsupervised learning of keypoints and landmarks has seen significant progress with the help of modern neural network architectures, but performance is yet to match the supervised counterpart, making their practicability questionable. We leverage the emergent knowledge within text-to-image diffusion models, towards more robust unsupervised keypoints. Our core idea is to find text embeddings that would cause the generative model to consistently attend to compact regions in images (i.e. keypoints). To do so, we simply optimize the text embedding such that the cross-attention maps within the denoising network are localized as Gaussians with small standard deviations. We validate our performance on multiple datasets: the CelebA, CUB-200-2011, Tai-Chi-HD, DeepFashion, and Human3.6m datasets. We achieve significantly improved accuracy, sometimes even outperforming supervised ones, particularly for data that is non-aligned and less curated. Our code is publicly available and can be found through our project page: https://ubc-vision.github.io/StableKeypoints/

PDF Abstract

Code

Add Remove Mark official

ubc-vision/StableKeypoints official

↳ Quickstart in

Colab

Tasks

Add Remove

Denoising

Unsupervised Human Pose Estimation

Unsupervised Keypoints

Datasets

CelebA

CUB-200-2011

Human3.6M

DeepFashion

Tai-Chi-HD

Results from the Paper

Edit

Ranked #1 on Unsupervised Human Pose Estimation on Tai-Chi-HD

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Human Pose Estimation	DeepFashion	StableKeypoints	PCK	70	# 1	Compare
Unsupervised Human Pose Estimation	Human3.6M	StableKeypoints	NME	4.45	# 5	Compare
Unsupervised Human Pose Estimation	Tai-Chi-HD	StableKeypoints	MAE	234.89	# 1	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Unsupervised Keypoints from Pretrained Diffusion Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove