TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Human Pose Forecasting	AMASS	ThePoseKnows	ADE	0.656	# 4
Human Pose Forecasting	AMASS	ThePoseKnows	FDE	0.675	# 4
Human Pose Forecasting	AMASS	ThePoseKnows	APD	9.283	# 4
Human Pose Forecasting	Human3.6M	Pose-Knows	APD	6723	# 9
Human Pose Forecasting	Human3.6M	Pose-Knows	ADE	461	# 8
Human Pose Forecasting	Human3.6M	Pose-Knows	FDE	560	# 8
Human Pose Forecasting	Human3.6M	Pose-Knows	MMADE	522	# 7
Human Pose Forecasting	Human3.6M	Pose-Knows	MMFDE	569	# 8
Human Pose Forecasting	Human3.6M	Pose-Knows	CMD	6.326	# 2
Human Pose Forecasting	Human3.6M	Pose-Knows	FID	0.538	# 2
Human Pose Forecasting	HumanEva-I	Pose-Knows	APD@2000ms	2308	# 8
Human Pose Forecasting	HumanEva-I	Pose-Knows	ADE@2000ms	269	# 5
Human Pose Forecasting	HumanEva-I	Pose-Knows	FDE@2000ms	296	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-pose-knows-video-forecasting-by/human-pose-forecasting-on-human36m)](https://paperswithcode.com/sota/human-pose-forecasting-on-human36m?p=the-pose-knows-video-forecasting-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-pose-knows-video-forecasting-by/human-pose-forecasting-on-amass)](https://paperswithcode.com/sota/human-pose-forecasting-on-amass?p=the-pose-knows-video-forecasting-by)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-pose-knows-video-forecasting-by/human-pose-forecasting-on-humaneva-i)](https://paperswithcode.com/sota/human-pose-forecasting-on-humaneva-i?p=the-pose-knows-video-forecasting-by)`

The Pose Knows: Video Forecasting by Generating Pose Futures

ICCV 2017 · Jacob Walker, Kenneth Marino, Abhinav Gupta, Martial Hebert ·

Current approaches in video forecasting attempt to generate videos directly in pixel space using Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs). However, since these approaches try to model all the structure and scene dynamics at once, in unconstrained settings they often generate uninterpretable results. Our insight is to model the forecasting problem at a higher level of abstraction. Specifically, we exploit human pose detectors as a free source of supervision and break the video forecasting problem into two discrete steps. First we explicitly model the high level structure of active objects in the scene---humans---and use a VAE to model the possible future movements of humans in the pose space. We then use the future poses generated as conditional information to a GAN to predict the future frames of the video in pixel space. By using the structured space of pose as an intermediate representation, we sidestep the problems that GANs have in generating video pixels directly. We show through quantitative and qualitative evaluation that our method outperforms state-of-the-art methods for video prediction.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

KMarino/MMD_evalcode

Tasks

Add Remove

Human Pose Forecasting

Video Prediction

Datasets

UCF101

Human3.6M

AMASS

Results from the Paper

Edit

Ranked #2 on Human Pose Forecasting on Human3.6M (CMD metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Human Pose Forecasting	AMASS	ThePoseKnows	ADE	0.656	# 4	Compare
			FDE	0.675	# 4	Compare
			APD	9.283	# 4	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Human Pose Forecasting	Human3.6M	Pose-Knows	APD	6723	# 9	See all
			ADE	461	# 8	See all
			FDE	560	# 8	See all
			MMADE	522	# 7	See all
			MMFDE	569	# 8	See all
			CMD	6.326	# 2	See all
			FID	0.538	# 2	See all
Human Pose Forecasting	HumanEva-I	Pose-Knows	APD@2000ms	2308	# 8	See all
			ADE@2000ms	269	# 5	See all
			FDE@2000ms	296	# 7	See all

Methods

Add Remove

Convolution • GAN • VAE

Edit Social Preview

The Pose Knows: Video Forecasting by Generating Pose Futures

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit