TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Quality Assessment	MSU FR VQA Database	LPIPS	SRCC	0.7538	# 19
Video Quality Assessment	MSU FR VQA Database	LPIPS	PLCC	0.8128	# 10
Video Quality Assessment	MSU FR VQA Database	LPIPS	KLCC	0.5846	# 19
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (VGG)	SROCC	0.52868	# 32
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (VGG)	PLCC	0.52820	# 32
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (VGG)	KLCC	0.41471	# 33
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (VGG)	Type	FR	# 1
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (Alex)	SROCC	0.54461	# 28
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (Alex)	PLCC	0.52385	# 34
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (Alex)	KLCC	0.43158	# 28
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (Alex)	Type	FR	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-unreasonable-effectiveness-of-deep/video-quality-assessment-on-msu-video-quality-1)](https://paperswithcode.com/sota/video-quality-assessment-on-msu-video-quality-1?p=the-unreasonable-effectiveness-of-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-unreasonable-effectiveness-of-deep/video-quality-assessment-on-msu-sr-qa-dataset)](https://paperswithcode.com/sota/video-quality-assessment-on-msu-sr-qa-dataset?p=the-unreasonable-effectiveness-of-deep)`

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

CVPR 2018 · Richard Zhang, Phillip Isola, Alexei A. Efros, Eli Shechtman, Oliver Wang ·

While it is nearly effortless for humans to quickly assess the perceptual similarity between two images, the underlying processes are thought to be quite complex. Despite this, the most widely used perceptual metrics today, such as PSNR and SSIM, are simple, shallow functions, and fail to account for many nuances of human perception. Recently, the deep learning community has found that features of the VGG network trained on ImageNet classification has been remarkably useful as a training loss for image synthesis. But how perceptual are these so-called "perceptual losses"? What elements are critical for their success? To answer these questions, we introduce a new dataset of human perceptual similarity judgments. We systematically evaluate deep features across different architectures and tasks and compare them with classic metrics. We find that deep features outperform all previous metrics by large margins on our dataset. More surprisingly, this result is not restricted to ImageNet-trained VGG features, but holds across different deep architectures and levels of supervision (supervised, self-supervised, or even unsupervised). Our results suggest that perceptual similarity is an emergent property shared across deep visual representations.

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Code

Add Remove Mark official

richzhang/PerceptualSimilarity official

3,373

Puzer/stylegan-encoder

1,069

pbaylies/stylegan-encoder

738

ak9250/stylegan-art

376

woctezuma/stylegan2-projecting-imag…

↳ Quickstart in

Colab

288

See all 24 implementations

Tasks

Add Remove

Image Quality Assessment

SSIM

Video Quality Assessment

Datasets

Introduced in the Paper:

Perceptual Similarity

Used in the Paper:

CSIQ MSU SR-QA Dataset

MSU NR VQA Database

MSU FR VQA Database

Results from the Paper

Edit

Ranked #19 on Video Quality Assessment on MSU FR VQA Database

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Quality Assessment	MSU FR VQA Database	LPIPS	SRCC	0.7538	# 19	Compare
			PLCC	0.8128	# 10	Compare
			KLCC	0.5846	# 19	Compare
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (VGG)	SROCC	0.52868	# 32	Compare
			PLCC	0.52820	# 32	Compare
			KLCC	0.41471	# 33	Compare
			Type	FR	# 1	Compare
Video Quality Assessment	MSU SR-QA Dataset	LPIPS (Alex)	SROCC	0.54461	# 28	Compare
			PLCC	0.52385	# 34	Compare
			KLCC	0.43158	# 28	Compare
			Type	FR	# 1	Compare

Methods

Add Remove

Convolution • Dense Connections • Dropout • Max Pooling • ReLU • Softmax • VGG

Edit Social Preview

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove