TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Anomaly Detection	VisA	WinCLIP+ (4-shot)	Detection AUROC	87.3	# 16
Anomaly Detection	VisA	WinCLIP+ (4-shot)	Segmentation AUPRO (until 30% FPR)	87.6	# 9
Anomaly Detection	VisA	WinCLIP+ (2-shot)	Detection AUROC	84.6	# 17
Anomaly Detection	VisA	WinCLIP+ (2-shot)	Segmentation AUPRO (until 30% FPR)	86.2	# 10
Anomaly Detection	VisA	WinCLIP+ (1-shot)	Detection AUROC	83.8	# 18
Anomaly Detection	VisA	WinCLIP+ (1-shot)	Segmentation AUPRO (until 30% FPR)	85.1	# 12
Anomaly Detection	VisA	WinCLIP (0-shot)	Detection AUROC	78.1	# 22
Anomaly Detection	VisA	WinCLIP (0-shot)	Segmentation AUPRO (until 30% FPR)	56.8	# 23

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/winclip-zero-few-shot-anomaly-classification/anomaly-detection-on-visa)](https://paperswithcode.com/sota/anomaly-detection-on-visa?p=winclip-zero-few-shot-anomaly-classification)`

WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation

CVPR 2023 · Jongheon Jeong, Yang Zou, Taewan Kim, Dongqing Zhang, Avinash Ravichandran, Onkar Dabeer ·

Visual anomaly classification and segmentation are vital for automating industrial quality inspection. The focus of prior research in the field has been on training custom models for each quality inspection task, which requires task-specific images and annotation. In this paper we move away from this regime, addressing zero-shot and few-normal-shot anomaly classification and segmentation. Recently CLIP, a vision-language model, has shown revolutionary generality with competitive zero-/few-shot performance in comparison to full-supervision. But CLIP falls short on anomaly classification and segmentation tasks. Hence, we propose window-based CLIP (WinCLIP) with (1) a compositional ensemble on state words and prompt templates and (2) efficient extraction and aggregation of window/patch/image-level features aligned with text. We also propose its few-normal-shot extension WinCLIP+, which uses complementary information from normal images. In MVTec-AD (and VisA), without further tuning, WinCLIP achieves 91.8%/85.1% (78.1%/79.6%) AUROC in zero-shot anomaly classification and segmentation while WinCLIP+ does 93.1%/95.2% (83.8%/96.4%) in 1-normal-shot, surpassing state-of-the-art by large margins.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

caoyunkang/WinClip

177

bychelsea/vand-april-gan

150

zqhang/anomalyclip

131

zqhang/Accurate-WinCLIP-pytorch

hq-deng/AnoVL

Tasks

Add Remove

Anomaly Classification

Anomaly Detection

Classification

Language Modelling

Segmentation

Datasets

MVTecAD

VisA

Results from the Paper

Edit

Ranked #9 on Anomaly Detection on VisA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Anomaly Detection	VisA	WinCLIP+ (4-shot)	Detection AUROC	87.3	# 16	Compare
Anomaly Detection	VisA	WinCLIP+ (4-shot)	Segmentation AUPRO (until 30% FPR)	87.6	# 9	Compare
Anomaly Detection	VisA	WinCLIP+ (2-shot)	Detection AUROC	84.6	# 17	Compare
Anomaly Detection	VisA	WinCLIP+ (2-shot)	Segmentation AUPRO (until 30% FPR)	86.2	# 10	Compare
Anomaly Detection	VisA	WinCLIP+ (1-shot)	Detection AUROC	83.8	# 18	Compare
Anomaly Detection	VisA	WinCLIP+ (1-shot)	Segmentation AUPRO (until 30% FPR)	85.1	# 12	Compare
Anomaly Detection	VisA	WinCLIP (0-shot)	Detection AUROC	78.1	# 22	Compare
Anomaly Detection	VisA	WinCLIP (0-shot)	Segmentation AUPRO (until 30% FPR)	56.8	# 23	Compare

Methods

Add Remove

CLIP

Edit Social Preview

WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove