TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	Discrete Adversarial Distillation (ViT-B, 224)	Top 1 Accuracy	81.9%	# 543
Domain Generalization	ImageNet-A	Discrete Adversarial Distillation (ResNet-50)	Top-1 accuracy %	7.7	# 33
Domain Generalization	ImageNet-A	Discrete Adversarial Distillation (ViT-B/224)	Top-1 accuracy %	31.8	# 26
Domain Generalization	ImageNet-R	Discrete Adversarial Distillation (ViT-B,224)	Top-1 Error Rate	34.9	# 14
Domain Generalization	ImageNet-Sketch	Discrete Adversarial Distillation (ViT-B, 224)	Top-1 accuracy	46.1	# 13
Image Classification	ImageNet V2	Discrete Adversarial Distillation (ViT-B, 224)	Top 1 Accuracy	71.7	# 22

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-out-of-distribution-robustness-1/domain-generalization-on-imagenet-sketch)](https://paperswithcode.com/sota/domain-generalization-on-imagenet-sketch?p=distilling-out-of-distribution-robustness-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-out-of-distribution-robustness-1/domain-generalization-on-imagenet-r)](https://paperswithcode.com/sota/domain-generalization-on-imagenet-r?p=distilling-out-of-distribution-robustness-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-out-of-distribution-robustness-1/image-classification-on-imagenet-v2)](https://paperswithcode.com/sota/image-classification-on-imagenet-v2?p=distilling-out-of-distribution-robustness-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-out-of-distribution-robustness-1/domain-generalization-on-imagenet-a)](https://paperswithcode.com/sota/domain-generalization-on-imagenet-a?p=distilling-out-of-distribution-robustness-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-out-of-distribution-robustness-1/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=distilling-out-of-distribution-robustness-1)`

Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models

NeurIPS 2023 · Andy Zhou, Jindong Wang, Yu-Xiong Wang, Haohan Wang ·

We propose a conceptually simple and lightweight framework for improving the robustness of vision models through the combination of knowledge distillation and data augmentation. We address the conjecture that larger models do not make for better teachers by showing strong gains in out-of-distribution robustness when distilling from pretrained foundation models. Following this finding, we propose Discrete Adversarial Distillation (DAD), which leverages a robust teacher to generate adversarial examples and a VQGAN to discretize them, creating more informative samples than standard data augmentation techniques. We provide a theoretical framework for the use of a robust teacher in the knowledge distillation with data augmentation setting and demonstrate strong gains in out-of-distribution robustness and clean accuracy across different student architectures. Notably, our method adds minor computational overhead compared to similar techniques and can be easily combined with other data augmentations for further improvements.

PDF Abstract NeurIPS 2023 PDF NeurIPS 2023 Abstract

Code

Add Remove Mark official

lapisrocks/DiscreteAdversarialDisti… official

Tasks

Add Remove

Data Augmentation

Domain Generalization

Image Classification

Knowledge Distillation

Datasets

ImageNet

ImageNet-C

ImageNet-R

ImageNet-A

ImageNet-Sketch

Stylized ImageNet

Results from the Paper

Add Remove

Ranked #13 on Domain Generalization on ImageNet-Sketch

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	Discrete Adversarial Distillation (ViT-B, 224)	Top 1 Accuracy	81.9%	# 543	Compare
Domain Generalization	ImageNet-A	Discrete Adversarial Distillation (ResNet-50)	Top-1 accuracy %	7.7	# 33	Compare
Domain Generalization	ImageNet-A	Discrete Adversarial Distillation (ViT-B/224)	Top-1 accuracy %	31.8	# 26	Compare
Domain Generalization	ImageNet-R	Discrete Adversarial Distillation (ViT-B,224)	Top-1 Error Rate	34.9	# 14	Compare
Domain Generalization	ImageNet-Sketch	Discrete Adversarial Distillation (ViT-B, 224)	Top-1 accuracy	46.1	# 13	Compare
Image Classification	ImageNet V2	Discrete Adversarial Distillation (ViT-B, 224)	Top 1 Accuracy	71.7	# 22	Compare

Methods

Add Remove

Knowledge Distillation

Edit Social Preview

Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove