TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Adversarial Robustness	CIFAR-10	Mixed classifier	Attack: AutoAttack	68.06	# 3
Adversarial Robustness	CIFAR-10	Mixed classifier	Accuracy	95.23	# 1
Adversarial Robustness	CIFAR-10	Mixed classifier	Robust Accuracy	68.06	# 3
Adversarial Robustness	CIFAR-100	Mixed Classifier	Clean Accuracy	85.21	# 1
Adversarial Robustness	CIFAR-100	Mixed Classifier	AutoAttacked Accuracy	38.72	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-the-accuracy-robustness-trade-off/adversarial-robustness-on-cifar-10)](https://paperswithcode.com/sota/adversarial-robustness-on-cifar-10?p=improving-the-accuracy-robustness-trade-off)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-the-accuracy-robustness-trade-off/adversarial-robustness-on-cifar-100)](https://paperswithcode.com/sota/adversarial-robustness-on-cifar-100?p=improving-the-accuracy-robustness-trade-off)`

Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive Smoothing

29 Jan 2023 · Yatong Bai, Brendon G. Anderson, Aerin Kim, Somayeh Sojoudi ·

While prior research has proposed a plethora of methods that build neural classifiers robust against adversarial robustness, practitioners are still reluctant to adopt them due to their unacceptably severe clean accuracy penalties. This paper significantly alleviates this accuracy-robustness trade-off by mixing the output probabilities of a standard classifier and a robust classifier, where the standard network is optimized for clean accuracy and is not robust in general. We show that the robust base classifier's confidence difference for correct and incorrect examples is the key to this improvement. In addition to providing intuitions and empirical evidence, we theoretically certify the robustness of the mixed classifier under realistic assumptions. Furthermore, we adapt an adversarial input detector into a mixing network that adaptively adjusts the mixture of the two base models, further reducing the accuracy penalty of achieving robustness. The proposed flexible method, termed "adaptive smoothing", can work in conjunction with existing or even future methods that improve clean accuracy, robustness, or adversary detection. Our empirical evaluation considers strong attack methods, including AutoAttack and adaptive attack. On the CIFAR-100 dataset, our method achieves an 85.21% clean accuracy while maintaining a 38.72% $\ell_\infty$-AutoAttacked ($\epsilon = 8/255$) accuracy, becoming the second most robust method on the RobustBench CIFAR-100 benchmark as of submission, while improving the clean accuracy by ten percentage points compared with all listed models. The code that implements our method is available at https://github.com/Bai-YT/AdaptiveSmoothing.

PDF Abstract

Code

Add Remove Mark official

bai-yt/adaptivesmoothing official

Tasks

Add Remove

Adversarial Robustness

Datasets

CIFAR-10

CIFAR-100 RobustBench

Results from the Paper

Edit

Ranked #1 on Adversarial Robustness on CIFAR-100 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Adversarial Robustness	CIFAR-10	Mixed classifier	Attack: AutoAttack	68.06	# 3	Compare
			Accuracy	95.23	# 1	Compare
			Robust Accuracy	68.06	# 3	Compare
Adversarial Robustness	CIFAR-100	Mixed Classifier	Clean Accuracy	85.21	# 1	Compare
Adversarial Robustness	CIFAR-100	Mixed Classifier	AutoAttacked Accuracy	38.72	# 1	Compare

Methods

Add Remove

BASE

Edit Social Preview

Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive Smoothing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove