TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	Fix-EfficientNet-B8 (MaxUp + CutMix)	Top 1 Accuracy	85.8%	# 187
Image Classification	ImageNet	Fix-EfficientNet-B8 (MaxUp + CutMix)	Number of params	87.42M	# 829
Image Classification	ImageNet	Fix-EfficientNet-B8 (MaxUp + CutMix)	Hardware Burden	None	# 1
Image Classification	ImageNet	Fix-EfficientNet-B8 (MaxUp + CutMix)	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/maxup-a-simple-way-to-improve-generalization/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=maxup-a-simple-way-to-improve-generalization)`

MaxUp: A Simple Way to Improve Generalization of Neural Network Training

20 Feb 2020 · Chengyue Gong, Tongzheng Ren, Mao Ye, Qiang Liu ·

We propose \emph{MaxUp}, an embarrassingly simple, highly effective technique for improving the generalization performance of machine learning models, especially deep neural networks. The idea is to generate a set of augmented data with some random perturbations or transforms and minimize the maximum, or worst case loss over the augmented data. By doing so, we implicitly introduce a smoothness or robustness regularization against the random perturbations, and hence improve the generation performance. For example, in the case of Gaussian perturbation, \emph{MaxUp} is asymptotically equivalent to using the gradient norm of the loss as a penalty to encourage smoothness. We test \emph{MaxUp} on a range of tasks, including image classification, language modeling, and adversarial certification, on which \emph{MaxUp} consistently outperforms the existing best baseline methods, without introducing substantial computational overhead. In particular, we improve ImageNet classification from the state-of-the-art top-1 accuracy $85.5\%$ without extra data to $85.8\%$. Code will be released soon.

PDF Abstract

Code

Add Remove Mark official

Yunodo/maxup

↳ Quickstart in

Colab

Tasks

Add Remove

Few-Shot Image Classification

General Classification

Image Classification

Language Modelling

Datasets

ImageNet

Penn Treebank

Results from the Paper

Edit

Ranked #185 on Image Classification on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	Fix-EfficientNet-B8 (MaxUp + CutMix)	Top 1 Accuracy	85.8%	# 187	Compare
			Number of params	87.42M	# 829	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

1x1 Convolution • Activation Regularization • Average Pooling • AWD-LSTM • Batch Normalization • Bottleneck Residual Block • Convolution • CutMix • Dense Connections • Depthwise Convolution • Depthwise Separable Convolution • DropConnect • Dropout • EfficientNet • Embedding Dropout • FixRes • Global Average Pooling • Inverted Residual Block • Kaiming Initialization • LSTM • Max Pooling • MaxUp • NT-ASGD • Pointwise Convolution • ProxylessNet-CPU • ProxylessNet-GPU • ProxylessNet-Mobile • Random Horizontal Flip • Random Resized Crop • ReLU • Residual Block • Residual Connection • ResNet • RMSProp • Sigmoid Activation • Squeeze-and-Excitation Block • Swish • Tanh Activation • Temporal Activation Regularization • Test • Variational Dropout • Weight Decay • Weight Tying

Edit Social Preview

MaxUp: A Simple Way to Improve Generalization of Neural Network Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove