TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Sparse Learning	ImageNet	Resnet-50: 80% Sparse 100 epochs	Top-1 Accuracy	75.84	# 4
Sparse Learning	ImageNet	Resnet-50: 90% Sparse 100 epochs	Top-1 Accuracy	73.82	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/do-we-actually-need-dense-over/sparse-learning-on-imagenet)](https://paperswithcode.com/sota/sparse-learning-on-imagenet?p=do-we-actually-need-dense-over)`

Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training

4 Feb 2021 · Shiwei Liu, Lu Yin, Decebal Constantin Mocanu, Mykola Pechenizkiy ·

In this paper, we introduce a new perspective on training deep neural networks capable of state-of-the-art performance without the need for the expensive over-parameterization by proposing the concept of In-Time Over-Parameterization (ITOP) in sparse training. By starting from a random sparse network and continuously exploring sparse connectivities during training, we can perform an Over-Parameterization in the space-time manifold, closing the gap in the expressibility between sparse training and dense training. We further use ITOP to understand the underlying mechanism of Dynamic Sparse Training (DST) and indicate that the benefits of DST come from its ability to consider across time all possible parameters when searching for the optimal sparse connectivity. As long as there are sufficient parameters that have been reliably explored during training, DST can outperform the dense neural network by a large margin. We present a series of experiments to support our conjecture and achieve the state-of-the-art sparse training performance with ResNet-50 on ImageNet. More impressively, our method achieves dominant performance over the overparameterization-based sparse methods at extreme sparsity levels. When trained on CIFAR-100, our method can match the performance of the dense model even at an extreme sparsity (98%). Code can be found https://github.com/Shiweiliuiiiiiii/In-Time-Over-Parameterization.

PDF Abstract

Code

Add Remove Mark official

Shiweiliuiiiiiii/In-Time-Over-Param… official

vita-group/granet

Shiweiliuiiiiiii/GraNet

stevenboys/agent

Tasks

Add Remove

Image Classification

Sparse Learning

Datasets

CIFAR-10

ImageNet

CIFAR-100

Results from the Paper

Edit

Ranked #4 on Sparse Learning on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Sparse Learning	ImageNet	Resnet-50: 80% Sparse 100 epochs	Top-1 Accuracy	75.84	# 4		Compare
Sparse Learning	ImageNet	Resnet-50: 90% Sparse 100 epochs	Top-1 Accuracy	73.82	# 6		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove