TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	Caltech-256	WaveMixLite-256/7	Accuracy	54.62	# 5
Image Classification	CIFAR-10	WaveMixLite-144/7	Percentage correct	97.29	# 83
Image Classification	CIFAR-100	WaveMixLite-256/7	Percentage correct	85.09	# 69
Image Classification	CIFAR-100	WaveMix-Lite-256/7	Percentage correct	70.20	# 164
Semantic Segmentation	Cityscapes val	WaveMix	mIoU	82.7	# 28
Semantic Segmentation	Cityscapes val	WaveMix-256/16 (Level-4)	mIoU	82.60	# 30
Image Classification	EMNIST-Balanced	WaveMixLite-128/7	Accuracy	91.06	# 1
Image Classification	EMNIST-Byclass	WaveMixLite-128/7	Accuracy	88.43	# 1
Image Classification	EMNIST-Bymerge	WaveMixLite-128/16	Accuracy	91.80	# 1
Image Classification	EMNIST-Digits	WaveMixLite-112/16	Accuracy (%)	99.82	# 1
Image Classification	EMNIST-Letters	WaveMixLite-112/16	Accuracy	95.96	# 1
Image Classification	Fashion-MNIST	WaveMixLite	Percentage error	5.68	# 8
Image Classification	Galaxy10 DECals	WaveMix	Top-1 Accuracy (%)	95.42	# 1
Image Classification	Galaxy10 DECals	WaveMix	PARAMS (M)	28	# 1
Image Classification	ImageNet	WaveMix-192/16 (level 3)	Top 1 Accuracy	74.93%	# 892
Image Classification	iNat2021-mini	WaveMix-256/16 (level 2)	Top 1 Accuracy	61.75	# 1
Image Classification	mnist	WaveMixLite	Percentage error	0.25	# 1
Scene Classification	Places365-Standard	WaveMix	Top 1 Error	43.55	# 1
Image Classification	Places365-Standard	WaveMix-240/12 (level 4)	Top 1 Accuracy	56.45	# 4
Image Classification	STL-10	WaveMixLite-256/7	Percentage correct	70.88	# 90
Image Classification	SVHN	WaveMixLite-144/15	Percentage error	1.27	# 6
Image Classification	Tiny ImageNet Classification	WaveMixLite-144/7	Validation Acc	77.47%	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-emnist-balanced)](https://paperswithcode.com/sota/image-classification-on-emnist-balanced?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-emnist-byclass)](https://paperswithcode.com/sota/image-classification-on-emnist-byclass?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-emnist-bymerge)](https://paperswithcode.com/sota/image-classification-on-emnist-bymerge?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-emnist-digits)](https://paperswithcode.com/sota/image-classification-on-emnist-digits?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-emnist-letters)](https://paperswithcode.com/sota/image-classification-on-emnist-letters?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-galaxy10-decals)](https://paperswithcode.com/sota/image-classification-on-galaxy10-decals?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-inat2021-mini)](https://paperswithcode.com/sota/image-classification-on-inat2021-mini?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-mnist-1)](https://paperswithcode.com/sota/image-classification-on-mnist-1?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/scene-classification-on-places365-standard)](https://paperswithcode.com/sota/scene-classification-on-places365-standard?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-places365-standard)](https://paperswithcode.com/sota/image-classification-on-places365-standard?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-caltech-256)](https://paperswithcode.com/sota/image-classification-on-caltech-256?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-svhn)](https://paperswithcode.com/sota/image-classification-on-svhn?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-fashion-mnist)](https://paperswithcode.com/sota/image-classification-on-fashion-mnist?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-tiny-imagenet-1)](https://paperswithcode.com/sota/image-classification-on-tiny-imagenet-1?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/semantic-segmentation-on-cityscapes-val)](https://paperswithcode.com/sota/semantic-segmentation-on-cityscapes-val?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-cifar-10)](https://paperswithcode.com/sota/image-classification-on-cifar-10?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-stl-10)](https://paperswithcode.com/sota/image-classification-on-stl-10?p=wavemix-lite-a-resource-efficient-neural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/wavemix-lite-a-resource-efficient-neural/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=wavemix-lite-a-resource-efficient-neural)`

WaveMix: A Resource-efficient Neural Network for Image Analysis

28 May 2022 · Pranav Jeevan, Kavitha Viswanathan, Anandu A S, Amit Sethi ·

We propose a novel neural architecture for computer vision -- WaveMix -- that is resource-efficient and yet generalizable and scalable. While using fewer trainable parameters, GPU RAM, and computations, WaveMix networks achieve comparable or better accuracy than the state-of-the-art convolutional neural networks, vision transformers, and token mixers for several tasks. This efficiency can translate to savings in time, cost, and energy. To achieve these gains we used multi-level two-dimensional discrete wavelet transform (2D-DWT) in WaveMix blocks, which has the following advantages: (1) It reorganizes spatial information based on three strong image priors -- scale-invariance, shift-invariance, and sparseness of edges -- (2) in a lossless manner without adding parameters, (3) while also reducing the spatial sizes of feature maps, which reduces the memory and time required for forward and backward passes, and (4) expanding the receptive field faster than convolutions do. The whole architecture is a stack of self-similar and resolution-preserving WaveMix blocks, which allows architectural flexibility for various tasks and levels of resource availability. WaveMix establishes new benchmarks for segmentation on Cityscapes; and for classification on Galaxy 10 DECals, Places-365, five EMNIST datasets, and iNAT-mini and performs competitively on other benchmarks. Our code and trained models are publicly available.

PDF Abstract

Code

Add Remove Mark official

pranavphoenix/WaveMix official

Tasks

Add Remove

Efficient Neural Network

Image Classification

Scene Classification

Semantic Segmentation

Spatial Token Mixer

Datasets

CIFAR-10

ImageNet

CIFAR-100

MNIST

Cityscapes

SVHN

Fashion-MNIST

Places

STL-10

Tiny ImageNet

DIV2K

iNaturalist

Caltech-256

EMNIST

Places365

iNat2021 Galaxy Zoo DECaLS

Results from the Paper

Edit

Ranked #1 on Image Classification on Galaxy10 DECals (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	Caltech-256	WaveMixLite-256/7	Accuracy	54.62	# 5	Compare
Image Classification	CIFAR-10	WaveMixLite-144/7	Percentage correct	97.29	# 83	Compare
Image Classification	CIFAR-100	WaveMixLite-256/7	Percentage correct	85.09	# 69	Compare
Image Classification	CIFAR-100	WaveMix-Lite-256/7	Percentage correct	70.20	# 164	Compare
Semantic Segmentation	Cityscapes val	WaveMix	mIoU	82.7	# 28	Compare
Semantic Segmentation	Cityscapes val	WaveMix-256/16 (Level-4)	mIoU	82.60	# 30	Compare
Image Classification	EMNIST-Balanced	WaveMixLite-128/7	Accuracy	91.06	# 1	Compare
Image Classification	EMNIST-Byclass	WaveMixLite-128/7	Accuracy	88.43	# 1	Compare
Image Classification	EMNIST-Bymerge	WaveMixLite-128/16	Accuracy	91.80	# 1	Compare
Image Classification	EMNIST-Digits	WaveMixLite-112/16	Accuracy (%)	99.82	# 1	Compare
Image Classification	EMNIST-Letters	WaveMixLite-112/16	Accuracy	95.96	# 1	Compare
Image Classification	Fashion-MNIST	WaveMixLite	Percentage error	5.68	# 8	Compare
Image Classification	Galaxy10 DECals	WaveMix	Top-1 Accuracy (%)	95.42	# 1	Compare
Image Classification	Galaxy10 DECals	WaveMix	PARAMS (M)	28	# 1	Compare
Image Classification	ImageNet	WaveMix-192/16 (level 3)	Top 1 Accuracy	74.93%	# 892	Compare
Image Classification	iNat2021-mini	WaveMix-256/16 (level 2)	Top 1 Accuracy	61.75	# 1	Compare
Image Classification	mnist	WaveMixLite	Percentage error	0.25	# 1	Compare
Scene Classification	Places365-Standard	WaveMix	Top 1 Error	43.55	# 1	Compare
Image Classification	Places365-Standard	WaveMix-240/12 (level 4)	Top 1 Accuracy	56.45	# 4	Compare
Image Classification	STL-10	WaveMixLite-256/7	Percentage correct	70.88	# 90	Compare
Image Classification	SVHN	WaveMixLite-144/15	Percentage error	1.27	# 6	Compare
Image Classification	Tiny ImageNet Classification	WaveMixLite-144/7	Validation Acc	77.47%	# 9	Compare

Methods

Add Remove

2D DWT • Average Pooling • Dense Connections • Dropout • GELU • Global Average Pooling • Layer Normalization • MLP-Mixer • PoolFormer • RAM • Residual Connection

Edit Social Preview

WaveMix: A Resource-efficient Neural Network for Image Analysis

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove