TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	Oct-ResNet-152 (SE)	Top 1 Accuracy	82.9%	# 445
Image Classification	ImageNet	Oct-ResNet-152 (SE)	Number of params	66.8M	# 784
Image Classification	ImageNet	Oct-ResNet-152 (SE)	Hardware Burden	20771G	# 1
Image Classification	ImageNet	Oct-ResNet-152 (SE)	Operations per network pass	2.22G	# 1
Image Classification	ImageNet	Oct-ResNet-152 (SE)	GFLOPs	22.2	# 370
Action Classification	Kinetics-400	Oct-I3D + NL	Acc@1	75.7	# 147

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/drop-an-octave-reducing-spatial-redundancy-in/action-classification-on-kinetics-400)](https://paperswithcode.com/sota/action-classification-on-kinetics-400?p=drop-an-octave-reducing-spatial-redundancy-in)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/drop-an-octave-reducing-spatial-redundancy-in/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=drop-an-octave-reducing-spatial-redundancy-in)`

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

ICCV 2019 · Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng ·

In natural images, information is conveyed at different frequencies where higher frequencies are usually encoded with fine details and lower frequencies are usually encoded with global structures. Similarly, the output feature maps of a convolution layer can also be seen as a mixture of information at different frequencies. In this work, we propose to factorize the mixed feature maps by their frequencies, and design a novel Octave Convolution (OctConv) operation to store and process feature maps that vary spatially "slower" at a lower spatial resolution reducing both memory and computation cost. Unlike existing multi-scale methods, OctConv is formulated as a single, generic, plug-and-play convolutional unit that can be used as a direct replacement of (vanilla) convolutions without any adjustments in the network architecture. It is also orthogonal and complementary to methods that suggest better topologies or reduce channel-wise redundancy like group or depth-wise convolutions. We experimentally show that by simply replacing convolutions with OctConv, we can consistently boost accuracy for both image and video recognition tasks, while reducing memory and computational cost. An OctConv-equipped ResNet-152 can achieve 82.9% top-1 classification accuracy on ImageNet with merely 22.2 GFLOPs.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

facebookresearch/OctConv official

559

osmr/imgclsmob

2,917

lxtGH/OctaveConv_pytorch

577

terrychenism/OctaveConv

494

d-li14/octconv.pytorch

287

See all 28 implementations

Tasks

Add Remove

Action Classification

Image Classification

Video Recognition

Datasets

ImageNet

Kinetics

Kinetics 400

Results from the Paper

Edit

Ranked #147 on Action Classification on Kinetics-400

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	Oct-ResNet-152 (SE)	Top 1 Accuracy	82.9%	# 445	Compare
			Number of params	66.8M	# 784	Compare
			Hardware Burden	20771G	# 1	Compare
			Operations per network pass	2.22G	# 1	Compare
			GFLOPs	22.2	# 370	Compare
Action Classification	Kinetics-400	Oct-I3D + NL	Acc@1	75.7	# 147	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Cosine Annealing • Dense Connections • Depthwise Convolution • Depthwise Separable Convolution • Global Average Pooling • Grouped Convolution • Inverted Residual Block • Kaiming Initialization • Label Smoothing • Max Pooling • Mixup • MobileNetV2 • Octave Convolution • Pointwise Convolution • ReLU • Residual Block • Residual Connection • ResNet • ResNeXt • ResNeXt Block • SGD • Sigmoid Activation • Softmax • Squeeze-and-Excitation Block

Edit Social Preview

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove