TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Scale Generalisation	MNIST Large Scale dataset	FovMax Single-scale training	Average Accuracy	99.32	# 1
Scale Generalisation	MNIST Large Scale dataset	FovAvg Single-scale training	Average Accuracy	99.32	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/scale-invariant-scale-channel-networks-deep/scale-generalisation-on-mnist-large-scale)](https://paperswithcode.com/sota/scale-generalisation-on-mnist-large-scale?p=scale-invariant-scale-channel-networks-deep)`

Scale-invariant scale-channel networks: Deep networks that generalise to previously unseen scales

11 Jun 2021 · Ylva Jansson, Tony Lindeberg ·

The ability to handle large scale variations is crucial for many real world visual tasks. A straightforward approach for handling scale in a deep network is to process an image at several scales simultaneously in a set of scale channels. Scale invariance can then, in principle, be achieved by using weight sharing between the scale channels together with max or average pooling over the outputs from the scale channels. The ability of such scale channel networks to generalise to scales not present in the training set over significant scale ranges has, however, not previously been explored. In this paper, we present a systematic study of this methodology by implementing different types of scale channel networks and evaluating their ability to generalise to previously unseen scales. We develop a formalism for analysing the covariance and invariance properties of scale channel networks, and explore how different design choices, unique to scaling transformations, affect the overall performance of scale channel networks. We first show that two previously proposed scale channel network designs do not generalise well to scales not present in the training set. We explain theoretically and demonstrate experimentally why generalisation fails in these cases. We then propose a new type of foveated scale channel architecture}, where the scale channels process increasingly larger parts of the image with decreasing resolution. This new type of scale channel network is shown to generalise extremely well, provided sufficient image resolution and the absence of boundary effects. Our proposed FovMax and FovAvg networks perform almost identically over a scale range of 8, also when training on single scale training data, and do also give improved performance when learning from datasets with large scale variations in the small sample regime.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Image Classification

Scale Generalisation

Datasets

CIFAR-10

MNIST

MNIST Large Scale dataset

Results from the Paper

Edit

Ranked #1 on Scale Generalisation on MNIST Large Scale dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Scale Generalisation	MNIST Large Scale dataset	FovMax Single-scale training	Average Accuracy	99.32	# 1		Compare
Scale Generalisation	MNIST Large Scale dataset	FovAvg Single-scale training	Average Accuracy	99.32	# 1		Compare

Methods

Add Remove

Average Pooling

Edit Social Preview

Scale-invariant scale-channel networks: Deep networks that generalise to previously unseen scales

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove