Self-Supervised Image Classification

85 papers with code • 2 benchmarks • 1 datasets

This is the task of image classification using representations learnt with self-supervised learning. Self-supervised methods generally involve a pretext task that is solved to learn a good representation and a loss function to learn with. One example of a loss function is an autoencoder based loss where the goal is reconstruction of an image pixel-by-pixel. A more popular recent example is a contrastive loss, which measure the similarity of sample pairs in a representation space, and where there can be a varying target instead of a fixed target to reconstruct (as in the case of autoencoders).

A common evaluation protocol is to train a linear classifier on top of (frozen) representations learnt by self-supervised methods. The leaderboards for the linear evaluation protocol can be found below. In practice, it is more common to fine-tune features on a downstream task. An alternative evaluation protocol therefore uses semi-supervised learning and finetunes on a % of the labels. The leaderboards for the finetuning protocol can be accessed here.

You may want to read some blog posts before reading the papers and checking the leaderboards:

Contrastive Self-Supervised Learning - Ankesh Anand
The Illustrated Self-Supervised Learning - Amit Chaudhary
Self-supervised learning and computer vision - Jeremy Howard
Self-Supervised Representation Learning - Lilian Weng

There is also Yann LeCun's talk at AAAI-20 which you can watch here (35:00+).

( Image credit: A Simple Framework for Contrastive Learning of Visual Representations )

Benchmarks

Add a Result

These leaderboards are used to track progress in Self-Supervised Image Classification

Trend	Dataset	Best Model	Paper	Code	Compare
	ImageNet	DINOv2 (ViT-g/14 @448)			See all
	ImageNet (finetuned)	DINOv2 (ViT-g/14, 448)			See all

Libraries

Use these libraries to find Self-Supervised Image Classification models and implementations

lightly-ai/lightly

13 papers

2,758

Westlake-AI/openmixup

13 papers

574

open-mmlab/mmselfsup

12 papers

3,090

facebookresearch/vissl

11 papers

3,230

See all 18 libraries.

Datasets

ImageNet

Latest papers

Most implemented Social Latest No code

Masked Siamese Networks for Label-Efficient Learning

lightly-ai/lightly • • 14 Apr 2022

We propose Masked Siamese Networks (MSN), a self-supervised learning framework for learning image representations.

2,758

14 Apr 2022

Paper
Code

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training

lixiaotong97/mc-beit • • 29 Mar 2022

Image BERT pre-training with masked image modeling (MIM) becomes a popular practice to cope with self-supervised representation learning.

29 Mar 2022

Paper
Code

Mugs: A Multi-Granular Self-Supervised Learning Framework

sail-sg/mugs • • 27 Mar 2022

It provides complementary instance supervision to IDS via an extra alignment on local neighbors, and scatters different local-groups separately to increase discriminability.

27 Mar 2022

Paper
Code

CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning

maple-research-lab/caco • • 27 Mar 2022

It trains an encoder by distinguishing positive samples from negative ones given query anchors.

27 Mar 2022

Paper
Code

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

facebookresearch/vissl • • 16 Feb 2022

Discriminative self-supervised learning allows training models on any random group of internet images, and possibly recover salient information that helps differentiate between the images.

3,230

16 Feb 2022

Paper
Code

OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

modelscope/modelscope • • 7 Feb 2022

In this work, we pursue a unified paradigm for multimodal pretraining to break the scaffolds of complex task/modality-specific customization.

6,094

07 Feb 2022

Paper
Code

Context Autoencoder for Self-Supervised Representation Learning

open-mmlab/mmselfsup • • 7 Feb 2022

The pretraining tasks include two tasks: masked representation prediction - predict the representations for the masked patches, and masked patch reconstruction - reconstruct the masked patches.

3,090

07 Feb 2022

Paper
Code

When Do Flat Minima Optimizers Work?

JeanKaddour/WASAM • • 1 Feb 2022

Recently, flat-minima optimizers, which seek to find parameters in low-loss neighborhoods, have been shown to improve a neural network's generalization performance over stochastic and adaptive gradient-based optimizers.

01 Feb 2022

Paper
Code

Max-Margin Contrastive Learning

anshulbshah/MMCL • • 21 Dec 2021

Standard contrastive learning approaches usually require a large number of negatives for effective unsupervised learning and often exhibit slow convergence.

21 Dec 2021

Paper
Code

Masked Feature Prediction for Self-Supervised Visual Pre-Training

facebookresearch/SlowFast • • CVPR 2022

We present Masked Feature Prediction (MaskFeat) for self-supervised pre-training of video models.

6,283

16 Dec 2021

Paper
Code

Self-Supervised Image Classification

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result