Few-Shot Image Classification

202 papers with code • 88 benchmarks • 23 datasets

Few-Shot Image Classification is a computer vision task that involves training machine learning models to classify images into predefined categories using only a few labeled examples of each category (typically < 6 examples). The goal is to enable models to recognize and classify new images with minimal supervision and limited data, without having to train on large datasets. (typically < 6 examples)

( Image credit: Learning Embedding Adaptation for Few-Shot Learning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Few-Shot Image Classification

Dataset	Best Model	Compare
Mini-Imagenet 5-way (1-shot)	CAML [Laion-2b]	See all
Mini-Imagenet 5-way (5-shot)	CAML [Laion-2b]	See all
Tiered ImageNet 5-way (5-shot)	CAML [Laion-2b]	See all
Tiered ImageNet 5-way (1-shot)	CAML [Laion-2b]	See all
CIFAR-FS 5-way (5-shot)	CAML [Laion-2b]	See all
CIFAR-FS 5-way (1-shot)	PT+MAP+SF+SOT (transductive)	See all
CUB 200 5-way 1-shot	PT+MAP+SF+SOT (transductive)	See all
CUB 200 5-way 5-shot	CAML [Laion-2b]	See all
FC100 5-way (1-shot)	BAVARDAGE	See all
FC100 5-way (5-shot)	BAVARDAGE	See all
OMNIGLOT - 1-Shot, 20-way	GCR	See all
Meta-Dataset	P>M>F (P=DINO-ViT-base, M=ProtoNet)	See all
OMNIGLOT - 5-Shot, 20-way	MC2+	See all
OMNIGLOT - 1-Shot, 5-way	MC2+	See all
OMNIGLOT - 5-Shot, 5-way	DCN6-E	See all
Mini-ImageNet - 1-Shot Learning	EPNet	See all
Mini-Imagenet 10-way (1-shot)	Transductive CNAPS + FETI	See all
Mini-Imagenet 10-way (5-shot)	Transductive CNAPS + FETI	See all
Tiered ImageNet 10-way (1-shot)	Transductive CNAPS + FETI	See all
Tiered ImageNet 10-way (5-shot)	Transductive CNAPS + FETI	See all
Meta-Dataset Rank	URT	See all
Mini-ImageNet-CUB 5-way (1-shot)	TRIDENT	See all
Dirichlet Mini-Imagenet (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet Mini-Imagenet (5-way, 5-shot)	BAVARDAGE	See all
Dirichlet Tiered-Imagenet (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet Tiered-Imagenet (5-way, 5-shot)	\alpha-TIM	See all
Mini-ImageNet-CUB 5-way (5-shot)	TRIDENT	See all
ImageNet - 1-shot	ViT-MoE-15B (Every-2)	See all
ImageNet - 5-shot	ViT-MoE-15B (Every-2)	See all
Dirichlet CUB-200 (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet CUB-200 (5-way, 5-shot)	BAVARDAGE	See all
ImageNet-FS (2-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (5-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet - 10-shot	MAWS (ViT-6.5B)	See all
Bongard-HOI	Human (Amateur)	See all
ImageNet-FS (1-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
Stanford Dogs 5-way (5-shot)	MML(KL)	See all
Stanford Cars 5-way (1-shot)	MATANet	See all
Stanford Cars 5-way (5-shot)	MATANet	See all
Mini-Imagenet 20-way (1-shot)	TIM-GD	See all
Mini-Imagenet 20-way (5-shot)	Meta LSTM, (from )	See all
CUB-200-2011 - 0-Shot	Word CNN-RNN (DS-SJE Embedding)	See all
ImageNet - 0-Shot	DebiasPL (ResNet50)	See all
Mini-Imagenet 5-way (10-shot)	PT+MAP	See all
CUB 200 50-way (0-shot)	DS-SJE Reed et al. (2016)	See all
CUB-200 - 0-Shot Learning	SJE	See all
Stanford Dogs 5-way (1-shot)	MML(KL)	See all
Caltech-256 5-way (1-shot)	UL-Hopfield (ULH)	See all
ORBIT Clutter Video Evaluation	ProtoNets + LITE	See all
ImageNet-FS (5-shot, novel)	KGTN-ens (ResNet-50, h+g, mean)	See all
SUN - 0-Shot	Synthesised Classifier	See all
OMNIGLOT-EMNIST 5-way (1-shot)	HyperShot	See all
OMNIGLOT-EMNIST 5-way (5-shot)	HyperShot	See all
CIFAR100 5-way (1-shot)	UL-Hopfield (ULH)	See all
Mini-ImageNet to CUB - 5 shot learning	TIM-GD	See all
ImageNet (1-shot)	TRAML	See all
ORBIT Clean Video Evaluation	SimpleCNAPs + LITE	See all
ImageNet-FS (10-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (1-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (2-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (10-shot, all)	KGTN (ResNet-50)	See all
Fewshot-CIFAR100 - 1-Shot Learning	pseudo-shots	See all
Fewshot-CIFAR100 - 5-Shot Learning	pseudo-shots	See all
CIFAR-FS - 1-Shot Learning	pseudo-shots	See all
Flowers-102 - 0-Shot	Word CNN-RNN (DS-SJE Embedding)	See all
AWA - 0-Shot	Synthesised Classifier	See all
FC100 5-way (10-shot)	MTL	See all
OMNIGLOT - 1-Shot, 423 way	APL	See all
OMNIGLOT - 5-Shot, 423 way	APL	See all
OMNIGLOT - 1-Shot, 1000 way	APL	See all
OMNIGLOT - 5-Shot, 1000 way	APL	See all
AWA1 - 0-Shot	TAFE-Net	See all
AWA2 - 0-Shot	TAFE-Net	See all
aPY - 0-Shot	TAFE-Net	See all
mini-ImageNet - 100-Way	GCR	See all
CIFAR-FS - 5-Shot Learning	pseudo-shots	See all
miniImagenet → CUB (5-way 1-shot)	LaplacianShot	See all
miniImagenet → CUB (5-way 5-shot)	LaplacianShot	See all
iNaturalist (227-way multi-shot)	LaplacianShot	See all
CUB-200-2011 5-way (1-shot)	MATANet	See all
CUB-200-2011 5-way (5-shot)	MATANet	See all
Oxford 102 Flower	RS-FSL	See all
CUB 200 5-way	EASY 3xResNet12 (transductive)	See all
iNaturalist 2018 - 1-shot	MAWS (ViT-2B)	See all
iNaturalist 2018 - 5-shot	MAWS (ViT-2B)	See all
iNaturalist 2018 - 10-shot	MAWS (ViT-2B)	See all
Caltech101	PRE	See all
Caltech-256 5-way (5-shot)	MergedNet-Concat	See all

Show all 88 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Few-Shot Image Classification models and implementations

sicara/easy-few-shot-learning

12 papers

911

learnables/learn2learn

6 papers

2,545

cnguyen10/few_shot_meta_learning

5 papers

238

peymanbateni/simple-cnaps

4 papers

110

See all 8 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification

no code yet • 11 Jul 2023

However, all the CAM-based methods (e. g., CAM, Grad-CAM, and Relevance-CAM) can only be used for interpreting CNN models with fully-connected (FC) layers as a classifier.

Paper
Add Code

FILM: How can Few-Shot Image Classification Benefit from Pre-Trained Language Models?

no code yet • 9 Jul 2023

Few-shot learning aims to train models that can be generalized to novel classes with only a few samples.

Paper
Add Code

Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification & Segmentation

no code yet • CVPR 2023

For this mixed setup, we propose to improve the pseudo-labels using a pseudo-label enhancer that was trained using the available ground-truth pixel-level labels.

Paper
Add Code

SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification

no code yet • 15 May 2023

Few-shot learning is a challenging area of research that aims to learn new concepts with only a few labeled samples of data.

Paper
Add Code

Strong Baselines for Parameter Efficient Few-Shot Fine-tuning

no code yet • 4 Apr 2023

Through our controlled empirical study, we have two main findings: (i) Fine-tuning just the LayerNorm parameters (which we call LN-Tune) during few-shot adaptation is an extremely strong baseline across ViTs pre-trained with both self-supervised and supervised objectives, (ii) For self-supervised ViTs, we find that simply learning a set of scaling parameters for each attention matrix (which we call AttnScale) along with a domain-residual adapter (DRA) module leads to state-of-the-art performance (while being $\sim\!$ 9$\times$ more parameter-efficient) on MD.

Paper
Add Code

Boosting Few-Shot Text Classification via Distribution Estimation

no code yet • 26 Mar 2023

Distribution estimation has been demonstrated as one of the most effective approaches in dealing with few-shot image classification, as the low-level patterns and underlying representations can be easily transferred across different tasks in computer vision domain.

Paper
Add Code

RotoGBML: Towards Out-of-Distribution Generalization for Gradient-Based Meta-Learning

no code yet • 12 Mar 2023

OOD exacerbates inconsistencies in magnitudes and directions of task gradients, which brings challenges for GBML to optimize the meta-knowledge by minimizing the sum of task gradients in each minibatch.

Paper
Add Code

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

no code yet • CVPR 2023

Hence we advocate that the key of better performance lies in meaningful latent modality structures instead of perfect modality alignment.

Paper
Add Code

CovidExpert: A Triplet Siamese Neural Network framework for the detection of COVID-19

no code yet • 17 Feb 2023

Patients with the COVID-19 infection may have pneumonia-like symptoms as well as respiratory problems which may harm the lungs.

Paper
Add Code

Explore the Power of Dropout on Few-shot Learning

no code yet • 26 Jan 2023

The generalization power of the pre-trained model is the key for few-shot deep learning.

Paper
Add Code

Few-Shot Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result