Few-Shot Image Classification

202 papers with code • 88 benchmarks • 23 datasets

Few-Shot Image Classification is a computer vision task that involves training machine learning models to classify images into predefined categories using only a few labeled examples of each category (typically < 6 examples). The goal is to enable models to recognize and classify new images with minimal supervision and limited data, without having to train on large datasets. (typically < 6 examples)

( Image credit: Learning Embedding Adaptation for Few-Shot Learning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Few-Shot Image Classification

Dataset	Best Model	Compare
Mini-Imagenet 5-way (1-shot)	CAML [Laion-2b]	See all
Mini-Imagenet 5-way (5-shot)	CAML [Laion-2b]	See all
Tiered ImageNet 5-way (5-shot)	CAML [Laion-2b]	See all
Tiered ImageNet 5-way (1-shot)	CAML [Laion-2b]	See all
CIFAR-FS 5-way (5-shot)	CAML [Laion-2b]	See all
CIFAR-FS 5-way (1-shot)	PT+MAP+SF+SOT (transductive)	See all
CUB 200 5-way 1-shot	PT+MAP+SF+SOT (transductive)	See all
CUB 200 5-way 5-shot	CAML [Laion-2b]	See all
FC100 5-way (1-shot)	BAVARDAGE	See all
FC100 5-way (5-shot)	BAVARDAGE	See all
OMNIGLOT - 1-Shot, 20-way	GCR	See all
Meta-Dataset	P>M>F (P=DINO-ViT-base, M=ProtoNet)	See all
OMNIGLOT - 5-Shot, 20-way	MC2+	See all
OMNIGLOT - 1-Shot, 5-way	MC2+	See all
OMNIGLOT - 5-Shot, 5-way	DCN6-E	See all
Mini-ImageNet - 1-Shot Learning	EPNet	See all
Mini-Imagenet 10-way (1-shot)	Transductive CNAPS + FETI	See all
Mini-Imagenet 10-way (5-shot)	Transductive CNAPS + FETI	See all
Tiered ImageNet 10-way (1-shot)	Transductive CNAPS + FETI	See all
Tiered ImageNet 10-way (5-shot)	Transductive CNAPS + FETI	See all
Meta-Dataset Rank	URT	See all
Mini-ImageNet-CUB 5-way (1-shot)	TRIDENT	See all
Dirichlet Mini-Imagenet (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet Mini-Imagenet (5-way, 5-shot)	BAVARDAGE	See all
Dirichlet Tiered-Imagenet (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet Tiered-Imagenet (5-way, 5-shot)	\alpha-TIM	See all
Mini-ImageNet-CUB 5-way (5-shot)	TRIDENT	See all
ImageNet - 1-shot	ViT-MoE-15B (Every-2)	See all
ImageNet - 5-shot	ViT-MoE-15B (Every-2)	See all
Dirichlet CUB-200 (5-way, 1-shot)	BAVARDAGE	See all
Dirichlet CUB-200 (5-way, 5-shot)	BAVARDAGE	See all
ImageNet-FS (2-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (5-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet - 10-shot	MAWS (ViT-6.5B)	See all
Bongard-HOI	Human (Amateur)	See all
ImageNet-FS (1-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
Stanford Dogs 5-way (5-shot)	MML(KL)	See all
Stanford Cars 5-way (1-shot)	MATANet	See all
Stanford Cars 5-way (5-shot)	MATANet	See all
Mini-Imagenet 20-way (1-shot)	TIM-GD	See all
Mini-Imagenet 20-way (5-shot)	Meta LSTM, (from )	See all
CUB-200-2011 - 0-Shot	Word CNN-RNN (DS-SJE Embedding)	See all
ImageNet - 0-Shot	DebiasPL (ResNet50)	See all
Mini-Imagenet 5-way (10-shot)	PT+MAP	See all
CUB 200 50-way (0-shot)	DS-SJE Reed et al. (2016)	See all
CUB-200 - 0-Shot Learning	SJE	See all
Stanford Dogs 5-way (1-shot)	MML(KL)	See all
Caltech-256 5-way (1-shot)	UL-Hopfield (ULH)	See all
ORBIT Clutter Video Evaluation	ProtoNets + LITE	See all
ImageNet-FS (5-shot, novel)	KGTN-ens (ResNet-50, h+g, mean)	See all
SUN - 0-Shot	Synthesised Classifier	See all
OMNIGLOT-EMNIST 5-way (1-shot)	HyperShot	See all
OMNIGLOT-EMNIST 5-way (5-shot)	HyperShot	See all
CIFAR100 5-way (1-shot)	UL-Hopfield (ULH)	See all
Mini-ImageNet to CUB - 5 shot learning	TIM-GD	See all
ImageNet (1-shot)	TRAML	See all
ORBIT Clean Video Evaluation	SimpleCNAPs + LITE	See all
ImageNet-FS (10-shot, novel)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (1-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (2-shot, all)	KGTN-ens (ResNet-50, h+g, max)	See all
ImageNet-FS (10-shot, all)	KGTN (ResNet-50)	See all
Fewshot-CIFAR100 - 1-Shot Learning	pseudo-shots	See all
Fewshot-CIFAR100 - 5-Shot Learning	pseudo-shots	See all
CIFAR-FS - 1-Shot Learning	pseudo-shots	See all
Flowers-102 - 0-Shot	Word CNN-RNN (DS-SJE Embedding)	See all
AWA - 0-Shot	Synthesised Classifier	See all
FC100 5-way (10-shot)	MTL	See all
OMNIGLOT - 1-Shot, 423 way	APL	See all
OMNIGLOT - 5-Shot, 423 way	APL	See all
OMNIGLOT - 1-Shot, 1000 way	APL	See all
OMNIGLOT - 5-Shot, 1000 way	APL	See all
AWA1 - 0-Shot	TAFE-Net	See all
AWA2 - 0-Shot	TAFE-Net	See all
aPY - 0-Shot	TAFE-Net	See all
mini-ImageNet - 100-Way	GCR	See all
CIFAR-FS - 5-Shot Learning	pseudo-shots	See all
miniImagenet → CUB (5-way 1-shot)	LaplacianShot	See all
miniImagenet → CUB (5-way 5-shot)	LaplacianShot	See all
iNaturalist (227-way multi-shot)	LaplacianShot	See all
CUB-200-2011 5-way (1-shot)	MATANet	See all
CUB-200-2011 5-way (5-shot)	MATANet	See all
Oxford 102 Flower	RS-FSL	See all
CUB 200 5-way	EASY 3xResNet12 (transductive)	See all
iNaturalist 2018 - 1-shot	MAWS (ViT-2B)	See all
iNaturalist 2018 - 5-shot	MAWS (ViT-2B)	See all
iNaturalist 2018 - 10-shot	MAWS (ViT-2B)	See all
Caltech101	PRE	See all
Caltech-256 5-way (5-shot)	MergedNet-Concat	See all

Show all 88 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Few-Shot Image Classification models and implementations

sicara/easy-few-shot-learning

12 papers

910

learnables/learn2learn

6 papers

2,539

cnguyen10/few_shot_meta_learning

5 papers

238

peymanbateni/simple-cnaps

4 papers

110

See all 8 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Logarithm-transform aided Gaussian Sampling for Few-Shot Learning

ganatra-v/gaussian-sampling-fsl • • 28 Sep 2023

These methods rely on transforming the distributions of experimental data to approximate Gaussian distributions for their functioning.

28 Sep 2023

Paper
Code

PRE: Vision-Language Prompt Learning with Reparameterization Encoder

minhanh151/respro • • 14 Sep 2023

In this work, we present Prompt Learning with Reparameterization Encoder (PRE) - a simple and efficient method that enhances the generalization ability of the learnable prompt to unseen classes while maintaining the capacity to learn Base classes.

14 Sep 2023

Paper
Code

Language Models as Black-Box Optimizers for Vision-Language Models

shihongl1998/llm-as-a-blackbox-optimizer • 12 Sep 2023

We highlight the advantage of conversational feedback that incorporates both positive and negative prompts, suggesting that LLMs can utilize the implicit gradient direction in textual feedback for a more efficient search.

12 Sep 2023

Paper
Code

Cross-Image Context Matters for Bongard Problems

nraghuraman/bongard-context • • 7 Sep 2023

Current machine learning methods struggle to solve Bongard problems, which are a type of IQ test that requires deriving an abstract "concept" from a set of positive and negative "support" images, and then classifying whether or not a new query image depicts the key concept.

07 Sep 2023

Paper
Code

DiffKendall: A Novel Approach for Few-Shot Learning with Differentiable Kendall's Rank Correlation

kaipengm2/DiffKendall • • NeurIPS 2023

By replacing geometric similarity with differentiable Kendall's rank correlation, our method can integrate with numerous existing few-shot approaches and is ready for integrating with future state-of-the-art methods that rely on geometric similarity metrics.

28 Jul 2023

Paper
Code

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability

xuanlinli17/large_vlm_distillation_ood • • ICCV 2023

Model distillation, the process of creating smaller, faster models that maintain the performance of larger models, is a promising direction towards the solution.

06 Jul 2023

Paper
Code

Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning

IRVLUTD/Proto-CLIP • • 6 Jul 2023

The two encoders are used to compute prototypes of image classes for classification.

06 Jul 2023

Paper
Code

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

mpatacchiola/imujoco • • 23 Jun 2023

Despite its simplicity this baseline is competitive with meta-learning methods on a variety of conditions and is able to imitate target policies trained on unseen variations of the original environment.

23 Jun 2023

Paper
Code

Multistage Relation Network With Dual-Metric for Few-Shot Hyperspectral Image Classification

ZhaohuiXue/DM-MRN • • IEEE Transactions on Geoscience and Remote Sensing 2023

In addition, an adaptive weighting strategy is designed to fuse the obtained relation scores, and classification can be achieved by assigning each query sample to the class with the highest value of the fused relation score.

28 Apr 2023

Paper
Code

ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot Learning

whut-yirong/espt • • 26 Apr 2023

With this definition, the ESPT-augmented FSL objective promotes learning more transferable feature representations that capture the local spatial features of different images and their inter-relational structural information in each input episode, thus enabling the model to generalize better to new categories with only a few samples.

26 Apr 2023

Paper
Code

Few-Shot Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result