Zero-Shot Learning

576 papers with code • 19 benchmarks • 29 datasets

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Learning

Dataset	Best Model	Compare
MedConceptsQA	gpt-4-0125-preview	See all
CUB-200-2011	DUET	See all
SUN Attribute	SPOT (VAEGAN)	See all
AwA2	ZSL-KG	See all
Oxford 102 Flower	SPOT	See all
VOC-MLT	CLIP(ResNet-50)	See all
COCO-MLT	ResNet-50	See all
CUB-200 - 0-Shot Learning	zsl_ADA	See all
PASCAL Context	ZS3Net	See all
iVQA	FrozenBiLM	See all
SNIPS	ZSL-KG	See all
aPY - 0-Shot	ZSL-KG	See all
LSMDC	FrozenBiLM	See all
MSRVTT-QA	HiTeA	See all
MSVD-QA	HiTeA	See all
TVQA	FrozenBiLM	See all
MIT-States	CZSL	See all
ImageNet_CN	$M^2$-Encoder	See all
How2QA	SeViLA	See all

Show all 19 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Zero-Shot Learning models and implementations

mlfoundations/open_clip

3 papers

8,609

faceonlive/ai-research

3 papers

225

alibaba/EasyNLP

2 papers

1,959

sicara/easy-few-shot-learning

2 papers

924

Datasets

Subtasks

Multi-label zero-shot learning

GZSL Video Classification

Latest papers with no code

Most implemented Social Latest No code

Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification

no code yet • 17 Apr 2024

This study is part of the debate on the efficiency of large versus small language models for text classification by prompting. We assess the performance of small language models in zero-shot text classification, challenging the prevailing dominance of large models. Across 15 datasets, our investigation benchmarks language models from 77M to 40B parameters using different architectures and scoring functions.

Paper
Add Code

Evolving Interpretable Visual Classifiers with Large Language Models

no code yet • 15 Apr 2024

To address these limitations, we present a novel method that discovers interpretable yet discriminative sets of attributes for visual recognition.

Paper
Add Code

OTTER: Improving Zero-Shot Classification via Optimal Transport

no code yet • 12 Apr 2024

Popular zero-shot models suffer due to artifacts inherited from pretraining.

Paper
Add Code

`Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning

no code yet • 12 Apr 2024

Current approaches in Generalized Zero-Shot Learning (GZSL) are built upon base models which consider only a single class attribute vector representation over the entire image.

Paper
Add Code

Connecting NeRFs, Images, and Text

no code yet • 11 Apr 2024

Neural Radiance Fields (NeRFs) have emerged as a standard framework for representing 3D scenes and objects, introducing a novel data type for information exchange and storage.

Paper
Add Code

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning

no code yet • 11 Apr 2024

ZSLViT mainly considers two properties in the whole network: i) discover the semantic-related visual representations explicitly, and ii) discard the semantic-unrelated visual information.

Paper
Add Code

Anchor-based Robust Finetuning of Vision-Language Models

no code yet • 9 Apr 2024

Specifically, two types of anchors are elaborated in our method, including i) text-compensated anchor which uses the images from the finetune set but enriches the text supervision from a pretrained captioner, ii) image-text-pair anchor which is retrieved from the dataset similar to pretraining data of CLIP according to the downstream task, associating with the original CLIP text with rich semantics.

Paper
Add Code

Condition Monitoring with Incomplete Data: An Integrated Variational Autoencoder and Distance Metric Framework

no code yet • 8 Apr 2024

Condition monitoring of industrial systems is crucial for ensuring safety and maintenance planning, yet notable challenges arise in real-world settings due to the limited or non-existent availability of fault samples.

Paper
Add Code

High-Discriminative Attribute Feature Learning for Generalized Zero-Shot Learning

no code yet • 7 Apr 2024

However, current attention-based models may overlook the transferability of visual features and the distinctiveness of attribute localization when learning regional features in images.

Paper
Add Code

Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models

no code yet • 7 Apr 2024

In this paper, we explore the feasibility of leveraging language as a naturally high-quality supervision for chest CT imaging.

Paper
Add Code

Zero-Shot Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result