Zero-Shot Object Detection

26 papers with code • 7 benchmarks • 6 datasets

Zero-shot object detection (ZSD) is the task of object detection where no visual training data is available for some of the target object classes.

( Image credit: Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts )

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-Shot Object Detection

Dataset	Best Model	Compare
MS-COCO	SeeDS	See all
PASCAL VOC'07	SeeDS	See all
LVIS v1.0 minival	OWLv2 (OWL-ST+FT)	See all
LVIS v1.0 val	OWLv2 (OWL-ST+FT)	See all
ODinW	Grounding DINO	See all
MSCOCO	Grounding DINO (without COCO data)	See all
ImageNet Detection	SUZOD	See all

Libraries

Use these libraries to find Zero-Shot Object Detection models and implementations

microsoft/GLIP

2 papers

1,947

Datasets

Most implemented papers

Most implemented Social Latest No code

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

computer-vision-in-the-wild/cvinw_readings • 19 Apr 2022

In general, these language-augmented visual models demonstrate strong transferability to a variety of datasets and tasks.

Paper
Code

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

idea-research/groundingdino • • 9 Mar 2023

To effectively fuse language and vision modalities, we conceptually divide a closed-set detector into three phases and propose a tight fusion solution, which includes a feature enhancer, a language-guided query selection, and a cross-modality decoder for cross-modality fusion.

Paper
Code

Zero-Shot Instance Segmentation

zhengye1995/Zero-shot-Instance-Segmentation • • CVPR 2021

We follow this motivation and propose a new task set named zero-shot instance segmentation (ZSI).

Paper
Code

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

tensorflow/tpu • • ICLR 2022

On COCO, ViLD outperforms the previous state-of-the-art by 4. 8 on novel AP and 11. 4 on overall AP.

Paper
Code

Polarity Loss for Zero-shot Object Detection

KennithLi/Awesome-Zero-Shot-Object-Detection • 22 Nov 2018

This setting gives rise to the need for correct alignment between visual and semantic concepts, so that the unseen objects can be identified using only their semantic attributes.

Paper
Code

Learning Open-World Object Proposals without Learning to Classify

mcahny/object_localization_network • • 15 Aug 2021

In this paper, we identify that the problem is that the binary classifiers in existing proposal methods tend to overfit to the training categories.

Paper
Code

Zero-Shot Object Detection by Hybrid Region Embedding

KennithLi/Awesome-Zero-Shot-Object-Detection • 16 May 2018

Object detection is considered as one of the most challenging problems in computer vision, since it requires correct prediction of both classes and locations of objects in images.

Paper
Code

Synthesizing the Unseen for Zero-shot Object Detection

nasir6/zero_shot_detection • • 19 Oct 2020

The existing zero-shot detection approaches project visual features to the semantic domain for seen objects, hoping to map unseen objects to their corresponding semantics during inference.

Paper
Code

Grounded Language-Image Pre-training

microsoft/GLIP • • CVPR 2022

The unification brings two benefits: 1) it allows GLIP to learn from both detection and grounding data to improve both tasks and bootstrap a good grounding model; 2) GLIP can leverage massive image-text pairs by generating grounding boxes in a self-training fashion, making the learned representation semantic-rich.

Paper
Code

Efficient Feature Distillation for Zero-shot Annotation Object Detection

dragonlzm/ezad • • 21 Mar 2023

We propose a new setting for detecting unseen objects called Zero-shot Annotation object Detection (ZAD).

Paper
Code

Zero-Shot Object Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result