TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Described Object Detection	Description Detection Dataset	MM-Grounding-DINO	Intra-scenario FULL mAP	22.9	# 1
Described Object Detection	Description Detection Dataset	MM-Grounding-DINO	Intra-scenario PRES mAP	21.9	# 2
Described Object Detection	Description Detection Dataset	MM-Grounding-DINO	Intra-scenario ABS mAP	26.0	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-open-and-comprehensive-pipeline-for/described-object-detection-on-description)](https://paperswithcode.com/sota/described-object-detection-on-description?p=an-open-and-comprehensive-pipeline-for)`

An Open and Comprehensive Pipeline for Unified Object Grounding and Detection

4 Jan 2024 · Xiangyu Zhao, Yicheng Chen, Shilin Xu, Xiangtai Li, Xinjiang Wang, Yining Li, Haian Huang ·

Grounding-DINO is a state-of-the-art open-set detection model that tackles multiple vision tasks including Open-Vocabulary Detection (OVD), Phrase Grounding (PG), and Referring Expression Comprehension (REC). Its effectiveness has led to its widespread adoption as a mainstream architecture for various downstream applications. However, despite its significance, the original Grounding-DINO model lacks comprehensive public technical details due to the unavailability of its training code. To bridge this gap, we present MM-Grounding-DINO, an open-source, comprehensive, and user-friendly baseline, which is built with the MMDetection toolbox. It adopts abundant vision datasets for pre-training and various detection and grounding datasets for fine-tuning. We give a comprehensive analysis of each reported result and detailed settings for reproduction. The extensive experiments on the benchmarks mentioned demonstrate that our MM-Grounding-DINO-Tiny outperforms the Grounding-DINO-Tiny baseline. We release all our models to the research community. Codes and trained models are released at https://github.com/open-mmlab/mmdetection/tree/main/configs/mm_grounding_dino.

PDF Abstract

Code

Add Remove Mark official

open-mmlab/mmdetection official

27,908

Tasks

Add Remove

Described Object Detection

Phrase Grounding

Referring Expression

Referring Expression Comprehension

Datasets

MS COCO

Cityscapes

LVIS

GQA

RefCOCO

Objects365

Flickr30K Entities

gRefCOCO

Description Detection Dataset

Results from the Paper

Add Remove

Ranked #1 on Described Object Detection on Description Detection Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Described Object Detection	Description Detection Dataset	MM-Grounding-DINO	Intra-scenario FULL mAP	22.9	# 1	Compare
			Intra-scenario PRES mAP	21.9	# 2	Compare
			Intra-scenario ABS mAP	26.0	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

An Open and Comprehensive Pipeline for Unified Object Grounding and Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove