TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO test-dev	MAL (ResNeXt101, multi-scale)	box mAP	47.0	# 116
Object Detection	COCO test-dev	MAL (ResNeXt101, multi-scale)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	MAL (ResNeXt101, multi-scale)	Operations per network pass	None	# 1
Object Detection	COCO test-dev	MAL (ResNet50, single-scale)	box mAP	39.2	# 200
Object Detection	COCO test-dev	MAL (ResNet50, single-scale)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	MAL (ResNet50, single-scale)	Operations per network pass	None	# 1
Object Detection	COCO test-dev	MAL (ResNeXt101, single-scale)	box mAP	45.9	# 129
Object Detection	COCO test-dev	MAL (ResNeXt101, single-scale)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	MAL (ResNeXt101, single-scale)	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multiple-anchor-learning-for-visual-object/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=multiple-anchor-learning-for-visual-object)`

Multiple Anchor Learning for Visual Object Detection

CVPR 2020 · Wei Ke, Tianliang Zhang, Zeyi Huang, Qixiang Ye, Jianzhuang Liu, Dong Huang ·

Classification and localization are two pillars of visual object detectors. However, in CNN-based detectors, these two modules are usually optimized under a fixed set of candidate (or anchor) bounding boxes. This configuration significantly limits the possibility to jointly optimize classification and localization. In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector. Our approach, referred to as Multiple Anchor Learning (MAL), constructs anchor bags and selects the most representative anchors from each bag. Such an iterative selection process is potentially NP-hard to optimize. To address this issue, we solve MAL by repetitively depressing the confidence of selected anchors by perturbing their corresponding features. In an adversarial selection-depression manner, MAL not only pursues optimal solutions but also fully leverages multiple anchors/features to learn a detection model. Experiments show that MAL improves the baseline RetinaNet with significant margins on the commonly used MS-COCO object detection benchmark and achieves new state-of-the-art detection performance compared with recent methods.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

KevinKecc/MAL

DeLightCMU/MAL

DeLightCMU/MAL-inference

Tasks

Add Remove

General Classification

Multiple Instance Learning

Object

object-detection

Object Detection

Datasets

MS COCO

ssd

Results from the Paper

Edit

Ranked #105 on Object Detection on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO test-dev	MAL (ResNeXt101, multi-scale)	box mAP	47.0	# 116	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Object Detection	COCO test-dev	MAL (ResNet50, single-scale)	box mAP	39.2	# 200	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare
Object Detection	COCO test-dev	MAL (ResNeXt101, single-scale)	box mAP	45.9	# 129	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

1x1 Convolution • Convolution • Focal Loss • FPN • RetinaNet

Edit Social Preview

Multiple Anchor Learning for Visual Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove