TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Counting	CARPK	RetinaNet (2018)	MAE	24.58	# 11
Long-tail Learning	COCO-MLT	Focal Loss(ResNet-50)	Average mAP	49.46	# 8
Object Detection	COCO-O	RetinaNet (ResNet-50)	Average mAP	16.6	# 39
Object Detection	COCO-O	RetinaNet (ResNet-50)	Effective Robustness	0.18	# 33
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	box mAP	40.8	# 171
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	AP50	61.1	# 113
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	AP75	44.1	# 122
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	APS	24.1	# 97
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	APM	44.2	# 108
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	APL	51.2	# 122
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	Hardware Burden	4G	# 1
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	Operations per network pass	None	# 1
Region Proposal	COCO test-dev	RPN+Focal Loss	AR100	50.2	# 3
Region Proposal	COCO test-dev	RPN+Focal Loss	AR1000	60.9	# 3
Region Proposal	COCO test-dev	RPN+Focal Loss	ARL	67.5	# 3
Region Proposal	COCO test-dev	RPN+Focal Loss	ARM	58.2	# 3
Region Proposal	COCO test-dev	RPN+Focal Loss	ARS	33.9	# 2
Region Proposal	COCO test-dev	RPN+Focal Loss	AR300	56.6	# 2
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	box mAP	39.1	# 191
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	AP50	59.1	# 132
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	AP75	42.3	# 132
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	APS	21.8	# 122
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	APM	42.7	# 119
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	APL	50.2	# 131
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	Hardware Burden	4G	# 1
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	Operations per network pass	None	# 1
Long-tail Learning	EGTEA	Focal loss (3D- ResNeXt101)	Average Precision	59.09	# 3
Long-tail Learning	EGTEA	Focal loss (3D- ResNeXt101)	Average Recall	59.17	# 3
2D Object Detection	SARDet-100K	RetinaNet	box mAP	47.4	# 10
Dense Object Detection	SKU-110K	RetinaNet	AP	45.5	# 5
Dense Object Detection	SKU-110K	RetinaNet	AP75	.389	# 1
Pedestrian Detection	TJU-Ped-campus	RetinaNet	R (miss rate)	34.73	# 5
Pedestrian Detection	TJU-Ped-campus	RetinaNet	RS (miss rate)	82.99	# 3
Pedestrian Detection	TJU-Ped-campus	RetinaNet	HO (miss rate)	71.31	# 3
Pedestrian Detection	TJU-Ped-campus	RetinaNet	R+HO (miss rate)	42.26	# 5
Pedestrian Detection	TJU-Ped-campus	RetinaNet	ALL (miss rate)	44.34	# 5
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	R (miss rate)	23.89	# 5
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	RS (miss rate)	37.92	# 4
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	HO (miss rate)	61.60	# 5
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	R+HO (miss rate)	28.45	# 4
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	ALL (miss rate)	41.40	# 5
Face Verification	Trillion Pairs Dataset	F-Softmax	Accuracy	37.14	# 5
Face Identification	Trillion Pairs Dataset	F-Softmax	Accuracy	39.80	# 5
Long-tail Learning	VOC-MLT	Focal Loss(ResNet-50)	Average mAP	73.88	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/region-proposal-on-coco-test-dev)](https://paperswithcode.com/sota/region-proposal-on-coco-test-dev?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/long-tail-learning-on-egtea)](https://paperswithcode.com/sota/long-tail-learning-on-egtea?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/dense-object-detection-on-sku-110k)](https://paperswithcode.com/sota/dense-object-detection-on-sku-110k?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/pedestrian-detection-on-tju-ped-campus)](https://paperswithcode.com/sota/pedestrian-detection-on-tju-ped-campus?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/pedestrian-detection-on-tju-ped-traffic)](https://paperswithcode.com/sota/pedestrian-detection-on-tju-ped-traffic?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/face-verification-on-trillion-pairs-dataset)](https://paperswithcode.com/sota/face-verification-on-trillion-pairs-dataset?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/face-identification-on-trillion-pairs-dataset)](https://paperswithcode.com/sota/face-identification-on-trillion-pairs-dataset?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/long-tail-learning-on-coco-mlt)](https://paperswithcode.com/sota/long-tail-learning-on-coco-mlt?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/2d-object-detection-on-sardet-100k)](https://paperswithcode.com/sota/2d-object-detection-on-sardet-100k?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/long-tail-learning-on-voc-mlt)](https://paperswithcode.com/sota/long-tail-learning-on-voc-mlt?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/object-counting-on-carpk)](https://paperswithcode.com/sota/object-counting-on-carpk?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/object-detection-on-coco-o)](https://paperswithcode.com/sota/object-detection-on-coco-o?p=focal-loss-for-dense-object-detection)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/focal-loss-for-dense-object-detection/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=focal-loss-for-dense-object-detection)`

Focal Loss for Dense Object Detection

ICCV 2017 · Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár ·

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a regular, dense sampling of possible object locations have the potential to be faster and simpler, but have trailed the accuracy of two-stage detectors thus far. In this paper, we investigate why this is the case. We discover that the extreme foreground-background class imbalance encountered during training of dense detectors is the central cause. We propose to address this class imbalance by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples. Our novel Focal Loss focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training. To evaluate the effectiveness of our loss, we design and train a simple dense detector we call RetinaNet. Our results show that when trained with the focal loss, RetinaNet is able to match the speed of previous one-stage detectors while surpassing the accuracy of all existing state-of-the-art two-stage detectors. Code is at: https://github.com/facebookresearch/Detectron.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

facebookresearch/detectron official

26,137

tensorflow/models

76,588

facebookresearch/detectron2

↳ Quickstart in

Colab

28,671

open-mmlab/mmdetection

27,744

AlexeyAB/darknet

↳ Quickstart in

Colab

21,441

See all 231 implementations

Tasks

Add Remove

2D Object Detection

Dense Object Detection

Knowledge Distillation

Long-tail Learning

Object

Object Detection

Pedestrian Detection

Real-Time Object Detection

Region Proposal

Datasets

MS COCO

ssd

EGTEA

CARPK

COCO-O

SKU110K COCO-MLT VOC-MLT

TJU-DHD

SARDet-100K

Results from the Paper

Edit

Ranked #3 on Region Proposal on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Long-tail Learning	COCO-MLT	Focal Loss(ResNet-50)	Average mAP	49.46	# 8	Compare
Object Detection	COCO-O	RetinaNet (ResNet-50)	Average mAP	16.6	# 39	Compare
Object Detection	COCO-O	RetinaNet (ResNet-50)	Effective Robustness	0.18	# 33	Compare
Object Detection	COCO test-dev	RetinaNet (ResNeXt-101-FPN)	box mAP	40.8	# 171	Compare
			AP50	61.1	# 113	Compare
			AP75	44.1	# 122	Compare
			APS	24.1	# 97	Compare
			APM	44.2	# 108	Compare
			APL	51.2	# 122	Compare
			Hardware Burden	4G	# 1	Compare
			Operations per network pass	None	# 1	Compare
Region Proposal	COCO test-dev	RPN+Focal Loss	AR100	50.2	# 3	Compare
			AR1000	60.9	# 3	Compare
			ARL	67.5	# 3	Compare
			ARM	58.2	# 3	Compare
			ARS	33.9	# 2	Compare
			AR300	56.6	# 2	Compare
Object Detection	COCO test-dev	RetinaNet (ResNet-101-FPN)	box mAP	39.1	# 191	Compare
			AP50	59.1	# 132	Compare
			AP75	42.3	# 132	Compare
			APS	21.8	# 122	Compare
			APM	42.7	# 119	Compare
			APL	50.2	# 131	Compare
			Hardware Burden	4G	# 1	Compare
			Operations per network pass	None	# 1	Compare
Long-tail Learning	EGTEA	Focal loss (3D- ResNeXt101)	Average Precision	59.09	# 3	Compare
Long-tail Learning	EGTEA	Focal loss (3D- ResNeXt101)	Average Recall	59.17	# 3	Compare
2D Object Detection	SARDet-100K	RetinaNet	box mAP	47.4	# 10	Compare
Pedestrian Detection	TJU-Ped-campus	RetinaNet	R (miss rate)	34.73	# 5	Compare
			RS (miss rate)	82.99	# 3	Compare
			HO (miss rate)	71.31	# 3	Compare
			R+HO (miss rate)	42.26	# 5	Compare
			ALL (miss rate)	44.34	# 5	Compare
Pedestrian Detection	TJU-Ped-traffic	RetinaNet	R (miss rate)	23.89	# 5	Compare
			RS (miss rate)	37.92	# 4	Compare
			HO (miss rate)	61.60	# 5	Compare
			R+HO (miss rate)	28.45	# 4	Compare
			ALL (miss rate)	41.40	# 5	Compare
Face Verification	Trillion Pairs Dataset	F-Softmax	Accuracy	37.14	# 5	Compare
Face Identification	Trillion Pairs Dataset	F-Softmax	Accuracy	39.80	# 5	Compare
Long-tail Learning	VOC-MLT	Focal Loss(ResNet-50)	Average mAP	73.88	# 10	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Object Counting	CARPK	RetinaNet (2018)	MAE	24.58	# 11	See all
Dense Object Detection	SKU-110K	RetinaNet	AP	45.5	# 5	See all
Dense Object Detection	SKU-110K	RetinaNet	AP75	.389	# 1	See all

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Focal Loss • FPN • Global Average Pooling • Grouped Convolution • Kaiming Initialization • Max Pooling • Random Horizontal Flip • ReLU • Residual Block • Residual Connection • ResNet • ResNeXt • ResNeXt Block • RetinaNet • SGD with Momentum • SPEED • Weight Decay

Edit Social Preview

Focal Loss for Dense Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit