TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Detection	COCO-Text	SSTD	F-Measure	37	# 4
Scene Text Detection	COCO-Text	SSTD	Precision	46	# 4
Scene Text Detection	COCO-Text	SSTD	Recall	31	# 4
Scene Text Detection	ICDAR 2013	SSTD	F-Measure	87%	# 10
Scene Text Detection	ICDAR 2013	SSTD	Precision	88	# 12
Scene Text Detection	ICDAR 2013	SSTD	Recall	86	# 7
Scene Text Detection	ICDAR 2015	EAST + PVANET2x RBOX (multi-scale)	F-Measure	80.7	# 36
Scene Text Detection	ICDAR 2015	EAST + PVANET2x RBOX (multi-scale)	Precision	83.3	# 37
Scene Text Detection	ICDAR 2015	EAST + PVANET2x RBOX (multi-scale)	Recall	78.3	# 34

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-shot-text-detector-with-regional/scene-text-detection-on-coco-text)](https://paperswithcode.com/sota/scene-text-detection-on-coco-text?p=single-shot-text-detector-with-regional)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-shot-text-detector-with-regional/scene-text-detection-on-icdar-2013)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2013?p=single-shot-text-detector-with-regional)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-shot-text-detector-with-regional/scene-text-detection-on-icdar-2015)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2015?p=single-shot-text-detector-with-regional)`

Single Shot Text Detector with Regional Attention

ICCV 2017 · Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li ·

We present a novel single-shot text detector that directly outputs word-level bounding boxes in a natural image. We propose an attention mechanism which roughly identifies text regions via an automatically learned attentional map. This substantially suppresses background interference in the convolutional features, which is the key to producing accurate inference of words, particularly at extremely small sizes. This results in a single model that essentially works in a coarse-to-fine manner. It departs from recent FCN- based text detectors which cascade multiple FCN models to achieve an accurate prediction. Furthermore, we develop a hierarchical inception module which efficiently aggregates multi-scale inception features. This enhances local details, and also encodes strong context information, allow- ing the detector to work reliably on multi-scale and multi- orientation text with single-scale images. Our text detector achieves an F-measure of 77% on the ICDAR 2015 bench- mark, advancing the state-of-the-art results in [18, 28]. Demo is available at: http://sstd.whuang.org/.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

BestSonny/SSTD

212

Tasks

Add Remove

Scene Text Detection

Datasets

ssd

ICDAR 2013

COCO-Text ICDAR 2015

Results from the Paper

Edit

Ranked #4 on Scene Text Detection on COCO-Text

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Detection	COCO-Text	SSTD	F-Measure	37	# 4	Compare
			Precision	46	# 4	Compare
			Recall	31	# 4	Compare
Scene Text Detection	ICDAR 2013	SSTD	F-Measure	87%	# 10	Compare
			Precision	88	# 12	Compare
			Recall	86	# 7	Compare
Scene Text Detection	ICDAR 2015	EAST + PVANET2x RBOX (multi-scale)	F-Measure	80.7	# 36	Compare
			Precision	83.3	# 37	Compare
			Recall	78.3	# 34	Compare

Methods

Add Remove

1x1 Convolution • Convolution • FCN • Inception Module • Max Pooling

Edit Social Preview

Single Shot Text Detector with Regional Attention

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove