TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text Spotting	ICDAR 2015	SPTS v2	F-measure (%) - Strong Lexicon	82.3	# 15
Text Spotting	ICDAR 2015	SPTS v2	F-measure (%) - Weak Lexicon	77.7	# 14
Text Spotting	ICDAR 2015	SPTS v2	F-measure (%) - Generic Lexicon	72.6	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spts-v2-single-point-scene-text-spotting/text-spotting-on-icdar-2015)](https://paperswithcode.com/sota/text-spotting-on-icdar-2015?p=spts-v2-single-point-scene-text-spotting)`

SPTS v2: Single-Point Scene Text Spotting

4 Jan 2023 · Yuliang Liu, Jiaxin Zhang, Dezhi Peng, Mingxin Huang, Xinyu Wang, Jingqun Tang, Can Huang, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin ·

End-to-end scene text spotting has made significant progress due to its intrinsic synergy between text detection and recognition. Previous methods commonly regard manual annotations such as horizontal rectangles, rotated rectangles, quadrangles, and polygons as a prerequisite, which are much more expensive than using single-point. Our new framework, SPTS v2, allows us to train high-performing text-spotting models using a single-point annotation. SPTS v2 reserves the advantage of the auto-regressive Transformer with an Instance Assignment Decoder (IAD) through sequentially predicting the center points of all text instances inside the same predicting sequence, while with a Parallel Recognition Decoder (PRD) for text recognition in parallel, which significantly reduces the requirement of the length of the sequence. These two decoders share the same parameters and are interactively connected with a simple but effective information transmission process to pass the gradient and information. Comprehensive experiments on various existing benchmark datasets demonstrate the SPTS v2 can outperform previous state-of-the-art single-point text spotters with fewer parameters while achieving 19$\times$ faster inference speed. Within the context of our SPTS v2 framework, our experiments suggest a potential preference for single-point representation in scene text spotting when compared to other representations. Such an attempt provides a significant opportunity for scene text spotting applications beyond the realms of existing paradigms. Code is available at: https://github.com/Yuliang-Liu/SPTSv2.

PDF Abstract

Code

Add Remove Mark official

shannanyinxiang/spts official

131

bytedance/sptsv2 official

111

yuliang-liu/sptsv2 official

Tasks

Add Remove

Text Detection

Text Spotting

Datasets

ICDAR 2013

Total-Text ICDAR 2015

Results from the Paper

Edit

Ranked #15 on Text Spotting on ICDAR 2015

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text Spotting	ICDAR 2015	SPTS v2	F-measure (%) - Strong Lexicon	82.3	# 15	Compare
			F-measure (%) - Weak Lexicon	77.7	# 14	Compare
			F-measure (%) - Generic Lexicon	72.6	# 11	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

SPTS v2: Single-Point Scene Text Spotting

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove