The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images.
In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition.
Many new proposals for scene text recognition (STR) models have been introduced in recent years.
Due to the fact that there are large geometrical margins among the minimal scale kernels, our method is effective to split the close text instances, making it easier to use segmentation-based methods to detect arbitrary-shaped text instances.
#4 best model for Scene Text Detection on SCUT-CTW1500
Yet, the widely adopted horizontal bounding box representation is not appropriate for ubiquitous oriented objects such as objects in aerial images and scene texts.
In this paper, we present an end-to-end trainable fast scene text detector, named TextBoxes++, which detects arbitrary-oriented scene text with both high accuracy and efficiency in a single network forward pass.
#2 best model for Scene Text Detection on COCO-Text
Scene text image contains two levels of contents: visual texture and semantic information.