GDB: Gated convolutions-based Document Binarization

4 Feb 2023  ยท  Zongyuan Yang, Yongping Xiong, Guibin Wu ยท

Document binarization is a key pre-processing step for many document analysis tasks. However, existing methods can not extract stroke edges finely, mainly due to the fair-treatment nature of vanilla convolutions and the extraction of stroke edges without adequate supervision by boundary-related information. In this paper, we formulate text extraction as the learning of gating values and propose an end-to-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction. The gated convolutions are applied to selectively extract the features of strokes with different attention. Our proposed framework consists of two stages. Firstly, a coarse sub-network with an extra edge branch is trained to get more precise feature maps by feeding a priori mask and edge. Secondly, a refinement sub-network is cascaded to refine the output of the first stage by gated convolutions based on the sharp edge. For global information, GDB also contains a multi-scale operation to combine local and global features. We conduct comprehensive experiments on ten Document Image Binarization Contest (DIBCO) datasets from 2009 to 2019. Experimental results show that our proposed methods outperform the state-of-the-art methods in terms of all metrics on average and achieve top ranking on six benchmark datasets.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Binarization DIBCO 2009 GDB F-Measure 94.79 # 2
Pseudo-F-measure 96.36 # 2
PSNR 20.68 # 2
DRD 1.7 # 2
Binarization DIBCO 2011 GDB PSNR 20.74 # 3
F-Measure 94.64 # 4
DRD 1.66 # 4
Pseudo-F-measure 96.74 # 4
Binarization DIBCO 2013 GDB F-Measure 95.86 # 3
Pseudo-F-measure 97.22 # 4
PSNR 22.89 # 2
DRD 1.27 # 2
Binarization DIBCO 2017 GDB F-Measure 94.32 # 1
DRD 1.79 # 1
PSNR 20.04 # 1
Pseudo-F-measure 96.58 # 1
Binarization DIBCO 2019 GDB F-Measure 73.88 # 2
Pseudo-F-measure 74.96 # 2
PSNR 14.8 # 2
DRD 10.41 # 2
Binarization H-DIBCO 2010 GDB F-Measure 95.19 # 1
Pseudo-F-measure 96.62 # 1
PSNR 21.98 # 1
DRD 1.32 # 1
Binarization H-DIBCO 2012 GDB PSNR 22.62 # 2
F-Measure 95.8 # 2
DRD 1.32 # 2
Pseudo-F-measure 97.03 # 2
Binarization H-DIBCO 2014 GDB F-Measure 97.58 # 1
Pseudo-F-measure 98.27 # 3
PSNR 23.74 # 1
DRD 0.72 # 2
Binarization H-DIBCO 2016 GDB F-Measure 90.41 # 6
PSNR 19 # 6
DRD 3.34 # 6
Pseudo-F-measure 94.7 # 4
Binarization H-DIBCO 2018 GDB PSNR 19.92 # 4
F-Measure 91.09 # 4
DRD 3.07 # 4
Pseudo-F-measure 94.57 # 3

Methods