An efficient way for segmentation of Bangla characters in printed document using curved scanning

13 May 2016 · Ahnaf Farhan Rownak; Md. Fazle Rabby; Sabir Ismail; Md. Saiful Islam ·

The preeminent reason for poor output in Optical Character Recognition (OCR) for Bangla text is introduced by segmentation related error. Different shape of characters, connected characters, modifiers in top and bottom, overlapped region between consecutive characters are the main obstacle for effective segmentation for Bangla printed text. In this paper an efficient strategy is introduced to segment characters consisting overlapped region with other characters. The proposed strategy of our research have achieved 99.8% accuracy rate in line segmentation, 99.5% accuracy in word segmentation and 99% accuracy for character segmentation. The error introduced when two consecutive characters have multiple touching points.

PDF Abstract