Adaptive script-independent text line xtraction

Majid Ziaratban; Karim Faez

Journal ArticleOPEN ACCESS

Adaptive script-independent text line xtraction

IEICE Transactions on Information and Systems (2011) E94-D(4) 866-877

DOI: 10.1587/transinf.E94.D.866

2Citations

6Readers

Abstract

In this paper, an adaptive block-based text line extraction algorithm is proposed. Three global and two local parameters are defined to adapt the method to various handwritings in different languages. A document image is segmented into several overlapping blocks. The skew of each block is estimated. Text block is de-skewed by using the estimated skew angle. Text regions are detected in the de-skewed text block. A number of data points are extracted from the detected text regions in each block. These data points are used to estimate the paths of text lines. By thinning the background of the image including text line paths, text line boundaries or separators are estimated. Furthermore, an algorithm is proposed to assign to the extracted text lines the connected components which have intersections with the estimated separators. Extensive experiments on different standard datasets in various languages demonstrate that the proposed algorithm outperforms previous methods. © 2011 The Institute of Electronics, Information and Communication Engineers.

Author supplied keywords

Cite

CITATION STYLE

APA

Ziaratban, M., & Faez, K. (2011). Adaptive script-independent text line xtraction. IEICE Transactions on Information and Systems, E94-D(4), 866–877. https://doi.org/10.1587/transinf.E94.D.866

Adaptive script-independent text line xtraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions