Adaptive script-independent text line xtraction

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

In this paper, an adaptive block-based text line extraction algorithm is proposed. Three global and two local parameters are defined to adapt the method to various handwritings in different languages. A document image is segmented into several overlapping blocks. The skew of each block is estimated. Text block is de-skewed by using the estimated skew angle. Text regions are detected in the de-skewed text block. A number of data points are extracted from the detected text regions in each block. These data points are used to estimate the paths of text lines. By thinning the background of the image including text line paths, text line boundaries or separators are estimated. Furthermore, an algorithm is proposed to assign to the extracted text lines the connected components which have intersections with the estimated separators. Extensive experiments on different standard datasets in various languages demonstrate that the proposed algorithm outperforms previous methods. © 2011 The Institute of Electronics, Information and Communication Engineers.

Cite

CITATION STYLE

APA

Ziaratban, M., & Faez, K. (2011). Adaptive script-independent text line xtraction. IEICE Transactions on Information and Systems, E94-D(4), 866–877. https://doi.org/10.1587/transinf.E94.D.866

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free