Abstract
Text/graphics separation aims at segmenting the document into two layers: a layer assumed to contain text and a layer containing graphical objects. In this paper, we present a consolidation of a method proposed byFletc her and Kasturi, with a number of improvements to make it more suitable for graphics-rich documents. We discuss the right choice of thresholds for this method, and their stability. We also propose a post-processing step for retrieving text components touching the graphics, through local segmentation of the distance skeleton.
Cite
CITATION STYLE
Tombre, K., Tabbone, S., Pélissier, L., Lamiroy, B., & Dosch, P. (2002). Text/graphics separation revisited. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2423, pp. 200–211). Springer Verlag. https://doi.org/10.1007/3-540-45869-7_24
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.