Page segmentation using document model

11Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Transforming a paper document to its electronic version in a form suitable for efficient storage, retrieval and interpretation continues to be a challenging problem. An efficient document model is necessary to solve this problem. Document modeling involves techniques of thresholding, skew detection, geometric layout analysis and logical layout analysis. The derived model can then be used in document storage and retrieval. In this paper, we use the traditional bottom-up approach based on the connected component extraction to efficiently implement page segmentation and region identification. A new document model which preserves top-down generation information is proposed based on which a document is logically represented for interactive editing storage, retrieval, transfer and logical analysis.

Cite

CITATION STYLE

APA

Jain, A. K., & Yu, B. (1997). Page segmentation using document model. In Proceedings of the International Conference on Document Analysis and Recognition, ICDAR (Vol. 1, pp. 34–38). IEEE. https://doi.org/10.1109/icdar.1997.619809

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free