Feature learning for footnote-based document image classification

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Classifying document images is a challenging problem that is confronted by many obstacles; specifically, the pivotal need of handdesigned features and the scarcity of labeled data. In this paper, a new approach for classifying document images, based on the availability of footnotes in them, is presented. Our proposed approach depends mainly on a Deep Belief Network (DBN) that consists of two phases, unsupervised pre-training and supervised fine-tuning. The main advantage of using this approach is its capability to automatically engineer the best features to be extracted from a raw document image for the sake of generating an efficient representation of it. This feature learning approach takes advantage of the vast amount of available unlabeled data and employs it with the limited number of labeled data. The obtained results show that the proposed approach provides an effective document image classification framework with a highly reliable performance.

Cite

CITATION STYLE

APA

Abuelwafa, S., Mhiri, M., Hedjam, R., Zhalehpour, S., Piper, A., Wellmon, C., & Cheriet, M. (2017). Feature learning for footnote-based document image classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10317 LNCS, pp. 643–650). Springer Verlag. https://doi.org/10.1007/978-3-319-59876-5_71

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free