Improving accuracy and speeding up document image classification through parallel systems

Javier Ferrando; Juan Luis Domínguez; Jordi Torres; Raúl García; David García; Daniel Garrido; Jordi Cortada; Mateo Valero

Conference ProceedingsOPEN ACCESS

Improving accuracy and speeding up document image classification through parallel systems

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12138 LNCS 387-400

DOI: 10.1007/978-3-030-50417-5_29

29Citations

40Readers

Abstract

This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost solely image input by combining image model predictions with the ones generated by BERT model on extracted text by OCR. We also show that the batch size can be effectively increased without hindering its accuracy so that the training process can be sped up by parallelizing throughout multiple GPUs, decreasing the computational time needed. Lastly, we expose the training performance differences between PyTorch and Tensorflow Deep Learning frameworks.

Author supplied keywords

Cite

CITATION STYLE

APA

Ferrando, J., Domínguez, J. L., Torres, J., García, R., García, D., Garrido, D., … Valero, M. (2020). Improving accuracy and speeding up document image classification through parallel systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12138 LNCS, pp. 387–400). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-50417-5_29

Improving accuracy and speeding up document image classification through parallel systems

Abstract

Author supplied keywords

Cite

Register to see more suggestions