This paper proposes a novel approach for scanned document categorization using a deep neural network architecture. The proposed approach leverages the strengths of both convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract features from the scanned documents and model the dependencies between words in the documents. The pre-processed documents are first fed into a CNN, which learns and extracts features from the images. The extracted features are then passed through an RNN, which models the sequential nature of the text. The RNN produces a probability distribution over the predefined categories, and the document is classified into the category with the highest probability. The proposed approach is evaluated on a dataset of scanned documents, where each document is categorized into one of four predefined categories. The experimental results demonstrate that the proposed approach achieves high accuracy and outperforms existing methods. The proposed approach achieves an overall accuracy of 97.3%, which is significantly higher than the existing methods' accuracy. Additionally, the proposed approach's performance was robust to variations in the quality of the scanned documents and the OCR accuracy. The contributions of this paper are twofold. Firstly, it proposes a novel approach for scanned document categorization using deep neural networks that leverages the strengths of CNNs and RNNs. Secondly; it demonstrates the effectiveness of the proposed approach on a dataset of scanned documents, highlighting its potential applications in various domains, such as information retrieval, data mining, and document management. The proposed approach can help organizations manage and analyze large volumes of data efficiently.
CITATION STYLE
Baimakhanova, A., Zhumadillayeva, A., Avdarsol, S., Zhabayev, Y., Revshenova, M., Aimeshov, Z., & Uxikbayev, Y. (2023). Automatic Classification of Scanned Electronic University Documents using Deep Neural Networks with Conv2D Layers. International Journal of Advanced Computer Science and Applications, 14(5), 694–701. https://doi.org/10.14569/IJACSA.2023.0140573
Mendeley helps you to discover research relevant for your work.