This paper presents a real database for the Arabic printed text recognition, APTID / MF (Arabic Printed Text Image Database / Multi-Font).This database can be used to evaluate the system that recognizes Arabic printed texts with an open vocabulary. APTID / MF may be also used for research in word segmentation and font identification. APTID / MF is obtained from 387 pages of Arabic printed documents scanned with grayscale format and 300 dpi resolutions. From this documents, 1,845 text-blocks have been extracted. In addition ground truth file is provided for each texts-block. APTID / MF also includes an Arabic printed character image dataset made up of 27,402 samples. The database is freely available to interested researchers. © 2013 Springer-Verlag.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Jaiem, F. K., Kanoun, S., Khemakhem, M., El Abed, H., & Kardoun, J. (2013). Database for Arabic printed text recognition research. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8156 LNCS, pp. 251–259). https://doi.org/10.1007/978-3-642-41181-6_26