Spot words in printed historical arabic documents

Fattah Zirari; Abdel Ennaji; Driss Mammass; Stéphane Nicolas

Conference ProceedingsOPEN ACCESS

Spot words in printed historical arabic documents

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8509 LNCS 289-296

DOI: 10.1007/978-3-319-07998-1_33

0Citations

1Readers

Abstract

Libraries contain huge amounts of arabic printed historical documents which cannot be available on-line because they do not have a searchable index. The word spotting idea has previously been suggested as a solution to create indexes for such a collecton of documents by matching word images. In this paper we present a word spotting method for arabic printed historical document. We start with word segmentation using run length smoothing algorithm. The description of the features selected to represent the words images is given afterwards. Elastic Dynamic Time Warping is used for matching the features of the two words. This method was tested on the arabic historical printed document database of Moroccan National Library. © 2014 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Zirari, F., Ennaji, A., Mammass, D., & Nicolas, S. (2014). Spot words in printed historical arabic documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8509 LNCS, pp. 289–296). Springer Verlag. https://doi.org/10.1007/978-3-319-07998-1_33

Spot words in printed historical arabic documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions