A new large-scale multi-purpose handwritten farsi database

Puntis Jifroodian Haghighi; Nicola Nobile; Chun Lei He; Ching Y. Suen

Conference Proceedings

A new large-scale multi-purpose handwritten farsi database

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5627 LNCS 278-286

DOI: 10.1007/978-3-642-02611-9_28

17Citations

14Readers

Get full text

Abstract

This paper introduces the Center for Pattern Recognition and Machine Intelligence (CENPARMI) Farsi dataset which can be used to measure the performance of handwritten recognition and word spotting systems. This dataset is unique in terms of its large number of gray and binary images (432,357 each) consisting of dates, words, isolated letters, isolated digits, numeral strings, special symbols, and documents. The data was collected from 400 native Farsi writers. The selection of Farsi words has been based on their high frequency in financial documents. The dataset is divided into grouped and ungrouped subsets which will give the user the flexibility of whether or not to use CENPARMI's pre-divided dataset (60% of the images are used as the Training set, 20% of the images as the Validation set, and the rest as the Testing set). Finally, experiments have been conducted on the Farsi isolated digits with a recognition rate of 96.85%. © 2009 Springer Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Haghighi, P. J., Nobile, N., He, C. L., & Suen, C. Y. (2009). A new large-scale multi-purpose handwritten farsi database. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5627 LNCS, pp. 278–286). https://doi.org/10.1007/978-3-642-02611-9_28

A new large-scale multi-purpose handwritten farsi database

Abstract

Author supplied keywords

Cite

Register to see more suggestions