Abstract
This paper proposes an innovative approach to improve the performance of Persian text classification. The proposed method uses a thesaurus as a helpful knowledge to obtain the real frequencies of words in the corpus. Three types of relationships are considered in our thesaurus. This is the first attempt to use a Persian thesaurus in the field of Persian information retrieval. Experimental results show a significant improvement in the case of employing Persian thesaurus rather common methods. © 2011 Springer-Verlag.
Author supplied keywords
Cite
CITATION STYLE
Parvin, H., Minaei-Bidgoli, B., & Dahbashi, A. (2011). Improving Persian text classification using Persian thesaurus. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7042 LNCS, pp. 391–398). https://doi.org/10.1007/978-3-642-25085-9_46
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.