Performance of turkish information retrieval: Evaluating the impact of linguistic parameters and compound nouns

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Turkish is an agglutinative language where linguistic parameters can have significant consequences on the information retrieval performances. In this paper, different Turkish linguistic parameters (truncation, stemming, stop words, etc.) have been studied and their impacts on an information retrieval system performance have been invistiguated. Three word truncations at fixed length (3, 4 and 5 characters) have been studied. The results have been compared using Snowball and Zemberek stemmers. Moreover, the results of using compound nouns, in addition to simple keywords, to index queries and documents have been studied. In the experimental part, Milliyet test collectionn have been tested by three information retrieval models. The comparisons of performance analysis have been done by he traditional information retrieval metrics and bpref metric since the test collection is build on an incomplete relevance judgments. © 2014 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Haddad, H., & Bechikh Ali, C. (2014). Performance of turkish information retrieval: Evaluating the impact of linguistic parameters and compound nouns. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8404 LNCS, pp. 381–391). Springer Verlag. https://doi.org/10.1007/978-3-642-54903-8_32

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free