Italian monolingual information retrieval with PROSIT

Gianni Amati; Claudio Carpineto; Giovanni Romano

Journal Article

Italian monolingual information retrieval with PROSIT

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2785 257-264

DOI: 10.1007/978-3-540-45237-9_21

10Citations

2Readers

Get full text

Abstract

PROSIT (PRObabilistic Sifting of Information Terms) is a novel probabilistic information retrieval system that combines a term-weighting model based on deviation from randomness with information-theoretic query expansion. We report on the application of PROSIT to the Italian monolingual task at CLEF. We experimented with both standard PROSIT and with enhanced versions. In particular, we studied the use of bigrams and coordination level-based retrieval within the PROSIT framework. The main findings of our research are that (i) standard PROSIT was quite effective, with an average precision of 0.5116 on CLEF 2001 queries and 0.5019 on CLEF 2002 queries, (ii) bigrams were useful provided that they were incorporated into the main algorithm, and (iii) the benefits of coordination level-based retrieval were unclear. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Amati, G., Carpineto, C., & Romano, G. (2003). Italian monolingual information retrieval with PROSIT. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2785, 257–264. https://doi.org/10.1007/978-3-540-45237-9_21

Italian monolingual information retrieval with PROSIT

Abstract

Cite

Register to see more suggestions