Performance comparison of Ad-Hoc retrieval models over full-text vs. titles of documents

Ahmed Saleh; Tilman Beck; Lukas Galke; Ansgar Scherp

Conference Proceedings

Performance comparison of Ad-Hoc retrieval models over full-text vs. titles of documents

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11279 LNCS 290-303

DOI: 10.1007/978-3-030-04257-8_30

0Citations

2Readers

Get full text

Abstract

While there are many studies on information retrieval models using full-text, there are presently no comparison studies of full-text retrieval vs. retrieval only over the titles of documents. On the one hand, the full-text of documents like scientific papers is not always available due to, e.g., copyright policies of academic publishers. On the other hand, conducting a search based on titles alone has strong limitations. Titles are short and therefore may not contain enough information to yield satisfactory search results. In this paper, we compare different retrieval models regarding their search performance on the full-text vs. only titles of documents. We use different datasets, including the three digital library datasets: EconBiz, IREON, and PubMed. The results show that it is possible to build effective title-based retrieval models that provide competitive results comparable to full-text retrieval. The difference between the average evaluation results of the best title-based retrieval models is only 3% less than those of the best full-text-based retrieval models.

Author supplied keywords

Cite

CITATION STYLE

APA

Saleh, A., Beck, T., Galke, L., & Scherp, A. (2018). Performance comparison of Ad-Hoc retrieval models over full-text vs. titles of documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11279 LNCS, pp. 290–303). Springer Verlag. https://doi.org/10.1007/978-3-030-04257-8_30

Performance comparison of Ad-Hoc retrieval models over full-text vs. titles of documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions