Abstract
For ordinary users, the task of accessing knowledge graphs through structured query languages like SPARQL is rather demanding. As a result, various approaches exploit the simpler and widely used keyword-based search paradigm, either by translating keyword queries to structured queries, or by adopting classical information retrieval (IR) techniques. In this paper, we study and adapt Elasticsearch, an out-of-the-box document-centric IR system, for supporting keyword search over RDF datasets. Contrary to other works that mainly retrieve entities, we opt for retrieving triples, due to their expressiveness and informativeness. We specify the set of functional requirements and study the emerging questions related to the selection and weighting of the triple data to index, and the structuring and ranking of the retrieved results. Finally, we perform an extensive evaluation of the different factors that affect the IR performance for four different query types. The reported results are promising and offer useful insights on how different Elasticsearch configurations affect the retrieval effectiveness and efficiency.
Cite
CITATION STYLE
Kadilierakis, G., Fafalios, P., Papadakos, P., & Tzitzikas, Y. (2020). Keyword Search over RDF Using Document-Centric Information Retrieval Systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12123 LNCS, pp. 121–137). Springer. https://doi.org/10.1007/978-3-030-49461-2_8
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.