For more than a decade, researches on OLAP and multidimensional databases have generated methodologies, tools and resource management systems for the analysis of numeric data. With the growing availability of digital documents, there is a need for incorporating text-rich documents within multidimensional databases as well as an adapted framework for their analysis. This paper presents a new aggregation function that aggregates textual data in an OLAP environment. The Top_Keyword function (Top_Kw for short) represents a set of documents by their most significant terms using a weighing function from information retrieval: tf.idf. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Ravat, F., Teste, O., Tournier, R., & Zurfluh, G. (2008). Top_keyword: An aggregation function for textual document OLAP. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5182 LNCS, pp. 55–64). https://doi.org/10.1007/978-3-540-85836-2_6
Mendeley helps you to discover research relevant for your work.