Abstract
With the emergence of semi-structured data format (such as XML), the storage of documents in centralized facilities has slowly appeared as a natural adaptation of data warehousing technology. Nowadays, OLAP (On-Line Analytical Processing) systems face growing non-numeric data. This paper presents a framework for the multidimensional analysis of textual data in an OLAP sense. Document structure, document meta-data and document contents are converted into subjects of analysis (facts) and analysis axes (dimensions) within an adapted star schema. This allows greater multidimensional analysis possibilities. This framework allows a user to gain insight within a collection of documents. © Lavoisier.
Author supplied keywords
Cite
CITATION STYLE
Ravat, F., Teste, O., & Tournier, R. (2007). Analyse multidimensionnelle de documents via des dimensions OLAP. Document Numerique, 10(2), 85–104. https://doi.org/10.3166/dn.10.2.85-104
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.