Abstract
Inspired by the relational algebra of data processing, this paper addresses the foundations of data analytical processing from a linear algebra perspective. The paper investigates, in particular, how aggregation operations such as cross tabulations and data cubes essential to quantitative analysis of data can be expressed solely in terms of matrix multiplication, transposition and the Khatri–Rao variant of the Kronecker product. The approach offers a basis for deriving an algebraic theory of data consolidation, handling the quantitative as well as qualitative sides of data science in a natural, elegant and typed way. It also shows potential for parallel analytical processing, as the parallelization theory of such matrix operations is well acknowledged.
Author supplied keywords
Cite
CITATION STYLE
Macedo, H. D., & Oliveira, J. N. (2015). A linear algebra approach to OLAP. Formal Aspects of Computing, 27(2), 283–307. https://doi.org/10.1007/s00165-014-0316-9
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.