Discovering knowledge from multi-relational data based on information retrieval theory

Rayner Alfred

Conference Proceedings

Discovering knowledge from multi-relational data based on information retrieval theory

Alfred R

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5678 LNAI 409-416

DOI: 10.1007/978-3-642-03348-3_39

0Citations

4Readers

Get full text

Abstract

Although the TF-IDF weighted frequency matrix (vector space model) has been widely studied and used in document clustering or document categorisation, there has been no attempt to extend this application to relational data that contain one-to-many associations between records. This paper explains the rationale for using TF-IDF (term frequency inverse document frequency), a technique for weighting data attributes, borrowed from Information Retrieval theory, to summarise datasets stored in a multi-relational setting with one-to-many relationships. A novel data summarisation algorithm based on TF-IDF is introduced, which is referred to as Dynamic Aggregation of Relational Attributes (DARA). The DARA algorithm applies clustering techniques in order to summarise these datasets. The experimental results show that using the DARA algorithm finds solutions with much greater accuracy. © 2009 Springer.

Author supplied keywords

Cite

CITATION STYLE

APA

Alfred, R. (2009). Discovering knowledge from multi-relational data based on information retrieval theory. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5678 LNAI, pp. 409–416). https://doi.org/10.1007/978-3-642-03348-3_39

Discovering knowledge from multi-relational data based on information retrieval theory

Abstract

Author supplied keywords

Cite

Register to see more suggestions