Towards flexible similarity analysis of XML data

Jesús M. Almendros-Jiménez; Alfredo Cuzzocrea

Conference Proceedings

Towards flexible similarity analysis of XML data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9416 573-576

DOI: 10.1007/978-3-319-26138-6_61

1Citations

1Readers

Get full text

Abstract

The problem of supporting similarity analysis of XML data is a major problem in the data fusion research area. Several approaches have been proposed in literature, but lack of flexibility represents a hard challenge to be faced-off, especially in modern Cloud Computing environments. Inspired by this motivation, we propose SemSynX, a novel technique for supporting similarity analysis of XML data via semantic and syntactic heterogeneity/homogeneity detection. SemSynX retrieves several similarity scores over input XML documents, thus enabling flexible management and “customization” of similarity tools over XML data. In particular, the proposed technique is highly customizable, and it permits the specification of thresholds for the requested degree of similarity for paths and values as well as for the degree of relevance for path and value matching. Also, selection of paths and semantics-based comparison of label content are supported. It thus makes possible to “adjust” the similarity analysis depending on the nature of the input XML documents.

Cite

CITATION STYLE

APA

Almendros-Jiménez, J. M., & Cuzzocrea, A. (2015). Towards flexible similarity analysis of XML data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9416, pp. 573–576). Springer Verlag. https://doi.org/10.1007/978-3-319-26138-6_61

Towards flexible similarity analysis of XML data

Abstract

Cite

Register to see more suggestions