XML-SIM-CHANGE: Structure and content semantic similarity detection among XML document versions

5Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

XML documents from different sources may represent the same or similar information with respect to content and structure. Being able to integrate similar XML documents is important to query systems and search engines. However, information changes periodically, therefore, it is important to detect the changes among different versions of an XML document and use the changed information to discover semantic similarity among XML documents. In this paper, we introduce such an approach to detect XML similarity using the change detection mechanism to join XML document versions. In our approach, keys in subtrees play an important role in order to avoid unnecessary comparisons of subtrees within different XML versions of the same document. We use relational database to store XML versions and apply SQL for detecting similarities. We show that our approach is highly scalable and has better efficiency in terms of execution time and provides comparable result quality. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Viyanon, W., & Madria, S. K. (2010). XML-SIM-CHANGE: Structure and content semantic similarity detection among XML document versions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6427 LNCS, pp. 1061–1078). https://doi.org/10.1007/978-3-642-16949-6_29

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free