Clustering XML documents semantically has become a major challenge in XML data managements. The key research issue is to find the similarity functions of XML documents. However, previous work gave more importance to the topology structure than to the semantic information. In this paper, the computation of similarity between two XML documents is based on both structural and semantic information. Then a minimal spanning tree clustering method is used to cluster XML documents. The experiment results show that the new method performs better than baseline similarity measure in terms of purity and rand index. © 2009 Springer-Verlag.
CITATION STYLE
Song, L., Ma, J., Lei, J., Zhang, D., & Wang, Z. (2009). Semantic structural similarity measure for clustering XML documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5854 LNCS, pp. 232–241). https://doi.org/10.1007/978-3-642-05250-7_25
Mendeley helps you to discover research relevant for your work.