A framework for clustering and dynamic maintenance of xml documents

Ahmed Al-Shammari; Chengfei Liu; Mehdi Naseriparsa; Bao Quoc Vo; Tarique Anwar; Rui Zhou

Conference Proceedings

A framework for clustering and dynamic maintenance of xml documents

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10604 LNAI 399-412

DOI: 10.1007/978-3-319-69179-4_28

6Citations

4Readers

Get full text

Abstract

Web data clustering has been widely studied in the data mining communities. However, dynamic maintenance of the web data clusters is still a challenging task. In this paper, we propose a novel framework called XClusterMaint which serves for both clustering and maintenance of the XML documents. For clustering, we take both structure and content into account and propose an efficient solution for grouping the documents based on the combination of structure and content similarity. For maintenance, we propose an incremental approach for maintaining the existing clusters dynamically when we receive new incoming XML documents. Since the dynamic maintenance of the clusters is computationally expensive, we also propose an improved approach which uses a lazy maintenance scheme to improve the performance of the clusters maintenance. The experimental results on real datasets verify the efficiency of the proposed clustering and maintenance model.

Author supplied keywords

Cite

CITATION STYLE

APA

Al-Shammari, A., Liu, C., Naseriparsa, M., Vo, B. Q., Anwar, T., & Zhou, R. (2017). A framework for clustering and dynamic maintenance of xml documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10604 LNAI, pp. 399–412). Springer Verlag. https://doi.org/10.1007/978-3-319-69179-4_28

A framework for clustering and dynamic maintenance of xml documents

Abstract

Author supplied keywords

Cite

Register to see more suggestions