XML documents possess inherent semi-structured property, consisting of structural and content features. Most existing methods for XML documents clustering consider only one aspect of them. In this paper, we propose a fuzzy XML documents projected clustering algorithm, which can be used to cluster XML documents efficiently by combining the structural and content features. Another contribution is the adoption of some fuzzy techniques in a way that each frequent induced substructure has a fuzzy parameter associated with each cluster. Experimental results on both synthetic and real datasets show its effectiveness, especially when applying to large schemaless XML document collections. © 2011 Springer-Verlag.
CITATION STYLE
Ji, T., Bao, X., & Yang, D. (2011). FXProj - A fuzzy XML documents projected clustering based on structure and content. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7120 LNAI, pp. 406–419). https://doi.org/10.1007/978-3-642-25853-4_31
Mendeley helps you to discover research relevant for your work.