A bloom filter based approach for evaluating structural similarity of XML documents

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Evaluating Similar structure of XML is a key issue for building the core algorithms for XML document clustering, XML classification and the extraction of schema or DTD from a corpus of XML documents. This evaluation is based on the structural similarity between XML documents. This work employs Bloom filter to represent an XML document with two structures: one is Tag-based Bloom filter (TBF) which describes an XML document with the tags of elements, and the other is Path-based Bloom filter (PBF) which describes hierarchical structure of the XML document. Based on this two structures, an approach is developed to evaluate the similarity of XML documents. A group of experiments was conducted to investigate the performance of the proposed approach. © 2009 Springer-Verlag.

Cite

CITATION STYLE

APA

Peng, D., Hou, H., & Lu, J. (2009). A bloom filter based approach for evaluating structural similarity of XML documents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5854 LNCS, pp. 242–251). https://doi.org/10.1007/978-3-642-05250-7_26

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free