An Optimistic approach for clustering multi-version XML documents using compressed delta

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Today with Standardization of XML as an information exchange over web, huge amount of information is formatted in the XML document. XML documents are huge in size. The amount of information that has to be transmitted, processed, stored, and queried is often larger than that of other data formats. Also in real world applications XML documents are dynamic in nature. The versatile applicability of XML documents in different fields of information maintenance and management is increasing the demand to store different versions of XML documents with time. However, storage of all versions of an XML document may introduce the redundancy. Self describing nature of XML creates the problem of verbosity, in result documents are in huge size. This paper proposes optimistic approach to Recluster multi-version XML documents which change in time by reassessing distance between them by using knowledge from initial clustering solution and changes stored in compressed delta. Evolving size of XML document is reduced by applying homomorphic compression before clustering them which retains its original structure. Compressed delta stores the changes responsible for document versions, without decompressing them. Test results shows that our approach performs much better than using full pair-wise document comparison.

Cite

CITATION STYLE

APA

Sonawane, V., & Rajeswara Rao, D. (2015). An Optimistic approach for clustering multi-version XML documents using compressed delta. International Journal of Electrical and Computer Engineering, 5(6), 1472–1479. https://doi.org/10.11591/ijece.v5i6.pp1472-1479

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free