Starting with Microsoft Office 2007, the Office Open XML file formats have become the default file format of Microsoft Office. As each day a lot of office documents have to be stored and transferred, reducing the document size will yield a benefit when storing and transferring these files. We present a compressed format for XML-based office documents that omits that data from an office document that is already defined by the Office Open XML format. Our evaluation shows that our compressed format reduces the - already compressed - office documents to a data size down to 41% of the original document size. Furthermore, for search operations tested in our evaluation, searching is faster on our compressed office documents than it is on the original documents. © 2010 Springer-Verlag.
CITATION STYLE
Böttcher, S., Hartel, R., & Messinger, C. (2010). Searchable compression of office documents by XML schema subtraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6309 LNCS, pp. 103–112). https://doi.org/10.1007/978-3-642-15684-7_9
Mendeley helps you to discover research relevant for your work.