A study of practical deduplication

242Citations
Citations of this article
106Readers
Mendeley users who have this article in their library.

Abstract

We collected file system content data from 857 desktop computers at Microsoft over a span of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication, particularly considering whole-file versus block-level elimination of redundancy. We found that whole-file deduplication achieves about three quarters of the space savings of the most aggressive block-level deduplication for storage of live file systems, and 87% of the savings for backup images. We also studied file fragmentation finding that it is not prevalent, and updated prior file system metadata studies, finding that the distribution of file sizes continues to skew toward very large unstructured files.

Cite

CITATION STYLE

APA

Meyer, D. T., & Bolosky, W. J. (2011). A study of practical deduplication. In Proceedings of FAST 2011: 9th USENIX Conference on File and Storage Technologies (pp. 1–13). USENIX Association.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free