Dynamic models for file sizes and double pareto distributions

97Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we introduce and analyze a new, dynamic generative user model to explain the behavior of file size distributions. Our Recursive Forest File model combines multiplicative models that generate lognormal distributions with recent work on random graph models for the web. Unlike similar previous work, our Recursive Forest File model allows new files to be created and oldfiles to be deleted over time, and our analysis covers problematic issues such as correlation among file sizes. Moreover, our model allows natural variations where files that are copied or modified are more likely to be copied or modified subsequently. Previous empirical work suggests thatfile sizes tend to have a lognormal body but a Pareto tail. The Recursive Forest File model explains this behavior, yielding a double Pareto distribution, which has a Pareto tail but close to a lognormal body. We believe the Recursive Forest model may be useful for describing other power law phenomena in computer systems as well as other fields. © A K Peters, Ltd.

References Powered by Scopus

On the Self-Similar Nature of Ethernet Traffic (Extended Version)

4026Citations
N/AReaders
Get full text

Mean-field theory for scale-free random networks

2052Citations
N/AReaders
Get full text

Graph structure in the Web

2014Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Opcodes as predictor for malware

251Citations
N/AReaders
Get full text

A five-year study of file-system metadata

181Citations
N/AReaders
Get full text

Workload modeling for computer systems performance evaluation

170Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Mitzenmacher, M. (2004). Dynamic models for file sizes and double pareto distributions. Internet Mathematics, 1(3), 305–333. https://doi.org/10.1080/15427951.2004.10129092

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 24

41%

Professor / Associate Prof. 17

29%

Researcher 16

27%

Lecturer / Post doc 2

3%

Readers' Discipline

Tooltip

Computer Science 29

56%

Physics and Astronomy 14

27%

Social Sciences 5

10%

Engineering 4

8%

Save time finding and organizing research with Mendeley

Sign up for free