Architecture of the internet archive

6Citations
Citations of this article
36Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Internet Archive is a live production system supporting close to a petabyte of data and delivering an average of 2.3Gb/sec of data to Internet users. We describe the architecture of this system with an emphasis on its robustness and how it is managed by a very small team of systems personnel. Notably, the current system does not employ a cache. We analyze the reasons for this decision and show that an effective cache could not be built until now. However, new solid state disk technology may offer promising new cache implementations.

Cite

CITATION STYLE

APA

Jaffe, E., & Kirkpatrick, S. (2009). Architecture of the internet archive. In ACM International Conference Proceeding Series (p. 11). https://doi.org/10.1145/1534530.1534545

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free