Impact of Failure on Interconnection Networks for Large Storage Systems

  • Xin Q
  • Miller E
  • Schwarz T
 et al. 
  • 18


    Mendeley users who have this article in their library.
  • 6


    Citations of this article.


Recent advances in large-capacity, low-cost storage devices have led to active research in design of large-scale storage systems built from commodity devices for supercomputing applications. Such storage systems, composed of thousands of storage devices, are required to provide high system bandwidth and petabyte-scale data storage. A robust network interconnection is essential to achieve high bandwidth, low latency, and reliable delivery during data transfers. However, failures, such as temporary link outages and node crashes, are inevitable. We discuss the impact of potential failures on network interconnections in very large-scale storage systems and analyze the trade-offs among several storage network topologies by simulations. Our results suggest that a good interconnect topology be essential to fault-tolerance of a petabyte-scale storage system.

Author-supplied keywords

  • Analytical models
  • Bandwidth
  • Computer crashes
  • Delay
  • Failure analysis
  • Large-scale systems
  • Memory
  • Multiprocessor interconnection networks
  • Network topology
  • Robustness
  • fault tolerant computing
  • fault-tolerance computing
  • interconnection network
  • large storage system
  • multiprocessor interconnection networks
  • petabyte-scale data storage
  • storage area networks
  • storage device
  • storage management
  • storage network topology
  • system recovery

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Get full text


  • Qin Xin

  • Ethan L. Miller

  • Thomas J. E. Schwarz

  • Darrell D. E. Long

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free