Maintaining quality of service with dynamic fault tolerance in fat-trees

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A very important ingredient in the computing landscape is Utility Computing Data Centres (UCDCs), large-scale computing systems that offer computational services to concurrently running jobs through virtual servers. As UCDC systems increase in size and the mean time between failure decreases, it is becoming an increasingly important challenge to expediently tolerate failures (dynamically), while distributing the effects of the failure amongst the virtual servers according to their service level agreements. We propose and evaluate a strategy for offering predictable service in fat-trees experiencing faults, by reprioritising packets. The strategy is able to distribute the effect of network faults in order to satisfy a number of quality-of-service demands. Which demands to favour depends on the computer system and the characteristics of the jobs it is running, and in the presence of a moderate number of faults it is to some degree possible to meet the demands. © 2008 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Sem-Jacobsen, F. O., & Skeie, T. (2008). Maintaining quality of service with dynamic fault tolerance in fat-trees. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5374 LNCS, pp. 451–464). Springer Verlag. https://doi.org/10.1007/978-3-540-89894-8_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free