Measurement and Modeling of Computer Reliability as Affected by System Activity

129Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.

Abstract

This paper demonstrates a practical approach to the study of the failure behavior of computer systems. Particular attention is devoted to the analysis of permanent failures. A number of important techniques, which may have general applicability in both failure and workload analysis, are brought together in this presentation. These include: smeared averaging of the workload data, clustering of like failures, and joint analysis of workload and failures. Approximately 17 percent of all failures affecting the CPU were estimated to be permanent. The manifestation of a permanent failure was found to be strongly correlated with the level and type of workload prior to the failure. Although, in strict terms, the results only relate to the manifestation of permanent failures and not to their occurrence, there are strong indications that permanent failures are both caused and discovered by increased activity. More measurements and experiments are necessary to determine their respective contributions to the measured workload/failure relationship. © 1986, ACM. All rights reserved.

Cite

CITATION STYLE

APA

Iyer, R. K., Rossetti, D. J., & Hsueh, M. C. (1986). Measurement and Modeling of Computer Reliability as Affected by System Activity. ACM Transactions on Computer Systems (TOCS), 4(3), 214–237. https://doi.org/10.1145/6420.6422

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free