The failure prediction of cluster systems based on system logs

5Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The failure prediction of cluster systems is an effective approach to improve the reliability of the cluster systems, which is becoming a new research hotspot of high performance computing, especially with the growth of cluster systems and applications both in scale and complexity. A classification sequential rule model is proposed to predict cluster system failures. The system logs of BlueGene/L, Red Storm, and Spirit are used as experimental datasets to predict cluster system failures. The results show that sequential rule approach outperforms SVM and HSMM in terms of precision and F-measure in 5hr prediction window, and in 1hr or 12hr prediction window, sequential rules, SVM and HSMM have their own strengths and weaknesses respectively. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Xu, J., & Li, H. (2013). The failure prediction of cluster systems based on system logs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8041 LNAI, pp. 526–537). Springer Verlag. https://doi.org/10.1007/978-3-642-39787-5_44

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free