The failure prediction of cluster systems is an effective approach to improve the reliability of the cluster systems, which is becoming a new research hotspot of high performance computing, especially with the growth of cluster systems and applications both in scale and complexity. A classification sequential rule model is proposed to predict cluster system failures. The system logs of BlueGene/L, Red Storm, and Spirit are used as experimental datasets to predict cluster system failures. The results show that sequential rule approach outperforms SVM and HSMM in terms of precision and F-measure in 5hr prediction window, and in 1hr or 12hr prediction window, sequential rules, SVM and HSMM have their own strengths and weaknesses respectively. © 2013 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Xu, J., & Li, H. (2013). The failure prediction of cluster systems based on system logs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8041 LNAI, pp. 526–537). Springer Verlag. https://doi.org/10.1007/978-3-642-39787-5_44
Mendeley helps you to discover research relevant for your work.