Predicting hard disk failure by means of automatized labeling and machine learning approach

7Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Today, cloud systems provide many key services to development and production environ-ments; reliable storage services are crucial for a multitude of applications ranging from commercial manufacturing, distribution and sales up to scientific research, which is often at the forefront of computing resource demands. In large-scale computer centers, the storage system requires particular attention and investment; usually, a large number of diverse storage devices need to be deployed in order to match the varying performance and volume requirements of changing user applications. As of today, magnetic drives still play a dominant role in terms of deployed storage volume and of service outages due to device failure. In this paper, we study methods to facilitate automated proac-tive disk replacement. We propose a method to identify disks with media failures in a production environment and describe an application of supervised machine learning to predict disk failures. In particular, a proper stage to automatically label (healthy/at-risk) the disks during the training and validation stage is presented along with tuning strategy to optimize the hyperparameters of the associated machine learning classifier. The approach is trained and validated against a large set of 65,000 hard drives in the CERN computer center, and the achieved results are discussed.

Cite

CITATION STYLE

APA

Gargiulo, F., Duellmann, D., Arpaia, P., & Schiano Lo Moriello, R. (2021). Predicting hard disk failure by means of automatized labeling and machine learning approach. Applied Sciences (Switzerland), 11(18). https://doi.org/10.3390/app11188293

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free