Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms

14Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The purpose of this study is to cluster the injury narratives to extract the root causes behind the accidents. Analysis is done on incident data collected from the database of an integrated steel plant. Key terms generated from the clustering of incident scenario help us in finding root causes of that particular incident. This study also proposed specific measures to the management that would improve the safety performance. This study uses text document clustering to discover the hidden factors and causes behind the incidents. Understanding previous accidents is necessary to avoid future accidents. However, for companies, management of large accident databases, and accurately classifying accident narratives are very challenging issues. Therefore, the aim of this study is to accurately classify accident reports using text classification approaches and evaluate their usefulness. The study used two machine learning (ML) algorithms, namely random forest (RF), and support vector machine (SVM) and found that SVM performed best in classifying the accident narratives. Further, SVM was experimented with different tokenization of the preprocessed narratives to get more reliable results.

Cite

CITATION STYLE

APA

Sarkar, S., Ejaz, N., Kumar, M., & Maiti, J. (2020). Root Cause Analysis of Incidents Using Text Clustering and Classification Algorithms. In Lecture Notes in Electrical Engineering (Vol. 605, pp. 707–718). Springer. https://doi.org/10.1007/978-3-030-30577-2_63

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free