In occupational accident analysis, the application of machine learning is still unexplored. In addition, the presence of unstructured text data makes the analysis really difficult. Therefore, the aim of the paper is to utilize the information within accident texts using structural topic model (STM) and predict loss time injury (LTI) and non-LTI. Random forest (RF) has been used in this study for prediction as well as rule extraction. The performance of RF has also been compared with that of support vector machine (SVM), and k-nearest neighbor (KNN). The experimental results reveal that RF outperforms other classifiers in terms of accuracy. Moreover, a set of interpretable nine rules are extracted exploring the causes of the occurrence of both LTI and non-LTI.
CITATION STYLE
Sarkar, S., Gaine, S., Deshmukh, A., Khatedi, N., & Maiti, J. (2020). A Structural Topic Modeling-Based Machine Learning Approach for Pattern Extraction from Accident Data. In Advances in Intelligent Systems and Computing (Vol. 1079, pp. 555–564). Springer. https://doi.org/10.1007/978-981-15-1097-7_46
Mendeley helps you to discover research relevant for your work.