BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components

Mujadded Al Rabbani Alif; Muhammad Hussain; Gareth Tucker; Simon Iwnicki

Journal ArticleOPEN ACCESS

BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components

Machines (2024) 12(2)

DOI: 10.3390/machines12020093

7Citations

22Readers

Abstract

Maintenance and safety inspection of trains is a critical element of providing a safe and reliable train service. Checking for the presence of bolts is an essential part of train inspection, which is currently, typically carried out during visual inspections. There is an opportunity to automate bolt inspection using machine vision with edge devices. One particular challenge is the implementation of such inspection mechanisms on edge devices, which necessitates using lighter models to ensure efficiency. Traditional methods have often fallen short of the required object detection performance, thus demonstrating the need for a more advanced approach. To address this challenge, researchers have been exploring the use of deep learning algorithms and computer vision techniques to improve the accuracy and reliability of bolt detection on edge devices. High precision in identifying absent bolts in train components is essential to avoid potential mishaps and system malfunctions. This paper presents “BoltVision”, a comparative analysis of three cutting-edge machine learning models: convolutional neural networks (CNNs), vision transformers (ViTs), and compact convolutional transformers (CCTs). This study illustrates the superior assessment capabilities of these models and discusses their effectiveness in addressing the prevalent issue of edge devices. Results show that BoltVision, utilising a pre-trained ViT base, achieves a remarkable 93% accuracy in classifying missing bolts. These results underscore the potential of BoltVision in tackling specific safety inspection challenges for trains and highlight its effectiveness when deployed on edge devices characterised by constrained computational resources. This attests to the pivotal role of transformer-based architectures in revolutionising predictive maintenance and safety assurance within the rail transportation industry.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Alif, M. A. R., Hussain, M., Tucker, G., & Iwnicki, S. (2024). BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components. Machines, 12(2). https://doi.org/10.3390/machines12020093

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 6

75%

Lecturer / Post doc 2

25%

Readers' Discipline

Engineering 4

57%

Computer Science 1

14%

Earth and Planetary Sciences 1

14%

Environmental Science 1

14%

Article Metrics

Mentions

Blog Mentions: 1

News Mentions: 1

View details >

BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components

Abstract

Author supplied keywords

References Powered by Scopus

Matplotlib: A 2D graphics environment

Rethinking the Inception Architecture for Computer Vision

A survey on deep transfer learning

Cited by Powered by Scopus

Comparative Analysis of YOLOv8 and YOLOv10 in Vehicle Detection: Performance Metrics and Model Efficacy

Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues

Lightweight Convolutional Network with Integrated Attention Mechanism for Missing Bolt Detection in Railways

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline

Article Metrics