BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components

7Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

Maintenance and safety inspection of trains is a critical element of providing a safe and reliable train service. Checking for the presence of bolts is an essential part of train inspection, which is currently, typically carried out during visual inspections. There is an opportunity to automate bolt inspection using machine vision with edge devices. One particular challenge is the implementation of such inspection mechanisms on edge devices, which necessitates using lighter models to ensure efficiency. Traditional methods have often fallen short of the required object detection performance, thus demonstrating the need for a more advanced approach. To address this challenge, researchers have been exploring the use of deep learning algorithms and computer vision techniques to improve the accuracy and reliability of bolt detection on edge devices. High precision in identifying absent bolts in train components is essential to avoid potential mishaps and system malfunctions. This paper presents “BoltVision”, a comparative analysis of three cutting-edge machine learning models: convolutional neural networks (CNNs), vision transformers (ViTs), and compact convolutional transformers (CCTs). This study illustrates the superior assessment capabilities of these models and discusses their effectiveness in addressing the prevalent issue of edge devices. Results show that BoltVision, utilising a pre-trained ViT base, achieves a remarkable 93% accuracy in classifying missing bolts. These results underscore the potential of BoltVision in tackling specific safety inspection challenges for trains and highlight its effectiveness when deployed on edge devices characterised by constrained computational resources. This attests to the pivotal role of transformer-based architectures in revolutionising predictive maintenance and safety assurance within the rail transportation industry.

References Powered by Scopus

Matplotlib: A 2D graphics environment

28264Citations
N/AReaders
Get full text

Rethinking the Inception Architecture for Computer Vision

26561Citations
N/AReaders
Get full text

A survey on deep transfer learning

2468Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Comparative Analysis of YOLOv8 and YOLOv10 in Vehicle Detection: Performance Metrics and Model Efficacy

45Citations
N/AReaders
Get full text

Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues

22Citations
N/AReaders
Get full text

Lightweight Convolutional Network with Integrated Attention Mechanism for Missing Bolt Detection in Railways

5Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Alif, M. A. R., Hussain, M., Tucker, G., & Iwnicki, S. (2024). BoltVision: A Comparative Analysis of CNN, CCT, and ViT in Achieving High Accuracy for Missing Bolt Classification in Train Components. Machines, 12(2). https://doi.org/10.3390/machines12020093

Readers over time

‘24‘2505101520

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

75%

Lecturer / Post doc 2

25%

Readers' Discipline

Tooltip

Engineering 4

57%

Computer Science 1

14%

Earth and Planetary Sciences 1

14%

Environmental Science 1

14%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free
0