Ensemble Vision Transformer for Dementia Diagnosis

Fei Huang; Anqi Qiu

Journal Article

Ensemble Vision Transformer for Dementia Diagnosis

IEEE Journal of Biomedical and Health Informatics (2024) 28(9) 5551-5561

DOI: 10.1109/JBHI.2024.3412812

18Citations

17Readers

Get full text

Abstract

In recent years, deep learning has gained momentum in computer-aided Alzheimer's Disease (AD) diagnosis. This study introduces a novel approach, Monte Carlo Ensemble Vision Transformer (MC-ViT), which develops an ensemble approach with Vision transformer (ViT). Instead of using traditional ensemble methods that deploy multiple learners, our approach employs a single vision transformer learner. By harnessing Monte Carlo sampling, this method produces a broad spectrum of classification decisions, enhancing the MC-ViT performance. This novel technique adeptly overcomes the limitation of 3D patch convolutional neural networks that only characterize partial of the whole brain anatomy, paving the way for a neural network adept at discerning 3D inter-feature correlations. Evaluations using the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset with 7199 scans and Open Access Series of Imaging Studies-3 (OASIS-3) with 1992 scans showcased its performance. With minimal preprocessing, our approach achieved an impressive 90% accuracy in AD classification, surpassing both 2D-slice CNNs and 3D CNNs.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, F., & Qiu, A. (2024). Ensemble Vision Transformer for Dementia Diagnosis. IEEE Journal of Biomedical and Health Informatics, 28(9), 5551–5561. https://doi.org/10.1109/JBHI.2024.3412812

Ensemble Vision Transformer for Dementia Diagnosis

Abstract

Author supplied keywords

Cite

Register to see more suggestions