Speech paralinguistic approach for detecting dementia using gated convolutional neural network

9Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

We propose a non-invasive and cost-effective method to automatically detect dementia by utilizing solely speech audio data. We extract paralinguistic features for a short speech segment and use Gated Convolutional Neural Networks (GCNN) to classify it into dementia or healthy. We evaluate our method on the Pitt Corpus and on our own dataset, the PROMPT Database. Our method yields the accuracy of 73.1% on the Pitt Corpus using an average of 114 seconds of speech data. In the PROMPT Database, our method yields the accuracy of 74.7% using 4 seconds of speech data and it improves to 80.8% when we use all the patient’s speech data. Furthermore, we evaluate our method on a three-class classification problem in which we included the Mild Cognitive Impairment (MCI) class and achieved the accuracy of 60.6% with 40 seconds of speech data.

References Powered by Scopus

"Mini-mental state". A practical method for grading the cognitive state of patients for the clinician

78022Citations
N/AReaders
Get full text

Long Short-Term Memory

77856Citations
N/AReaders
Get full text

The Montreal Cognitive Assessment, MoCA: A brief screening tool for mild cognitive impairment

18024Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Deep learning-based speech analysis for Alzheimer’s disease detection: a literature review

37Citations
N/AReaders
Get full text

SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing

36Citations
N/AReaders
Get full text

Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review

31Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Rodrigues Makiuchi, M., Warnita, T., Inoue, N., Shinoda, K., Yoshimura, M., Kitazawa, M., … Kishimoto, T. (2021). Speech paralinguistic approach for detecting dementia using gated convolutional neural network. IEICE Transactions on Information and Systems, 104(11), 1930–1940. https://doi.org/10.1587/TRANSINF.2020EDP7196

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

67%

Lecturer / Post doc 1

17%

Researcher 1

17%

Readers' Discipline

Tooltip

Computer Science 5

56%

Engineering 3

33%

Arts and Humanities 1

11%

Save time finding and organizing research with Mendeley

Sign up for free