Analysis of smartphone recordings in time, frequency, and cepstral domains to classify Parkinson’s disease

13Citations
Citations of this article
61Readers
Mendeley users who have this article in their library.

Abstract

Objectives: Parkinson’s disease (PD) is the second most common neurodegenerative disorder; it affects more than 10 million people worldwide. Detecting PD usually requires a professional assessment by an expert, and investigation of the voice as a biomarker of the disease could be effective in speeding up the diagnostic process. Methods: We present our methodology in which we distinguish PD patients from healthy controls (HC) using a large sample of 18,210 smartphone recordings. Those recordings were processed by an audio processing technique to create a final dataset of 80,594 instances and 138 features from the time, frequency, and cepstral domains. This dataset was preprocessed and normalized to create baseline machine-learning models using four classifiers, namely, linear support vector machine, K-nearest neighbor, random forest, and ex-treme gradient boosting (XGBoost). We divided our dataset into training and held-out test sets. Then we used stratified 5-fold cross-validation and four performance measures: accuracy, sensitivity, specificity, and F1-score to assess the performance of the models. We applied two feature selection methods, analysis of variance (ANOVA) and least absolute shrinkage and selection operator (LASSO), to reduce the dimensionality of the dataset by selecting the best subset of features that maxi-mizes the performance of the classifiers. Results: LASSO outperformed ANOVA with almost the same number of features. With 33 features, XGBoost achieved a maximum accuracy of 95.31% on training data, and 95.78% by predicting unseen data. Conclusions: Developing a smartphone-based system that implements machine-learning techniques is an effective way to diagnose PD using the voice as a biomarker.

References Powered by Scopus

A survey on feature selection methods

3996Citations
N/AReaders
Get full text

Suitability of dysphonia measurements for telemonitoring of Parkinson's disease

707Citations
N/AReaders
Get full text

Speech impairment in a large sample of patients with Parkinson's disease

532Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Internet of Things Technologies and Machine Learning Methods for Parkinson’s Disease Diagnosis, Monitoring and Management: A Systematic Review

59Citations
N/AReaders
Get full text

An improved framework for Parkinson's disease prediction using Variational Mode Decomposition-Hilbert spectrum of speech signal

40Citations
N/AReaders
Get full text

Machine learning- and statistical-based voice analysis of Parkinson's disease patients: A survey

28Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Tougui, I., Jilbab, A., & Mhamdi, J. E. (2020). Analysis of smartphone recordings in time, frequency, and cepstral domains to classify Parkinson’s disease. Healthcare Informatics Research, 26(4), 274–283. https://doi.org/10.4258/hir.2020.26.4.274

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 24

73%

Researcher 6

18%

Professor / Associate Prof. 3

9%

Readers' Discipline

Tooltip

Computer Science 8

33%

Engineering 8

33%

Nursing and Health Professions 4

17%

Neuroscience 4

17%

Save time finding and organizing research with Mendeley

Sign up for free