The Use of Deep Learning Software in the Detection of Voice Disorders: A Systematic Review

Joshua Barlow; Zara Sragi; Gabriel Rivera-Rivera; Abdurrahman Al-Awady; Ümit Daşdöğen; Mark S. Courey; Diana N. Kirke

ArticleOPEN ACCESS

The Use of Deep Learning Software in the Detection of Voice Disorders: A Systematic Review

Otolaryngology - Head and Neck Surgery (United States)

DOI: 10.1002/ohn.636

20Citations

28Readers

Abstract

Objective: To summarize the use of deep learning in the detection of voice disorders using acoustic and laryngoscopic input, compare specific neural networks in terms of accuracy, and assess their effectiveness compared to expert clinical visual examination. Data Sources: Embase, MEDLINE, and Cochrane Central. Review Methods: Databases were screened through November 11, 2023 for relevant studies. The inclusion criteria required studies to utilize a specified deep learning method, use laryngoscopy or acoustic input, and measure accuracy of binary classification between healthy patients and those with voice disorders. Results: Thirty-four studies met the inclusion criteria, with 18 focusing on voice analysis, 15 on imaging analysis, and 1 both. Across the 18 acoustic studies, 21 programs were used for identification of organic and functional voice disorders. These technologies included 10 convolutional neural networks (CNNs), 6 multilayer perceptrons (MLPs), and 5 other neural networks. The binary classification systems yielded a mean accuracy of 89.0% overall, including 93.7% for MLP programs and 84.5% for CNNs. Among the 15 imaging analysis studies, a total of 23 programs were utilized, resulting in a mean accuracy of 91.3%. Specifically, the twenty CNNs achieved a mean accuracy of 92.6% compared to 83.0% for the 3 MLPs. Conclusion: Deep learning models were shown to be highly accurate in the detection of voice pathology, with CNNs most effective for assessing laryngoscopy images and MLPs most effective for assessing acoustic input. While deep learning methods outperformed expert clinical exam in limited comparisons, further studies integrating external validation are necessary.

Author supplied keywords

Cite

CITATION STYLE

APA

Barlow, J., Sragi, Z., Rivera-Rivera, G., Al-Awady, A., Daşdöğen, Ü., Courey, M. S., & Kirke, D. N. (2024, June 1). The Use of Deep Learning Software in the Detection of Voice Disorders: A Systematic Review. Otolaryngology - Head and Neck Surgery (United States). John Wiley and Sons Inc. https://doi.org/10.1002/ohn.636

The Use of Deep Learning Software in the Detection of Voice Disorders: A Systematic Review

Abstract

Author supplied keywords

Cite

Register to see more suggestions