Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

19Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.

Abstract

The classification of dysfluencies is one of the important steps in objective measurement of stuttering disorder. In this work, the focus is on investigating the applicability of automatic speaker recognition (ASR) method for stuttering dysfluency recognition. The system designed for this particular task relies on the Gaussian mixture model (GMM), which is the most widely used probabilistic modeling technique in ASR. The GMM parameters are estimated from Mel frequency cepstral coefficients (MFCCs). This statistical speaker-modeling technique represents the fundamental characteristic sounds of speech signal. Using this model, we build a dysfluency recognizer that is capable of recognizing dysfluencies irrespective of a person as well as what is being said. The performance of the system is evaluated for different types of dysfluencies such as syllable repetition, word repetition, prolongation, and interjection using speech samples from the University College London Archive of Stuttered Speech (UCLASS).

Cite

CITATION STYLE

APA

Mahesha, P., & Vinod, D. S. (2016). Gaussian Mixture Model Based Classification of Stuttering Dysfluencies. Journal of Intelligent Systems, 25(3), 387–399. https://doi.org/10.1515/jisys-2014-0140

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free