Dysfluency recognition by using spectral entropy features

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In recent decades, speech recognition technology has improved effectively and significantly, but it is restricted only for a stream of words. These recognition systems assist human’s effectively for structured speech but for unstructured speech this assistance is not so effective for humans to communicate with machines, because unstructured stream of word lacks in providing useful information about pronunciation and punctuation. Recovering of such structural information by detecting the position of each phones in a sentence by locating the sentence boundaries, repeated words and missing phones in each phrase. The proposed work investigates the spectral entropy features, for the automatic detection of voiced and non-voiced regions, in the process of dysfluent speech recognition. The entropy features are estimated by normalizing the Fourier transform spectrum as Probability mass function (PMF). For clear formants of speech, the value of entropy is low and the value of entropy is high for flat distribution of silence part or if there is any noise in speech sample. A comparison of entropy features with Word Error Rate is presented in the proposed work.

Cite

CITATION STYLE

APA

Vinay, N. A., & Bharathi, S. H. (2019). Dysfluency recognition by using spectral entropy features. International Journal of Engineering and Advanced Technology, 8(6), 517–520. https://doi.org/10.35940/ijeat.F7881.088619

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free