Detecting non-modal phonation in telephone speech

5Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Non-modal phonation conveys both linguistic and paralinguistic information, and is distinguished by acoustic source and filter features. Detecting non-modal phonation in speech requires reliable F0 analysis, a problem for telephone-band speech, where F0 analysis frequently fails. We demonstrate an approach to the detection of creaky phonation in telephone speech based on robust F0 and spectral analysis. Our F0 analysis relies on an autocorrelation algorithm applied to the intensity-boosted and inverse-filtered speech signal and succeeds in regions of nonmodal phonation where the non-filtered F0 analysis typically fails. In addition to the extracted F0 values, spectral amplitude is measured at the first two harmonics (H1, H2) and the first three formants (A1, A2, A3). Visual and spectral inspection of the detected creaky phonation confirms the findings reported from laboratory setting. Statistical analysis using oneway ANOVA and classification using Support Vector Machine (SVM) reveals promising results which lead to further improvement for automatic detection of non-modal phonation in telephone speech.

Cite

CITATION STYLE

APA

Yoon, T. J., Cole, J., & Hasegawa-Johnson, M. (2008). Detecting non-modal phonation in telephone speech. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (pp. 33–36). International Speech Communications Association. https://doi.org/10.21437/speechprosody.2008-7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free