Analysis of acoustic parameters for consonant voicing classification in clean and telephone speech

  • Lee S
  • Choi J
4Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper describes acoustic cues for classification of consonant voicing in a distinctive feature-based speech recognition system. Initial acoustic cues are selected by studying consonant production mechanisms. Spectral representations, band-limited energies, and correlation values, along with Mel-frequency cepstral coefficients features (MFCCs) are also examined. Analysis of variance is performed to assess relative significance of features. Overall, 82.2 %, 80.6 %, and 78.4 % classification rates are obtained on the TIMIT database for stops, fricatives, and affricates, respectively. Combining acoustic parameters with MFCCs shows performance improvement in all cases. Also, performance in the NTIMIT telephone channel speech shows that acoustic parameters are more robust than MFCCs.

Cite

CITATION STYLE

APA

Lee, S.-M., & Choi, J.-Y. (2012). Analysis of acoustic parameters for consonant voicing classification in clean and telephone speech. The Journal of the Acoustical Society of America, 131(3), EL197–EL202. https://doi.org/10.1121/1.3678667

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free