The effect of fuzzy training targets on voice quality classification

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The dynamic use of voice qualities in spoken language can reveal useful information on a speaker's attitude, mood and affective states. This information may be desirable for a range of speech technology applications. However, annotation of voice quality may frequently be inconsistent across raters. But whom should one trust or is the truth somewhere in between? The current study looks first to describe a voice quality feature set that is suitable for differentiating voice qualities on a tense to breathy dimension. These features are used as inputs to a fuzzy-input fuzzy-output support vector machine (F 2SVM) algorithm, to automatically classify the voice qualities. The F2SVM is compared to standard approaches and shows promising results. Performances for cross validation, leave one speaker out, and cross corpus experiments of around 90% are achieved. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Scherer, S., Kane, J., Gobl, C., & Schwenker, F. (2013). The effect of fuzzy training targets on voice quality classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7742 LNAI, pp. 43–51). https://doi.org/10.1007/978-3-642-37081-6_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free