When and why listeners disagree in voice quality assessment tasks

Jody Kreiman; Bruce R. Gerratt; Mika Ito

Journal ArticleOPEN ACCESS

When and why listeners disagree in voice quality assessment tasks

Kreiman J
Gerratt B
Ito M

The Journal of the Acoustical Society of America (2007) 122(4) 2354-2364

DOI: 10.1121/1.2770547

149Citations

134Readers

Get full text

Abstract

Modeling sources of listener variability in voice quality assessment is the first step in developing reliable, valid protocols for measuring quality, and provides insight into the reasons that listeners disagree in their quality assessments. This study examined the adequacy of one such model by quantifying the contributions of four factors to interrater variability: instability of listeners’ internal standards for different qualities, difficulties isolating individual attributes in voice patterns, scale resolution, and the magnitude of the attribute being measured. One hundred twenty listeners in six experiments assessed vocal quality in tasks that differed in scale resolution, in the presence/absence of comparison stimuli, and in the extent to which the comparison stimuli (if present) matched the target voices. These factors accounted for 84.2% of the variance in the likelihood that listeners would agree exactly in their assessments. Providing listeners with comparison stimuli that matched the target voices doubled the likelihood that they would agree exactly. Listeners also agreed significantly better when assessing quality on continuous versus six-point scales. These results indicate that interrater variability is an issue of task design, not of listener unreliability.

Cite

CITATION STYLE

APA

Kreiman, J., Gerratt, B. R., & Ito, M. (2007). When and why listeners disagree in voice quality assessment tasks. The Journal of the Acoustical Society of America, 122(4), 2354–2364. https://doi.org/10.1121/1.2770547

When and why listeners disagree in voice quality assessment tasks

Abstract

Cite

Register to see more suggestions