Voice stress extraction

Grazyna Demenko

Conference ProceedingsOPEN ACCESS

Voice stress extraction

Demenko G

Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (2008) 53-56

DOI: 10.21437/speechprosody.2008-12

7Citations

11Readers

Abstract

The aim of the research was to assess the possibility of voice stress extraction and classification. It was assumed that the study's results could be applied in call centers and could be useful for security services. The authentic Poznan police database with the recordings of the 997 emergency phone calls was used for analysis. Out of 60 000 recordings collected in the database, 20 000 were automatically selected, a few hundred of which were eventually chosen for acoustic evaluation, the basis for that selection being a perceptual assessment. The MDVP analysis confirmed statistical significance of such parameters as fundamental frequency, energy and pitch variations for stress categorization. Some segmental parameters such as tremor and noise parameters were also confirmed to be of some importance. In case of highly stressful conditions a systematic over-oneoctave shift in pitch was observed. It was concluded that the range of F0 per se does not seem to correlate with stress whereas the shift in F0 register constitutes the primary indicator of stress. Linear Discriminant Analysis based on 12 acoustic features showed it is possible to categorize the following classes: neutral, depressive, stressed, highly stressed speech.

Cite

CITATION STYLE

APA

Demenko, G. (2008). Voice stress extraction. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (pp. 53–56). International Speech Communications Association. https://doi.org/10.21437/speechprosody.2008-12

Voice stress extraction

Abstract

Cite

Register to see more suggestions