Active learning for speech event detection in HCI

Patrick Thiam; Sascha Meudt; Friedhelm Schwenker; Günther Palm

Conference ProceedingsOPEN ACCESS

Active learning for speech event detection in HCI

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9896 LNAI 285-297

DOI: 10.1007/978-3-319-46182-3_24

7Citations

8Readers

Abstract

In this work, a pool-based active learning approach combining outlier detection methods with uncertainty sampling is proposed for speech event detection. Events in this case are regarded as atypical utterances (e.g. laughter, heavy breathing) occurring sporadically during a Human Computer Interaction (HCI) scenario. The proposed approach consists in using rank aggregation to select informative speech segments which have previously been ranked using different outlier detection techniques combined with an uncertainty sampling technique. The uncertainty sampling method is based on the distance to the boundary of a Support Vector Machine with Radial Basis Function kernel trained on the available annotated samples. Extensive experimental results prove the effectiveness of the proposed approach.

Author supplied keywords

Cite

CITATION STYLE

APA

Thiam, P., Meudt, S., Schwenker, F., & Palm, G. (2016). Active learning for speech event detection in HCI. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9896 LNAI, pp. 285–297). Springer Verlag. https://doi.org/10.1007/978-3-319-46182-3_24

Active learning for speech event detection in HCI

Abstract

Author supplied keywords

Cite

Register to see more suggestions