The influence of errors in phonetic annotations on performance of speech recognition system

Radek Šafařík; Lukáš Matějů; Lenka Weingartová

Conference Proceedings

The influence of errors in phonetic annotations on performance of speech recognition system

Lecture Notes in Computer Science (2018) 11107 LNAI 419-427

DOI: 10.1007/978-3-030-00794-2_45

0Citations

1Readers

Get full text

Abstract

This paper deals with errors in acoustic training data and the influence on speech recognition performance. The training data can be prepared manually, automatically or by combination of these two. In all cases, some mislabeled phonemes can appear in phonetic annotations. We conducted series of experiments which simulate some common errors. The experiments deal with various amount of changes in phonetic annotations such as different types of changes in voicing of obstruents, random substitution of consonants or vowels and random deleting of phonemes. All experiments were done for Czech language using GlobalPhone speech data set and both Gaussian mixture models and deep neural networks were used for acoustic modeling. The results show that some amount of such errors in training data does not influence speech recognition accuracy. The accuracy is significantly influenced only by large amount of errors (more than 50%).

Author supplied keywords

Cite

CITATION STYLE

APA

Šafařík, R., Matějů, L., & Weingartová, L. (2018). The influence of errors in phonetic annotations on performance of speech recognition system. In Lecture Notes in Computer Science (Vol. 11107 LNAI, pp. 419–427). Springer Verlag. https://doi.org/10.1007/978-3-030-00794-2_45

The influence of errors in phonetic annotations on performance of speech recognition system

Abstract

Author supplied keywords

Cite

Register to see more suggestions