Using random forests for prosodic break prediction based on automatic speech labeling

Olga Khomitsevich; Pavel Chistikov; Dmitriy Zakharov

Conference Proceedings

Using random forests for prosodic break prediction based on automatic speech labeling

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8773 467-474

DOI: 10.1007/978-3-319-11581-8_58

3Citations

3Readers

Get full text

Abstract

In this paper we present a system for automatically predicting prosodic breaks in synthesized speech using the Random Forests classifier. In our experiments the classifier is trained on a large dataset consisting of audiobooks, which is automatically labeled with phone, word, and pause labels. To provide part of speech (POS) tags in the text, a rule-based POS tagger is used. We use crossvalidation in order to be able to examine not only the results for a specific subset of data but also the systems reliability across the dataset. The experimental results demonstrate that the system shows good and consistent results on the audiobook database; the results are poorer and less robust on a smaller database of read speech even though part of that database was labeled manually.

Author supplied keywords

Cite

CITATION STYLE

APA

Khomitsevich, O., Chistikov, P., & Zakharov, D. (2014). Using random forests for prosodic break prediction based on automatic speech labeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 467–474). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_58

Using random forests for prosodic break prediction based on automatic speech labeling

Abstract

Author supplied keywords

Cite

Register to see more suggestions