A multi-criteria text selection approach for building a speech corpus

Chiragkumar Patel; Sunil Kumar Kopparapu

Conference Proceedings

A multi-criteria text selection approach for building a speech corpus

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9302 15-22

DOI: 10.1007/978-3-319-24033-6_2

2Citations

4Readers

Get full text

Abstract

Speech corpus is an important and primary requirement for several speech tasks. Building a speech corpora is a lengthy, time consuming and expensive process, it typically involves collection of a large set of textual utterances and then selective distribution of these text utterances among a set of speakers, called speaker sheets. These speaker sheets are articulated by speakers to generate the speech corpora. Depending on the task at hand the speech corpora needs to satisfy certain criteria; For example, a phonetically balanced speech corpora is essential for building an automatic speech recognition (ASR) engine, while for a text dependent speaker recognition engine there is a need for several spoken repetition of the same text by several speakers. In this paper, we formulate a method that enables creation of speaker sheets from a predetermined set of text utterances such that the speech corpora satisfies the desired requirement.

Author supplied keywords

Cite

CITATION STYLE

APA

Patel, C., & Kopparapu, S. K. (2015). A multi-criteria text selection approach for building a speech corpus. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9302, pp. 15–22). Springer Verlag. https://doi.org/10.1007/978-3-319-24033-6_2

A multi-criteria text selection approach for building a speech corpus

Abstract

Author supplied keywords

Cite

Register to see more suggestions