A multi-criteria text selection approach for building a speech corpus

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Speech corpus is an important and primary requirement for several speech tasks. Building a speech corpora is a lengthy, time consuming and expensive process, it typically involves collection of a large set of textual utterances and then selective distribution of these text utterances among a set of speakers, called speaker sheets. These speaker sheets are articulated by speakers to generate the speech corpora. Depending on the task at hand the speech corpora needs to satisfy certain criteria; For example, a phonetically balanced speech corpora is essential for building an automatic speech recognition (ASR) engine, while for a text dependent speaker recognition engine there is a need for several spoken repetition of the same text by several speakers. In this paper, we formulate a method that enables creation of speaker sheets from a predetermined set of text utterances such that the speech corpora satisfies the desired requirement.

Cite

CITATION STYLE

APA

Patel, C., & Kopparapu, S. K. (2015). A multi-criteria text selection approach for building a speech corpus. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9302, pp. 15–22). Springer Verlag. https://doi.org/10.1007/978-3-319-24033-6_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free