OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS

Alan W. Black; Nick Campbell

Conference Proceedings

OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS

4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 (1995) 581-584

DOI: 10.21437/eurospeech.1995-148

164Citations

40Readers

Get full text

Abstract

Concatenating units of natural speech is one method of speech synthesis. Most such systems use an inventory of fixed length units, typically diphones or triphones with one instance of each type. An alternative is to use more varied, non-uniform units extracted from large speech databases containing multiple instances of each. The greater variability in such natural speech segments allows closer modeling of naturalness and differences in speaking styles, and eliminates the need for specially-recorded, single-use databases. However, with the greater variability comes the problem of how to select between the many instances of units in the database. This paper addresses that issue and presents a general method for unit selection.

Cite

CITATION STYLE

APA

Black, A. W., & Campbell, N. (1995). OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS. In 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995 (pp. 581–584). The International Society for Computers and Their Applications (ISCA). https://doi.org/10.21437/eurospeech.1995-148

OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS

Abstract

Cite

Register to see more suggestions