A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique

Matej Rojc; Izidor Mlakar

Journal ArticleOPEN ACCESS

A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique

IEEE Access (2019) 7 108035-108048

DOI: 10.1109/ACCESS.2019.2932750

4Citations

9Readers

Abstract

A major drawback of corpus-based speech synthesis systems is the use of large acoustic inventories, and currently one of the main challenges is the optimal representation of concatenation costs associated with units in the acoustic inventory. These concatenation costs are used to evaluate spectral mismatches between the acoustic units to be concatenated. The combinatorics of costs grows exponentially with the size of the acoustic inventories and can result in hundreds of millions or even billions of concatenation costs to be processed. Therefore, in this paper, we represent a novel unit selection optimization algorithm, which minimizes the size of concatenation costs through the vector quantization-based compression technique and tuple structures. Furthermore, the proposed optimization algorithm is designed to be used as an objective measure to optimize the performance of the unit selection cost function regarding the quality of the speech output, and to evaluate the effect of the vector quantization-based compression technique on its performance. The results obtained show that even when data compression is above 50%, the effect on the quality of the synthesized speech is negligible.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Rojc, M., & Mlakar, I. (2019). A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique. IEEE Access, 7, 108035–108048. https://doi.org/10.1109/ACCESS.2019.2932750

Readers' Seniority

PhD / Post grad / Masters / Doc 2

40%

Researcher 2

40%

Professor / Associate Prof. 1

20%

Readers' Discipline

Computer Science 5

100%

A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique

Abstract

Author supplied keywords

References Powered by Scopus

Data clustering: 50 years beyond K-means

Tacotron: Towards end-To-end speech synthesis

Three learning phases for radial-basis-function networks

Cited by Powered by Scopus

An LSTM-based model for the compression of acoustic inventories for corpus-based text-to-speech synthesis systems

Development and Application of Constructive English MOOC System Based on RBF Algorithm

Research on Intelligent Retrieval Model of Multilingual Text Information in Corpus

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline