A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique

4Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

A major drawback of corpus-based speech synthesis systems is the use of large acoustic inventories, and currently one of the main challenges is the optimal representation of concatenation costs associated with units in the acoustic inventory. These concatenation costs are used to evaluate spectral mismatches between the acoustic units to be concatenated. The combinatorics of costs grows exponentially with the size of the acoustic inventories and can result in hundreds of millions or even billions of concatenation costs to be processed. Therefore, in this paper, we represent a novel unit selection optimization algorithm, which minimizes the size of concatenation costs through the vector quantization-based compression technique and tuple structures. Furthermore, the proposed optimization algorithm is designed to be used as an objective measure to optimize the performance of the unit selection cost function regarding the quality of the speech output, and to evaluate the effect of the vector quantization-based compression technique on its performance. The results obtained show that even when data compression is above 50%, the effect on the quality of the synthesized speech is negligible.

References Powered by Scopus

Data clustering: 50 years beyond K-means

7331Citations
N/AReaders
Get full text

Tacotron: Towards end-To-end speech synthesis

910Citations
N/AReaders
Get full text

Three learning phases for radial-basis-function networks

471Citations
N/AReaders
Get full text

Cited by Powered by Scopus

An LSTM-based model for the compression of acoustic inventories for corpus-based text-to-speech synthesis systems

7Citations
N/AReaders
Get full text

Development and Application of Constructive English MOOC System Based on RBF Algorithm

2Citations
N/AReaders
Get full text

Research on Intelligent Retrieval Model of Multilingual Text Information in Corpus

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Rojc, M., & Mlakar, I. (2019). A New Unit Selection Optimisation Algorithm for Corpus-Based TTS Systems Using the RBF-Based Data Compression Technique. IEEE Access, 7, 108035–108048. https://doi.org/10.1109/ACCESS.2019.2932750

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

40%

Researcher 2

40%

Professor / Associate Prof. 1

20%

Readers' Discipline

Tooltip

Computer Science 5

100%

Save time finding and organizing research with Mendeley

Sign up for free