Improvements on automatic speech segmentation at the phonetic level

Jon Ander Gómez; Marcos Calvo

Conference ProceedingsOPEN ACCESS

Improvements on automatic speech segmentation at the phonetic level

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7042 LNCS 557-564

DOI: 10.1007/978-3-642-25085-9_66

8Citations

3Readers

Abstract

In this paper, we present some recent improvements in our automatic speech segmentation system, which only needs the speech signal and the phonetic sequence of each sentence of a corpus to be trained. It estimates a GMM by using all the sentences of the training subcorpus, where each Gaussian distribution represents an acoustic class, which probability densities are combined with a set of conditional probabilities in order to estimate the probability densities of the states of each phonetic unit. The initial values of the conditional probabilities are obtained by using a segmentation of each sentence assigning the same number of frames to each phonetic unit. A DTW algorithm fixes the phonetic boundaries using the known phonetic sequence. This DTW is a step inside an iterative process which aims to segment the corpus and re-estimate the conditional probabilities. The results presented here demonstrate that the system has a good capacity to learn how to identify the phonetic boundaries. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Gómez, J. A., & Calvo, M. (2011). Improvements on automatic speech segmentation at the phonetic level. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7042 LNCS, pp. 557–564). https://doi.org/10.1007/978-3-642-25085-9_66

Improvements on automatic speech segmentation at the phonetic level

Abstract

Author supplied keywords

Cite

Register to see more suggestions