A method for automatic vietnamese speech segmentation

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In speech synthesis and recognition, the segmentation is an important step. The result of further steps depend completely on this process. There are several effective segmentation method in the literature, but for Vietnamese speech, researchers usually base on their experience to set the length while using sliding window. It causes an inefficient segmentation; and they need to try with the other value (length of voice). In this paper, we propose a method supporting in segmentation for Vietnamese speech and automatically determine the suitable length of voices and silent pause. We firstly estimate, by experimenting, the min and average length of a voice and a silent pause for Vietnamese speech in three main type speaking (slow, normal and fast). Then, based on these values, we start to segment the voice and pause by sliding window with proposed algorithm. Experiment results show that the proposed method can be used to effectively segment the Vietnamese speech.

Cite

CITATION STYLE

APA

Anh, T. T., Huu, M. N., & Trong, K. N. (2019). A method for automatic vietnamese speech segmentation. International Journal of Innovative Technology and Exploring Engineering, 8(11), 2887–2892. https://doi.org/10.35940/ijitee.K2427.0981119

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free