Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density

Dzmitry Pekar; Siarhei Tsikhanenka

Journal ArticleOPEN ACCESS

Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density

Pekar D
Tsikhanenka S

Journal of Telecommunications and Information Technology (2010) (4) 44-49

DOI: 10.26636/jtit.2010.4.1095

N/ACitations

8Readers

Abstract

This article demonstrates a new approach to speaker independent phoneme detection. The core of the algorithm is to measure the distance between normalized power spectral densities in adjacent, short-time segments and verify it based on velocity of changes of values of short-time signal energy analysis. The results of experiment analysis indicate that proposed algorithm allows revealing a phoneme structure of pronounced speech with high probability. The advantages of this algorithm are absence of any prior information on a signal or model of phonemes and speakers that allows the algorithm to be speaker independent and have a low computation complexity.

Cite

CITATION STYLE

APA

Pekar, D., & Tsikhanenka, S. (2010). Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density. Journal of Telecommunications and Information Technology, (4), 44–49. https://doi.org/10.26636/jtit.2010.4.1095

Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density

Abstract

Cite

Register to see more suggestions