Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density

  • Pekar D
  • Tsikhanenka S
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

This article demonstrates a new approach to speaker independent phoneme detection. The core of the algorithm is to measure the distance between normalized power spectral densities in adjacent, short-time segments and verify it based on velocity of changes of values of short-time signal energy analysis. The results of experiment analysis indicate that proposed algorithm allows revealing a phoneme structure of pronounced speech with high probability. The advantages of this algorithm are absence of any prior information on a signal or model of phonemes and speakers that allows the algorithm to be speaker independent and have a low computation complexity.

Cite

CITATION STYLE

APA

Pekar, D., & Tsikhanenka, S. (2010). Speech Segmentation Algorithm Based on an Analysis of the Normalized Power Spectral Density. Journal of Telecommunications and Information Technology, (4), 44–49. https://doi.org/10.26636/jtit.2010.4.1095

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free