Tied Mixture Modeling in Hindi Speech Recognition System

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The goal of automatic speech recognition (ASR) is to accurately and efficiently convert a speech signal into a text message independent of the device, speaker or environment. In ASR, the speech signal is captured and parameterized at front-end and evaluated at back-end using the Gaussian mixture hidden Markov model (HMM). In statistical modeling, to handle the large number of HMM state parameters and to minimize the computation overhead, similar states are tied. In this paper we present a scheme to find the degree of mixture tying that is best suited for the small amount of training data, usually available for Indian languages. In our proposed approach, perceptual linear prediction (PLP) combined with Heteroscedastic linear discriminant analysis (HLDA) was used for feature extraction. All the experiments were conducted in general field conditions and in context of Indian languages, specifically Hindi, and for Indian speaking style. © Springer-Verlag Berlin Heidelberg 2010.

Cite

CITATION STYLE

APA

Aggarwal, R. K., & Dave, M. (2010). Tied Mixture Modeling in Hindi Speech Recognition System. In Communications in Computer and Information Science (Vol. 101, pp. 514–519). https://doi.org/10.1007/978-3-642-15766-0_86

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free