Tied Mixture Modeling in Hindi Speech Recognition System

R. K. Aggarwal; M. Dave

Conference Proceedings

Tied Mixture Modeling in Hindi Speech Recognition System

Communications in Computer and Information Science (2010) 101 514-519

DOI: 10.1007/978-3-642-15766-0_86

0Citations

4Readers

Get full text

Abstract

The goal of automatic speech recognition (ASR) is to accurately and efficiently convert a speech signal into a text message independent of the device, speaker or environment. In ASR, the speech signal is captured and parameterized at front-end and evaluated at back-end using the Gaussian mixture hidden Markov model (HMM). In statistical modeling, to handle the large number of HMM state parameters and to minimize the computation overhead, similar states are tied. In this paper we present a scheme to find the degree of mixture tying that is best suited for the small amount of training data, usually available for Indian languages. In our proposed approach, perceptual linear prediction (PLP) combined with Heteroscedastic linear discriminant analysis (HLDA) was used for feature extraction. All the experiments were conducted in general field conditions and in context of Indian languages, specifically Hindi, and for Indian speaking style. © Springer-Verlag Berlin Heidelberg 2010.

Author supplied keywords

Cite

CITATION STYLE

APA

Aggarwal, R. K., & Dave, M. (2010). Tied Mixture Modeling in Hindi Speech Recognition System. In Communications in Computer and Information Science (Vol. 101, pp. 514–519). https://doi.org/10.1007/978-3-642-15766-0_86

Tied Mixture Modeling in Hindi Speech Recognition System

Abstract

Author supplied keywords

Cite

Register to see more suggestions