Using gaussian mixtures on triphone acoustic modelling-based punjabi continuous speech recognition

4Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Continuous speech recognition for a particular language is always an area which relies, for its performance, on these major aspects: acoustic modelling and language modelling. Gaussian mixture model-hidden Markov model (GMM–HMM) is a part of acoustic modelling. These components are applied at the back end of ASR design to accurately and efficiently convert continuous speech signal to corresponding text. Triphone-based acoustic modelling makes use of two different context-dependent triphone models: word-internal and cross-word models. In spite of active research in the field of automatic speech recognition for a number of Indian and foreign languages, only few attempts have been made for Punjabi language, specially, in the area of continuous speech recognition. This research paper is aimed at analysing the impact of GMM–HMM-based acoustic model on the Punjabi speaker-independent continuous speech recognition. Recognition accuracy has been determined at word and sentence levels, respectively, with PLP and MFCC features by varying Gaussian mixtures from 2 to 32.

Cite

CITATION STYLE

APA

Ghai, W., Kumar, S., & Athavale, V. A. (2021). Using gaussian mixtures on triphone acoustic modelling-based punjabi continuous speech recognition. In Advances in Intelligent Systems and Computing (Vol. 1086, pp. 395–406). Springer. https://doi.org/10.1007/978-981-15-1275-9_32

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free