Decision tree-based acoustic models for speech recognition

Masami Akamine; Jitendra Ajmera

Journal ArticleOPEN ACCESS

Decision tree-based acoustic models for speech recognition

Eurasip Journal on Audio, Speech, and Music Processing (2012) 2012(1)

DOI: 10.1186/1687-4722-2012-10

15Citations

19Readers

Abstract

This article proposes a new acoustic model using decision trees (DTs) as replacements for Gaussian mixture models (GMM) to compute the observation likelihoods for a given hidden Markov model state in a speech recognition system. DTs have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform feature selection. This article explores and exploits DTs for the purpose of large vocabulary speech recognition. Equal and decoding questions have newly been introduced into DTs to directly model gender- and context-dependent acoustic space. Experimental results for the 5k ARPA wall-street-journal task show that context information significantly improves the performance of DT-based acoustic models as expected. Context-dependent DT-based models are highly compact compared to conventional GMM-based acoustic models. This means that the proposed models have effective data-sharing across various context classes. © 2012 Akamine and Ajmera; licensee Springer.

Author supplied keywords

Cite

CITATION STYLE

APA

Akamine, M., & Ajmera, J. (2012). Decision tree-based acoustic models for speech recognition. Eurasip Journal on Audio, Speech, and Music Processing, 2012(1). https://doi.org/10.1186/1687-4722-2012-10

Decision tree-based acoustic models for speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions