Tandem connectionist feature extraction for conversational speech recognition

Qifeng Zhu; Barry Chen; Nelson Morgan; Andreas Stolcke

Conference Proceedings

Tandem connectionist feature extraction for conversational speech recognition

Lecture Notes in Computer Science (2005) 3361 223-231

DOI: 10.1007/978-3-540-30568-2_19

19Citations

10Readers

Get full text

Abstract

Multi-Layer Perceptrons (MLPs) can be used in automatic speech recognition in many ways. A particular application of this tool over the last few years has been the Tandem approach, as described in [7] and other more recent publications. Here we discuss the characteristics of the MLP-based features used for the Tandem approach, and conclude with a report on their application to conversational speech recognition. The paper shows that MLP transformations yield variables that have regular distributions, which can be further modified by using logarithm to make the distribution easier to model by a Gaussian-HMM. Two or more vectors of these features can easily be combined without increasing the feature dimension. We also report recognition results that show that MLP features can significantly improve recognition performance for the NIST 2001 Hub-5 evaluation set with models trained on the Switchboard Corpus, even for complex systems incorporating MMIE training and other enhancements. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Zhu, Q., Chen, B., Morgan, N., & Stolcke, A. (2005). Tandem connectionist feature extraction for conversational speech recognition. In Lecture Notes in Computer Science (Vol. 3361, pp. 223–231). Springer Verlag. https://doi.org/10.1007/978-3-540-30568-2_19

Tandem connectionist feature extraction for conversational speech recognition

Abstract

Cite

Register to see more suggestions