In this article, we describe a speaker adaptation method based on the probabilistic 2-mode analysis of training models. Probabilistic 2-mode analysis is a probabilistic extension of multilinear analysis. We apply probabilistic 2-mode analysis to speaker adaptation by representing each of the hidden Markov model mean vectors of training speakers as a matrix, and derive the speaker adaptation equation in the maximum a posteriori (MAP) framework. The adaptation equation becomes similar to the speaker adaptation equation using the MAP linear regression adaptation. In the experiments, the adapted models based on probabilistic 2-mode analysis showed performance improvement over the adapted models based on Tucker decomposition, which is a representative multilinear decomposition technique, for small amounts of adaptation data while maintaining good performance for large amounts of adaptation data. © 2013 Jeong; licensee Springer.
CITATION STYLE
Jeong, Y. (2013). Speaker adaptation in the maximum a posteriori framework based on the probabilistic 2-mode analysis of training models. Eurasip Journal on Audio, Speech, and Music Processing, 2013(1). https://doi.org/10.1186/1687-4722-2013-7
Mendeley helps you to discover research relevant for your work.