In this paper wavelet decomposition is used to decompose speech signal into five levels. Low-frequency part of the speech signal was reconstructed. Because different frequencies of the speech signal have different influence on the performance of the system, the acoustic model of each level was trained and tested. The experimental results show that the acoustic model of level 1 is the best for clean speech and the acoustic model of level 2 is the best for noisy speech.It is proved that the frequency band of A1 makes a lot of contribution on the performance of clean speech and the frequency band of A2 makes a lot of contribution on the performance of noisy speech. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Yan, L., Liu, G., & Guo, J. (2005). A study on robustness of large vocabulary mandarin Chinese continuous speech recognition system based on wavelet analysis. In Lecture Notes in Computer Science (Vol. 3686, pp. 497–504). Springer Verlag. https://doi.org/10.1007/11551188_54
Mendeley helps you to discover research relevant for your work.