In this paper we propose a method for estimating speaking rate by means of Deep Neural Networks (DNN). The proposed approach is used for speaking rate adaptation of an automatic speech recognition system. The adaptation is performed by changing step in front-end feature processing according to the estimations of speaking rate. Experiments show that adaptation results using the proposed DNN-based speaking rate estimator are better than the results of adaptation using the speaking rate estimator based on the recognition results.
CITATION STYLE
Tomashenko, N., & Khokhlov, Y. (2014). Speaking rate estimation based on deep neural networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 418–424). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_52
Mendeley helps you to discover research relevant for your work.