At ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved not only for speech translation but also for the general use of speech recognition in real environments. In this paper, three large speech databases are designed to cope with these problems in speech recognition and the current status of data collection is reported.
CITATION STYLE
Nakamura, A., Matsunaga, S., Shimizu, T., Tonomura, M., & Sagisaka, Y. (1996). Japanese speech databases for robust speech recognition. In International Conference on Spoken Language Processing, ICSLP, Proceedings (Vol. 4, pp. 2199–2202). IEEE. https://doi.org/10.21437/icslp.1996-557
Mendeley helps you to discover research relevant for your work.