Abstract
Recently, the DARPA community in the United States started a new data collection initiative in the Wall Street Journal (WSJ) domain to support research and development of very large vocabulary continuous speech recognition (CSR) systems. Since August 1991, our group has actively participated in the development of the WSJ-CSR corpus. The purpose of this paper is to document our involvement in this process, from recording and transcription to analyses and distribution. We will also present the results of an experiment investigating the preprocessing of the prompt text.
Cite
CITATION STYLE
Phillips, M., Glass, J., Polifroni, J., & Zue, V. (1992). Collection and Analyses of WSJ-CSR Corpus at MIT. In 2nd International Conference on Spoken Language Processing, ICSLP 1992 (pp. 907–910). The International Society for Computers and Their Applications (ISCA). https://doi.org/10.21437/icslp.1992-279
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.