Collection and Analyses of WSJ-CSR Corpus at MIT

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recently, the DARPA community in the United States started a new data collection initiative in the Wall Street Journal (WSJ) domain to support research and development of very large vocabulary continuous speech recognition (CSR) systems. Since August 1991, our group has actively participated in the development of the WSJ-CSR corpus. The purpose of this paper is to document our involvement in this process, from recording and transcription to analyses and distribution. We will also present the results of an experiment investigating the preprocessing of the prompt text.

Cite

CITATION STYLE

APA

Phillips, M., Glass, J., Polifroni, J., & Zue, V. (1992). Collection and Analyses of WSJ-CSR Corpus at MIT. In 2nd International Conference on Spoken Language Processing, ICSLP 1992 (pp. 907–910). The International Society for Computers and Their Applications (ISCA). https://doi.org/10.21437/icslp.1992-279

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free