Methods for transcription and forced alignment of a legacy speech corpus

Rachel M. Olsen; Michael L. Olsen; Joseph A. Stanley; Margaret E.L. Renwick; William Kretzschmar

Conference ProceedingsOPEN ACCESS

Methods for transcription and forced alignment of a legacy speech corpus

Proceedings of Meetings on Acoustics (2017) 30(1)

DOI: 10.1121/2.0000559

12Citations

5Readers

Abstract

This paper describes the transcription and forced alignment of the Digital Archive of Southern Speech (DASS), a subset of the Linguistic Atlas of the Gulf States comprising 372 hours of recordings (64 interviews) conducted across eight southern U.S. states from 1968 to 1983. This project provides a large corpus of historical, semi-spontaneous Southern speech, time-Aligned to the audio for acoustic analysis. Manual orthographic transcription of full DASS interviews is carried out according to in-house guidelines that ensure consistency across files and transcribers. Separate codes are used for the interviewee, interviewer, nonspeech, overlapping and unintelligible speech. Transcriber output is converted to Praat TextGrids using scripts from LaBB-CAT, a tool for maintaining large speech corpora. TextGrids containing only the interviewee's speech are generated, and subjected to forced alignment by DARLA, which accommodates the levels of variation and noise in the DASS files with high degrees of success. Toward acoustic analysis, four methods for vowel formant extraction are evaluated: The native output of DARLA, FAVE, a local implemen-Tation of FAVE-Extract, and a Praat-based extractor that incorporates separate formant tracks for different regions of the vowel space. The workflow of transcription and analysis is presented to benefit other projects of similar size and scope.

Cite

CITATION STYLE

APA

Olsen, R. M., Olsen, M. L., Stanley, J. A., Renwick, M. E. L., & Kretzschmar, W. (2017). Methods for transcription and forced alignment of a legacy speech corpus. In Proceedings of Meetings on Acoustics (Vol. 30). Acoustical Society of America. https://doi.org/10.1121/2.0000559

Methods for transcription and forced alignment of a legacy speech corpus

Abstract

Cite

Register to see more suggestions