Recognizing Structure in Report Transcripts

Jeremy Jancsary

Thesis

Recognizing Structure in Report Transcripts

Jancsary J

N/ACitations

1Readers

Abstract

Typically, the output of Automatic Speech Recognition (ASR) is a mere sequence of words. This view may be sufﬁcient for some tasks, whereas others require a more structured approach. This thesis presents a framework that allows for identiﬁcation of deep, underlying structure in report dictations. Identiﬁcation of structural elements, such as headings, sections and enumerations is an important step towards automatic post-processing of dictated speech. The contributions of this thesis include a generic approach that can be integrated seamlessly with existing ASR solutions and provides structured output, as well as a freely available Conditional Random Field (CRF) toolkit that forms the basis of the aforementioned approach and may also be applicable to numerous other problems.

Recognizing Structure in Report Transcripts

Abstract

Author supplied keywords

Cite

Recognizing Structure in Report Transcripts

Abstract

Author supplied keywords

Cite

Register to see more suggestions