Disfluency detection with a semi-markov model and prosodic features

James Ferguson; Greg Durrett; Dan Klein

Conference ProceedingsOPEN ACCESS

Disfluency detection with a semi-markov model and prosodic features

NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2015) 257-262

DOI: 10.3115/v1/n15-1029

53Citations

95Readers

Abstract

We present a discriminative model for detecting disfluencies in spoken language transcripts. Structurally, our model is a semi-Markov conditional random field with features targeting characteristics unique to speech repairs. This gives a significant performance improvement over standard chain-structured CRFs that have been employed in past work. We then incorporate prosodic features over silences and relative word duration into our semi-CRF model, resulting in further performance gains; moreover, these features are not easily replaced by discrete prosodic indicators such as ToBI breaks. Our final system, the semi-CRF with prosodic information, achieves an F-score of 85.4, which is 1.3 F1 better than the best prior reported F-score on this dataset.

Cite

CITATION STYLE

APA

Ferguson, J., Durrett, G., & Klein, D. (2015). Disfluency detection with a semi-markov model and prosodic features. In NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 257–262). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/n15-1029

Disfluency detection with a semi-markov model and prosodic features

Abstract

Cite

Register to see more suggestions