Deep Learning for Punctuation Restoration in Medical Reports

Wael Salloum; Greg Finley; Erik Edwards; Mark Miller; David Suendermann-Oeft

Conference ProceedingsOPEN ACCESS

Deep Learning for Punctuation Restoration in Medical Reports

BioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop (2017) 159-164

DOI: 10.18653/v1/w17-2319

25Citations

96Readers

Abstract

In clinical dictation, speakers try to be as concise as possible to save time, often resulting in utterances without explicit punctuation commands. Since the end product of a dictated report, e.g. an out-patient letter, does require correct orthography, including exact punctuation, the latter need to be restored, preferably by automated means. This paper describes a method for punctuation restoration based on a state-of-the-art stack of NLP and machine learning techniques including B-RNNs with an attention mechanism and late fusion, as well as a feature extraction technique tailored to the processing of medical terminology using a novel vocabulary reduction model. To the best of our knowledge, the resulting performance is superior to that reported in prior art on similar tasks.

Cite

CITATION STYLE

APA

Salloum, W., Finley, G., Edwards, E., Miller, M., & Suendermann-Oeft, D. (2017). Deep Learning for Punctuation Restoration in Medical Reports. In BioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop (pp. 159–164). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-2319

Deep Learning for Punctuation Restoration in Medical Reports

Abstract

Cite

Register to see more suggestions