Chunking Clinical Text Containing Non-Canonical Language

2Citations
Citations of this article
80Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Free text notes typed by primary care physicians during patient consultations typically contain highly non-canonical language. Shallow syntactic analysis of free text notes can help to reveal valuable information for the study of disease and treatment. We present an exploratory study into chunking such text using off-the-shelf language processing tools and pre-trained statistical models. We evaluate chunking accuracy with respect to part-of-speech tagging quality, choice of chunk representation, and breadth of context features. Our results indicate that narrow context feature windows give the best results, but that chunk representation and minor differences in tagging quality do not have a significant impact on chunking accuracy.

Cite

CITATION STYLE

APA

Savkov, A., Carroll, J., & Cassell, J. (2014). Chunking Clinical Text Containing Non-Canonical Language. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 77–82). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3411

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free