Unstructured clinical notes are rich sources for valuable patient information. Information extraction techniques can be employed to extract this valuable information, which in turn can be used to discover new knowledge. Named entity recognition and normalization are the basic tasks involved in information extraction. In this paper, identification of disorder named entities and the mapping of identified disorder entities to SNOMED-CT terminology using UMLS Metathesaurus is presented. A supervised linear chain conditional random field model based on sets of features to predict disorder mentions is used in conjunction with MetaMap to identify and normalize disorders. Error analysis conclude that recall of the developed system can be significantly increased by adding more features during model development and also by using a frame based approach for handling disjoint entities.
CITATION STYLE
Jonnagaddala, J., Liaw, S. T., Ray, P., Kumar, M., & Dai, H. J. (2015). TMUNSW: Identification of disorders and normalization to SNOMED-CT terminology in unstructured clinical notes. In SemEval 2015 - 9th International Workshop on Semantic Evaluation, co-located with the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015 - Proceedings (pp. 394–398). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s15-2068
Mendeley helps you to discover research relevant for your work.