Natural language processing to extract medical problems from electronic clinical documents: Performance evaluation

  • Meystre S
  • Haug P
  • 144


    Mendeley users who have this article in their library.
  • 83


    Citations of this article.


In this study, we evaluate the performance of a Natural Language Processing (NLP) application designed to extract medical problems from narrative text clinical documents. The documents come from a patient's electronic medical record and medical problems are proposed for inclusion in the patient's electronic problem list. This application has been developed to help maintain the problem list and make it more accurate, complete, and up-to-date. The NLP part of this system-analyzed in this study-uses the UMLS MetaMap Transfer (MMTx) application and a negation detection algorithm called NegEx to extract 80 different medical problems selected for their frequency of use in our institution. When using MMTx with its default data set, we measured a recall of 0.74 and a precision of 0.756. A custom data subset for MMTx was created, making it faster and significantly improving the recall to 0.896 with a non-significant reduction in precision. © 2005 Elsevier Inc. All rights reserved.

Author-supplied keywords

  • Medical Records
  • MetaMap Transfer
  • Natural Language Processing
  • NegEx
  • Problem oriented
  • Program evaluation

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Get full text


  • Stéphane Meystre

  • Peter J. Haug

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free