Improving term extraction with linguistic analysis in the biomedical domain

  • Golik W
  • Bossy R
  • Ratkovic Z
  • et al.
N/ACitations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

This paper presents a linguistic-based approach to term extraction from corpora in the biomedical domain. The method is based on an analysis of terms and their context that verify linguistic constraints. It focuses on participles and prepositional complements. The purpose of our approach is to obtain terms that are relevant for knowledge acquisition applications, such as the creation and en- richment of terminologies and ontologies. We report on the evaluations we conducted by applying two complementary strategies, using a reference termi- nology and a manual validation. They were applied to two corpora of differing genres and Life Science domains, namely pharmacology patents and animal physiology scientific articles. Our work shows that the linguistic analysis-based developments significantly improve the extraction results. The method is espe- cially efficient when dealing withgerunds and to prepositionalmodifiers.

Cite

CITATION STYLE

APA

Golik, W., Bossy, R., Ratkovic, Z., & Nédellec, C. (2013). Improving term extraction with linguistic analysis in the biomedical domain. Research in Computing Science, 70(1), 157–172. https://doi.org/10.13053/rcs-70-1-12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free