This paper presents a linguistic-based approach to term extraction from corpora in the biomedical domain. The method is based on an analysis of terms and their context that verify linguistic constraints. It focuses on participles and prepositional complements. The purpose of our approach is to obtain terms that are relevant for knowledge acquisition applications, such as the creation and en- richment of terminologies and ontologies. We report on the evaluations we conducted by applying two complementary strategies, using a reference termi- nology and a manual validation. They were applied to two corpora of differing genres and Life Science domains, namely pharmacology patents and animal physiology scientific articles. Our work shows that the linguistic analysis-based developments significantly improve the extraction results. The method is espe- cially efficient when dealing withgerunds and to prepositionalmodifiers.
CITATION STYLE
Golik, W., Bossy, R., Ratkovic, Z., & Nédellec, C. (2013). Improving term extraction with linguistic analysis in the biomedical domain. Research in Computing Science, 70(1), 157–172. https://doi.org/10.13053/rcs-70-1-12
Mendeley helps you to discover research relevant for your work.