Enhancing multilingual biomedical terminologies via machine translation from parallel corpora

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Creating and maintaining terminologies by human experts is known to be a resource-expensive task. We here report on efforts to computationally support this process by treating term acquisition as a machine translation-guided classification problem capitalizing on parallel multilingual corpora. Experiments are described for French, German, Spanish and Dutch parts of a multilingual biomedical terminology, for which we generated 18k, 23k, 19k and 12k new terms and synonyms, respectively; about one half relate to concepts that have not been lexically labeled before. Based on expert assessment of a sample of the novel German segment about 80% of these newly acquired terms were judged as linguistically correct and bio-medically reasonable additions to the terminology. © Springer International Publishing Switzerland 2014.

Cite

CITATION STYLE

APA

Hellrich, J., & Hahn, U. (2014). Enhancing multilingual biomedical terminologies via machine translation from parallel corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8455 LNCS, pp. 9–20). Springer Verlag. https://doi.org/10.1007/978-3-319-07983-7_2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free