TermEnsembler:An ensemble learning approach to bilingual term extraction and alignment

  • Repar A
  • Podpečan V
  • Vavpetič A
  • et al.
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

This paper describes TermEnsembler, a bilingual term extraction and alignment system utilizing a novel ensemble learning approach to bilingual term alignment. In the proposed system, the processing starts with monolingual term extraction from a language industry standard file type containing aligned English and Slovenian texts. The two separate term lists are then automatically aligned using an ensemble of seven bilingual alignment methods, which are first executed separately and then merged using the weights learned with an evolutionary algorithm. In the experiments, the weights were learned on one domain and tested on two other domains. When evaluated on the top 400 aligned term pairs, the precision of term alignment is over 96%, while the number of correctly aligned multi-word unit terms exceeds 30% when evaluated on the top 400 term pairs. © 2019 Elsevier B.V., All rights reserved.

Cite

CITATION STYLE

APA

Repar, A., Podpečan, V., Vavpetič, A., Lavrač, N., & Pollak, S. (2019). TermEnsembler:An ensemble learning approach to bilingual term extraction and alignment. Terminology, 25(1), 93–120. Retrieved from https://www.scopus.com/inward/record.uri?eid=2-s2.0-85070381504&doi=10.1075%2Fterm.00029.rep&partnerID=40&md5=b28906688d46b9d9419b789bf9162d30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free