Parallel corpora for wordnet construction: Machine translation vs. automatic sense tagging

Antoni Oliver; Salvador Climent

Conference Proceedings

Parallel corpora for wordnet construction: Machine translation vs. automatic sense tagging

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7182 LNCS(PART 2) 110-121

DOI: 10.1007/978-3-642-28601-8_10

11Citations

12Readers

Get full text

Abstract

In this paper we present a methodology for WordNet construction based on the exploitation of parallel corpora with semantic annotation of the English source text. We are using this methodology for the enlargement of the Spanish and Catalan versions of WordNet 3.0, but the methodology can also be used for other languages. As big parallel corpora with semantic annotation are not usually available, we explore two strategies to overcome this problem: to use monolingual sense tagged corpora and machine translation, on the one hand; and to use parallel corpora and automatic sense tagging on the source text, on the other. With these resources, the problem of acquiring a WordNet from parallel corpora can be seen as a word alignment task. Fortunately, this task is well known, and some aligning algorithms are freely available. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Oliver, A., & Climent, S. (2012). Parallel corpora for wordnet construction: Machine translation vs. automatic sense tagging. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7182 LNCS, pp. 110–121). https://doi.org/10.1007/978-3-642-28601-8_10

Parallel corpora for wordnet construction: Machine translation vs. automatic sense tagging

Abstract

Author supplied keywords

Cite

Register to see more suggestions