This paper describes design and creation of a multilingual parallel corpus for South African languages. One of the applications of the corpus, namely, the induction of a part-of-spccch tagger for Afrikaans from the data, is presented in the paper. Development of the Afrikaans part-of-spccch tagger is based on a modified method for induction of linguistic tools from parallel corpora originally proposed by Yarowsky and Ngai (2001).
CITATION STYLE
Trushkina, J. (2006). Development of a multilingual parallel corpus and a part-of-speech tagger for Afrikaans. IFIP International Federation for Information Processing, 228, 453–462. https://doi.org/10.1007/978-0-387-44641-7_47
Mendeley helps you to discover research relevant for your work.