Using wiktionary to build an italian part-of-speech tagger

Tom De Smedt; Fabio Marfia; Matteo Matteucci; Walter Daelemans

Conference Proceedings

Using wiktionary to build an italian part-of-speech tagger

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8455 LNCS 1-8

DOI: 10.1007/978-3-319-07983-7_1

3Citations

6Readers

Get full text

Abstract

While there has been a lot of progress in Natural Language Processing (NLP), many basic resources are still missing for many languages, including Italian, especially resources that are free for both research and commercial use. One of these basic resources is a Part-of-Speech tagger, a first processing step in many NLP applications. We describe a weakly-supervised, fast, free and reasonably accurate part-of-speech tagger for the Italian language, created by mining words and their part-of-speech tags from Wiktionary. We have integrated the tagger in Pattern, a freely available Python toolkit. We believe that our approach is general enough to be applied to other languages as well. © Springer International Publishing Switzerland 2014.

Author supplied keywords

Cite

CITATION STYLE

APA

De Smedt, T., Marfia, F., Matteucci, M., & Daelemans, W. (2014). Using wiktionary to build an italian part-of-speech tagger. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8455 LNCS, pp. 1–8). Springer Verlag. https://doi.org/10.1007/978-3-319-07983-7_1

Using wiktionary to build an italian part-of-speech tagger

Abstract

Author supplied keywords

Cite

Register to see more suggestions