Compilation of a Swiss German Dialect Corpus and its Application to PoS Tagging

27Citations
Citations of this article
74Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Swiss German is a dialect continuum whose dialects are very different from Standard German, the official language of the German part of Switzerland. However, dealing with Swiss German in natural language processing, usually the detour through Standard German is taken. As writing in Swiss German has become more and more popular in recent years, we would like to provide data to serve as a stepping stone to automatically process the dialects. We compiled NOAH s Corpus of Swiss German Dialects consisting of various text genres, manually annotated with Part-of-Speech tags. Furthermore, we applied this corpus as training set to a statistical Part-of-Speech tagger and achieved an accuracy of 90.62%.

Cite

CITATION STYLE

APA

Hollenstein, N., & Aepli, N. (2014). Compilation of a Swiss German Dialect Corpus and its Application to PoS Tagging. In 1st Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects, VarDial 2014 at the 25th International Conference on Computational Linguistics: System Demonstrations, COLING 2014 - Proceedings (pp. 85–94). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-5310

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free