When considering a social media corpus, we often have access to structural information about how messages are flowing between people or organizations. This information is particularly useful when the linguistic evidence is sparse, incomplete, or of dubious quality. In this paper we construct a simple model to leverage the structure of Twitter data to help determine the set of languages each user is fluent in. Our results demonstrate that imposing several intuitive constraints leads to improvements in performance and stability. We release the first annotated data set for exploring this task, and discuss how our approach may be extended to other applications.
CITATION STYLE
Lippincott, T., & Van Durme, B. (2016). Fluency detection on communication networks. In EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 1025–1029). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d16-1107
Mendeley helps you to discover research relevant for your work.