Identifying cognate sets across dictionaries of related languages

23Citations
Citations of this article
89Readers
Mendeley users who have this article in their library.

Abstract

We present a system for identifying cognate sets across dictionaries of related languages. The likelihood of a cognate relationship is calculated on the basis of a rich set of features that capture both phonetic and semantic similarity, as well as the presence of regular sound correspondences. The similarity scores are used to cluster words from different languages that may originate from a common proto-word. When tested on the Algonquian language family, our system detects 63% of cognate sets while maintaining cluster purity of 70%.

Cite

CITATION STYLE

APA

St Arnaud, A., Beck, D., & Kondrak, G. (2017). Identifying cognate sets across dictionaries of related languages. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 2519–2528). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d17-1267

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free