Evaluation of Several Phonetic Similarity Algorithms on the Task of Cognate Identification

38Citations
Citations of this article
101Readers
Mendeley users who have this article in their library.

Abstract

We investigate the problem of measuring phonetic similarity, focusing on the identification of cognates, words of the same origin in different languages. We compare representatives of two principal approaches to computing phonetic similarity: manually-designed metrics, and learning algorithms. In particular, we consider a stochastic transducer, a Pair HMM, several DBN models, and two constructed schemes. We test those approaches on the task of identifying cognates among Indoeuropean languages, both in the supervised and unsupervised context. Our results suggest that the averaged context DBN model and the Pair HMM achieve the highest accuracy given a large training set of positive examples.

Cite

CITATION STYLE

APA

Kondrak, G., & Sherif, T. (2006). Evaluation of Several Phonetic Similarity Algorithms on the Task of Cognate Identification. In COLING ACL 2006 - Linguistic Distances, Proceedings of the Workshop (pp. 43–50). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1641976.1641983

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free