In this paper we present new methods for language classification which put to good use both syntax and fuzzy tools, and are capable of dealing with irrelevant linguistic features (i.e. features which should not contribute to the classification) and even inconsistent features (which do not make sense for specific languages). We introduce a metric distance, based on the generalized Steinhaus transform, which allows one to deal jointly with irrelevance and inconsistency. To evaluate our methods, we test them on a syntactic data set, due to the linguist G. Longobardi and his school. We obtain phylogenetic trees which sometimes outperform the ones obtained by Atkinson and Gray (Gray and Atkinson, 2003; Bouckaert et al., 2012).
CITATION STYLE
Franzoi, L., Sgarro, A., Dinu, A., & Dinu, L. P. (2019). Linguistic classification: Dealing jointly with irrelevance and inconsistency. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2019-September, pp. 345–352). Incoma Ltd. https://doi.org/10.26615/978-954-452-056-4_040
Mendeley helps you to discover research relevant for your work.