Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers

Oleksandr Harust; Yugo Murawaki; Sadao Kurohashi

Conference ProceedingsOPEN ACCESS

Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers

COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (2020) 5843-5854

DOI: 10.18653/v1/2020.coling-main.514

2Citations

62Readers

Abstract

We propose a novel task of native-like expression identification by contrasting texts written by native speakers and those by proficient second language speakers. This task is highly challenging mainly because 1) the combinatorial nature of expressions prevents us from choosing candidate expressions a priori and 2) the distributions of the two types of texts overlap considerably. Our solution to the first problem is to combine a powerful neural network-based classifier of sentence-level nativeness with an explainability method that measures an approximate contribution of a given expression to the classifier’s prediction. To address the second problem, we introduce a special label neutral and reformulate the classification task as complementary-label learning. Our crowdsourcing-based evaluation and in-depth analysis suggest that our method successfully uncovers linguistically interesting usages distinctive of native speech.

Cite

CITATION STYLE

APA

Harust, O., Murawaki, Y., & Kurohashi, S. (2020). Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 5843–5854). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.514

Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers

Abstract

Cite

Register to see more suggestions