Stacked sentence-document classifier approach for improving native language identification

Andrea Cimino; Felice Dell'Orletta

Conference ProceedingsOPEN ACCESS

Stacked sentence-document classifier approach for improving native language identification

EMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop (2017) 430-437

DOI: 10.18653/v1/w17-5049

16Citations

64Readers

Abstract

In this paper, we describe the approach of the ItaliaNLP Lab team to native language identification and discuss the results we submitted as participants to the essay track of NLI Shared Task 2017. We introduce for the first time a 2-stacked sentencedocument architecture for native language identification that is able to exploit both local sentence information and a wide set of general-purpose features qualifying the lexical and grammatical structure of the whole document. When evaluated on the official test set, our sentence-document stacked architecture obtained the best result among all the participants of the essay track with an F1 score of 0.8818.

Cite

CITATION STYLE

APA

Cimino, A., & Dell’Orletta, F. (2017). Stacked sentence-document classifier approach for improving native language identification. In EMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop (pp. 430–437). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-5049

Stacked sentence-document classifier approach for improving native language identification

Abstract

Cite

Register to see more suggestions