AmritaCEN at SemEval-2016 task 11: Complex word identification using word embedding

S. P. Sanjay; Anand M. Kumar; K. P. Soman

Conference ProceedingsOPEN ACCESS

AmritaCEN at SemEval-2016 task 11: Complex word identification using word embedding

SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (2016) 1022-1027

DOI: 10.18653/v1/s16-1159

10Citations

80Readers

Abstract

Complex word identification task focuses on identifying the difficult word from English sentence for a Non-Native speakers. Non-Native speakers are those who don't have English as their native language. It is a subtask for lexical simplification. We have experimented with word embedding features, orthographic word features, similarity features and POS tag features which improves the performance of the classification. In addition to the SemEval 2016 results we have evaluated the training data by varying the vector dimension size and obtained the best possible size for producing better performance. The SVM learning algorithm will attains constant and maximum accuracy through linear classifier. We achieve a G-score of 0.43 and 0.54 on computing complex words for two systems.

Cite

CITATION STYLE

APA

Sanjay, S. P., Kumar, A. M., & Soman, K. P. (2016). AmritaCEN at SemEval-2016 task 11: Complex word identification using word embedding. In SemEval 2016 - 10th International Workshop on Semantic Evaluation, Proceedings (pp. 1022–1027). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s16-1159

AmritaCEN at SemEval-2016 task 11: Complex word identification using word embedding

Abstract

Cite

Register to see more suggestions