hub at SemEval-2021 Task 1: Fusion of Sentence and Word Frequency to Predict Lexical Complexity

Bo Huang; Yang Bai; Xiaobing Zhou

Conference ProceedingsOPEN ACCESS

hub at SemEval-2021 Task 1: Fusion of Sentence and Word Frequency to Predict Lexical Complexity

SemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop (2021) 598-602

DOI: 10.18653/v1/2021.semeval-1.75

4Citations

42Readers

Abstract

In this paper, we propose a method of fusing sentence information and word frequency information for the SemEval 2021 Task 1-Lexical Complexity Prediction (LCP) shared task. In our system, the sentence information comes from the RoBERTa model, and the word frequency information comes from the Tf-Idf algorithm. Use Inception block as a shared layer to learn sentence and word frequency information. We described the implementation of our best system and discussed our methods and experiments in the task. The shared task is divided into two subtasks. The goal of the two subtasks is to predict the complexity of a predetermined word. The evaluation index of the task is the Pearson correlation coefficient. Our best performance system has Pearson correlation coefficients of 0.7434 and 0.8000 in the single-token subtask test set and the multi-token subtask test set, respectively.

Cite

CITATION STYLE

APA

Huang, B., Bai, Y., & Zhou, X. (2021). hub at SemEval-2021 Task 1: Fusion of Sentence and Word Frequency to Predict Lexical Complexity. In SemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop (pp. 598–602). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.semeval-1.75

hub at SemEval-2021 Task 1: Fusion of Sentence and Word Frequency to Predict Lexical Complexity

Abstract

Cite

Register to see more suggestions