TCU at SemEval-2022 Task 8: A Stacking Ensemble Transformer Model for Multilingual News Article Similarity

Xiang Luo; Yanqing Niu; Boer Zhu

Conference ProceedingsOPEN ACCESS

TCU at SemEval-2022 Task 8: A Stacking Ensemble Transformer Model for Multilingual News Article Similarity

SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop (2022) 1202-1207

DOI: 10.18653/v1/2022.semeval-1.170

2Citations

23Readers

Abstract

Previous studies focus on measuring the degree of similarity of texts by using traditional machine learning methods, such as Support Vector Regression (SVR). Based on Transformers, this paper describes our contribution to SemEval-2022 Task 8 Multilingual News Article Similarity. The similarity of multilingual news articles requires a regression prediction on the similarity of multilingual articles, rather than a classification for judging text similarity. This paper mainly describes the architecture of the model and how to adjust the parameters in the experiment and strengthen the generalization ability. In this paper, we implement and construct different models through transformer-based models. We applied different transformer-based models, as well as ensemble them together by using ensemble learning. To avoid the overfit, we focus on the adjustment of parameters and the increase of generalization ability in our experiments. In the last submitted contest, we achieve a score of 0.715 and rank the 21st place.

Cite

CITATION STYLE

APA

Luo, X., Niu, Y., & Zhu, B. (2022). TCU at SemEval-2022 Task 8: A Stacking Ensemble Transformer Model for Multilingual News Article Similarity. In SemEval 2022 - 16th International Workshop on Semantic Evaluation, Proceedings of the Workshop (pp. 1202–1207). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.semeval-1.170

TCU at SemEval-2022 Task 8: A Stacking Ensemble Transformer Model for Multilingual News Article Similarity

Abstract

Cite

Register to see more suggestions