DeepBlueAI at SemEval-2021 Task 7: Detecting and Rating Humor and Offense with Stacking Diverse Language Model-Based Methods

Bingyan Song; Chunguang Pan; Shengguang Wang; Zhipeng Luo

Conference ProceedingsOPEN ACCESS

DeepBlueAI at SemEval-2021 Task 7: Detecting and Rating Humor and Offense with Stacking Diverse Language Model-Based Methods

SemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop (2021) 1130-1134

DOI: 10.18653/v1/2021.semeval-1.158

9Citations

42Readers

Abstract

This paper describes the winning system for SemEval-2021 Task 7: Detecting and Rating Humor and Offense. Our strategy is stacking diverse pre-trained language models (PLMs) such as RoBERTa and ALBERT. We first perform fine-tuning on these two PLMs with various hyperparameters and different training strategies. Then a valid stacking mechanism is applied on top of the fine-tuned PLMs to get the final prediction. Experimental results on the dataset released by the organizer of the task show the validity of our method and we win first place and third place for subtask 2 and 1a.

Cite

CITATION STYLE

APA

Song, B., Pan, C., Wang, S., & Luo, Z. (2021). DeepBlueAI at SemEval-2021 Task 7: Detecting and Rating Humor and Offense with Stacking Diverse Language Model-Based Methods. In SemEval 2021 - 15th International Workshop on Semantic Evaluation, Proceedings of the Workshop (pp. 1130–1134). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.semeval-1.158

DeepBlueAI at SemEval-2021 Task 7: Detecting and Rating Humor and Offense with Stacking Diverse Language Model-Based Methods

Abstract

Cite

Register to see more suggestions