Classification of Censored Tweets in Chinese Language using XLNet

Shaikh Sahil Ahmed; M. Anand Kumar

Conference ProceedingsOPEN ACCESS

Classification of Censored Tweets in Chinese Language using XLNet

NLP4IF 2021 - NLP for Internet Freedom: Censorship, Disinformation, and Propaganda, Proceedings of the 4th Workshop (2021) 136-139

DOI: 10.18653/v1/2021.nlp4if-1.21

3Citations

53Readers

Abstract

In the growth of today’s world and advanced technology, social media networks play a significant role in impacting human lives. Censorship is the overthrowing of speech, public transmission, or other details that play a vast role in social media. The content may be considered harmful, sensitive, or inconvenient. Authorities like institutes, governments, and other organizations conduct Censorship. This paper has implemented a model that helps classify censored and uncensored tweets as a binary classification. The paper describes submission to the Censorship shared task of the NLP4IF 2021 workshop. We used various transformer-based pre-trained models, and XLNet outputs a better accuracy among all. We fine-tuned the model for better performance and achieved a reasonable accuracy, and calculated other performance metrics.

Cite

CITATION STYLE

APA

Ahmed, S. S., & Anand Kumar, M. (2021). Classification of Censored Tweets in Chinese Language using XLNet. In NLP4IF 2021 - NLP for Internet Freedom: Censorship, Disinformation, and Propaganda, Proceedings of the 4th Workshop (pp. 136–139). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.nlp4if-1.21

Classification of Censored Tweets in Chinese Language using XLNet

Abstract

Cite

Register to see more suggestions