NoHateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System

0Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Hate speech is a surely relevant problem in Brazil. Nevertheless, its regulation is not effective due to the difficulty to identify, quantify and classify offensive comments. Here, we introduce a novel system for offensive comment analysis in Brazilian Portuguese. The system titled NoHateBrazil1 recognizes explicit and implicit offensiveness in context at a fine-grained level. Specifically, we propose a framework for data collection, human annotation and machine learning models that were used to build the system. In addition, we assess the potential of our system to reflect stereotypical beliefs against marginalized groups by contrasting them with counter-stereotypes. As a result, a friendly web application was implemented, which besides presenting relevant performance, showed promising results towards mitigation of the risk of reinforcing social stereotypes. Lastly, new measures were proposed to improve the explainability of offensiveness classification and reliability of the model's predictions.

Cite

CITATION STYLE

APA

Vargas, F., Carvalho, I., Schmeisser-Nieto, W. S., Benevenuto, F., & Pardo, T. A. S. (2023). NoHateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 1180–1186). Incoma Ltd. https://doi.org/10.26615/978-954-452-092-2_125

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free