Hate speech is a surely relevant problem in Brazil. Nevertheless, its regulation is not effective due to the difficulty to identify, quantify and classify offensive comments. Here, we introduce a novel system for offensive comment analysis in Brazilian Portuguese. The system titled NoHateBrazil1 recognizes explicit and implicit offensiveness in context at a fine-grained level. Specifically, we propose a framework for data collection, human annotation and machine learning models that were used to build the system. In addition, we assess the potential of our system to reflect stereotypical beliefs against marginalized groups by contrasting them with counter-stereotypes. As a result, a friendly web application was implemented, which besides presenting relevant performance, showed promising results towards mitigation of the risk of reinforcing social stereotypes. Lastly, new measures were proposed to improve the explainability of offensiveness classification and reliability of the model's predictions.
CITATION STYLE
Vargas, F., Carvalho, I., Schmeisser-Nieto, W. S., Benevenuto, F., & Pardo, T. A. S. (2023). NoHateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 1180–1186). Incoma Ltd. https://doi.org/10.26615/978-954-452-092-2_125
Mendeley helps you to discover research relevant for your work.