Estimating aggressiveness of russian texts by means of machine learning

Dmitriy Levonevskiy; Dmitrii Malov; Irina Vatamaniuk

Conference Proceedings

Estimating aggressiveness of russian texts by means of machine learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11658 LNAI 270-279

DOI: 10.1007/978-3-030-26061-3_28

3Citations

5Readers

Get full text

Abstract

This paper considers emotional assessment of texts in Russian using machine learning on the example of aggression detection. It summarizes the related work, methods, models and datasets, describes actual problems, proposes a text processing pipeline and a software system for training neural networks on heterogeneous datasets. The experiments show that neural networks trained on the annotated corpora both in Russian and English, allow to determine whether a text item in Russian contains an aggressive message. Authors thoroughly compare different assessment methods, particularly corpus-based approaches, machine learning solutions and hybrid variants. Results, obtained here, can be used to estimate the aggressiveness probability, for example, to rank messages for subsequent manual verification. They also enable feasibility studies on the possibilities of detecting a particular type of emotion in a text using corpora in other languages. The paper highlights further research directions, where different Python toolkits (NLTK, Keras) could be used for better model performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Levonevskiy, D., Malov, D., & Vatamaniuk, I. (2019). Estimating aggressiveness of russian texts by means of machine learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11658 LNAI, pp. 270–279). Springer Verlag. https://doi.org/10.1007/978-3-030-26061-3_28

Estimating aggressiveness of russian texts by means of machine learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions