Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text

3Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

Abstract

Children with special needs may struggle to identify uncomfortable and unsafe situations. In this study, we aimed at developing an automated system that can detect such situations based on audio and text cues to encourage children’s safety and prevent situations of violence toward them. We composed a text and audio database with over 1891 sentences extracted from videos presenting real-world situations, and categorized them into three classes: neutral sentences, insulting sentences, and sentences indicating unsafe conditions. We compared insulting and unsafe sentence-detection abilities of various machine-learning methods. In particular, we found that a deep neural network that accepts the text embedding vectors of bidirectional encoder representations from transformers (BERT) and audio embedding vectors of Wav2Vec as input attains the highest accuracy in detecting unsafe and insulting situations. Our results indicate that it may be applicable to build an automated agent that can detect unsafe and unpleasant situations that children with special needs may encounter, given the dialogue contexts conducted with these children.

Cite

CITATION STYLE

APA

Allouch, M., Mansbach, N., Azaria, A., & Azoulay, R. (2023). Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text. Applied Sciences (Switzerland), 13(6). https://doi.org/10.3390/app13063927

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free