Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text

Merav Allouch, Noa Mansbach, Amos Azaria, Rina Azoulay

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Children with special needs may struggle to identify uncomfortable and unsafe situations. In this study, we aimed at developing an automated system that can detect such situations based on audio and text cues to encourage children’s safety and prevent situations of violence toward them. We composed a text and audio database with over 1891 sentences extracted from videos presenting real-world situations, and categorized them into three classes: neutral sentences, insulting sentences, and sentences indicating unsafe conditions. We compared insulting and unsafe sentence-detection abilities of various machine-learning methods. In particular, we found that a deep neural network that accepts the text embedding vectors of bidirectional encoder representations from transformers (BERT) and audio embedding vectors of Wav2Vec as input attains the highest accuracy in detecting unsafe and insulting situations. Our results indicate that it may be applicable to build an automated agent that can detect unsafe and unpleasant situations that children with special needs may encounter, given the dialogue contexts conducted with these children.

Original languageEnglish
Article number3927
JournalApplied Sciences (Switzerland)
Volume13
Issue number6
DOIs
StatePublished - Mar 2023

Keywords

  • assistive technologies for persons with disabilities
  • audio classification
  • bulling
  • children’s safety
  • machine learning
  • pretrained models
  • text classification

Fingerprint

Dive into the research topics of 'Utilizing Machine Learning for Detecting Harmful Situations by Audio and Text'. Together they form a unique fingerprint.

Cite this