YouTube videos are one of the most effective platforms for disseminating creative material and ideas, and they appeal to a diverse audience. Along with adults and older children, young children are avid consumers of YouTube materials. Children often lack means to evaluate if a given content is appropriate for their age, and parents have very limited options to enforce content restrictions on YouTube. Young children can thus become exposed to inappropriate content, such as violent, scary or disturbing videos on YouTube. Previous studies demonstrated that YouTube videos can be classified into appropriate or inappropriate for young viewers using video metadata, such as video thumbnails, title, comments, etc. Metadata-based approaches achieve high accuracy, but still have significant misclassifications, due to the reliability of input features. In this paper, we propose a fusion model, called Samba, which uses both metadata and video subtitles for content classification. Using subtitles in the model helps better infer the true nature of a video improving classification accuracy. On a large-scale, comprehensive dataset of 70K videos, we show that Samba achieves 95% accuracy, outperforming other state-of-the-art classifiers by at least 7%. We also publicly release our dataset.
CITATION STYLE
Binh, L., Tandon, R., Oinar, C., Liu, J., Durairaj, U., Guo, J., … Mirkovic, J. (2022). Samba: Identifying Inappropriate Videos for Young Children on YouTube. In International Conference on Information and Knowledge Management, Proceedings (pp. 88–97). Association for Computing Machinery. https://doi.org/10.1145/3511808.3557442
Mendeley helps you to discover research relevant for your work.