The South Korean government operates human-based lyrics-rating systems to reduce adolescents’ exposure to inappropriate songs. In this study, we developed lyrics classification models for an automated lyrics-rating system for adolescents. There are two kinds of inappropriate lyrics for adolescents: (1) lyrics with inappropriate words and (2) lyrics with inappropriate content based on the semantic context. To tackle the first issue, we propose logCDα as a method for generating a lexicon of inappropriate words. It attained the highest performance among the lexicon-based filtering methods examined. Further, to deal with the second issue, we propose a hybrid classification model that combines logCDα with an RNN based model. The hybrid model composed of a ‘lexicon-checking model’ and a ‘context-checking model’ achieved the highest performance among all of the models examined, highlighting the effectiveness of combining the models to specifically target each of the two types of inappropriate lyrics.
CITATION STYLE
Kim, J., & Yi, M. Y. (2019). A hybrid modeling approach for an automated lyrics-rating system for adolescents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11437 LNCS, pp. 779–786). Springer Verlag. https://doi.org/10.1007/978-3-030-15712-8_53
Mendeley helps you to discover research relevant for your work.