Abstract
Anorexia Nervosa (AN) is a serious mental disorder that has been proved to be traceable on social media through the analysis of users’ written posts. Here we present an approach to generate word embeddings enhanced for a classification task dedicated to the detection of Reddit users with AN. Our method extends Word2vec’s objective function in order to put closer domain-specific and semantically related words. The approach is evaluated through the calculation of an average similarity measure, and via the usage of the embeddings generated as features for the AN screening task. The results show that our method outperforms the usage of fine-tuned pre-learned word embeddings, related methods dedicated to generate domain adapted embeddings, as well as representations learned on the training set using Word2vec. This method can potentially be applied and evaluated on similar tasks that can be formalized as document categorization problems. Regarding our use case, we believe that this approach can contribute to the development of proper automated detection tools to alert and assist clinicians.
Author supplied keywords
Cite
CITATION STYLE
Ramírez-Cifuentes, D., Largeron, C., Tissier, J., Freire, A., & Baeza-Yates, R. (2020). Enhanced Word Embeddings for Anorexia Nervosa Detection on Social Media. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12080 LNCS, pp. 404–417). Springer. https://doi.org/10.1007/978-3-030-44584-3_32
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.