Multi-view unsupervised user feature embedding for social media-based substance use prediction

29Citations
Citations of this article
106Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we demonstrate how the state-of-the-art machine learning and text mining techniques can be used to build effective social media-based substance use detection systems. Since a substance use ground truth is difficult to obtain on a large scale, to maximize system performance, we explore different unsupervised feature learning methods to take advantage of a large amount of unsupervised social media data. We also demonstrate the benefit of using multi-view unsupervised feature learning to combine heterogeneous user information such as Facebook “likes” and “status updates” to enhance system performance. Based on our evaluation, our best models achieved 86% AUC for predicting tobacco use, 81% for alcohol use and 84% for illicit drug use, all of which significantly outperformed existing methods. Our investigation has also uncovered interesting relations between a user’s social media behavior (e.g., word usage) and substance use.

Cite

CITATION STYLE

APA

Ding, T., Bickel, W. K., & Pan, S. (2017). Multi-view unsupervised user feature embedding for social media-based substance use prediction. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 2275–2284). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d17-1241

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free