This work explores the detection of individuals' risk of type 2 diabetes mellitus (T2DM) directly from their social media (Twitter) activity. Our approach extends a deep learning architecture with several contributions: following previous observations that language use differs by gender, it captures and uses gender information through domain adaptation; it captures recency of posts under the hypothesis that more recent posts are more representative of an individual's current risk status; and, lastly, it demonstrates that in this scenario where activity factors are sparsely represented in the data, a bag-of-word neural network model using custom dictionaries of food and activity words performs better than other neural sequence models. Our best model, which incorporates all these contributions, achieves a risk-detection F1 of 41.9, considerably higher than the baseline rate (36.9).
CITATION STYLE
Bell, D., Laparra, E., Kousik, A., Ishihara, T., Surdeanu, M., & Kobourov, S. (2018). Detecting Diabetes Risk from Social Media Activity. In EMNLP 2018 - 9th International Workshop on Health Text Mining and Information Analysis, LOUHI 2018 - Proceedings of the Workshop (pp. 1–11). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-5601
Mendeley helps you to discover research relevant for your work.