Skip to content
Journal article

Forecasting the onset and course of mental illness with Twitter data

Reece A, Reagan A, Lix K, Dodds P, Danforth C, Langer E ...see all
  • 12

    Readers

    Mendeley users who have this article in their library.
  • N/A

    Citations

    Citations of this article.
  • N/A

    Views

    ScienceDirect users who have downloaded this article.
Sign in to save reference

Abstract

We developed computational models to predict the emergence of depression and Post­Traumatic Stress Disorder in Twitter users. Twitter data and details of depression history were collected from 204 individuals (105 depressed, 99 healthy). We extracted predictive features measuring affect, linguistic style, and context from participant tweets (N=279,951) and built models using these features with supervised learning algorithms. Resulting models successfully discriminated between depressed and healthy content, and compared favorably to general practitioners' average success rates in diagnosing depression. Results held even when the analysis was restricted to content posted before first depression diagnosis. State­space temporal analysis suggests that onset of depression may be detectable from Twitter data several months prior to diagnosis. Predictive results were replicated with a separate sample of individuals diagnosed with PTSD (N users =174, N tweets =243,775). A state­space time series model revealed indicators of PTSD almost immediately post­trauma, often many months prior to clinical diagnosis. These methods suggest a data­driven, predictive approach for early screening and detection of mental illness.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • Andrew G. Reece

  • Andrew J. Reagan

  • Katharina L. M. Lix

  • Peter Sheridan Dodds

  • Christopher M. Danforth

  • Ellen J. Langer

Cite this document

Choose a citation style from the tabs below