Validating machine learning algorithms for twitter data against established measures of suicidality

Scott R. Braithwaite; Christophe Giraud-Carrier; Josh West; Michael D. Barnes; Carl Lee Hanson

Journal ArticleOPEN ACCESS

Validating machine learning algorithms for twitter data against established measures of suicidality

JMIR Mental Health (2016) 3(2)

DOI: 10.2196/mental.4822

160Citations

203Readers

Get full text

Abstract

Background: One of the leading causes of death in the United States (US) is suicide and new methods of assessment are needed to track its risk in real time. Objective: Our objective is to validate the use of machine learning algorithms for Twitter data against empirically validated measures of suicidality in the US population. Methods: Using a machine learning algorithm, the Twitter feeds of 135 Mechanical Turk (MTurk) participants were compared with validated, self-report measures of suicide risk. Results: Our findings show that people who are at high suicidal risk can be easily differentiated from those who are not by machine learning algorithms, which accurately identify the clinically significant suicidal rate in 92% of cases (sensitivity: 53%, specificity: 97%, positive predictive value: 75%, negative predictive value: 93%). Conclusions: Machine learning algorithms are efficient in differentiating people who are at a suicidal risk from those who are not. Evidence for suicidality can be measured in nonclinical populations using social media data.

Author supplied keywords

Cite

CITATION STYLE

APA

Braithwaite, S. R., Giraud-Carrier, C., West, J., Barnes, M. D., & Hanson, C. L. (2016). Validating machine learning algorithms for twitter data against established measures of suicidality. JMIR Mental Health, 3(2). https://doi.org/10.2196/mental.4822

Validating machine learning algorithms for twitter data against established measures of suicidality

Abstract

Author supplied keywords

Cite

Register to see more suggestions