Fine-tuning BERT to Classify COVID19 Tweets Containing Symptoms

1Citations
Citations of this article
43Readers
Mendeley users who have this article in their library.

Abstract

Twitter provides a source of patient-generated data that has been used in various population health studies. The first step in many of these studies is to identify and capture Twitter messages (tweets) containing medication mentions. Identifying personal mentions of COVID19 symptoms requires distinguishing personal mentions from other mentions such as symptoms reported by others and references to news articles or other sources. In this article, we describe our submission to Task 6 of the Social Media Mining for Health Applications (SMM4H) Shared Task 2021. This task challenged participants to classify tweets where the target classes are - (1) self-reports, (2) non-personal reports, and (3) literature/news mentions. Our system uses a handcrafted preprocessing and word embeddings from BERT encoder model. We achieve. F1 score of 93%.

Cite

CITATION STYLE

APA

Roychoudhury, R., & Naskar, S. K. (2021). Fine-tuning BERT to Classify COVID19 Tweets Containing Symptoms. In Social Media Mining for Health, SMM4H 2021 - Proceedings of the 6th Workshop and Shared Tasks (pp. 138–140). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.smm4h-1.30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free