BERT Prescriptions to avoid unwanted headaches: A comparison of transformer architectures for adverse drug event detection

23Citations
Citations of this article
70Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Pretrained transformer-based models, such as BERT and its variants, have become a common choice to obtain state-of-the-art performances in NLP tasks. In the identification of Adverse Drug Events (ADE) from social media texts, for example, BERT architectures rank first in the leaderboard. However, a systematic comparison between these models has not yet been done. In this paper, we aim at shedding light on the differences between their performance analyzing the results of 12 models, tested on two standard benchmarks. SpanBERT and PubMedBERT emerged as the best models in our evaluation: this result clearly shows that span-based pretraining gives a decisive advantage in the precise recognition of ADEs, and that in-domain language pretraining is particularly useful when the transformer model is trained just on biomedical text from scratch.

Cite

CITATION STYLE

APA

Portelli, B., Lenzi, E., Chersoni, E., Serra, G., & Santus, E. (2021). BERT Prescriptions to avoid unwanted headaches: A comparison of transformer architectures for adverse drug event detection. In EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 1740–1747). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.eacl-main.149

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free