UoB at ProfNER 2021: Data Augmentation for Classification Using Machine Translation

2Citations
Citations of this article
49Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper describes the participation of the UoB-NLP team in the ProfNER-ST shared subtask 7a. The task was aimed at detecting the mention of professions in social media text. Our team experimented with two methods of improving the performance of pre-trained models: Specifically, we experimented with data augmentation through translation and the merging of multiple language inputs to meet the objective of the task. While the best performing model on the test data consisted of mBERT fine-tuned on augmented data using back-translation, the improvement is minor possibly because multi-lingual pre-trained models such as mBERT already have access to the kind of information provided through back-translation and bilingual data.

Cite

CITATION STYLE

APA

De Leon, F. L., Madabushi, H. T., & Lee, M. (2021). UoB at ProfNER 2021: Data Augmentation for Classification Using Machine Translation. In Social Media Mining for Health, SMM4H 2021 - Proceedings of the 6th Workshop and Shared Tasks (pp. 115–117). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.smm4h-1.23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free