Is GPT-3 a Good Data Annotator?

48Citations
Citations of this article
99Readers
Mendeley users who have this article in their library.

Abstract

Data annotation is the process of labeling data that could be used to train machine learning models. Having high-quality annotation is crucial, as it allows the model to learn the relationship between the input data and the desired output. GPT-3, a large-scale language model developed by OpenAI, has demonstrated impressive zero- and few-shot performance on a wide range of NLP tasks. It is therefore natural to wonder whether it can be used to effectively annotate data for NLP tasks. In this paper, we evaluate the performance of GPT-3 as a data annotator by comparing it with traditional data annotation methods and analyzing its output on a range of tasks. Through this analysis, we aim to provide insight into the potential of GPT-3 as a general-purpose data annotator in NLP.

Cite

CITATION STYLE

APA

Ding, B., Qin, C., Liu, L., Chia, Y. K., Li, B., Joty, S., & Bing, L. (2023). Is GPT-3 a Good Data Annotator? In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 11173–11195). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.626

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free