Can Large Language Models Capture Dissenting Human Voices?

Noah Lee; Na Min An; James Thorne

Conference ProceedingsOPEN ACCESS

Can Large Language Models Capture Dissenting Human Voices?

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 4569-4585

DOI: 10.18653/v1/2023.emnlp-main.278

13Citations

17Readers

Abstract

Large language models (LLMs) have shown impressive achievements in solving a broad range of tasks. Augmented by instruction fine-tuning, LLMs have also been shown to generalize in zero-shot settings as well. However, whether LLMs closely align with the human disagreement distribution has not been well-studied, especially within the scope of natural language inference (NLI). In this paper, we evaluate the performance and alignment of LLM distribution with humans using two different techniques to estimate the multinomial distribution: Monte Carlo Estimation (MCE) and Log Probability Estimation (LPE). As a result, we show LLMs exhibit limited ability in solving NLI tasks and simultaneously fail to capture human disagreement distribution. The inference and human alignment performances plunge even further on data samples with high human disagreement levels, raising concerns about their natural language understanding (NLU) ability and their representativeness to a larger human population.

Cite

CITATION STYLE

APA

Lee, N., An, N. M., & Thorne, J. (2023). Can Large Language Models Capture Dissenting Human Voices? In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 4569–4585). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.278

Can Large Language Models Capture Dissenting Human Voices?

Abstract

Cite

Register to see more suggestions