The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews?

Balázs Kovács

Journal Article

The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews?

Kovács B

Marketing Letters (2024) 35(4) 651-666

DOI: 10.1007/s11002-024-09729-3

33Citations

116Readers

Get full text

Abstract

Online reviews serve as a guide for consumer choice. With advancements in large language models (LLMs) and generative AI, the fast and inexpensive creation of human-like text may threaten the feedback function of online reviews if neither readers nor platforms can differentiate between human-written and AI-generated content. In two experiments, we found that humans cannot recognize AI-written reviews. Even with monetary incentives for accuracy, both Type I and Type II errors were common: human reviews were often mistaken for AI-generated reviews, and even more frequently, AI-generated reviews were mistaken for human reviews. This held true across various ratings, emotional tones, review lengths, and participants’ genders, education levels, and AI expertise. Younger participants were somewhat better at distinguishing between human and AI reviews. An additional study revealed that current AI detectors were also fooled by AI-generated reviews. We discuss the implications of our findings on trust erosion, manipulation, regulation, consumer behavior, AI detection, market structure, innovation, and review platforms.

Author supplied keywords

Cite

CITATION STYLE

APA

Kovács, B. (2024). The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews? Marketing Letters, 35(4), 651–666. https://doi.org/10.1007/s11002-024-09729-3

The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews?

Abstract

Author supplied keywords

Cite

Register to see more suggestions