Know what you don't know: Modeling a pragmatic speaker that refers to objects of unknown categories

Sina Zarrieß; David Schlangen

Conference ProceedingsOPEN ACCESS

Know what you don't know: Modeling a pragmatic speaker that refers to objects of unknown categories

ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2020) 654-659

DOI: 10.18653/v1/p19-1063

9Citations

121Readers

Abstract

Zero-shot learning in Language & Vision is the task of correctly labelling (or naming) objects of novel categories. Another strand of work in L&V aims at pragmatically informative rather than “correct” object descriptions, e.g. in reference games. We combine these lines of research and model zero-shot reference games, where a speaker needs to successfully refer to a novel object in an image. Inspired by models of “rational speech acts”, we extend a neural generator to become a pragmatic speaker reasoning about uncertain object categories. As a result of this reasoning, the generator produces fewer nouns and names of distractor categories as compared to a literal speaker. We show that this conversational strategy for dealing with novel objects often improves communicative success, in terms of resolution accuracy of an automatic listener.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Zarrieß, S., & Schlangen, D. (2020). Know what you don’t know: Modeling a pragmatic speaker that refers to objects of unknown categories. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 654–659). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1063

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 38

68%

Researcher 12

21%

Professor / Associate Prof. 3

Lecturer / Post doc 3

Readers' Discipline

Computer Science 51

80%

Linguistics 8

13%

Neuroscience 3

Engineering 2

Know what you don't know: Modeling a pragmatic speaker that refers to objects of unknown categories

Abstract

References Powered by Scopus

ImageNet: A Large-Scale Hierarchical Image Database

Microsoft COCO: Common objects in context

Show and tell: A neural image caption generator

Cited by Powered by Scopus

Decoding methods in neural language generation: A survey

Linguistic issues behind visual question answering

Learning to Mediate Disparities Towards Pragmatic Communication

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline