The lexical gap: An improved measure of automated image description quality

1Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

The challenge of automatically describing images and videos has stimulated much research in Computer Vision and Natural Language Processing. In order to test the semantic abilities of new algorithms, we need reliable and objective ways of measuring progress. Using our dataset of 2K human and machine descriptions, we find that standard evaluation measures alone do not adequately measure the semantic richness of a description. We introduce and test a new measure of semantic ability based on relative lexical diversity. We show how our measure can work alongside existing measures to achieve state of the art correlation with human judgement of quality.

Cite

CITATION STYLE

APA

Kershaw, A., & Bober, M. (2019). The lexical gap: An improved measure of automated image description quality. In IWCS 2019 - Proceedings of the 13th International Conference on Computational Semantics - Student Papers (pp. 15–23). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-0603

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free