Current evaluation metrics for image description may be too coarse. We therefore propose a series of binary forced-choice tasks that each focus on a different aspect of the captions. We evaluate a number of different off-the-shelf image description systems. Our results indicate strengths and shortcomings of both generation and ranking based approaches.
CITATION STYLE
Hodosh, M., & Hockenmaier, J. (2016). Focused evaluation for image description with binary forced-choice tasks. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 19–28). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-3203
Mendeley helps you to discover research relevant for your work.