Attention Beam: An Image Captioning Approach (Student Abstract)

3Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

The aim of image captioning is to generate textual description of a given image. Though seemingly an easy task for humans, it is challenging for machines as it requires the ability to comprehend the image (computer vision) and consequently generate a human-like description for the image (natural language understanding). In recent times, encoder-decoder based architectures have achieved state-of-the-art results for image captioning. Here, we present a heuristic of beam search on top of the encoder-decoder based architecture that gives better quality captions on three benchmark datasets: Flickr8k, Flickr30k and MS COCO.

Cite

CITATION STYLE

APA

Shrimal, A., & Chakraborty, T. (2021). Attention Beam: An Image Captioning Approach (Student Abstract). In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 18, pp. 15887–15888). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i18.17940

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free