Topic Guided Image Captioning with Scene and Spatial Features

Usman Zia; M. Mohsin Riaz; Abdul Ghafoor

Conference Proceedings

Topic Guided Image Captioning with Scene and Spatial Features

Lecture Notes in Networks and Systems (2022) 450 LNNS 180-191

DOI: 10.1007/978-3-030-99587-4_16

0Citations

3Readers

Get full text

Abstract

Automatic generation of captions for visual contents has recently emerged as a challenging research field due to it’s enormous impact in areas like computer vision, information retrieval, autonomous vehicles and natural language processing. Traditional models mainly focus on single aspect of the visual features to generate descriptions. The proposed model incorporates spatial information of salient objects capturing detailed characteristics coupled with scene category to incorporate general image setting. These extracted features are processed by topic-aware attention-based language model to generate human like captions. Performance of the proposed model is compared with state-of-the-art research through evaluation on benchmark image captioning datasets. The experimental results depict the promising performance of the proposed model compared with the captioning models proposed in recent literature.

Cite

CITATION STYLE

APA

Zia, U., Riaz, M. M., & Ghafoor, A. (2022). Topic Guided Image Captioning with Scene and Spatial Features. In Lecture Notes in Networks and Systems (Vol. 450 LNNS, pp. 180–191). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-99587-4_16

Topic Guided Image Captioning with Scene and Spatial Features

Abstract

Cite

Register to see more suggestions