Generating descriptions for sequential images with local-object attention and global semantic context modelling

ArXiv: 2012.01295
2Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we propose an end-to-end CNN-LSTM model for generating descriptions for sequential images with a local-object attention mechanism. To generate coherent descriptions, we capture global semantic context using a multilayer perceptron, which learns the dependencies between sequential images. A paralleled LSTM network is exploited for decoding the sequence descriptions. Experimental results show that our model outperforms the baseline across three different evaluation metrics on the datasets published by Microsoft.

References Powered by Scopus

Long Short-Term Memory

76931Citations
N/AReaders
Get full text

Show and tell: A neural image caption generator

4682Citations
N/AReaders
Get full text

Deep visual-semantic alignments for generating image descriptions

3803Citations
N/AReaders
Get full text

Cited by Powered by Scopus

BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling

27Citations
N/AReaders
Get full text

Analysis decision-making system for aquaculture water quality based on deep learning

9Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Su, J., Lin, C., Zhou, M., Dai, Q., & Lv, H. (2018). Generating descriptions for sequential images with local-object attention and global semantic context modelling. In 2IS and NLG 2018 - Workshop on Intelligent Interactive Systems and Language Generation, Proceedings of the Workshop (pp. 3–8). Association for Computational Linguistics (ACL).

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 19

68%

Researcher 6

21%

Lecturer / Post doc 2

7%

Professor / Associate Prof. 1

4%

Readers' Discipline

Tooltip

Computer Science 25

76%

Linguistics 5

15%

Engineering 2

6%

Neuroscience 1

3%

Save time finding and organizing research with Mendeley

Sign up for free