Active learning based framework for image captioning corpus creation

Moustapha Cheikh; Mounir Zrigui

Conference Proceedings

Active learning based framework for image captioning corpus creation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12096 LNCS 128-142

DOI: 10.1007/978-3-030-53552-0_14

4Citations

3Readers

Get full text

Abstract

Image captioning aims at analyzing the content of an image in order to subsequently generate a textual description through verbally expressing the important aspects of it. In spite of the fact that the task of automatic image description is not bound to the English language, yet, the recent advances mostly focus on English descriptions. Collecting captions for images is an expensive process that requires time and labor cost. In this paper, we introduce a novel active learning framework with human in the loop for image captioning corpus creation, using a translated version of existing datasets. We implemented this framework to create a new dataset called ArabicFlickr1K. This dataset has 1095 images, each is associated with three to five descriptions. We also propose a neural network architecture to automatically generate Arabic captions for images. This architecture relies on an encoder-decoder framework. Our model scored 47% on BLUE-1.

Author supplied keywords

Cite

CITATION STYLE

APA

Cheikh, M., & Zrigui, M. (2020). Active learning based framework for image captioning corpus creation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12096 LNCS, pp. 128–142). Springer. https://doi.org/10.1007/978-3-030-53552-0_14

Active learning based framework for image captioning corpus creation

Abstract

Author supplied keywords

Cite

Register to see more suggestions