Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

undefined; undefined; undefined; undefined; undefined; Kota Akshith Reddy; Satish C J; Jahnavi Polsani; Teja Naveen Chintapalli; Gangapatnam Sai Ananya

Journal ArticleOPEN ACCESS

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

et al.

International Journal of Recent Technology and Engineering (IJRTE) (2021) 10(3) 131-139

DOI: 10.35940/ijrte.c6439.0910321

N/ACitations

2Readers

Abstract

Automatic Image Caption Generation is one of the core problems in the field of Deep Learning. Data Augmentation is a technique which helps in increasing the amount of data at hand and this is done by augmenting the training data using various techniques like flipping, rotating, Zooming, Brightening, etc. In this work, we create an Image Captioning model and check its robustness on all the major types of Image Augmentation techniques. The results show the fuzziness of the model while working with the same image but a different augmentation technique and because of this, a different caption is produced every time a different data augmentation technique is employed. We also show the change in the performance of the model after applying these augmentation techniques. Flickr8k dataset is used for this study along with BLEU score as the evaluation metric for the image captioning model.

Cite

CITATION STYLE

APA

Reddy, K. A. … Ananya, G. S. (2021). Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques. International Journal of Recent Technology and Engineering (IJRTE), 10(3), 131–139. https://doi.org/10.35940/ijrte.c6439.0910321

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

Abstract

Cite

Register to see more suggestions