Abstract
Due to advancement of multimedia technology, availability and usage of image and video data is enormous. For indexing and retrieving those data, there is a need for an efficient technique. Now, Automatic keyword generation for images is a focussed research which has lot of attractions. In general, conventional auto annotation methods having lesser performance over deep learning methods. The annotation is transformed as captioning in deep learning models. In this paper, we propose a new model CSL Net (CSLN) as a combination of convoluted squeeze and excitation block with Bi-LSTM blocks to predict tags for images. The proposed model is evaluated using the various benchmark datasets like CIFAR10, Corel5K, ESPGame and IAPRTC12. It is observed that, the proposed work yields better results compared to that of the existing methods in term of precision, recall and accuracy.
Cite
CITATION STYLE
A, Vijayarani., & G., L. P. G. (2019). CSL Net: Convoluted SE and LSTM Blocks Based Network for Automatic Image Annotation. International Journal of Engineering and Advanced Technology, 9(2), 47–54. https://doi.org/10.35940/ijeat.b3276.129219
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.