Abstract
Convolutional neural networks (CNN) have proven to be highly effective in large-scale object detection and image classification, as well as in serving as feature extractors for content-based image retrieval. While CNN models are typically trained with category label supervision and softmax loss for product image retrieval, we propose a different approach for feature extraction using the squared-hinge loss, an alternative multiclass classification loss function. First, transfer learning is performed on a pre-trained model, followed by fine-tuning the model. Then, image features are extracted based on the fine-tuned model and indexed using the nearest-neighbor indexing technique. Experiments are conducted on VGG19, InceptionV3, MobileNetV2, and ResNet18 CNN models. The model training results indicate that training the models with squared-hinge loss reduces the loss values in each epoch and reaches stability in less epoch than softmax loss. Retrieval results show that using features from squared-hinge trained models improves the retrieval accuracy by up to 3.7% compared to features from softmax-trained models. Moreover, the squared-hinge trained MobileNetV2 features outperformed others, while the ResNet18 feature gives the advantage of having the lowest dimensionality with competitive accuracy.
Author supplied keywords
Cite
CITATION STYLE
Rahman, A., Winarko, E., & Mustofa, K. (2023). Content-based product image retrieval using squared-hinge loss trained convolutional neural networks. International Journal of Electrical and Computer Engineering, 13(5), 5804–5812. https://doi.org/10.11591/ijece.v13i5.pp5804-5812
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.