Content-based product image retrieval using squared-hinge loss trained convolutional neural networks
Abstract
Convolutional neural networks (CNN) have proven to be highly effective in large-scale object detection and image classification, as well as in serving as feature extractors for content-based image retrieval. While CNN models are typically trained with category label supervision and softmax loss for product image retrieval, we propose a different approach for feature extraction using the squared-hinge loss, an alternative multiclass classification loss function. First, transfer learning is performed on a pre-trained model, followed by fine-tuning the model. Then, image features are extracted based on the fine-tuned model and indexed using the nearest-neighbor indexing technique. Experiments are conducted on VGG19, InceptionV3, MobileNetV2, and ResNet18 CNN models. The model training results indicate that training the models with squared-hinge loss reduces the loss values in each epoch and reaches stability in less epoch than softmax loss. Retrieval results show that using features from squared-hinge trained models improves the retrieval accuracy by up to 3.7% compared to features from softmax-trained models. Moreover, the squared-hinge trained MobileNetV2 features outperformed others, while the ResNet18 feature gives the advantage of having the lowest dimensionality with competitive accuracy.
Keywords
Full Text:
PDFDOI: http://doi.org/10.11591/ijece.v13i5.pp5804-5812
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).