Mask R-CNN with New Data Augmentation Features for Smart Detection of Retail Products

22Citations
Citations of this article
48Readers
Mendeley users who have this article in their library.

Abstract

Human–computer interactions (HCIs) use computer technology to manage the interfaces between users and computers. Object detection systems that use convolutional neural networks (CNNs) have been repeatedly improved. Computer vision is also widely applied to multiple spe-cialties. However, self-checkouts operating with a faster region-based convolutional neural network (faster R-CNN) image detection system still feature overlapping and cannot distinguish between the color of objects, so detection is inhibited. This study uses a mask R-CNN with data augmentation (DA) and a discrete wavelet transform (DWT) in lieu of a faster R-CNN to prevent trivial details in images from hindering feature extraction and detection for deep learning (DL). The experiment results show that the proposed algorithm allows more accurate and efficient detection of overlapping and similarly colored objects than a faster R-CNN with ResNet 101, but allows excellent resolution and real-time processing for smart retail stores.

Cite

CITATION STYLE

APA

Hsia, C. H., Chang, T. H. W., Chiang, C. Y., & Chan, H. T. (2022). Mask R-CNN with New Data Augmentation Features for Smart Detection of Retail Products. Applied Sciences (Switzerland), 12(6). https://doi.org/10.3390/app12062902

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free