Object detection for automatic visual checkout in self-service vending machines is attracting significant attention in the retail industry. However, several critical challenges have not received enough attention. First, large-scale, high-quality retail image datasets are urgently demanded to train and evaluate the detection models. Second, the trained models should be able to cope with the frequently added new goods at low cost, while most cutting-edge models cannot. In this paper, we propose a new hierarchical large-scale object detection dataset, called Take Goods from Shelves (TGFS), containing 38K images of 24 fine-grained and 3 coarse classes. A preliminary method for solving the goods-adding problem, called Faster R-CNN Class-incremental Object Detector (FCIOD), is also described and evaluated. In addition, several popular methods are benchmarked on the TGFS dataset.
CITATION STYLE
Hao, Y., Fu, Y., & Jiang, Y. G. (2019). Take Goods from Shelves: A dataset for class-incremental object detection. In ICMR 2019 - Proceedings of the 2019 ACM International Conference on Multimedia Retrieval (pp. 271–278). Association for Computing Machinery, Inc. https://doi.org/10.1145/3323873.3325033
Mendeley helps you to discover research relevant for your work.