We propose a CNN-based food image segmentation which requires no pixel-wise annotation. The proposed method consists of food region proposals by selective search and bounding box clustering, back propagation based saliency map estimation with the CNN model fine-tuned with the UEC-FOOD100 dataset, GrabCut guided by the estimated saliency maps and region integration by non-maximum suppression. In the experiments, the proposed method outperformed RCNN regarding food region detection as well as the PASCAL VOC detection task.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Shimoda, W., & Yanai, K. (2015). CNN-based food image segmentation without pixel-wise annotation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9281, pp. 449–457). Springer Verlag. https://doi.org/10.1007/978-3-319-23222-5_55