Saliency detection via combining region-level and pixel-level predictions with CNNS

62Citations
Citations of this article
55Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper proposes a novel saliency detection method by combining region-level saliency estimation and pixel-level saliency prediction with CNNs (denoted as CRPSD). For pixel-level saliency prediction, a fully convolutional neural network (called pixel-level CNN) is constructed by modifying the VGGNet architecture to perform multiscale feature learning, based on which an image-to-image prediction is conducted to accomplish the pixel-level saliency detection. For regionlevel saliency estimation, an adaptive superpixel based region generation technique is first designed to partition an image into regions, based on which the region-level saliency is estimated by using a CNN model (called region-level CNN). The pixel-level and region-level saliencies are fused to form the final salient map by using another CNN (called fusion CNN). And the pixel-level CNN and fusion CNN are jointly learned. Extensive quantitative and qualitative experiments on four public benchmark datasets demonstrate that the proposed method greatly outperforms the state-of-the-art saliency detection approaches.

Cite

CITATION STYLE

APA

Tang, Y., & Wu, X. (2016). Saliency detection via combining region-level and pixel-level predictions with CNNS. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9912 LNCS, pp. 809–825). Springer Verlag. https://doi.org/10.1007/978-3-319-46484-8_49

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free