A New Semantic Segmentation Model for Supplementing More Spatial Information

8Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Semantic segmentation, aiming to assign semantic labels to each pixel, is broadly applied into many fields, such as video surveillance, medical image analysis, and autonomous driving. However, there are two challenges in semantic segmentation task: 1) the deficiency of rich contextual information; and 2) the lack of sufficient spatial information, all of which affect segmentation performance seriously. To solve these two challenges, the global feature capturing module (GFCM) and Conv Block are proposed in this paper to build a new model to improve segmentation performance. Specifically, GFCM, made of the global encoding module (GEM) and spatial attention module (SAM), is designed to extract adequate global contextual information and build global spatial dependencies. Composed of three convolution layers, Conv Block is proposed to preserve rich spatial information. Based on GFCM and Conv Block, a new model is designed, where a data-dependent upsampling operator (DUpsampling) is exploited to recover the pixel-wise prediction effectively. The extensive experiments have been made to prove the effectiveness of the design, and the new model achieves 73.69% mIoU on Cityscapes test set and 80.05% mIoU on PASCAL VOC 2012 test set without any post-processing.

Cite

CITATION STYLE

APA

Han, H. H., & Fan, L. (2019). A New Semantic Segmentation Model for Supplementing More Spatial Information. IEEE Access, 7, 86979–86988. https://doi.org/10.1109/ACCESS.2019.2915088

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free