Semantic segmentation of urban buildings using a high-resolution network (Hrnet) with channel and spatial attention gates

79Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

Abstract

In this study, building extraction in aerial images was performed using csAG-HRNet by applying HRNet-v2 in combination with channel and spatial attention gates. HRNet-v2 consists of transition and fusion processes based on subnetworks according to various resolutions. The channel and spatial attention gates were applied in the network to efficiently learn important features. A channel attention gate assigns weights in accordance with the importance of each channel, and a spatial attention gate assigns weights in accordance with the importance of each pixel position for the entire channel. In csAG-HRNet, csAG modules consisting of a channel attention gate and a spatial attention gate were applied to each subnetwork of stage and fusion modules in the HRNet-v2 network. In experiments using two datasets, it was confirmed that csAG-HRNet could minimize false detections based on the shapes of large buildings and small nonbuilding objects compared to existing deep learning models.

Cite

CITATION STYLE

APA

Seong, S., & Choi, J. (2021). Semantic segmentation of urban buildings using a high-resolution network (Hrnet) with channel and spatial attention gates. Remote Sensing, 13(16). https://doi.org/10.3390/rs13163087

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free