Abstract
In this study, building extraction in aerial images was performed using csAG-HRNet by applying HRNet-v2 in combination with channel and spatial attention gates. HRNet-v2 consists of transition and fusion processes based on subnetworks according to various resolutions. The channel and spatial attention gates were applied in the network to efficiently learn important features. A channel attention gate assigns weights in accordance with the importance of each channel, and a spatial attention gate assigns weights in accordance with the importance of each pixel position for the entire channel. In csAG-HRNet, csAG modules consisting of a channel attention gate and a spatial attention gate were applied to each subnetwork of stage and fusion modules in the HRNet-v2 network. In experiments using two datasets, it was confirmed that csAG-HRNet could minimize false detections based on the shapes of large buildings and small nonbuilding objects compared to existing deep learning models.
Author supplied keywords
Cite
CITATION STYLE
Seong, S., & Choi, J. (2021). Semantic segmentation of urban buildings using a high-resolution network (Hrnet) with channel and spatial attention gates. Remote Sensing, 13(16). https://doi.org/10.3390/rs13163087
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.