Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction

Ali Jamali; Swalpa Kumar; Jonathan Li; Pedram Ghamisi

Journal Article

Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction

IEEE Geoscience and Remote Sensing Letters (2024) 21

DOI: 10.1109/LGRS.2024.3354560

14Citations

15Readers

Get full text

Abstract

In the domain of remote sensing image interpretation, road extraction from high-resolution aerial imagery has already been a hot research topic. Although deep CNNs have presented excellent results for semantic segmentation, the efficiency and capabilities of vision transformers (ViTs) are yet to be fully researched. As such, for accurate road extraction, a deep semantic segmentation neural network that utilizes the abilities of residual learning, heterogeneous convolutions (HetConvs), UNet, and ViTs, which is called ResUNetFormer, is proposed in this letter. The developed ResUNetFormer is evaluated on various cutting-edge deep learning-based road extraction techniques on the public Massachusetts road dataset. Statistical and visual results demonstrate the superiority of the ResUNetFormer over the state-of-the-art CNNs and ViTs for segmentation. The code will be made available publicly at https://github.com/aj1365/ResUNetFormer.

Author supplied keywords

Cite

CITATION STYLE

APA

Jamali, A., Kumar, S., Li, J., & Ghamisi, P. (2024). Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction. IEEE Geoscience and Remote Sensing Letters, 21. https://doi.org/10.1109/LGRS.2024.3354560

Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions