LiteST-Net: A Hybrid Model of Lite Swin Transformer and Convolution for Building Extraction from Remote Sensing Image

Wei Yuan; Xiaobo Zhang; Jibao Shi; Jin Wang

Journal ArticleOPEN ACCESS

LiteST-Net: A Hybrid Model of Lite Swin Transformer and Convolution for Building Extraction from Remote Sensing Image

Remote Sensing (2023) 15(8)

DOI: 10.3390/rs15081996

14Citations

11Readers

Abstract

Extracting building data from remote sensing images is an efficient way to obtain geographic information data, especially following the emergence of deep learning technology, which results in the automatic extraction of building data from remote sensing images becoming increasingly accurate. A CNN (convolution neural network) is a successful structure after a fully connected network. It has the characteristics of saving computation and translation invariance with improved local features, but it has difficulty obtaining global features. Transformers can compensate for the shortcomings of CNNs and more effectively obtain global features. However, the calculation number of transformers is excessive. To solve this problem, a Lite Swin transformer is proposed. The three matrices Q, K, and V of the transformer are simplified to only a V matrix, and the v of the pixel is then replaced by the v with the largest projection value on the pixel feature vector. In order to better integrate global features and local features, we propose the LiteST-Net model, in which the features extracted by the Lite Swin transformer and the CNN are added together and then sampled up step by step to fully utilize the global feature acquisition ability of the transformer and the local feature acquisition ability of the CNN. The comparison experiments on two open datasets are carried out using our proposed LiteST-Net and some classical image segmentation models. The results show that compared with other networks, all metrics of LiteST-Net are the best, and the predicted image is closer to the label.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Yuan, W., Zhang, X., Shi, J., & Wang, J. (2023). LiteST-Net: A Hybrid Model of Lite Swin Transformer and Convolution for Building Extraction from Remote Sensing Image. Remote Sensing, 15(8). https://doi.org/10.3390/rs15081996

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 3

100%

Readers' Discipline

Earth and Planetary Sciences 2

100%

Article Metrics

Mentions

Blog Mentions: 1

News Mentions: 1

View details >

LiteST-Net: A Hybrid Model of Lite Swin Transformer and Convolution for Building Extraction from Remote Sensing Image

Abstract

Author supplied keywords

References Powered by Scopus

U-net: Convolutional networks for biomedical image segmentation

Gradient-based learning applied to document recognition

Fully convolutional networks for semantic segmentation

Cited by Powered by Scopus

BEMRF-Net: Boundary Enhancement and Multiscale Refinement Fusion for Building Extraction From Remote Sensing Imagery

A Hybrid Algorithm with Swin Transformer and Convolution for Cloud Detection

Evaluation and Interpretation of Runoff Forecasting Models Based on Hybrid Deep Neural Networks

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline

Article Metrics