Compact cloud detection with bidirectional self-attention knowledge distillation

Yajie Chai; Kun Fu; Xian Sun; Wenhui Diao; Zhiyuan Yan; Yingchao Feng; Lei Wang

Journal ArticleOPEN ACCESS

Compact cloud detection with bidirectional self-attention knowledge distillation

Remote Sensing (2020) 12(17)

DOI: 10.3390/RS12172770

17Citations

8Readers

Abstract

The deep convolutional neural network has made significant progress in cloud detection. However, the compromise between having a compact model and high accuracy has always been a challenging task in cloud detection for large-scale remote sensing imagery. A promising method to tackle this problem is knowledge distillation, which usually lets the compact model mimic the cumbersome model's output to get better generalization. However, vanilla knowledge distillation methods cannot properly distill the characteristics of clouds in remote sensing images. In this paper, we propose a novel self-attention knowledge distillation approach for compact and accurate cloud detection, named Bidirectional Self-Attention Distillation (Bi-SAD). Bi-SAD lets a model learn from itself without adding additional parameters or supervision. With bidirectional layer-wise features learning, the model can get a better representation of the cloud's textural information and semantic information, so that the cloud's boundaries become more detailed and the predictions become more reliable. Experiments on a dataset acquired by GaoFen-1 satellite show that our Bi-SAD has a great balance between compactness and accuracy, and outperforms vanilla distillation methods. Compared with state-of-the-art cloud detection models, the parameter size and FLOPs are reduced by 100 times and 400 times, respectively, with a small drop in accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Chai, Y., Fu, K., Sun, X., Diao, W., Yan, Z., Feng, Y., & Wang, L. (2020). Compact cloud detection with bidirectional self-attention knowledge distillation. Remote Sensing, 12(17). https://doi.org/10.3390/RS12172770

Compact cloud detection with bidirectional self-attention knowledge distillation

Abstract

Author supplied keywords

Cite

Register to see more suggestions