The multisource remote sensing classification task has two main challenges. 1) How to capture hyperspectral image (HSI) and light detection and ranging (LiDAR) features cooperatively to fully mine the complementary information between data. 2) How to adaptively fuse multisource features, which should not only overcome the imbalance between HSI and LiDAR data but also avoid the generation of redundant information. The local information interaction transformer (LIIT) model proposed herein can effectively address these above issues. Specifically, multibranch feature embedding is first performed to help in the fine-grained serialization of multisource features; subsequently, a local-based multisource feature interactor (L-MSFI) is designed to explore HSI and LiDAR features together. This structure provides an information transmission environment for multibranch features and further alleviates the homogenization processing mode of the self-Attention process. More importantly, a multisource feature selection module (MSTSM) is developed to dynamically fuse HSI and LiDAR features to solve the problem of insufficient fusion. Experiments were carried out on three multisource remote-sensing classification datasets, the results of which show that LIIT has more performance advantages than the state-of-The-Art CNN and transformer methods.
CITATION STYLE
Zhang, Y., Peng, Y., Tu, B., & Liu, Y. (2023). Local Information Interaction Transformer for Hyperspectral and LiDAR Data Classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 16, 1130–1143. https://doi.org/10.1109/JSTARS.2022.3232995
Mendeley helps you to discover research relevant for your work.