What question did this study set out to answer?

The aim is to enhance remote sensing scene classification by integrating lightweight deep learning methods with multi-scale features.

February 26, 2026Open Access

Towards Lightweight and Multi-Scale Scene Classification: A Lie Group-Guided Deep Learning Network with Collaborative Attention

Key Points

The aim is to enhance remote sensing scene classification by integrating lightweight deep learning methods with multi-scale features.
Proposed LGLMNet combining Lie Group covariance features for shallow and deep feature extraction.
Utilized a dual-branch architecture for efficient processing of high-level semantics and low-level details.
Implemented a parallel depthwise separable convolution block for multi-scale perception and used collaborative attention for enhanced modeling.
Developed a cross-layer feature fusion block to merge outputs from both branches.
LGLMNet achieved accuracy improvements of 2.14%, 2.32%, and 1.12% on UCM-21, AID, and NWPU-45 datasets, respectively.
Maintained a lightweight model with only 2.6 million parameters, ensuring efficiency.

Abstract

Remote sensing scene classification (RSSC) plays a crucial role in Earth observation. Current deep learning methods, while accurate, tend to focus on high-level semantic features and overlook complementary shallow details such as edges and textures. Moreover, conventional CNNs are limited by fixed receptive fields, whereas transformers incur high computational costs. To address these limitations, we propose the Lie Group lightweight multi-scale network (LGLMNet), a lightweight multi-scale network that integrates Lie Group covariance features. It employs a dual-branch architecture combining Lie Group machine learning (LGML) for shallow feature extraction and a deep learning branch for high-level semantics. In the deep branch, we design a parallel depthwise separable convolution block (PDSCB) for multi-scale perception and a spatial-channel collaborative attention mechanism (SCCA) for efficient global–local modeling. A cross-layer feature fusion block (CLFFB) effectively merges the two branches. Compared with state-of-the-art methods, the proposed LGLMNet achieves accuracy improvements of 2.14%, 2.32%, and 1.12% on UCM-21, AID, and NWPU-45 datasets, respectively, while maintaining a lightweight structure with only 2.6 M parameters.

Mark Helpful

Bookmark

Relay

View Full Paper