What question did this study set out to answer?

The study aims to enhance the accuracy of land use and land cover classification by effectively fusing hyperspectral and LiDAR data.

March 22, 2026

MCIB: Multi-modal Complementary Information Bottleneck for Hyperspectral and LiDAR Classification

Key Points

The study aims to enhance the accuracy of land use and land cover classification by effectively fusing hyperspectral and LiDAR data.
Developed the multi-modal complementary information bottleneck framework.
Formalized the MCIB objective with structured priors for information-theoretic bounds.
Designed an end-to-end variational optimization strategy using supervised conditional InfoNCE.
Conducted extensive experiments on benchmark HSI-LiDAR datasets.
The MCIB framework achieved superior classification performance compared to existing methods.
Demonstrated effective reduction of data redundancy and enhanced cross-modal complementarity.
Provided a principled solution addressing theoretical gaps in multi-modal representation learning.

Abstract

The effective fusion of multi-modal remote sensing images, particularly hyperspectral imagery (HSI) and light detection and ranging (LiDAR) data, is pivotal for accurate land use and land cover (LULC) classification. However, this process is hindered by two inherent challenges: pervasive data redundancy and the underutilization of cross-modal complementarity, largely due to the lack of a unifying theoretical framework. To address these limitations, we propose the multi-modal complementary information bottleneck (MCIB) framework, which extends the IB principle to learn compact, sufficient, and complementary representations for multi-modal scenes. From a theoretical perspective, we formalize the MCIB objective and introduce structured priors to derive tractable information-theoretic bounds, providing a principled and computationally feasible approach to reduce redundancy and enhance complementarity simultaneously. Building on the obtained theoretical insights, we design an end-to-end variational optimization strategy with a novel supervised conditional InfoNCE (SCInfoNCE). Efficiently reusing existing model components, this new supervised contrastive method optimizes the conditional mutual information terms crucial for synergy. Extensive experiments on benchmark HSI-LiDAR datasets demonstrate superior classification performance of MCIB. This work not only fills a theoretical gap in multi-modal representation learning, but offers a robust and principled solution for LULC classification using complex heterogeneous remote sensing images.

Ask AI

Helpful

Bookmark