Modern multimodal learning often requires handling heterogeneous data types whose structures and information densities differ substantially. To address this challenge in the context of metabolic dysfunction-associated fatty liver disease (MAFLD) prediction, we propose an information-density-aware multimodal framework (NID-Net). Instead of relying on simple concatenation or shallow fusion, the model processes each modality using methods that align with its structural characteristics. Structured indicators with high information density are first processed by an XGBoost module optimized via Lagrange remainder correction, which enhances the nonlinearity of the loss landscape and improves robustness to data sparsity and imbalance. Meanwhile, tongue images with relatively low information density are encoded using a region-enhanced Swin Transformer, where adaptive regional biases guide the model toward informative local representations. The resulting modality-specific embeddings are fused within a Mixture-of-Experts (MoE) architecture, enabling selective specialization and nonlinear decision boundaries across modalities. Extensive experiments on real-world medical datasets demonstrate that NID-Net not only surpasses existing multimodal fusion approaches in predictive performance but also provides interpretable insights into cross-modal feature interactions. This work highlights the fundamental role of nonlinear design in achieving efficient, balanced, and explainable multimodal prediction systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xiaohua Hu
Xiang Zhu
Jia Shi
Chaos Solitons & Fractals
Second Military Medical University
Nanjing Forestry University
Changhai Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Hu et al. (Sat,) studied this question.
synapsesocial.com/papers/69a75ed6c6e9836116a29cf2 — DOI: https://doi.org/10.1016/j.chaos.2026.117978
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: