Neural language models have achieved remarkable progress in semantic representation learning. However, cross-domain representation learning still suffers from prominent semantic noise propagation issues. Existing methods still face challenges in cross-domain semantic modeling, including limited robustness across different semantic granularities, difficulty in separating transferable semantics from task-irrelevant semantic interference, and insufficient adaptability to specialized scenarios. These issues may reduce feature discriminability in fine-grained semantic tasks and complex application settings. To address these problems, we propose the Diffusion-Coded Attention Network (DCANet), a novel cross-domain representation learning architecture with three synergistic core modules: a multi-granular parallel diffusion masking mechanism for cross-scale context fusion via stochastic path activation, an implicit semantic encoder that distills domain-invariant patterns into adaptive bias codes via shared latent manifolds, and a self-correcting attention topology realizing dynamic semantic purification via closed-loop interactions between local features and global bias states. Extensive evaluations are conducted on nine well-recognized benchmark datasets to verify DCANet’s effectiveness and reliability. Experimental results show that DCANet attains state-of-the-art results on the majority of the benchmark datasets, with significant accuracy improvements on text classification and sentiment analysis tasks.
Han et al. (Thu,) studied this question.