This paper presents a Chinese-English neural machine-translation model that hybridizes self-attention with a lightweight gating module to better capture the distinct morphological and syntactic characteristics of the two languages. Conventional transformers treat source and target sequences homogeneously, ignoring the fact that Chinese relies on analytical structure while English is more morphologically marked. We therefore split the encoder into two parallel branches: a multi-head self-attention stack that learns long-range dependencies and a convolution-guided gate that dynamically emphasizes character-level features such as boundary and sub-word information. The two branches interact through a learnable fusion gate whose parameters are updated by the overall translation loss, allowing the network to softly switch between global and local views at each layer. A mirrored gating sub-layer is further inserted into the decoder to prevent the model from over-attending to either language-specific signals. Experiments on multiple Chinese-English corpora show that the proposed architecture substantially improves translation adequacy and fluency without extra parameters or latency, confirming the effectiveness of integrating language-aware inductive bias into mainstream attention-based frameworks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qian Chen
SHILAP Revista de lepidopterología
Building similarity graph...
Analyzing shared references across papers
Loading...
Qian Chen (Sun,) studied this question.
www.synapsesocial.com/papers/698c1c53267fb587c655ea88 — DOI: https://doi.org/10.6180/jase.202607_30.035