Amid globalisation and growing cross-language information needs, machine translation is crucial for overcoming language barriers.Deep learning has advanced it, but transformer faces limitations: insufficient efficiency in capturing long-range dependencies and poor performance in low-resource translation.To address these, this study proposes three core solutions: 1) a hybrid LSTM-transformer architecture fusing LSTM's gating mechanism (long-sequence modelling) and transformer's self-attention (global context capture); 2) an adaptive gradient clipping (AGC) strategy for training stability; 3) dynamic weight sharing with adversarial domain adaptation to enhance cross-language transfer.Experiments on WMT14 English-German/French corpora show the model's BLEU value is 2.8 higher than benchmark Transformer, with 18% faster convergence; in English Romanian low-resource scenarios, the transfer mechanism boosts BLEU by 5.3.This study validates the hybrid architecture and optimisation strategies, offering new ideas for efficient gradient optimisation and low-resource translation models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Meizhen Zou
International Journal of Information and Communication Technology
ZheJiang Academy of Agricultural Sciences
Shaoxing University
Building similarity graph...
Analyzing shared references across papers
Loading...
Meizhen Zou (Thu,) studied this question.
www.synapsesocial.com/papers/69c772058bbfbc51511e228e — DOI: https://doi.org/10.1504/ijict.2026.152530