What question did this study set out to answer?

The aim is to enhance Japanese translation quality by optimizing user feedback utilization in machine translation systems.

June 1, 2026Open Access

A Reinforcement Learning-Based Interactive Machine Translation Feedback Optimization System

Puntos clave

The aim is to enhance Japanese translation quality by optimizing user feedback utilization in machine translation systems.
Developed an interactive interface to gather user evaluations and corrections for translations.
Implemented a reward function to quantify feedback consistency and updated model parameters using policy gradient methods.
Incorporated a reward resampling mechanism and dynamic weight adjustment strategy to manage training instability.
Reduced Translation Edit Rate (TER) to 39.6%.
Minimized average interaction turns to 1.21.
Decreased average user edit distance to 5.48.

Resumen

Despite the widespread application of Neural Machine Translation (NMT) technology, existing systems still face challenges such as insufficient utilization of user feedback and limited improvement in translation quality for Japanese translation tasks. This paper proposes a Reinforcement Learning (RL)-based interactive machine translation feedback optimization system aimed at continuously enhancing Japanese translation quality through user interaction. The system first introduces a human-computer interaction interface to collect user evaluations and correction suggestions for initial translations. A reward function quantifies the consistency between the translation model’s output and user feedback. Subsequently, a policy gradient method updates the NMT model parameters, enabling the model to efficiently absorb user feedback. Furthermore, the paper innovatively combines a reward resampling mechanism to mitigate training instability caused by sparse feedback and introduces a dynamic weight adjustment strategy to enhance the effectiveness of diverse user feedback. Experiments conducted on the Japanese Patent Corpus and the WAT 2023 dataset show the system reduces the Translation Edit Rate (TER) to 39.6%, lowers the average interaction turns to a minimum of 1.21, and decreases the average user edit distance to 5.48. The RL-based interactive feedback optimization provides a novel approach to improving Japanese machine translation quality.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Cui et al. (Thu,) studied this question.

synapsesocial.com/papers/6a1d22bb02fbce9130638666 https://doi.org/https://doi.org/10.1016/j.procs.2026.03.271

Me gusta

Guardar

Ver artículo completo