What question did this study set out to answer?

This study aims to assess the effectiveness of AI-driven translation systems compared to human translators, focusing on their workflows.

June 3, 2026Open Access

Workflow matters

Key Points

This study aims to assess the effectiveness of AI-driven translation systems compared to human translators, focusing on their workflows.
Compared two AI systems utilizing large language models for translation tasks.
One system mimicked traditional human translation methods; another was tailored to LLM strengths.
Evaluated translations through expert assessments and blind evaluations.
Both AI systems achieved accuracy comparable to professional human translations.
The LLM-driven system excelled in stylistic quality but occasionally included unnecessary content.
Translations from AI systems were often preferred for fluency in blind evaluations.

Abstract

Abstract Large language models (LLMs) have shown significant potential in translation tasks but often struggle with literary texts. This study compares professional human translations with translations produced by two AI-driven systems that coordinate multiple LLM-based agents. The first system mimics professional human translation practice, with distinct drafting and revision phases. The second redesigns the process specifically for LLMs’ capabilities, breaking translation into granular steps with specialized AI agents handling strategic planning, stylistic refinement, and coherence checking. Expert evaluations revealed that both AI systems achieved accuracy comparable to professional human translators. The LLM-capability-driven system produced translations with superior stylistic qualities and poetic language, though it occasionally added extraneous content. Meanwhile, the practice-derived system delivered concise translations but sometimes lacked cohesive flow. Blind evaluations showed that the translations from both AI systems were frequently preferred over human translations, particularly in terms of fluency. This study demonstrates that rethinking translation workflows around LLM capabilities can yield exceptional results, sometimes surpassing human performance in certain aspects.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Lulu Wang

Hong Kong Polytechnic University

Sanjun Sun

Beijing Foreign Studies University

Xing Wang

Tencent (China)

Journals

Target International Journal of Translation Studies

Actions

Institutions

Hong Kong Polytechnic University

Tencent (China)

Beijing Foreign Studies University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Workflow matters

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider