Abstract Large language models (LLMs) have shown significant potential in translation tasks but often struggle with literary texts. This study compares professional human translations with translations produced by two AI-driven systems that coordinate multiple LLM-based agents. The first system mimics professional human translation practice, with distinct drafting and revision phases. The second redesigns the process specifically for LLMs’ capabilities, breaking translation into granular steps with specialized AI agents handling strategic planning, stylistic refinement, and coherence checking. Expert evaluations revealed that both AI systems achieved accuracy comparable to professional human translators. The LLM-capability-driven system produced translations with superior stylistic qualities and poetic language, though it occasionally added extraneous content. Meanwhile, the practice-derived system delivered concise translations but sometimes lacked cohesive flow. Blind evaluations showed that the translations from both AI systems were frequently preferred over human translations, particularly in terms of fluency. This study demonstrates that rethinking translation workflows around LLM capabilities can yield exceptional results, sometimes surpassing human performance in certain aspects.
Building similarity graph...
Analyzing shared references across papers
Loading...
Lulu Wang
Hong Kong Polytechnic University
Sanjun Sun
Beijing Foreign Studies University
Xing Wang
Tencent (China)
Target International Journal of Translation Studies
Hong Kong Polytechnic University
Tencent (China)
Beijing Foreign Studies University
Building similarity graph...
Analyzing shared references across papers
Loading...
Wang et al. (Mon,) studied this question.
synapsesocial.com/papers/6a1fc58bdee9eb8c0dce6f0c — DOI: https://doi.org/10.1075/target.25081.wan
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: