What question did this study set out to answer?

The aim is to investigate how AI translation tools affect the identification of genetic conditions across multiple languages.

May 8, 2026

Disparate language and model effects on AI-based translation and recognition of genetic conditions

Key Points

The aim is to investigate how AI translation tools affect the identification of genetic conditions across multiple languages.
Translated descriptions of 40 genetic conditions using Neural Machine Translation and Large Language Models into 191 and 93 languages respectively.
Assessed the identification accuracy of conditions using 3 proprietary and 3 open-weight language models.
Analyzed the influence of factors like literature prevalence, language attributes, and translation methods on model accuracy.
Significant variances in condition identification were observed based on translation method, language, and AI model used.
Some models' accuracy was notably influenced by factors such as condition prevalence in literature and language-specific attributes.
Adaptive translation did not enhance translation or diagnostic accuracy, though smaller language models showed some improvement.

Abstract

Abstract Introduction Artificial intelligence (AI) is increasingly prevalent. Patients and clinicians may use AI-based tools in many different languages. Objective To investigate AI translation tools for descriptions of genetic conditions and how AI identification of genetic conditions is affected by translations. Materials and Methods We used Neural machine translation (NMT) and large language-model (LLM) translation to translate descriptions of 40 genetic conditions into 191 and 93 languages, respectively. Excluding translations retaining English medical terms verbatim, we respectively focused on 139 and 70 languages. After assessing translations, we assessed the ability of 3 proprietary and 3 open-weight general LLMs to identify conditions in the translations. We analyzed how accuracy was affected by the conditions’ prevalence in the literature, and attributes of the languages (the script, language family, and prevalence of the language in training sources). We also investigated adaptive translation for select languages. Results We found significant differences in condition identification based on the translation method, condition, language, and prediction model. The accuracy of some models was more affected than others by factors like the conditions’ literature prevalence, language script, family, and language prevalence. Adaptive translation for select languages did not improve translations or diagnostic accuracy with the 3 tested LLMs. However, further analysis with 1 language showed that this approach was more effective with smaller LLMs. Conclusions AI-based translation has variable performance, which can affect the ability of AI models to recognize genetic conditions. These findings should inform safe medical AI use to support consistent performance in different languages.

Bookmark

Cite This Study

Duong et al. (Wed,) studied this question.

synapsesocial.com/papers/69fd7f65bfa21ec5bbf07ea0 https://doi.org/https://doi.org/10.1093/jamia/ocag067

Bookmark