What question did this study set out to answer?

This study aims to evaluate the efficacy of Large Language Models in the Swahili language.

April 26, 2026Open Access

Employing Large Language Models in Swahili, a low-resource language

Key Points

This study aims to evaluate the efficacy of Large Language Models in the Swahili language.
Participants from Tanzanian Swahili-speaking communities assessed LLM outputs through usability tests.
Focus on the performance and accuracy of models in a low-resource language context.
Inaccuracies and unclear outputs of LLMs in Swahili reveal architectural limitations and data scarcity.
Participants noted frequent distortions in Swahili compared to French, underscoring the need for multilingual training.

Abstract

The article is potentially destined to examine the efficacy of Large Language Models (LLMs) in Swahili (standard dialect spoken in Tanzania), a relatively less privileged and Low-resource Language (LRL) that, to some extent, remains underrepresented in AI communication technologies. Despite the rapid growth in LLM use, Swahili users in Tanzania often encounter inaccuracies and unclear outputs, highlighting persistent challenges in model performance. The inaccuracies of LLMs in Swahili undoubtedly demonstrate the challenges associated with their use and effectiveness in such a language. In this study, participants from Tanzanian Swahili-speaking communities evaluated the models' outputs through usability tests. Findings reveal that apart from architectural limitations, data scarcity drives the ineffectiveness of the models. Frequent distortions, mostly in Swahili than in French confirms the need for broader multilingual inclusion in LLM training. The study highlights the imperative for inclusive AI development that empowers low-resource languages.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Dietram Efrem Mgeni - (Wed,) studied this question.

synapsesocial.com/papers/69edaafc4a46254e215b346c https://doi.org/https://doi.org/10.1016/j.nlp.2026.100209

Bookmark

View Full Paper