What question did this study set out to answer?

This research aims to enhance on-device spoken language understanding (SLU) for new languages using a large language model.

May 4, 2026Open Access

Scaling on-device spoken language understanding to new languages with large language models

Key Points

This research aims to enhance on-device spoken language understanding (SLU) for new languages using a large language model.
Developed a data-centric pipeline for SLU using machine translation of slot-annotated English training data.
Fine-tuned a large language model to preserve slot annotations during translation.
Evaluated the approach on the MultiATIS++ benchmark across eight languages.
Achieved overall accuracy of 62.18%, a significant increase from 55.11% by prior methods.
Increased accuracy from 5.31% to 22.06% in a resource-constrained edge scenario using a tiny SLU model.
Showed that LLM-based translation does not require changes to SLU architecture and is slot-type independent.

Abstract

Abstract Spoken language understanding (SLU) models are a core component of voice assistants (e.g., Alexa, Bixby, Google Assistant), but collecting extensive labeled data for target languages is challenging. In this paper, we introduce a data-centric pipeline to expand On-Device SLU to new languages by leveraging a large language model (LLM) for machine translation of slot-annotated English training data. The LLM is fine-tuned to preserve slot annotations during translation using an HTML tag-based slot marking strategy. Our approach is evaluated on the MultiATIS++ benchmark, a multilingual SLU dataset covering eight languages. In an On-device setting, we achieve a new state-of-the-art overall accuracy of 62.18 %, up from 55.11 % achieved by the best prior method, HC 2 L. In an Edge scenario with a tiny SLU model (5MB, no pre-training), our translated data boosts overall accuracy from a baseline 5.31 % to 22.06 %. In contrast to mentioned baselines, our LLM-based translation requires no changes to the SLU model architecture and is slot-type independent, requiring no manual slot descriptions. This work demonstrates that state-of-the-art LLMs can serve as effective “slot translators” providing a scalable path to multilingual SLU without costly SLU data collection or architecture overhaul.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jakub Hościłowicz

Pawel Pawlowski

Marcin Skorupa

Journals

Poznań Studies in Contemporary Linguistics

Actions

Institutions

Warsaw University of Technology

Samsung (Poland)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Scaling on-device spoken language understanding to new languages with large language models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study