April 2, 2024Open Access

On the Multilingual Ability of Decoder-based Pre-trained Language Models: Finding and Controlling Language-Specific Neurons

Key Points

Key points are not available for this paper at this time.

Abstract

Current decoder-based pre-trained language models (PLMs) successfully demonstrate multilingual capabilities. However, it is unclear how these models handle multilingualism. We analyze the neuron-level internal behavior of multilingual decoder-based PLMs, Specifically examining the existence of neurons that fire ``uniquely for each language'' within decoder-only multilingual PLMs. We analyze six languages: English, German, French, Spanish, Chinese, and Japanese, and show that language-specific neurons are unique, with a slight overlap (< 5%) between languages. These neurons are mainly distributed in the models' first and last few layers. This trend remains consistent across languages and models. Additionally, we tamper with less than 1% of the total neurons in each model during inference and demonstrate that tampering with a few language-specific neurons drastically changes the probability of target language occurrence in text generation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Kojima et al. (Tue,) studied this question.

www.synapsesocial.com/papers/68e70d86b6db643587686898 — DOI: https://doi.org/10.48550/arxiv.2404.02431

Authors

Takeshi Kojima

Itsuki Okimura

Yusuke Iwasawa

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

On the Multilingual Ability of Decoder-based Pre-trained Language Models: Finding and Controlling Language-Specific Neurons

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion