What question did this study set out to answer?

The aim is to understand why RNNs generate phonological patterns not commonly found in human languages.

March 8, 2026

(R)NNs too expressive?

Key Points

The aim is to understand why RNNs generate phonological patterns not commonly found in human languages.
Compared RNNs and CNNs on string recognition tasks.
Analyzed the expressivity of each model's architecture.
Investigated the relationship between model expressivity and performance.
RNNs are found to generate unattested phonological patterns.
Model expressivity does not predict the success in recognizing string classes.
CNNs perform better due to their position-invariant biases.

Abstract

Abstract This work investigates why recurrent neural networks (RNNs) tend to learn phonological patterns that are unattested or dispreferred by humans. Specifically, we explore the hypothesis that their over-generation is caused by their excess expressive capacity – they are beyond the limited complexity class that contains the set of attested phonological patterns. We compared these over-expressive RNNs against the weaker convolutional neural networks (CNNs) on a battery of string recognition tasks. We find that the expressivity of a model’s architecture does not predict the string classes that it excels at recognizing. Instead, we suggest that CNNs’ position-invariant biases better explain their successes in our experiment.

Bookmark

Cite This Study

Li et al. (Sat,) studied this question.

synapsesocial.com/papers/69ada90bbc08abd80d5bc650 https://doi.org/https://doi.org/10.1515/lingvan-2024-0218

Bookmark