In this article, we present a first approach on simulating the change of gender from Latin to Old Occitan. The reduction of genders in the transition to the Romance languages is one of the most important developments in the nominal system. In order to simulate this language change, we used varied data, taken from the most exhaustive Old Occitan dictionary (DOM), from an edition of a juridical text as well as from the transcriptions of two manuscripts taken from COMETA (Wiedner 2025). The nouns with information on their etymon and the gender in Latin and Old Occitan were then used as input for the simulation. Building on previous simulation-based approaches (Polinsky and Van Everbroeck 2003), our model provides a form-based, data-driven perspective on how gender marking might evolve under the interaction of morphological cues. Despite its inherent simplifications, the model achieved good predictive accuracy and yielded interpretable tendencies in the distribution of attention, suggesting that even at the level of orthographic form, systematic regularities relevant to gender can be captured computationally.
Wiedner et al. (Mon,) studied this question.