What question did this study set out to answer?

This research aims to explore the effectiveness of embeddings compared to traditional stylometry in authorship attribution for poetry.

May 7, 2026Open Access

Stylometry or Embeddings? Authorship Attribution for Russian and Italian Poetry

Puntos clave

This research aims to explore the effectiveness of embeddings compared to traditional stylometry in authorship attribution for poetry.
Analyzed two poetry corpora: 5,800 Russian poems from 29 authors and 10,400 Italian poems from 52 authors.
Employed a progressive residualization analysis to examine embedding representations against stylometric features.
Subtracted lexical controls such as character n-grams and word bigrams to measure contributions to attribution accuracy.
For Russian poetry, attribution accuracy dropped to near chance after adjusting for stylometric features.
For Italian poetry, the analysis revealed a significant, persistent accuracy above chance, indicating unique historical signals.
Findings suggest that embeddings and stylometry share overlapping signals, yet approach lexical and historical variation differently.

Resumen

Large Language Model (LLM) embeddings achieve strong performance in authorship attribution, yet it remains unclear which aspects of literary style they encode. We address this question through a residualization analysis of two poetry corpora: 5,800 Russian poems (29 authors) and 10,400 Italian poems spanning seven centuries (52 authors). Using a progressive residualization waterfall, we subtract interpretable stylometric features and high-dimensional lexical controls from embedding representations to quantify their contribution to attribution accuracy. For Russian poetry, residual signal collapses to near chance (1.1 times chance) after accounting for character n-grams and word bigrams, indicating that embeddings largely compress orthographic and lexical distributions already exploited in classical stylometry. For Italian poetry, a reduced but significant residual persists (4.6 times chance), consistent with diachronic or dialectal variation not fully captured by standard features. We conclude that embeddings and stylometry rely on overlapping signals but differ in how they weight lexical, semantic, and historical variation.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo