This study tests the hypothesis that there is a statistically measurable association between planetary categories and their associated symbolism in Western astrology and lexical patterns in biographical texts. The hypothesis is evaluated by examining whether biographical texts form statistically distinguishable aggregated lexical structures corresponding to planetary categories. The analysis is based on a corpus of 26,000 biographies of individuals with precisely documented birth times (Rodden rating AA). For each individual, planetary azimuths at the moment of birth were computed, after which biographies were grouped by planet using an operational criterion of a ‘dominant’ planet, defined via the Local Space method through the concentration of geocoded life events within a specified angular sector. For each of the 10 planets, word clouds were built from the biographical texts and their separability was assessed using a classification procedure with train-validate-holdout splits. The classification procedure was used not as an applied predictive model, but as an instrument for assessing the separability and reproducibility of the identified lexical structures. The observed classification correctly identified up to 10 out of 10 planets (train-validate data) and up to 7 out of 10 planets (train-holdout data). The results substantially outperformed randomised control models: in an aggregated permutation test across different sample splits, the probability of obtaining comparable results was approximately p ≈ .002–.004, with a large effect size (r ≈ .84–.91). Variation of the Local Space parameters predictably affected classification performance. An additional comparison with a corpus of traditional astrological texts revealed statistically significant agreement between biographical and astrological lexical profiles, exceeding the level expected by chance (p ≈ .002–.007, r ≈ .77–.89; permutation test). Overall, the results indicate statistically significant separability of planetary lexical clouds constructed from biographies and suggest that text corpora grouped by ‘dominant’ planet exhibit robust differences in lexical patterns consistent with astrological interpretations.
Dmitriy Koritskiy (Thu,) studied this question.