Background Endometriosis (EMs) affects approximately 10% of reproductive‐age women worldwide, yet its pathogenesis remains incompletely understood. Abnormal cell differentiation and somatic mutations in the ectopic endometrial microenvironment play critical roles in disease progression and treatment response heterogeneity. This study is aimed at elucidating the molecular mechanisms underlying ectopic endometrial cell differentiation using machine learning (ML) approaches and single‐cell RNA sequencing (scRNA‐seq), and identifying novel prognostic biomarkers and therapeutic targets, with particular attention to mutation‐driven transcriptional alterations. Methods We analyzed comprehensive transcriptomic data from the Gene Expression Omnibus (GEO) and Human Endometrium Database (HED), including scRNA‐seq data from 162,485 cells across 46 EMs patients. Through systematic comparative analysis, we identified 298 genes associated with ectopic endometrial cell differentiation, including genes harboring recurrent somatic mutations reported in endometriotic lesions. We evaluated 10 distinct ML algorithms and 101 hybrid combinations to develop predictive models for patient stratification. Unsupervised clustering analysis identified distinct patient phenotypes. Functional enrichment analysis, pathway analysis, and cell–cell communication networks were constructed to characterize the ectopic microenvironment. Four key genes (HOXA10, ESR1, MMP9, and SPP1) were validated by quantitative real‐time PCR in normal endometrial stromal cells (NESCs) and ectopic endometrial stromal cells (EESCs). Results Unsupervised clustering revealed two distinct patient subgroups characterized as high‐ and low‐invasive ectopic endometrium phenotypes with significantly different disease progression trajectories. Single‐cell analysis unveiled extensive cellular heterogeneity within the ectopic endometrial microenvironment, identifying multiple cell types including epithelial, stromal, endothelial, and immune cells. Gene ontology and pathway enrichment analyses demonstrated significant activation of extracellular matrix organization, cell adhesion, cell migration, and angiogenesis pathways. Cell–cell communication analysis revealed macrophages as central mediators forming extensive connections with stromal, epithelial, and endothelial cells, with SPP1 emerging as a key signaling molecule. Trajectory analysis of stromal cells identified at least two major differentiation branches, indicating divergent differentiation programs. Notably, several of the 298 differentiation‐associated genes overlapped with loci frequently mutated in ectopic lesions, suggesting that somatic mutations may contribute to aberrant gene regulation. qRT‐PCR validation confirmed significant differential expression of key genes: HOXA10 showed 62% downregulation (0.38 ± 0.06 vs. 1.00 ± 0.09, p < 0.001), whereas ESR1, MMP9, and SPP1 demonstrated 108%, 252%, and 189% upregulation, respectively, in EESCs compared with NESCs (all p < 0.001). Conclusions This study provides a comprehensive molecular characterization of ectopic endometrial cell differentiation through integrative ML‐based analysis and single‐cell sequencing. The identification of distinct patient phenotypes, key regulatory genes, and macrophage‐centric communication networks advances our understanding of EMs pathogenesis. HOXA10, ESR1, MMP9, and SPP1 represent potential diagnostic biomarkers and therapeutic targets for personalized treatment strategies. The convergence of mutation‐associated transcriptional changes and differentiation abnormalities underscores the need for mutation‐aware therapeutic strategies. These findings pave the way for developing more effective and targeted interventions to improve patient outcomes in EMs management.
Zhang et al. (Thu,) studied this question.