What question did this study set out to answer?

The aim is to enhance the alignment of biomedical knowledge graphs by improving the construction of similarity features using advanced optimization techniques.

May 15, 2026

Biomedical Knowledge Graph Alignment with GPT-Augmented Similarity Feature Construction via Tree-based Particle Swarm Optimization and Adaptive Fitness Optimization

Key Points

The aim is to enhance the alignment of biomedical knowledge graphs by improving the construction of similarity features using advanced optimization techniques.
Implemented a GPT-based semantic feature construction method to extract context-aware similarity features.
Developed a tree-based symbolic representation within a particle swarm optimization framework.
Introduced an adaptive fitness landscape optimization mechanism to improve convergence stability during evolution.
T-PSO-AFO outperforms existing biomedical entity matching approaches by a significant margin in experimental datasets.
Demonstrated robustness and effectiveness in aligning heterogeneous biomedical knowledge graphs.

Abstract

Accurate biomedical Knowledge Graph (KG) alignment is critical for integrating heterogeneous biomedical information and enabling reliable downstream applications, such as clinical decision support, biomedical search, and personalized healthcare. However, variations in terminologies, KG structures, and semantic granularity across biomedical sources introduce substantial heterogeneity, making the construction of high-quality Similarity Features (SFs) a major challenge. Although the Generative Pre-trained Transformer (GPT) has shown strong potential in capturing nuanced biomedical semantics, each GPT-augmented SF typically reflects only one semantic perspective, and combining multiple SFs in a meaningful manner remains non-trivial because of the expanded symbolic search space and potential feature conflicts. To address these challenges, we propose a novel Tree-based Particle Swarm Optimization with Adaptive Fitness Optimization (T-PSO-AFO) framework for GPT-augmented SF construction. First, a GPT-based semantic feature construction method is introduced to extract expressive and context-aware SFs that better capture biomedical entity equivalence. Then, a tree-based symbolic representation within a PSO framework is developed to explore diverse and complex SF combinations more effectively. Finally, an adaptive fitness landscape optimization mechanism is proposed to dynamically reshape the symbolic search space during evolution, improving the convergence stability and alignment performance. Experiments on OAEI's LargeBio and Disease and Phenotype datasets demonstrate that T-PSO-AFO significantly outperforms state-of-the-art biomedical entity matching approaches, validating its robustness, effectiveness, and scalability in aligning heterogeneous biomedical KGs.

Bookmark

Cite This Study

Xue et al. (Thu,) studied this question.

synapsesocial.com/papers/6a06b7eae7dec685947aa856 https://doi.org/https://doi.org/10.1109/jbhi.2026.3692696

Bookmark