What question did this study set out to answer?

The research aims to enhance molecular generation methods for drug design by addressing the dynamics of molecular interactions.

April 17, 2026Open Access

Steering semi-flexible molecular diffusion model for structure-based drug design with reinforcement learning

Key Points

The research aims to enhance molecular generation methods for drug design by addressing the dynamics of molecular interactions.
Developed a diffusion framework using reinforcement learning for ligand modeling.
Defined the denoising process as a Markov decision process.
Incorporated multiple molecular properties to guide generation towards drug-like characteristics.
Implemented a fast sampling strategy to improve efficiency during training and sampling.
Conducted self-supervised training on both target-free and target-specific molecules.
Achieved a Vina score of -7.23 kcal/mol.
Demonstrated an 11.53% success rate in molecular generation.
Generated molecules maintained known interaction patterns and identified new binding chemotypes.

Abstract

Current structure-based molecular generation faces a fundamental dilemma: While static ligand modeling dominates computational approaches, real-world molecular interactions are inherently dynamic. Inspired by the conformational changes ligands undergo during semi-flexible docking, we propose a reinforcement learning (RL)–steered diffusion framework for semi-flexible molecular generation in protein pockets. By defining the denoising process as a Markov decision process, RL dynamically adjusts molecular structures through iterative exploration. Simultaneously, we incorporate multiple molecular properties as conditions to constrain the denoising policy to drug-like regions and perform self-supervised rigid training on both target-free and target-specific molecules. In addition, we propose a fast sampling strategy that accelerates sampling by 20 times, thereby improving the efficiency of training and sampling. Experiments demonstrate that our method outperforms state-of-the-art methods with a Vina score of −7.23 kcal/mol and an 11.53% success rate. Targeting unseen real-world proteins, the generated molecules preserve canonical interaction patterns while discovering previously unknown binding chemotypes.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Wed,) studied this question.

synapsesocial.com/papers/69e1cf375cdc762e9d858201 https://doi.org/https://doi.org/10.1126/sciadv.ady9955

Bookmark

View Full Paper