Abstract Although generative models hold promise for discovering molecules with optimized desired properties, they often fail to suggest synthesizable molecules that improve upon the properties of the structures represented in the training distribution. We find that this limitation arises not only from the molecule generation process itself, but also from the poor generalization capabilities of molecular property predictors. We address this challenge by creating a closed-loop molecule generation pipeline with iterative retraining on new quantum chemical simulation data. Compared against static, single-pass generative modeling approaches, only our closed-loop iterative workflow generates molecules with properties extending beyond the training distribution (up to 0.44 standard deviations beyond the original range) and achieves a 79% improvement in out-of-distribution molecule classification accuracy. Furthermore, by conditioning molecular generation on thermodynamic stability data obtained during the iterative loop, the proportion of stable and hence potentially synthesizable molecules generated is 3.5x higher than the next-best model.
Building similarity graph...
Analyzing shared references across papers
Loading...
Evan R. Antoniuk
Peggy Li
Nathan Keilbart
npj Computational Materials
Lawrence Livermore National Laboratory
Building similarity graph...
Analyzing shared references across papers
Loading...
Antoniuk et al. (Tue,) studied this question.
www.synapsesocial.com/papers/698d6de45be6419ac0d53238 — DOI: https://doi.org/10.1038/s41524-025-01924-8
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: