March 18, 2024Open Access

Will we ever be able to accurately predict solubility?

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract Accurate prediction of thermodynamic solubility by machine learning remains a challenge. Recent models often display good performances, but their reliability may be deceiving when used prospectively. This study investigates the origins of these discrepancies, following three directions: a historical perspective, an analysis of the aqueous solubility dataverse and data quality. We investigated over 20 years of published solubility datasets and models, highlighting overlooked datasets and the overlaps between popular sets. We benchmarked recently published models on a novel curated solubility dataset and report poor performances. We also propose a workflow to cure aqueous solubility data aiming at producing useful models for bench chemist. Our results demonstrate that some state-of-the-art models are not ready for public usage because they lack a well-defined applicability domain and overlook historical data sources. We report the impact of factors influencing the utility of the models: interlaboratory standard deviation, ionic state of the solute and data sources. The herein obtained models, and quality-assessed datasets are publicly available.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Llompart et al. (Mon,) studied this question.

synapsesocial.com/papers/68e7375cb6db6435876b0ab6 — DOI: https://doi.org/10.1038/s41597-024-03105-6

Authors

Pierre Llompart

Sanofi (France)

Claire Minoletti

Sanofi (France)

Shamkhal Baybekov

Université de Strasbourg

Journals

Scientific Data

Actions

Institutions

Université de Strasbourg

Sanofi (France)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Will we ever be able to accurately predict solubility?

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion