Molecule parametrization is an essential requirement to guarantee the accuracy of docking calculations. Parametrization includes a proper perception of chemical properties such as bonds, formal charges and protonation states. This includes large biological macromolecules, such as proteins and nucleic acids, and small molecules, such as ligands and cofactors. The structures of proteins and nucleic acids are challenging due to omission of several atoms from the structural model, and from the lack of connectivity and bond order information in the PDB and mmCIF file formats. For small molecules, the very large chemical diversity poses challenges for both validating correctness and providing accurate parameters. These challenges affect various modeling approaches like molecular docking and molecular dynamics. Moreover, several specialized methods (particularly in molecular docking) leverage specific chemical properties to add custom potentials, pseudoatoms, or manipulate atomic connectivity. To address these challenges, we developed Meeko, a molecular parametrization Python package that leverages the widely used RDKit cheminformatics library for a chemically accurate description of the molecular representation. Small molecules are modeled as single RDKit molecules, and biological macromolecules as multiple RDKit molecules, one for each residue. Meeko is highly customizable and designed to be easily scriptable for high-throughput processing, replacing MGLTools for receptor and ligand preparation.
Santos‐Martins et al. (Thu,) studied this question.