Key points are not available for this paper at this time.
A new fingerprint design concept is introduced that transforms molecular property descriptors into two-state descriptors and thus permits binary encoding. This transformation is based on the calculation of statistical medians of descriptor distributions in large compound collections and alleviates the need for value range encoding of these descriptors. For binary encoded property descriptors, bit positions that are set off capture as much information as bit positions that are set on, different from conventional fingerprint representations. Accordingly, a variant of the Tanimoto coefficient has been defined for comparison of these fingerprints. Following our design idea, a prototypic fingerprint termed MP-MFP was implemented by combining 61 binary encoded property descriptors with 110 structural fragment-type descriptors. The performance of this fingerprint was evaluated in systematic similarity search calculations in a database containing 549 molecules belonging to 38 different activity classes and 5000 background molecules. In these calculations, MP-MFP correctly recognized approximately 34% of all similarity relationships, with only 0.04% false positives, and performed better than previous designs and MACCS keys. The results suggest that combinations of simplified two-state property descriptors have predictive value in the analysis of molecular similarity.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ling Xue
Hebei Medical University
Jeffrey W. Godden
Center for Information Technology
Florence L. Stahura
Albany Molecular Research (United States)
Journal of Chemical Information and Computer Sciences
University of Washington
Albany Molecular Research (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Xue et al. (Tue,) studied this question.
synapsesocial.com/papers/6a0fe5909e54838161fd5b37 — DOI: https://doi.org/10.1021/ci030285+