Exploring activity landscapes with extended similarity: is Tanimoto enough?

Timothy Dunn; Edgar Lopez-Lopez; Taewon Kim; Jose Luis Medina Franco; Ramon Miranda-Quintana

doi:10.26434/chemrxiv-2023-ncl51

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Exploring activity landscapes with extended similarity: is Tanimoto enough?

25 January 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Understanding structure-activity landscapes is essential in drug discovery. Similarly, it has been shown that the presence of activity cliffs in compound data sets can have a substantial impact not only on the design progress but also can influence the predictive ability of machine learning models. With the continued expansion of the chemical space and the currently available large and ultra-large libraries, it is imperative to implement efficient tools to analyze the activity landscape of compound data sets rapidly. The goal of this study is to show the applicability of the n-ary indices to quantify the structure-activity landscapes of large compound data sets using different types of structural representation rapidly and efficiently. We also discuss how a recently introduced medoid algorithm provides the foundation to finding optimum correlations between similarity measures and structure-activity rankings. The applicability of the n-ary indices and the medoid algorithm is shown by analyzing the activity landscape of 10 compound data sets with pharmaceutical relevance using three fingerprints of different designs, 16 extended similarity indices, and 11 coincidence thresholds.

Keywords

structure-activity relationships

molecular fingerprints

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Exploring activity landscapes with extended similarity: is Tanimoto enough?

Timothy B. Dunn, Edgar López‐López, Taewon David Kim, José L. Medina‐Franco, Ramón Alain Miranda‐Quintana journal article

Molecular Informatics , Volume 42, Issue 7

Online publication date: Jun 07, 2023

Version History

Jan 25, 2023 Version 1

Metrics

1,735

658

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2023-ncl51

Funding

University of Florida

UFII Seed grant

Programa de Apoyo a Proyectos de Investigación e Innovación Tecnológica (PAPIIT)

IN201321

Consejo Nacional de Ciencia y Tecnología (CONACyT)

762342 (No. CVU: 894234)

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Exploring activity landscapes with extended similarity: is Tanimoto enough?

Authors

Abstract

Keywords

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share