Exposing the limitations of molecular machine learning with activity cliffs.

Derek van Tilborg; Alisa Alenicheva; Francesca Grisoni

doi:10.26434/chemrxiv-2022-mfq52-v3

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Exposing the limitations of molecular machine learning with activity cliffs.

30 August 2022, Version 3

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning has become a crucial tool in drug discovery and chemistry at large, e.g., to predict molecular properties, such as bioactivity, with high accuracy. However, activity cliffs – pairs of molecules that are highly similar in their structure but exhibit large differences in potency – have been underinvestigated for their effect on model performance. Not only are these edge cases informative for molecule discovery and optimization, but models that are well-equipped to accurately predict the potency of activity cliffs have an increased potential for prospective applications. Our work aims to fill the current knowledge gap on best-practice machine learning methods in the presence of activity cliffs. We benchmarked a total of 720 machine and deep learning models on curated bioactivity data from 30 macromolecular targets for their performance on activity cliff compounds. While all methods struggled in the presence of activity cliffs, machine learning approaches based on molecular descriptors outperformed more complex deep learning methods. Our findings highlight large case-by-case differences in performance, advocating for (a) the inclusion of dedicated “activity-cliff-centered” metrics during model development and evaluation, and (b) the development of novel algorithms to better predict the properties of activity cliffs. To this end, the methods, metrics, and results of this study have been encapsulated into an open-access benchmarking platform named MoleculeACE (Activity Cliff Estimation, available on GitHub at: https://github.com/molML/MoleculeACE). MoleculeACE is designed to steer the community towards addressing the pressing but overlooked limitation of molecular machine learning models posed by activity cliffs.

Keywords

Activity cliffs

Molecular machine learning

Benchmark

Deep learning

Supplementary weblinks

Title

Description

Actions

Title

MoleculeACE

Description

MoleculeACE (Activity Cliff Estimation) is an open-access benchmarking platform to: (1) Systematically benchmark a model’s performance on activity cliffs compounds, in comparison with well-established machine- and deep learning methods; (2) evaluate the deck of chosen models on a new dataset not included in our benchmark, thanks to the data collection and curation pipeline; and (3) further expand the definition of activity cliffs, based on specific use cases.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Exposing the Limitations of Molecular Machine Learning with Activity Cliffs

Derek van Tilborg, Alisa Alenicheva, Francesca Grisoni journal article

Journal of Chemical Information and Modeling , Volume 62, Issue 23

Online publication date: Dec 01, 2022

Version History

Aug 30, 2022 Version 3

Jul 05, 2022 Version 2

Mar 28, 2022 Version 1

Version Notes

We elucidated the reasoning behind our taken splitting approach in-depth and mention the effects of the splitting approach on 1. data distributions, 2. the presence of activity cliffs in the test and train set, and 3. the occurrence and effects of all activity cliff ‘partners’ ending up in the test set. We included a supporting table. Secondly, to assess any bias of our splitting approach in favor of ECFP descriptors (used for splitting), we compared similarities between the train and test set for different molecular descriptors. We found no significant differences for other descriptors. We fixed several typos.

Metrics

6,498

2,793

Views

Downloads

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2022-mfq52-v3

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) declare that they have sought and gained approval from the relevant ethics committee/IRB for this research and its publication.

Exposing the limitations of molecular machine learning with activity cliffs.

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share