Chemical Space Exploration with Active Learning and Alchemical Free Energies

Yuriy Khalak; Gary Tresadern; David F. Hahn; Bert L. de Groot; Vytautas Gapsys

doi:10.26434/chemrxiv-2022-q9033

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Chemical Space Exploration with Active Learning and Alchemical Free Energies

13 July 2022, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Drug discovery can be thought as a search for a needle in a haystack. Finding the initial active hit molecules, the optimal decoration of lead molecule analogues, to final clinical candidate selection is an on-going trade-off between applying the best methods versus the cost of assessing the large available chemical space. Computational techniques can impact by narrowing the search-space, but some preferred methods such as binding affinity calculations can still only be performed on a small fraction of the possible molecules. For that purpose, machine learning (ML) strategies are being developed to complement the experimentation and computationally more expensive approaches in navigating and triaging large chemical libraries. In the current study, we explore how an active learning protocol can be combined with first principles based alchemical free energy calculations to identify high affinity phosphodiesterase 2 (PDE2) inhibitors. Firstly, we calibrate the procedure using a large set of experimentally characterised PDE2 binders. The optimized protocol is then used prospectively on a large chemical library to navigate towards potent inhibitors. In the active learning cycle, at every iteration a small fraction of compounds is probed by alchemical calculations and the obtained affinities are used to train ML models. With successive rounds high affinity binders are identified by explicitly evaluating only a small subset of compounds in a large chemical library, thus providing an efficient protocol that robustly identifies a large fraction of true positives.

Keywords

active learning

machine learning

free energy calculations

computational alchemy

molecular dynamics

Supplementary materials

Title

Description

Actions

Title

Supplementary Information: Chemical Space Exploration with Active Learning and Alchemical Free Energies

Description

Supplementary figures and tables

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Chemical Space Exploration with Active Learning and Alchemical Free Energies

Yuriy Khalak, Gary Tresadern, David F. Hahn, Bert L. de Groot, Vytautas Gapsys journal article

Journal of Chemical Theory and Computation , Volume 18, Issue 10

Online publication date: Sep 23, 2022

Version History

Jul 13, 2022 Version 1

Metrics

1,279

587

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2022-q9033

Funding

Vlaams Agentschap Innoveren & Ondernemen (VLAIO)

HBC.2018.2295, "Dynamics for Molecular Design (DynaMoDe)"

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Chemical Space Exploration with Active Learning and Alchemical Free Energies

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share