Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Morgan Thomas; Noel M. O'Boyle; Andreas Bender; Chris de Graaf

doi:10.26434/chemrxiv-2022-prz2r

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

15 April 2022, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

A plethora of AI-based techniques now exists to conduct de novo molecule generation that can devise molecules conditioned towards a particular endpoint in the context of drug design. One popular approach is using reinforcement learning to update a recurrent neural network or language-based de novo molecule generator. However, reinforcement learning can be inefficient, sometimes requiring up to 10^5 molecules to be sampled to optimize more complex objectives, which poses a limitation when using computationally expensive scoring functions like docking or computer-aided synthesis planning models. In this work, we propose a reinforcement learning strategy called Augmented Hill-Climb based on a simple, hypothesis-driven hybrid between REINVENT and Hill-Climb that improves sample-efficiency by addressing the limitations of both currently used strategies. We compare its ability to optimize several docking tasks with REINVENT and benchmark this strategy against other commonly used reinforcement learning strategies including REINFORCE, REINVENT (version 1 & 2), Hill-Climb and best agent reminder. We find that optimization ability is improved ~1.5-fold and sample-efficiency is improved ~45-fold compared to REINVENT while still delivering appealing chemistry as output. Diversity filters were used, and their parameters were tuned to overcome observed failure modes that take advantage of certain diversity filter configurations. Lastly, we find that Augmented Hill-Climb outperforms the other reinforcement learning strategies used on six tasks, especially in the early stages of training or for more difficult objectives. Overall, we hence show that AHC improves sample-efficiency for language-based de novo molecule generation conditioning via reinforcement learning, compared to the current state-of-the-art. This makes more computationally expensive scoring functions, such as docking, more accessible on a relevant timescale.

Keywords

Artificial Intelligence

Structure-based drug design

SBDD

Generative models

De novo molecule generation

Recurrent neural network

REINVENT

Reinforcement learning

Supplementary materials

Title

Description

Actions

Title

Supporting information

Description

Supplementary tables and figures

Actions

Supplementary weblinks

Title

Description

Actions

Title

Supporting data

Description

Supporting data including the prior training datasets, trained priors and results.

Actions

View

Title

SMILES-RNN

Description

GitHub repository containing code used to generate the results.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Morgan Thomas, Noel M. O’Boyle, Andreas Bender, Chris de Graaf journal article

Journal of Cheminformatics , Volume 14, Issue 1

Online publication date: Oct 03, 2022

Version History

Apr 15, 2022 Version 1

Metrics

1,803

591

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2022-prz2r

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Augmented Hill-Climb increases reinforcement learning efficiency for language-based de novo molecule generation

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share