Optimization of binding affinities in chemical space with generative pretrained transformer and deep reinforcement learning

Xiaopeng Xu; Juexiao Zhou; Chen  Zhu; Qing Zhan; Zhongxiao Li; Ruochi Zhang; Yu Wang; Xingyu Liao; Xin Gao

doi:10.26434/chemrxiv-2023-7v4sw

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Optimization of binding affinities in chemical space with generative pretrained transformer and deep reinforcement learning

03 April 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Background: The key challenge in drug discovery is to discover novel compounds with desirable properties. Among the properties, binding affinity to a target is one of the prerequisites and usually evaluated by molecular docking or quantitative structure activity relationship (QSAR) models. Methods: In this study, we developed Simplified molecular input line entry system Generative Pretrained Transformer with Reinforcement Learning (SGPT-RL), which uses a transformer decoder as the policy network of the reinforcement learning agent to optimize the binding affinity to a target. SGPT-RL was evaluated on the Moses distribution learning benchmark and two goal-directed generation tasks, with Dopamine Receptor D2 (DRD2) and Angiotensin-Converting Enzyme 2 (ACE2) as the targets. Both QSAR model and molecular docking were implemented as the optimization goals in the tasks. The popular Reinvent method was used as the baseline for comparison. Results: The results on Moses benchmark showed that SGPT-RL learned good property distributions and generated molecules with high validity and novelty. On the two goal-directed generation tasks, both SGPT-RL and Reinvent were able to generate valid molecules with improved target scores. The SGPT-RL method achieved better results than Reinvent on the ACE2 task, where molecular docking was used as the optimization goal. Further analysis shows that SGPT-RL learned conserved scaffold patterns during exploration. Conclusions: The superior performance of SGPT-RL in the ACE2 task indicates that it can be applied to the virtual screening process where molecular docking is widely used as the criteria. Besides, the scaffold patterns learned by SGPT-RL during the exploration process can assist chemists to better design and discover novel lead candidates.

Keywords

Drug design

Transformer

Reinforcement learning

Molecular docking

Hit discovery

Supplementary materials

Title

Description

Actions

Title

Supplementary tables and figures

Description

Supplementary tables and figures

Actions

Supplementary weblinks

Title

Description

Actions

Title

Source code of SGPT-RL

Description

Source code of SGPT-RL in GitHub.

Actions

View

Title

Source data of SGPT-RL

Description

Source data of SGPT-RL in Zenodo.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

This manuscript is now published on F1000Research under this link: https://f1000research.com/articles/12-757/v2.

Now Published

Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning

Xiaopeng Xu, Juexiao Zhou, Chen Zhu, Qing Zhan, Zhongxiao Li, Ruochi Zhang, Yu Wang, Xingyu Liao, Xin Gao journal article

F1000Research , Volume 12

Online publication date: Jun 28, 2023

Version History

Apr 03, 2023 Version 1

Metrics

954

427

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2023-7v4sw

Funding

King Abdullah University of Science and Technology

FCC/1/1976-44-01, FCC/1/1976-45-01, URF/1/4663-01-01, REI/1/5202-01-01, REI/1/4940-01-01, and RGC/3/4816-01-01.

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Optimization of binding affinities in chemical space with generative pretrained transformer and deep reinforcement learning

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share