LigGPT: Molecular Generation using a Transformer-Decoder Model

Viraj Bagal; Rishal Aggarwal; P. K. Vinod; U. Deva Priyakumar

doi:10.26434/chemrxiv.14561901.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

LigGPT: Molecular Generation using a Transformer-Decoder Model

11 May 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Application of deep learning techniques for the de novo generation of molecules, termed as inverse molecular design, has been gaining enormous traction in drug design. The representation of molecules in SMILES notation as a string of characters enables the usage of state of the art models in Natural Language Processing, such as the Transformers, for molecular design in general. Inspired by Generative Pre-Training (GPT) model that have been shown to be successful in generating meaningful text, we train a Transformer-Decoder on the next token prediction task using masked self-attention for the generation of druglike molecules in this study. We show that our model, LigGPT, outperforms other previously proposed modern machine learning frameworks for molecular generation in terms of generating valid, unique and novel molecules. Furthermore, we demonstrate that the model can be trained conditionally to optimize multiple properties of the generated molecules. We also show that the model can be used to generate molecules with desired scaffolds as well as desired molecular properties, by passing these structures as conditions, which has potential applications in lead optimization in addition to de novo molecular design. Using saliency maps, we highlight the interpretability of the generative process of the model.

Keywords

Generative Modeling

Artificial Intelligence

Transformer-Decoder

Generative Pre-Training

Drug Design

Lead Optimization

Supplementary materials

Title

Description

Actions

Title

liggpt si

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

MolGPT: Molecular Generation Using a Transformer-Decoder Model

Viraj Bagal, Rishal Aggarwal, P. K. Vinod, U. Deva Priyakumar journal article

Journal of Chemical Information and Modeling , Volume 62, Issue 9

Online publication date: Oct 25, 2021

Version History

May 11, 2021 Version 1

Metrics

5,936

4,534

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.14561901.v1

Funding

IHub-Data, IIIT Hyderabad

Author’s competing interest statement

No Conflict of Interest

LigGPT: Molecular Generation using a Transformer-Decoder Model

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share