ChemRxiv
These are preliminary reports that have not been peer-reviewed. They should not be regarded as conclusive, guide clinical practice/health-related behavior, or be reported in news media as established information. For more information, please see our FAQs.
ICLR_2021_workshop_camera_ready.pdf (352.85 kB)

Transformer Neural Network for Structure Constrained Molecular Optimization

preprint
submitted on 14.04.2021, 18:19 and posted on 15.04.2021, 10:25 by Jiazhen He, Felix Mattsson, Marcus Forsberg, Esben Jannik Bjerrum, Ola Engkvist, eva nittinger, Christian Tyrchan, Werngard Czechtizky
Finding molecules with a desirable balance of multiple properties is a main challenge in drug discovery. Here, we focus on the task of molecular optimization, where a starting molecule with promising properties needs to be further optimized towards the desirable properties. Typically, chemists would apply chemical transformations to the starting molecule based on their intuition. A widely used strategy is the concept of matched molecular pairs where two molecules differ by a single transformation. In particular, a chemist would be interested in keeping one part of the starting molecule (core) constant, while substituting the other part (R-group), to optimize the starting molecule towards desirable properties. Motivated by this, we train a Transformer model, Transformer-R, to generate R-groups given the starting molecule (with its core and R-group specified) and the specified desirable properties. The generated R-groups will be attached to the core to form the final molecules, which are guaranteed to keep the core of interest and are expected to satisfy the desirable properties in the input. Our model could accelerate the process of optimizing antiviral drug candidates in terms of various properties of interest, e.g. pharmacokinetics.

History

Email Address of Submitting Author

jiazhen.he@astrazeneca.com

Institution

AstraZeneca

Country

Sweden

ORCID For Submitting Author

0000-0001-5848-8318

Declaration of Conflict of Interest

No conflict of interest

Version Notes

Accepted by ICLR 2021 Workshop: Machine Learning for Preventing and Combating Pandemics

Exports