Abstract
In the present work, two reaction-based generative models for molecular design are presented: Growing Optimizer and Linking Optimizer. These models are designed to emulate real-life chemical synthesis by sequentially selecting building blocks and simulating the reactions between them to form new compounds. By focusing on the feasibility of the generated molecules, Growing Optimizer and Linking Optimizer offer several advantages, including the ability to restrict chemistry to specific building blocks, reaction types, and synthesis pathways, a crucial requirement in drug design. Unlike text-based models, which construct molecules by iteratively forming a textual representation of the molecular structure, and graph-based models, which assemble molecules atom by atom or fragment by fragment, our approach incorporates a more comprehensive understanding of chemical knowledge, making it relevant for drug discovery projects. Comparative analysis with REINVENT 4, a state-of-the-art molecular generative model, shows that Growing Optimizer and Linking Optimizer are more likely to produce synthetically accessible molecules while reaching molecules of interest with the desired properties.
Supplementary materials
Title
Supplementary Information
Description
A. Chemical Space Estimation
B. Pseudo-algorithm sampling
C. Setup for Docking Simulations
D. Pipeline of reaction and template preparation
E. Experiments
Actions