Bayesian Sequential Stacking Algorithm for Concurrently Designing Molecules and Synthetic Reaction Networks

Qi Zhang; Chang Liu; Stephen Wu; Ryo Yoshida

doi:10.26434/chemrxiv-2022-5qpv9

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Bayesian Sequential Stacking Algorithm for Concurrently Designing Molecules and Synthetic Reaction Networks

04 April 2022, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

In the last few years, de novo molecular design using machine learning has made great technical progress but its practical deployment has not been as successful. This is mostly owing to the cost and technical difficulty of synthesizing such computationally designed molecules. To overcome such barriers, various methods for synthetic route design using deep neural networks have been studied intensively in recent years. However, little progress has been made in designing molecules and their synthetic routes simultaneously. Here, we formulate the problem of simultaneously designing molecules with the desired set of properties and their synthetic routes within the framework of Bayesian inference. The design variables consist of a set of reactants in a reaction network and its network topology. The design space is extremely large because it consists of all combinations of purchasable reactants, often in the order of millions or more. In addition, the designed reaction networks can adopt any topology beyond simple multistep linear reaction routes. To solve this hard combinatorial problem, we present a powerful sequential Monte Carlo algorithm that recursively designs a synthetic reaction network by sequentially building up single-step reactions. In a case study of designing drug-like molecules based on commercially available compounds, compared with heuristic combinatorial search methods, the proposed method shows overwhelming performance in terms of computational efficiency and coverage and novelty with respect to existing compounds.

Keywords

Molecular design

synthetic reaction network

machine learning

Bayesian inference

recurrent algorithm

Supplementary materials

Title

Description

Actions

Title

Examples of designed products

Description

Products and their synthetic pathway networks were designed using Seq-Stack Reaction.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Seq-Stack-Reaction

Description

software for "Bayesian sequential stacking algorithm for concurrently designing molecules and synthetic reaction networks"

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

A Bayesian method for concurrently designing molecules and synthetic reaction networks

Qi Zhang, Chang Liu, Stephen Wu, Yoshihiro Hayashi, Ryo Yoshida journal article

Science and Technology of Advanced Materials: Methods , Volume 3, Issue 1

Online publication date: May 17, 2023

Version History

Apr 04, 2022 Version 1

Metrics

819

239

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2022-5qpv9

Funding

MEXT

PMXP1020210314

JST CREST

JPMJCR19I3

JSPS Grant-in-Aid for Scientific Research (A)

19H01132

MEXT KAKENHI Grant-in-Aid

19H05820

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Bayesian Sequential Stacking Algorithm for Concurrently Designing Molecules and Synthetic Reaction Networks

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share