Prediction of Compound Synthesis Accessibility Based on Reaction Knowledge Graph

21 May 2021, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

With the increasing application of deep learning based generative models for de novo molecule design, quantitative estimation of molecular synthetic accessibility becomes a crucial factor for prioritizing the structures generated from generative models. On the other hand, it is also useful for helping prioritization of hit/lead compounds and guiding retro-synthesis analysis. In current study, based on the USPTO and Pistachio reaction datasets, we created a chemical reaction network, in which a depth-first search was performed for identification of the reaction paths of product compounds. This reaction dataset was then used to build predictive model for distinguishing the organic compounds either as easy synthesize (ES) or hard-to synthesize (HS) classes. Three synthesis accessibility (SA) models were built using deep learning/machine learning algorithms. The comparison between our three SA scoring functions with other existing synthesis accessibility scoring schemes, such as SYBA, SCScore, SAScore were also carried out. and the graph based deep learning model outperforms those existing SA scores. Our results show that prediction models based on historical reaction knowledge could be a useful tool for measuring molecule complexity and estimating molecule SA.

Keywords

network of chemistry
deep learning
synthesis accessibility

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.