AutoDesigner - Core Design, a De Novo Design Algorithm for Chemical Scaffolds: Application to the Design and Synthesis of Novel Selective Wee1 Inhibitors

14 June 2024, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The hit identification stage of a drug discovery program generally involves the design of novel chemical scaffolds with desired biological activity against the target(s) of interest. One common approach is scaffold hopping, which is the manual design of novel scaffolds based on known chemical matter. One major limitation of this approach is narrow chemical space exploration, which can lead to difficulties in maintaining or improving biological activity, selectivity, and favorable property space. Another limitation is the lack of preliminary structure-activity relationship (SAR) data around these designs, which could lead to selecting suboptimal scaffolds to advance lead optimization. To address these limitations, we propose AutoDesigner - Core Design (CoreDesign), a de novo scaffold design algorithm. Our approach is a cloud-integrated, de novo design algorithm for systematically exploring and refining chemical scaffolds against biological targets of interest. The algorithm designs, evaluates, and optimizes a vast range - from millions to billions - of molecules in silico, following defined project parameters encompassing structural novelty, physicochemical attributes, potency, and selectivity. In this manner, CoreDesign can generate novel scaffolds and also explore preliminary SAR around each scaffold using FEP+ potency predictions. CoreDesign requires only a single ligand with quantifiable binding affinity and an initial binding hypothesis, making it especially suited for the hit-identification stage where experimental data is often limited. To validate CoreDesign in a real-world drug discovery setting, we applied it to the design of novel, potent Wee1 inhibitors with improved selectivity over PLK1. Starting from a single known ligand, CoreDesign rapidly explored over 23 billion molecules to identify 1,342 novel chemical series with a mean of 4 compounds per scaffold. Importantly, all chemical series met the predefined property space requirements. To rapidly analyze this large amount of data and prioritize chemical scaffolds for synthesis, we utilize t-Distributed Stochastic Neighbor Embedding (t-SNE) plots of in silico properties. The chemical space projections allowed us to rapidly identify a structurally novel 5-5 fused core meeting all the hit-identification requirements. Several compounds were synthesized and assayed from the scaffold, displaying good potency against Wee1 and excellent PLK1 selectivity. Our results suggest that CoreDesign can significantly speed up the hit-identification process and increase the probability of success of drug discovery campaigns by allowing teams to bring forward high-quality chemical scaffolds de-risked by the availability of preliminary SAR.

Keywords

scaffold hopping
drug design
hit identification
de novo design
chemical space exploration
virtual screening
drug discovery

Supplementary materials

Title
Description
Actions
Title
AutoDesigner - Core Design, a De Novo Design Algorithm for Chemical Scaffolds: Application to the Design and Synthesis of Novel Selective Wee1 Inhibitors
Description
Supporting information including computational and experimental method details
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.