Abstract
Computer-Assisted Synthesis Programs are increasingly employed by organic chemists. Often, these tools combine neural networks for policy prediction with heuristic search algorithms. We propose two novel enhancements, which we call eUCT and dUCT, to the Monte Carlo tree search (MCTS) algorithm. The enhancements were deployed in AiZynthFinder and have been integrated into the open-source electronic lab notebook, AI4Green, available at https://ai4green.app. A memory-efficient stock file was used to reduce the computational carbon footprint. Both enhancements significantly reduced, by up to 50%, the computational clock-time to solve 1,500 heavy (500 to 800 Da) molecules. The dUCT enhancement increased the number of routes found per molecule for the 1,500 heavy molecules and a 50,000-molecule set from ChEMBL. eUCT and dUCT-v2 solved between 600 and 900 more molecules than the unenhanced MCTS algorithm across the 50,000 molecules. When limited to a 150 second time constraint, dUCT-v1 solved ~5 million more routes to the 50,000 targets than the unenhanced algorithm.
Supplementary weblinks
Title
AI4Green ELN
Description
AI4Green is open-source and released under the AGPL-3.0 license. Full source code, installation instructions and links to our video tutorials and user guides can be found at the associated URL.
Actions
View Title
AI4Green-retrosynthesis
Description
Code and associated data for the MCTS enhancements presented in the paper.
Actions
View