Synthesis Route Identification and Prioritization in Reaction Knowledge Graphs

06 June 2025, Version 1

Abstract

Identifying synthesis routes from knowledge graphs poses challenges beyond retrosynthesis, including path–finding artifacts and data issues. We introduce “SynGPS”, a novel algorithm that overcomes these limitations by identifying viable routes even with common artifacts. SynGPS can resolve nonsensical cycles, disconnect misleading links to starting materials, and remove ambiguous reactions, relying solely on topological heuristics for flexible scoring. We also present the Backtracking–Oriented Yield Aggregation (BOYA) algorithm, a new molar ratio–based method for calculating synthesis yield that addresses the molecular weight biases of existing weight-based approaches. Case studies demonstrate the effectiveness of SynGPS and BOYA algorithms, and we provide a rigorous theoretical framework that can facilitate the comparison of existing and future methods in the field of computer—aided synthesis planning (CASP).

Keywords

SynGPS
synthesis route
search
CASP
graph theory
BOYA
aggregated yield
chemistry automation
knowledge graph

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.