Abstract
Organic synthesis, central to modern chemistry, relies heavily on precise understanding of reaction kinetics, where accurate transition state structures and energies are essential. While density functional theory (DFT) remains the mainstream method for transition state searches, inherent trade-offs between accuracy and computational cost persist. To bridge this gap, DeePEST-OS—a generic machine learning potential integrating Δ-learning with a high-order equivariant message passing neural network—is developed to enable rapid and precise transition state searches for organic synthesis. The challenge of data scarcity is addressed through the establishment of a novel reaction database containing ~75,000 DFT-calculated transition states for model training. DeePEST-OS rapidly predicts potential energy surfaces along intrinsic reaction coordinate pathways, achieving speeds nearly three orders of magnitude faster than rigorous DFT computations. High accuracy is simultaneously maintained, exhibiting a root mean square deviation of 0.14 Å for transition states geometries and a mean absolute error of 0.64 kcal/mol for reaction barriers across 1,000 external test reactions—representing significant improvement over semi-empirical quantum chemistry methods. Comparative analysis against the state-of-the-art React-OT model further highlights the superior precision and computational efficiency of DeePEST-OS. A case study involving the retrosynthesis of the drug Zatosetron is also presented to demonstrate the practical utility of DeePEST-OS in accelerating exploration of complex reaction networks.
Supplementary materials
Title
Supporting materials
Description
The folder has three subfolders and a Supporting Information file. Supporting Information includes supplemental figures and tables. Cross-dataset validation of DeePEST-OS contains all geometries from that section. Conformational isomer and multi-step organic reactions include geometries from their respective sections in the main text.
Actions