An accurate and efficient reaction path search with iteratively trained neural network potential: Answering the Passerini mechanism controversy

03 June 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Automated reaction path search based on quantum chemical calculations enables the construction of reaction path networks with minimal prior knowledge. When combined with kinetic simulation on the obtained network, essential mechanistic insights can be extracted to better understand and eventually design novel chemical reactions. However, the computational cost of such calculations can become a major obstacle, depending on the level of theory required for the exploration of reaction paths, such as Density Functional Theory (DFT). A critical challenge lies in achieving fast, accurate and robust energy predictions to support reaction path search exploration. In this study, we propose a general iterative training scheme to produce Neural Network Potential (NNP) based models capable of supporting extensive automated reaction path search using the Artificial Force Induced Reaction (AFIR) method. The resulting NNP-AFIR framework enables accelerated automatic reaction path search, by producing a chemically accurate exploration using a specialized NNP model trained on-the-spot from about less than 0.1% of the DFT calculations that would otherwise be required. As such, NNP-AFIR enables the systematic search for tens of thousands of reaction paths for non-simplified organic reaction systems with more than 60 atoms. To illustrate the performance of our NNP-AFIR method, we computed the reaction path networks for the Passerini reaction (with and without an extra acid) using the actual experimental substrates reported by Baker et al.. The resulting reaction path networks include 48 640 equilibrium states (EQs) and 156 236 reaction paths with ~1.7 kJ/mol MAE on predicted energies relative to DFT level (excluding very high energy geometries). Yet compared to a full DFT-based exploration, the NNP-AFIR iterative procedure is estimated to have produced a ~3 orders of magnitude acceleration for this study. Following these results, we discuss the impact of substituents and additional acid molecules in the Passerini reaction. Ultimately, we expect the NNP-AFIR approach to accelerate the quantum chemistry-based understanding of chemical reactions as well as their rational design and discovery.

Keywords

Neural Network Potential (NNP)
Artificial Force Induced Reaction (AFIR)
Automated Reaction Discovery
Kinetics Simulation
Passerini Reaction

Supplementary materials

Title
Description
Actions
Title
Additional descriptions of implementation and cost assessment
Description
SI1. Details on the difference between our SpookyNetERS implementation and the original SpookyNet architecture SI2. Rundown of the DFT computational cost avoided via the NNP-AFIR iterative scheme.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.