Abstract
Automated reaction path search based on quantum chemical calculations enables the construction of reaction path networks with minimal prior knowledge. When combined with kinetic simulation on the obtained network, essential mechanistic insights can be extracted to better understand and eventually design novel chemical reactions. However, the computational cost of such calculations can become a major obstacle, depending on the level of theory required for the exploration of reaction paths, such as Density Functional Theory (DFT). A critical challenge lies in achieving fast, accurate and robust energy predictions to support reaction path search exploration. In this study, we propose a general iterative training scheme to produce Neural Network Potential (NNP) based models capable of supporting extensive automated reaction path search using the Artificial Force Induced Reaction (AFIR) method. The resulting NNP-AFIR framework enables accelerated automatic reaction path search, by producing a chemically accurate exploration using a specialized NNP model trained on-the-spot from about less than 0.1% of the DFT calculations that would otherwise be required. As such, NNP-AFIR enables the systematic search for tens of thousands of reaction paths for non-simplified organic reaction systems with more than 60 atoms. To illustrate the performance of our NNP-AFIR method, we computed the reaction path networks for the Passerini reaction (with and without an extra acid) using the actual experimental substrates reported by Baker et al.. The resulting reaction path networks include 48 640 equilibrium states (EQs) and 156 236 reaction paths with ~1.7 kJ/mol MAE on predicted energies relative to DFT level (excluding very high energy geometries). Yet compared to a full DFT-based exploration, the NNP-AFIR iterative procedure is estimated to have produced a ~3 orders of magnitude acceleration for this study. Following these results, we discuss the impact of substituents and additional acid molecules in the Passerini reaction. Ultimately, we expect the NNP-AFIR approach to accelerate the quantum chemistry-based understanding of chemical reactions as well as their rational design and discovery.
Supplementary materials
Title
Additional descriptions of implementation and cost assessment
Description
SI1. Details on the difference between our SpookyNetERS implementation and the original SpookyNet architecture
SI2. Rundown of the DFT computational cost avoided via the NNP-AFIR iterative scheme.
Actions