Employing the active learning strategy to construct full-dimensional  intermolecular potential energy surfaces within spectroscopic accuracy

You Li; Xiao-Long Zhang; Hui Li

doi:10.26434/chemrxiv-2024-shw8x

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Employing the active learning strategy to construct full-dimensional intermolecular potential energy surfaces within spectroscopic accuracy

18 December 2024, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

In this work, we employed an uncertainty-driven active learning strategy to achieve highly efficient point sampling for full-dimension potential energy surface constructions. The model uncertainty is defined as the weighted square energy difference between two neural network (NN) models trained with the same dataset, and the local maximums of uncertainty would be added into the training set by two criteria. A two-step sampling procedure was introduced to reduce the computational costs of expansive double-precision neural network training. The 6-D H$_2$O-He system was chosen as the test system. A reference PES was constructed firstly by the newly developed MLRNet model with a weighted RMSE of 0.028 cm$^{-1}$, where the full-dimension long-range function was fitted by a pruned basis expansion method. Our tests demonstrate that it is also reliable for the long-range switched fundamental invariant neural network (LS-FI-NN) to construct spectroscopically accurate PES, however, it is less inefficient for the newly developed MLRNet model. For the first single-precision sampling, the LS-FI-NN only requires 472 fitting points to achieve a weighted-RMSE of 0.3253 cm$^{-1}$ for 47945 test points. In comparison, the MLRNet requires 652 points to reach a similar accuracy. Notably, the MLRNet demonstrated lower training errors across all sampling cycles and lower test errors in the first few cycles with less trainable parameters, which indicates its potential with an appropriate sampling procedure. For the second double-precision sampling, the LS-FI-NN achieved a test RMSD of 0.0710 cm$^{-1}$ with only 613 points, while the MLRNet can't converge to a given threshold for tens of iterations. The spectroscopic calculations were performed to further validate the accuracy of these PESs. The energy levels of the double precision LS-FI-NN showed great agreement with the reference PES's results, with only 0.0161 cm$^{-1}$ and 0.0044 cm$^{-1}$ average errors for vibrational levels and the band origin shifts.

Keywords

Active learning

potential energy surfaces

point sampling

vdW dimer

ro-vibrational spectroscopy

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Dec 18, 2024 Version 1

Metrics

311

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2024-shw8x

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content