Using physical property surrogate models to perform multi-fidelity global optimization of force field parameters

Owen Madin; Michael Shirts

doi:10.26434/chemrxiv-2022-7bmzv

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Using physical property surrogate models to perform multi-fidelity global optimization of force field parameters

22 September 2022, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Dispersion-repulsion interactions, commonly represented in atomistic force fields by the Lennard-Jones (LJ) potential, play an important role in the accuracy of molecular simulations. Training the force field parameters used in the LJ potential is challenging, generally requiring adjustment based on simulations of macroscopic physical properties. The computational expense of these simulations limits the size of training data set and number of optimization steps that can be taken, often requiring modelers to perform optimizations within a local parameter region. To allow for global LJ parameter optimization against large training sets, we introduce a multi-fidelity optimization technique which uses Gaussian process surrogate modeling to build inexpensive models of physical properties as a function of LJ parameters. This allows for fast evaluation of objective functions, greatly accelerating searches over parameter space. We use an iterative framework which performs global optimization at the surrogate level, followed by validation at the simulation level and surrogate refinement. Using this technique on two previously studied training sets, containing up to 195 physical property targets, we refit a subset of the LJ parameters for the OpenFF 1.0.0 ``Parsley'' force field. We demonstrate that this multi-fidelity technique can find improved parameter sets compared to a purely simulation-based optimization by searching more broadly and escaping local minima. In most cases, these parameter sets are transferable to other similar molecules in a test set. This multi-fidelity technique provides a platform for fast optimization against physical properties that can be refined and applied in multiple ways to the development of molecular models.

Keywords

Supplementary materials

Title

Description

Actions

Title

Supplementary Information for "Using physical property surrogate models to perform multi-fidelity global optimization of force field parameters"

Description

Full description of training sets, performance metrics on training sets, performance metrics on test set.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Dec 09, 2022 Version 2

Sep 22, 2022 Version 1

Metrics

1,197

437

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2022-7bmzv

Funding

National Institutes of Health

R01GM132386

Author’s competing interest statement

MRS is an Open Science Fellow for Roivant Sciences.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Using physical property surrogate models to perform multi-fidelity global optimization of force field parameters

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share