Assessing the accuracy and efficiency of free energy differences obtained from reweighted flow-based probabilistic generative models

Edgar Olehnovics; Yifei Michelle Liu; Nada Mehio; Ahmad Y Sheikh; Michael Shirts; Matteo Salvalaglio

doi:10.26434/chemrxiv-2024-z9g39

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Assessing the accuracy and efficiency of free energy differences obtained from reweighted flow-based probabilistic generative models

22 April 2024, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Computing free energy differences between metastable states characterized by non-overlapping Boltzmann distributions is often a computationally intensive endeavour, usually requiring chains of intermediate states to connect these metastable states. Targeted free energy perturbation (TFEP) can significantly lower the computational cost of FEP calculations by choosing a set of invertible maps used to directly transform the distributions of interest, achieving the necessary statistically significant overlaps without sampling any intermediate states. Probabilistic generative models (PGMs) based on normalising-flow architectures can make it much easier via machine learning to train invertible maps needed for TFEP. However, the accuracy and applicability of approaches based on empirically learned maps depend crucially on the choice of reweighting method adopted to estimate the free energy differences. In this work, we assess the accuracy, rate of convergence, and data efficiency of different free energy estimators, including exponential averaging, BAR, and MBAR, in reweighting PGMs trained by maximum likelihood on limited amounts of molecular dynamics data sampled only from end-states of interest. We carry out the comparisons on a set of simple but representative case studies, including conformational ensembles of alanine dipeptide and ibuprofen. Our results indicate that BAR and MBAR are both data efficient and robust, even in the presence of significant model overfitting in the generation of invertible maps. This analysis can serve as a stepping stone for the deployment of efficient and quantitatively accurate ML-based FE calculation methods in complex systems.

Keywords

Free energy calculations

probabilistic generative models

Bennet Acceptance Ratio

MBAR

Targeted FEP

Supplementary materials

Title

Description

Actions

Title

Supplementary Materials

Description

Additional figures S1-S5

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jun 25, 2024 Version 3

Jun 13, 2024 Version 2

Apr 22, 2024 Version 1

Metrics

1,200

662

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-z9g39

Funding

UK Research and Innovation

EP/X033139/1

Engineering and Physical Sciences Research Council

EP/R018820/1

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Assessing the accuracy and efficiency of free energy differences obtained from reweighted flow-based probabilistic generative models

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share