Expanded ensemble predictions of toluene--water partition coefficients in the SAMPL9 LogP challenge

23 January 2025, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The logarithm of the partition coefficient (logP) between water and a nonpolar solvent is useful for characterizing a small molecule's hydrophobicity. For example, the water-octanol logP is often used as a predictor of a drug’s lipophilicity and/or membrane permeability, good indicators of its bioavailability. Existing computational predictors of water-octanol logP are generally very accurate due to the wealth of experimental measurements, but may be less so for other non-polar solvents such as toluene. In this work, we participate in a Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) logP challenge to examine the accuracy of a molecular simulation-based absolute free energy approach to predict water-toluene logP in a blind test for sixteen drug-like compounds with acid-base properties. Our simulation workflow used the OpenFF 2.0.0 force field, and an expanded ensemble (EE) method for free energy estimation, which enables efficient parallelization over multiple distributed computing clients for enhanced sampling. The EE method uses Wang-Landau flat-histogram sampling to estimate the free energy of decoupling in each solvent, and can be performed in a single simulation. Our protocol also includes a step to optimize the schedule of alchemical intermediates in each decoupling. The results show that our EE workflow is able to accurately predict free energies of transfer, achieving an RMSD of 2.26 kcal/mol (1.65 logP units), and $R^2$ of 0.80. An examination of outliers suggests that improved force field parameters could achieve better accuracy. Overall, our results suggest that expanded ensemble free energy calculations provide reasonably accurate logP prediction for a general-purpose force field.

Keywords

free energy methods
computational drug discovery

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.