Abstract
Acid dissociation constants (pKa) are widely measured and studied, most typically in water. Comparatively few datasets and models for non-aqueous pKa values exist. In this work, we demonstrate how the pKa in one solvent can be accurately determined using reference data in another solvent, corrected by solvation energy calculations from the COSMO-RS method. We benchmark this approach in ten different solvents, and find that pKa values calculated in six solvents deviate from experimental data on average by less than 1 pKa unit. We observe comparable performance on a more diverse test set including amino acids and drug molecules, with higher error for large molecules. The model performance in four other solvents is worse, with some MAEs exceeding 3 pKa units; we discuss how such errors arise due to both model error and inconsistency in obtaining experimental data. Finally, we demonstrate how this technique can be used to estimate the proton transfer energy between different solvents, and use this to report a value of the proton’s solvation energy in formamide, a quantity that does not have a consensus value in literature.
Supplementary materials
Title
Supporting Information
Description
.pdf describing the data curation and use of reference ensembling for calculating pKas.
Actions