Taming Tautomerism in Organic Crystal Structure Prediction

19 May 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Tautomerism influences the solubility, stability, efficacy, and even toxicity of drug formulations. Computational solid-form screening is increasingly being used to help identify effective drug formulations and de-risk against undesirable crystal forms. Here, however, we demonstrate that widely-used density functionals predict tautomeric crystal polymorph energetics poorly, largely due to density-driven delocalization error. We first show how these DFT models incorrectly identify the preferred tautomeric polymorphs of 2-thiobarituric acid and the antihelmintic drug mebendazole, and that some functionals substantially underestimate the barrier to solid-state tautomerization. Surveying 19 additional examples, we find that tautomeric polymorph energy differences defy conventional wisdom by regularly and substantially exceeding the 10 kJ/mol energy window associated with crystal polymorphism. Moreover, the DFT errors in the polymorph energy differences are strikingly large: they can reach tens of kJ/mol, and they produce the wrong qualitative stability ordering in ~20-30% of the cases. We address the challenge of predicting tautomeric polymorphs through a general solution that combines periodic hybrid DFT with intramolecular coupled-cluster theory corrections to substantially improve agreement with experiment. Most notably, our refined models reveal that the experimentally reported crystal structure of mebendazole form B features the incorrect tautomer--a prediction that is confirmed via solid-state NMR analysis.

Keywords

Crystal Structure Prediction
Polymorphism
Pharmaceuticals
Desmotropy
Delocalization Error

Supplementary materials

Title
Description
Actions
Title
Supporting Information - Computational Methods and Additional Data (PDF)
Description
Computational details and additional figures and data tables.
Actions
Title
Crystal structures of 2-thiobarbituric acid (CIF)
Description
DFT-predicted crystal structures for the crystal energy landscape of 2-thiobarbituric acid.
Actions
Title
Crystal structures of mebendazole (CIF)
Description
DFT-optimized crystal structures of the mebendazole polymorphs.
Actions
Title
Crystal structures of from the survey of tautomeric polymorphs (CIF)
Description
DFT-optimized crystal structures for the 19 sets of tautomeric polymorphs surveyed.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.