Detecting polymorphism in molecular crystals from single-molecule properties

26 May 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Understanding the crystallization of molecules, and specifically the appearance of polymorphs, is a great challenge to modern chemistry, with both fundamental and practical aspects. Here, motivated by the proven ability of Machine-Learning (ML) algorithms to perform classification tasks, we harness ML-based tools and existing chemical datasets to ask the following question: can the existence of polymorphs of a molecular crystal be predicted based solely on properties of the single molecule. We find that our algorithm can predict the existence of polymorphism with an average accuracy of 65% at best, not enough to generate a reliable “polymorph predicting” engine. Moreover, our results imply that the number of polymorphs is much larger than that reported in the literature. We suggest two possible reasons for the poor performance, both providing important insights. The first is that the data is inherently biased towards mono-morphs, as many polymorphs are reported as mono-morphs, not because they indeed have only one stable crystal structure, but rather because only on crystal form has been observed in experiments. The second reason is even more profound, suggesting that the many-body nature of crystallization limits the possibility of predicting crystal properties based solely on single-molecule characteristics.

Keywords

Polymorphs
Crystallography
Classification
Machine Learning

Supplementary materials

Title
Description
Actions
Title
Supporting information
Description
Methods and supplementary plots
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.