Abstract
Crystal polymorphism is an important and fascinating aspect of solid state chemistry with far reaching implications in the pharmaceuticals, agrisciences, nutraceuticals, battery and aviation industries. Late appearing more stable polymorphs have caused numerous issues in the pharmaceutical industry. Experimental polymorph screening can be very expensive and time consuming, and sometimes may miss important low energy polymorphs due to an inability to exhaust all crystallization conditions. In this paper, we report a crystal structure prediction (CSP) method with state of the art accuracy and efficiency, validated on a large and diverse dataset including 65 molecules with 135 experimentally found polymorphic forms. The method combines a novel systematic crystal packing search algorithm and the use of machine learning force fields in a hierarchical crystal energy ranking. Our method not only reproduces all the experimentally known polymorphs, but also suggests new low energy polymorphs yet to be discovered by experiment that might pose potential risks to development of the currently known forms of these compounds. In addition, we also report the prediction results of a blinded study and demonstrate, in two prospective drug development projects, how the method was used to accelerate clinical formulation design and derisk downstream processing. The high accuracy, reliability, and efficiency of our method with large scale validations, a blinded study, and prospective studies position it for routine molecular crystal structure prediction in drug development.
Supplementary materials
Title
Supplementary Information for A Robust Crystal Structure Prediction Method to Support Small Molecule Drug Development with Large Scale Validation and Prospective Studies
Description
Further validation of conformational generation, additional details on the machine learning force field, density functional theory calculations, detailed results on systems discussed in the main text and temperature dependent stability.
Actions
Supplementary weblinks
Title
Ranking results on 65 molecular systems
Description
CIF files giving the crystal structures of low lying structures found by CSP for 65 molecular systems with energies from r2SCAN-D3.
Actions
View