ChemRxiv
These are preliminary reports that have not been peer-reviewed. They should not be regarded as conclusive, guide clinical practice/health-related behavior, or be reported in news media as established information. For more information, please see our FAQs.
1/1
0/0

Multi-fidelity Statistical Machine Learning for Molecular Crystal Structure Prediction

preprint
submitted on 01.06.2020 and posted on 03.06.2020 by Olga Egorova, Roohollah Hafizi, David C. Woods, Graeme Day
The prediction of crystal structures from first principles requires highly accurate energies for large numbers of putative crystal structures. The accuracy of solid state density functional theory (DFT) calculations is often required, but hundreds or more structures can be present in the low energy region of interest, so that the associated computational costs are prohibitive. Here, we apply statistical machine learning to predict expensive hybrid functional DFT (PBE0) calculations using a multi-fidelity approach to re-evalute the energies of crystal structures predicted with an inexpensive force field. The method uses an autoregressive Gaussian process, making use of less expensive GGA DFT (PBE) calculations to bridge the gap between the force field and PBE0 energies. The method is benchmarked on the crystal structure landscapes of three small, hydrogen bonding organic molecules and shown to produce accurate predictions of energies and crystal structure ranking using small numbers of the most expensive calculations; the PBE0 energies can be predicted with errors of less than 1 kJ/mol with between 4.2-6.8% of the cost of the full calculations. As the model that we have developed is probabilistic, we discuss how the uncertainties in predicted energies impact on assessment of the energetic ranking of crystal structures.

Funding

Active Learning for Computational Polymorph Landscape Analysis

Engineering and Physical Sciences Research Council

Find out more...

History

Email Address of Submitting Author

G.M.Day@soton.ac.uk

Institution

University of Southampton

Country

United Kingdom

ORCID For Submitting Author

0000-0001-8396-2771

Declaration of Conflict of Interest

no conflict of interest

Licence

Exports