Machine Learned Potentials by Active Learning from Organic Crystal Structure Prediction Landscapes

Patrick W. V. Butler; Roohollah Hafizi; Graeme Day

doi:10.26434/chemrxiv-2023-97rmb

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Machine Learned Potentials by Active Learning from Organic Crystal Structure Prediction Landscapes

27 October 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

A primary challenge in organic molecular crystal structure prediction (CSP) is accurately ranking the energies of potential structures. While high-level solid-state density functional theory (DFT) methods allow for mostly reliable discrimination of the low energy structures, their high computational cost is problematic because of the need to evaluate tens to hundreds of thousands of trial crystal structures to fully explore typical crystal energy landscapes. Consequently, lower-cost but less accurate empirical force fields are often used, sometimes as the first stage of a hierarchical scheme involving multiple stages of increasingly accurate energy calculations. Machine learned potentials (MLPs), trained to reproduce the results of ab initio methods with computational cost close to that of force fields, can improve the efficiency of CSP by reducing or eliminating the need for costly DFT calculations at the final stages of CSP. Here, we investigate active learning methods for training MLPs with CSP datasets. The combination of active learning with the well-developed sampling methods from CSP yields potentials in a highly automated workflow that are relevant over a wide range of the crystal packing space. To demonstrate these potentials, we illustrate efficiently re-ranking large, diverse crystal structure landscapes to near-DFT accuracy from force field-based CSP, improving the reliability of the final energy ranking. Furthermore, we demonstrate how these potentials can be extended to more accurately model structures far from lattice energy minima through additional on-the-fly training within Monte Carlo simulations.

Keywords

crystal structure prediction

machine learning

active learning

Supplementary materials

Title

Description

Actions

Title

supporting methods and results

Description

Further method descriptions, hyperparameter testing and additional results of active learning.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Oct 27, 2023 Version 1

Metrics

1,343

705

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2023-97rmb

Funding

Engineering and Physical Sciences Research Council

EP/V026887/1

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Machine Learned Potentials by Active Learning from Organic Crystal Structure Prediction Landscapes

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share