Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models

Chenru Duan; Jon Paul Janet; Fang Liu; Aditya Nandy; Heather Kulik

doi:10.26434/chemrxiv.7616009.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models

23 January 2019, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

High-throughput computational screening for chemical discovery mandates the automated and unsupervised simulation of thousands of new molecules and materials. In challenging materials spaces, such as open shell transition metal chemistry, characterization requires time-consuming first-principles simulation that often necessitates human intervention. These calculations can frequently lead to a null result, e.g., the calculation does not converge or the molecule does not stay intact during a geometry optimization. To overcome this challenge toward realizing fully automated chemical discovery in transition metal chemistry, we have developed the first machine learning models that predict the likelihood of successful simulation outcomes. We train support vector machine and artificial neural network classifiers to predict simulation outcomes (i.e., geometry optimization result and degree of deviation) for a chosen electronic structure method based on chemical composition. For these static models, we achieve an area under the curve of at least 0.95, minimizing computational time spent on non- productive simulations and therefore enabling efficient chemical space exploration. We introduce a metric of model uncertainty based on the distribution of points in the latent space to systematically improve model prediction confidence. In a complementary approach, we train a convolutional neural network classification model on simulation output electronic and geometric structure time series data. This dynamic model generalizes more readily than the static classifier by becoming more predictive as input simulation length increases. Finally, we describe approaches for using these models to enable autonomous job control in transition metal complex discovery.

Keywords

machine learning

automation

convolutional neural networks

transition metal chemistry

geometry optimizations

Supplementary materials

Title

Description

Actions

Title

SIClassifier v5

Description

Actions

Title

data set

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models

Chenru Duan, Jon Paul Janet, Fang Liu, Aditya Nandy, Heather J. Kulik journal article

Journal of Chemical Theory and Computation , Volume 15, Issue 4

Online publication date: Mar 12, 2019

Version History

Jan 23, 2019 Version 1

Metrics

4,529

1,378

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.7616009.v1

Funding

DARPA D18AP00039

Office of Naval Research N00014-17-1-2956

Office of Naval Research N00014-18-1-2434

Author’s competing interest statement

The authors declare no competing financial interest.

Learning from Failure: Predicting Electronic Structure Calculation Outcomes with Machine Learning Models

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share