Bayesian optimization of nanoporous materials

Aryan Deshwal; Cory Simon; Janardhan Rao Doppa

doi:10.26434/chemrxiv-2021-4624n-v2

Materials Science

Search within Materials Science

Bayesian optimization of nanoporous materials

12 July 2021, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Nanoporous materials (NPMs) could be used to store, capture, and sense many different gases. Given an adsorption task, we often wish to search a library of NPMs for the one with the optimal adsorption property. The high cost of NPM synthesis and gas adsorption measurements, whether these experiments are in the lab or in a simulation, often precludes exhaustive search. We explain, demonstrate, and advocate Bayesian optimization (BO) to actively search for the optimal NPM in a library of NPMs-- and find it using the fewest experiments. The two ingredients of BO are a surrogate model and an acquisition function. The surrogate model is a probabilistic model reflecting our beliefs about the NPM-structure--property relationship based on observations from past experiments. The acquisition function uses the surrogate model to score each NPM according to the utility of picking it for the next experiment. It balances two competing goals: (a) exploitation of our current approximation of the structure-property relationship to pick the highest-performing NPM, and (b) exploration of blind spots in the NPM space to pick an NPM we are uncertain about, to improve our approximation of the structure-property relationship. We demonstrate BO by searching an open database of ~70,000 hypothetical covalent organic frameworks (COFs) for the COF with the highest simulated methane deliverable capacity. BO finds the optimal COF and acquires 30% of the top 100 highest-ranked COFs after evaluating only ~120 COFs. More, BO searches more efficiently than evolutionary and one-shot supervised machine learning approaches.

Keywords

bayesian optimization

COFs

machine learning

Supplementary weblinks

Title

Description

Actions

Title

Github repo with the code

Description

code to reproduce our results.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Bayesian optimization of nanoporous materials

Aryan Deshwal, Cory M. Simon, Janardhan Rao Doppa journal article

Molecular Systems Design & Engineering , Volume 6, Issue 12

Online publication date: 2021

Version History

Jul 12, 2021 Version 2

Jun 23, 2021 Version 1

Version Notes

- added new section illustrating exploration/exploitation balance by using EI, max y, and max sigma as acquisition functions - for evolutionary search, when a new acquired point in feature space is asked for, we search for the closest COF in the database *not in the acquired set*. [before, the evolutionary search was picking the same COF over and over, and this was counted towards a COF evaluation]. we also clarify this in the text now. - normalize outputs in BO *only based on the training/already-acquired observations* - random forest: now budget of evaluations is used for 50% explore, 50% exploit. also the proper number of training data is used for the search efficiency plot, so now RF is better than the random search, consistent with intuition. - refactored and commented code in Jupyter Notebooks

Metrics

2,200

1,890

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2021-4624n-v2

Funding

National Science Foundation

IIS-1845922

National Science Foundation

OAC-1910213

National Science Foundation

1920945

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Bayesian optimization of nanoporous materials

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share