DiSCoVeR: a Materials Discovery Screening Tool for High Performance, Unique Chemical Compositions

Sterling Baird; Tran Diep; Taylor Sparks

doi:10.26434/chemrxiv-2021-5l2f8-v3

Materials Science

Search within Materials Science

DiSCoVeR: a Materials Discovery Screening Tool for High Performance, Unique Chemical Compositions

27 October 2021, Version 3

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

We present Descending from Stochastic Clustering Variance Regression (DiSCoVeR), a Python tool for identifying high-performing, chemically unique compositions relative to existing compounds using a combination of a chemical distance metric, density-aware dimensionality reduction, and clustering. We introduce several new metrics for materials discovery and validate DiSCoVeR on Materials Project bulk moduli using compound-wise and cluster-wise validation methods. We visualize these via multiobjective Pareto front plots and assign a weighted score to each composition where this score encompasses the trade-off between performance and density-based chemical uniqueness. We explore an additional uniqueness proxy related to property gradients in chemical space. We demonstrate that DiSCoVeR can successfully screen materials for both performance and uniqueness in order to extrapolate to new chemical spaces.

Keywords

machine learning

uniform manifold approximation and projection

optimization

earth mover's distance

Wasserstein distance

materials informatics

materials discovery

CrabNet

ElMD

Supplementary weblinks

Title

Description

Actions

Title

DiSCoVeR Codebase

Description

A materials discovery algorithm geared towards exploring high performance candidates in new chemical spaces.

Actions

View

Title

Trained Materials Discovery Python Class

Description

Trained materials discovery Python Discover() class for Materials Project elasticity data. For documentation, see the linked GitHub repository.

Actions

View

Title

Interactive DiSCoVeR Pareto Front Figures

Description

Various figures, both interactive and non-interactive, related to the DiSCoVeR algorithm as applied to compounds and clusters. For more details, see the paper.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

DiSCoVeR: a materials discovery screening tool for high performance, unique chemical compositions

Sterling G. Baird, Tran Q. Diep, Taylor D. Sparks journal article

Digital Discovery

Online publication date: 2022

Version History

Oct 27, 2021 Version 3

Oct 18, 2021 Version 2

Oct 13, 2021 Version 1

Version Notes

The results have changed slightly and align with the new dataset version on figshare. RobustScaler used instead of MinMaxScaler to reduce effect of outliers. A few minor formatting tweaks (e.g. acknowledgement section).

Metrics

1,569

777

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2021-5l2f8-v3

Funding

National Science Foundation

DMR-1651668

National Science Foundation

DMR-1950589

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

DiSCoVeR: a Materials Discovery Screening Tool for High Performance, Unique Chemical Compositions

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share