An ecosystem for digital reticular chemistry

01 September 2022, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The space of all plausible materials for a given application is so large that it cannot be explored using a brute-force approach. This is, in particular, the case for reticular chemistry which provides materials designers with a practically infinite playground on different length scales. One promising approach to guide the design and discovery of materials is machine learning, which typically involves learning a mapping of structures onto properties from data. While there have been plenty of examples of the use of machine learning for reticular materials, the progress in the field seems to have stagnated. From our perspective, an important reason is that digital reticular chemistry is still more an art than a science in which many parts are only accessible to experienced groups. The lack of standardization across all the steps of the machine learning pipeline makes it practically impossible to directly compare machine learning models and build on top of prior results. To confront these challenges, we present mofdscribe: a software ecosystem that accompanies—seasoned as well as novice—digital reticular chemists on all steps from ideation to model publication. Our package provides reference datasets (including a completely new one), more than 35 reported as well as completely novel featurization strategies, data splitters, and validation helpers which can be used to benchmark new modeling strategies on standard benchmark tasks and to report the results on a public leaderboard. We envision that this ecosystem allows for a more robust, comparable, and productive area of digital reticular chemistry.

Keywords

MOF
reticular chemistry
machine learning
featurization

Supplementary materials

Title
Description
Actions
Title
Supplementary Material
Description
Supplementary Notes, Tables, and Figures.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.