Data-driven discovery of organic electronic materials enabled by hybrid top-down/bottom-up design

J. Terence Blaskovits; Ruben Laplaza; Sergi Vela; Clémence Corminboeuf

doi:10.26434/chemrxiv-2022-88t32

Materials Chemistry

Search within Materials Chemistry

Data-driven discovery of organic electronic materials enabled by hybrid top-down/bottom-up design

05 December 2022, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

The high-throughput molecular exploration and screening of organic electronic materials often starts with either a 'top-down' mining of existing repositories, or the 'bottom-up' assembly of fragments based on predetermined rules and known synthetic templates. In both instances, the datasets used are often produced on a case-by-case basis, and require the high-quality computation of electronic properties and extensive user input: curation in the top-down approach, and the construction of a fragment library and introduction of rules for linking them in the bottom-up approach. Both approaches are time-consuming and require significant computational resources. Here, we generate a top-down set of 117K synthesized molecules containing their optimized structures, associated electronic and topological properties and chemical composition, and use these structures as a vast library of molecular building blocks for bottom-up fragment-based materials design. A tool is developed to automate the coupling of these building block units based on their available C(sp2/sp)-H bonds, thus providing a fundamental link between the two philosophies of dataset construction. Statistical models are trained on this dataset and a subset of the resulting hybrid top-down/bottom-up compounds, which enable on-the-fly prediction of key ground state (frontier molecular orbital gaps) and excited state (S1 and T1 energies) properties from molecular geometries with high accuracy across all known p-block organic compound space. With access to ab initio-quality optical properties in hand, it is possible to apply this bottom-up pipeline using existing compounds as molecular building blocks to any materials design campaign. To illustrate this, we construct and screen over a million molecular candidates for efficient intramolecular singlet fission, the leading candidates of which provide insight into the structural features that may promote this multiexciton-generating process.

Keywords

donor-acceptor materials

building blocks

high-throughput virtual screening

singlet fission

statistical models

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

computational details; dataset generation and curation; method benchmarking; details on crosscoupler tool, substructure search, and statistical models; learning curves; correlation plots; validation of property prediction

Actions

Supplementary weblinks

Title

Description

Actions

Title

Materials Cloud repository

Description

CSVs and interactive plots of data reported in this work.

Actions

View

Title

GitHub repository

Description

crosscoupler tool and scripts for XGBoost models

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Data‐Driven Discovery of Organic Electronic Materials Enabled by Hybrid Top‐Down/Bottom‐Up Design

J. Terence Blaskovits, Ruben Laplaza, Sergi Vela, Clémence Corminboeuf journal article

Advanced Materials

Online publication date: Oct 10, 2023

Version History

Dec 05, 2022 Version 1

Metrics

1,623

728

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2022-88t32

Funding

École polytechnique fédérale de Lausanne

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Data-driven discovery of organic electronic materials enabled by hybrid top-down/bottom-up design

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share