Multi-fidelity prediction of molecular optical peaks with deep learning

Kevin Greenman; William Green; Rafael Gómez-Bombarelli

doi:10.26434/chemrxiv-2021-6d2bp

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Multi-fidelity prediction of molecular optical peaks with deep learning

15 October 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Optical properties are central to molecular design for many applications, including solar cells and biomedical imaging. A variety of ab initio and statistical methods have been developed for their prediction, each with a trade-off between accuracy, generality, and cost. Existing theoretical methods such as time-dependent density functional theory (TD-DFT) are generalizable across chemical space because of their robust physics-based foundations but still exhibit random and systematic errors with respect to experiment despite their high computational cost. Statistical methods can achieve high accuracy at a lower cost, but data sparsity and unoptimized molecule and solvent representations often limit their ability to generalize. Here, we utilize directed message passing neural networks (D-MPNNs) to represent both dye molecules and solvents for predictions of molecular absorption peaks in solution. Additionally, we demonstrate a multi-fidelity approach based on an auxiliary model trained on over 28,000 TD-DFT calculations that further improves accuracy and generalizability, as shown through rigorous splitting strategies. Combining several openly-available experimental datasets, we benchmark these methods against a state-of-the-art regression tree algorithm and compare the D-MPNN solvent representation to several alternatives. Finally, we explore the interpretability of the learned representations using dimensionality reduction and evaluate the use of ensemble variance as an estimator of the epistemic uncertainty in our predictions of molecular peak absorption in solution. The prediction methods proposed herein can be integrated with active learning, generative modeling, and experimental workflows to enable the more rapid design of molecules with targeted optical properties.

Keywords

machine learning

absorption spectra

deep learning

photophysical properties

multi-fidelity

Supplementary weblinks

Title

Description

Actions

Title

UVVisML Code for Predicting Molecular Optical Spectra

Description

GitHub repo with code and trained models for predicting the optical properties of molecules.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Multi-fidelity prediction of molecular optical peaks with deep learning

Kevin P. Greenman, William H. Green, Rafael Gómez-Bombarelli journal article

Chemical Science , Volume 13, Issue 4

Online publication date: 2022

Version History

Oct 15, 2021 Version 1

Metrics

2,259

582

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2021-6d2bp

Funding

National Science Foundation

1745302

DARPA Accelerated Molecular Discovery

HR00111920025

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Multi-fidelity prediction of molecular optical peaks with deep learning

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share