Interpretable Molecular Property Predictions Using Marginalized Graph Kernels

Yan Xiang; Yu-Hang Tang; Guang Lin; Daniel Reker

doi:10.26434/chemrxiv-2023-gd1gl

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Interpretable Molecular Property Predictions Using Marginalized Graph Kernels

20 February 2023, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Marginalized graph kernels have shown competitive performance in molecular machine learning tasks but currently lack measures of interpretability, which are important to improve trust in the models, detect biases, and inform molecular optimization campaigns. We here conceive and implement two interpretability measures for Gaussian process regression using a marginalized graph kernel (GPR-MGK) to quantify (1) the contribution of specific training data to the prediction and (2) the contribution of specific nodes of the graph to the prediction. We demonstrate the applicability of these interpretability measures for molecular property prediction. We compare GPR-MGK to graph neural networks on four logic datasets and find that the atomic attribution of GPR-MGK generally outperforms the atomic attribution of graph neural networks. We also perform a detailed molecular attribution analysis using the FreeSolv dataset, showing how molecules in the training set influence machine learning predictions and why Morgan fingerprints perform poorly on this dataset. This is the first systematic examination of the interpretability of GPR-MGK and thereby an important step in the further maturation of marginalized graph kernel methods for interpretable molecular predictions.

Keywords

Graph Kernel

Molecular Machine Learning

Explainable AI

Supplementary materials

Title

Description

Actions

Title

Supplementary Information

Description

Supplementary Discussion, Tables, and Figures

Actions

Supplementary weblinks

Title

Description

Actions

Title

GitHub repository

Description

Code to perform MGK interpretation

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Interpretable Molecular Property Predictions Using Marginalized Graph Kernels

Yan Xiang, Yu-Hang Tang, Guang Lin, Daniel Reker journal article

Journal of Chemical Information and Modeling , Volume 63, Issue 15

Online publication date: Jul 28, 2023

Version History

Feb 20, 2023 Version 1

Metrics

1,270

552

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2023-gd1gl

Author’s competing interest statement

D.R. acts as a consultant to the pharmaceutical and biotechnology industry.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Interpretable Molecular Property Predictions Using Marginalized Graph Kernels

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Now Published

Version History

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share