Interpreting Graph Neural Networks with Myerson Values for Cheminformatics Approaches

Samuel K. R. Homberg; Malte L. Modlich; Janosch Menke; Garrett M. Morris; Benjamin Risse; Oliver Koch

doi:10.26434/chemrxiv-2023-1hxxc-v2

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Interpreting Graph Neural Networks with Myerson Values for Cheminformatics Approaches

09 July 2024, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Graph neural networks (GNNs) are a natural choice to represent chemical data, due to their inherent ability to handle arbitrary input topologies. They avoid the need to convert molecules into molecular fingerprints with a fixed vector length. However, like most deep learning models, GNNs are not interpretable and common explainability methods fail because of the variable input size. We introduce a novel method to interpret the predictions of GNNs based on Myerson values from cooperative game theory. Myerson values are closely related to Shapley values , which have been adapted to explain a wide variety of machine learning model predictions. Applying these approaches to GNNs have, however, proven to be challenging because of their varying graph size. Our approach treats a GNN as a coalition game and the nodes of an input layer graph as players. The Myerson value of a node then determines the contribution to the prediction of the model, with only connected nodes contributing to coalitions. All Myerson values add up to the predicted value of the model allowing for a simple and intuitive interpretation of the prediction. Because calculating Myerson values becomes computationally infeasible for large graphs, we have also implemented a scalable approximation technique using Monte Carlo sampling. We developed the technique for applications in cheminformatics and drug discovery, but it can also be used in any application that uses GNNs. The effectiveness of our approach is validated through successful applications to two proof-of-concept datasets (logP and molecular weight) as well as a real-world dataset featuring kinase inhibitors, highlighting its broad applicability and promise in explaining graph-based cheminformatic models.

Keywords

Explainable AI

Artificial Intelligence

Neural Networks

Graph Neural Networks

Game Theory

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jul 09, 2024 Version 2

May 08, 2023 Version 1

Version Notes

Because calculating Myerson values becomes computationally infeasible for large graphs, we have implemented a scalable approximation technique using Monte Carlo sampling.

Metrics

1,903

909

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2023-1hxxc-v2

Funding

Deutsche Forschungsgemeinschaft

SPP 2363 – Molecular Machine Learning

Deutsche Forschungsgemeinschaft

KO 4689/5-1

Author’s competing interest statement

OK is Senior Advisor Digital Life Science at Nuvisan ICB GmbH and Scientific Advisor at Prosion GmbH.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Interpreting Graph Neural Networks with Myerson Values for Cheminformatics Approaches

Authors

Abstract

Keywords

Comments

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share