Revealing the Impact of Aggregations in the Graph-based Molecular Machine Learning: Electrostatic Interaction versus Pooling Methods

29 November 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Molecular structures that can be readily represented by graphs comprising constituent atoms (nodes) and their chemical bonds (edges) can also be used as input data for well-known machine learning (ML) models that process this data, such as graph neural networks (GNNs). GNNs showed a reasonable performance in the predicting properties of chemical systems. In typical applications of GNNs to chemistry-related fields, the main objective is to create an optimal molecular representation by aggregating atomic features and pooling features in the graph. In this study, we investigated two different approaches that can possibly generate better molecular representations. First, we created intermolecular edges to predict the photochemical properties of chromophore molecules in the solution. These intermolecular edges were constructed using atomic partial charges, inspired from the fact that electrostatic interaction is the main component of solute-solvent interaction. In the second approach, we investigated the effect of the aggregation and pooling functions. The results showed that intermolecular electrostatic interactions based on ground state charges prevent the GNN model from generating more effective molecular representations. On the contrary, the model demonstrated better performance when the averaging and adding operations were employed in a hybrid manner for aggregation and pooling functions.

Keywords

Graph Neural Network
Aggregation
Pooling

Supplementary materials

Title
Description
Actions
Title
Supporting Information for Revealing the Impact of Aggregations in the Graph-based Molecular Machine Learning: Electrostatic Interaction versus Pooling Methods
Description
Supporting Information which contains set of optimized hyperparameters and additional figures
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.