Encoding Protein-Ligand Interactions: Binding Affinity Prediction with Multigraph-based Modeling and Graph Convolutional Network

26 October 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning models are increasingly harnessed to expedite drug discovery by effectively predicting the properties of small molecules, such as pKa, solubility, and binding affinity. While these models offer accelerated compound identification and optimization, the challenge arises when studying properties influenced by the interaction between a ligand and its corresponding protein. In response to this challenge, graph neural networks (GNNs) have emerged as a solution for integrating 3D structural data to enhance our understanding of protein-ligand interactions. Our novel model, InterGraph, introduces a unique approach to modeling protein-ligand interactions as topological multigraphs. This approach provides a comprehensive representation of the intricate spatial organization and connectivity patterns within these systems. By incorporating "interaction spheres" with varying edge densities, we capture the proximity-based influence of interactions, filtering out those occurring beyond 9 Å from the ligand, which are deemed irrelevant. Our model was trained using a ligand binding dataset from PDBbind and achieved an RMSE value of 1.34 when tested on a hold-out dataset. Our results highlight the efficacy of the multigraph in encoding the significance of close interactions, a critical factor in understanding binding affinity. On average, our model accurately predicts binding affinity values for various protein-ligand complexes and demonstrates heightened accuracy for hydrolase, lyase, and families of proteins involved in mediating protein-protein interactions. Furthermore, when compared to a set of complexes that underwent redocking calculations, the InterGraph method displayed sensitivity to the binding mode. In the context of drug discovery and development, protein-ligand interactions are pivotal in numerous biological processes. Understanding how these interactions affect biological processes is central to advancing pharmaceutical research. Binding affinity, a thermodynamic property characterizing the strength of such interactions, offers invaluable insights into these intricate processes.

Keywords

GCN
Binding affinity
Multigraph
Protein-ligand binding

Supplementary materials

Title
Description
Actions
Title
Supporting Informations
Description
The supporting information for the paper "Encoding Protein-Ligand Interactions: Binding Affinity Prediction with Multigraph-based Modeling and Graph Convolutional Network" provides additional details and resources that complement the primary research manuscript. The supporting information comprises key visuals, including K-Fold cross-validation results, probability density distributions, a pie chart depicting successful predictions, and comparison plots. These visuals enhance our understanding of the model's performance and data distribution, adding depth to the research.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.