Abstract
Machine learning models are increasingly harnessed to expedite drug discovery by effectively predicting the properties of small molecules, such as pKa, solubility, and binding affinity. While these models offer accelerated compound identification and optimization, the challenge arises when studying properties influenced by the interaction between a ligand and its corresponding protein. In response to this challenge, graph neural networks (GNNs) have emerged as a solution for integrating 3D structural data to enhance our understanding of protein-ligand interactions. Our novel model, InterGraph, introduces a unique approach to modeling protein-ligand interactions as topological multigraphs. This approach provides a comprehensive representation of the intricate spatial organization and connectivity patterns within these systems. By incorporating "interaction spheres" with varying edge densities, we capture the proximity-based influence of interactions, filtering out those occurring beyond 9 Å from the ligand, which are deemed irrelevant. Our model was trained using a ligand binding dataset from PDBbind and achieved an RMSE value of 1.34 when tested on a hold-out dataset. Our results highlight the efficacy of the multigraph in encoding the significance of close interactions, a critical factor in understanding binding affinity. On average, our model accurately predicts binding affinity values for various protein-ligand complexes and demonstrates heightened accuracy for hydrolase, lyase, and families of proteins involved in mediating protein-protein interactions. Furthermore, when compared to a set of complexes that underwent redocking calculations, the InterGraph method displayed sensitivity to the binding mode. In the context of drug discovery and development, protein-ligand interactions are pivotal in numerous biological processes. Understanding how these interactions affect biological processes is central to advancing pharmaceutical research. Binding affinity, a thermodynamic property characterizing the strength of such interactions, offers invaluable insights into these intricate processes.
Supplementary materials
Title
Supporting Informations
Description
The supporting information for the paper "Encoding Protein-Ligand Interactions: Binding Affinity Prediction with Multigraph-based Modeling and Graph Convolutional Network" provides additional details and resources that complement the primary research manuscript. The supporting information comprises key visuals, including K-Fold cross-validation results, probability density distributions, a pie chart depicting successful predictions, and comparison plots. These visuals enhance our understanding of the model's performance and data distribution, adding depth to the research.
Actions
Supplementary weblinks
Title
InterGraph
Description
The GitHub page for this project hosts the code and resources used in developing the predictive models. It provides access to the codebase and model implementations used in the research. This resource supports transparency, reproducibility, and collaboration in the field of binding affinity prediction, making the research methodology accessible to a wider audience and encouraging further advancements in the domain
Actions
View