Abstract
Machine learning models are employed to enhance the speed and provide novel insights in drug discovery due to their demonstrated effectiveness in predicting properties of small molecules like pKa, solubility, and binding affinity. These approaches accelerate drug discovery by helping researchers efficiently identify, prioritize, and optimize compounds. Nonetheless, when investigating properties that depend on the interaction between a ligand and its corresponding protein, a compelling need arises to incorporate the protein counterpart information within the models. Recently, graph neural networks (GNNs) have been developed to incorporate 3D structural information to improve our understanding of the underlying protein-ligand interactions. However, incorporating 3D information into GNNs is not always straightforward. To address the challenge, we introduce a model called InterGraph, which models the protein-ligand interaction as topological multigraphs. By leveraging a topological representation, InterGraph offers a comprehensive approach to a graph representation of the intricate spatial organization and connectivity patterns within protein-ligand systems. We introduce interaction spheres that assign varying edge densities, capturing the proximity-based influence of interactions. This approach enables us to capture the characteristics of the interaction network, filtering out the ones that are beyond 9 Å from the ligand since they are not considered relevant or established. Finally, we trained the model using a ligand binding dataset from PDBbind and tested it on a hold-out test set, achieving an RMSE value of 1.34. Our findings have demonstrated the power of the multigraph to encode the importance of close interactions, a factor that is relevant in the context of binding affinity. On average, our model accurately predicts binding affinity values for several protein-ligand complexes and exhibits higher accuracy for hydrolase, lyase, and families of proteins involved in mediating protein-protein interactions. Additionally, the Intergraph method displayed sensitivity to the binding mode when compared to a set of complexes that had undergone redocking calculations
Supplementary materials
Title
Supporting Informations
Description
The supporting information for the paper "Encoding Protein-Ligand Interactions: Binding Affinity Prediction with Multigraph-based Modeling and Graph Convolutional Network" provides additional details and resources that complement the primary research manuscript. The supporting information comprises key visuals, including K-Fold cross-validation results, probability density distributions, a pie chart depicting successful predictions, and comparison plots. These visuals enhance our understanding of the model's performance and data distribution, adding depth to the research.
Actions
Supplementary weblinks
Title
InterGraph
Description
The GitHub page for this project hosts the code and resources used in developing the predictive models. It provides access to the codebase and model implementations used in the research. This resource supports transparency, reproducibility, and collaboration in the field of binding affinity prediction, making the research methodology accessible to a wider audience and encouraging further advancements in the domain
Actions
View