HydraScreen: A Generalizable Structure-Based Deep Learning Approach to Drug Discovery

06 October 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

We propose HydraScreen, a deep-learning approach that aims to provide a framework for more robust machine-learning-accelerated drug discovery. HydraScreen utilizes a state-of-the-art 3D convolutional neural network, designed for the effective representation of molecular structures and interactions in protein-ligand binding. We design an end-to-end pipeline for high-throughput screening and lead optimization, targeting applications in structure-based drug design. We assess our approach using established public benchmarks based on the CASF 2016 core set, achieving top-tier results in affinity and pose prediction (Pearson's r = 0.86, RMSE = 1.15, Top-1 = 0.95). Furthermore, we present a novel interaction profiling approach to identify potential biases in the model and dataset to boost interpretability and support the unbiased nature of our method. Finally, we showcase HydraScreen's capacity to generalize across unseen proteins and ligands, offering directions for future development of robust machine learning scoring functions. HydraScreen, accessible at https://hydrascreen.ro5.ai, provides a user-friendly GUI and a public API, facilitating easy assessment of individual protein–ligand complexes.

Keywords

Strucure-based drug discovery
Machine Learning Scoring Function
Deep Learning
Pose Estimation
Affinity Prediction
SBDD
MLSF
Docking
Computational Chemistry
Virtual Screening

Supplementary materials

Title
Description
Actions
Title
Supporting Information for “HydraScreen: A Generalizable Structure-Based Deep Learning Approach to Drug Discovery"
Description
Supporting information for “HydraScreen: A Generalizable Structure-Based Deep Learning Approach to Drug Discovery"
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.