Integrating Machine Learning-Based Pose Sampling with Established Scoring Functions for Virtual Screening

24 February 2025, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Physics-based docking methods have long been the cornerstone of structure-based virtual screening (VS). However, the emergence of machine learning (ML)-based docking approaches has opened up new possibilities for enhancing VS technologies. In this study, we explore the integration of DiffDock-L, a leading ML-based pose sampling method, into VS workflows by combining it with the well-established Vina and Gnina scoring functions. We assess this integrated approach in terms of its VS effectiveness, pose sampling quality, and complementarity to traditional physics-based docking methods, such as AutoDock Vina. Our findings from the DUDEZ benchmark dataset show that DiffDock-L performs competitively in both VS performance and pose sampling in cross-docking settings. In most cases, it generates physically plausible and biologically relevant poses, establishing itself as a viable alternative to physics-based docking algorithms. Additionally, we found that the choice of scoring function significantly influences VS success.

Keywords

docking
machine learning
DiffDock
DiffDock-L
AutoDock Vina
Benchmarking

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Contains additional details on parameters used in executing the docking programs, statistics of the processed molecules, correlation analyses for docking scores, validity and plausibility analyses of docking poses, and statistics on protein-ligand interaction profiles of the docking poses and reference ligands for individual targets (PDF).
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.