Efficient Exploration of Chemical Space with Docking and Deep-Learning

Ying Yang; Kun Yao; Matthew P. Repasky; Karl Leswing; Robert Abel; Brian Shoichet; Steven Jerome

doi:10.26434/chemrxiv.14153819.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Efficient Exploration of Chemical Space with Docking and Deep-Learning

04 March 2021, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

With the advent of make-on-demand commercial libraries, the number of purchasable compounds available for virtual screening and assay has grown explosively in recent years, with several libraries eclipsing one billion compounds. Today’s screening libraries are larger and more diverse, enabling discovery of more potent hit compounds and unlocking new areas of chemical space, represented by new core scaffolds. Applying physics-based in-silico screening methods in an exhaustive manner, where every molecule in the library must be enumerated and evaluated independently, is increasingly cost-prohibitive. Here, we introduce a protocol for machine learning-enhanced molecular docking based on active learning to dramatically increase throughput over traditional docking. We leverage a novel selection protocol that strikes a balance between two objectives: (1) Identifying the best scoring compounds and (2) exploring a large region of chemical space, demonstrating superior performance compared to a purely greedy approach. Together with automated redocking of the top compounds, this method captures nearly all the high scoring scaffolds in the library found by exhaustive docking. This protocol is applied to our recent virtual screening campaigns against the D4 and AMPC targets that produced dozens of highly potent, novel inhibitors, and a blinded test against the MT1 target. Our protocol recovers more than 80% of the experimentally confirmed hits with a 14-fold reduction in compute cost, and more than 90% of the hit scaffolds in the top 5% of model predictions, preserving the diversity of the experimentally confirmed hit compounds.

Keywords

Docking approaches

Deep Learning Applications

active learning strategies

Chemical space

Chemical Diversity

Supplementary materials

Title

Description

Actions

Title

docking active-learning SI

Description

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Efficient Exploration of Chemical Space with Docking and Deep Learning

Ying Yang, Kun Yao, Matthew P. Repasky, Karl Leswing, Robert Abel, Brian K. Shoichet, Steven V. Jerome journal article

Journal of Chemical Theory and Computation , Volume 17, Issue 11

Online publication date: Sep 30, 2021

Version History

Mar 04, 2021 Version 1

Metrics

8,658

5,087

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.14153819.v1

Author’s competing interest statement

No conflict of interest

Efficient Exploration of Chemical Space with Docking and Deep-Learning

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Metrics

License

DOI

Author’s competing interest statement

Share