MEMES: Machine learning framework for Enhanced MolEcular Screening

Sarvesh Mehta; Siddhartha Laghuvarapu; Yashaswi Pathak; Aaftaab Sethi; Mallika Alvala; U. Deva Priyakumar

doi:10.26434/chemrxiv-2021-nr0vn-v2

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

MEMES: Machine learning framework for Enhanced MolEcular Screening

15 July 2021, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as "hits". In such an experiment, each molecule from large small-molecule drug library is evaluated for physical property such as the docking score against a target receptor. In real-life drug discovery experiments, the drug libraries are extremely large but still a minor representation of the essentially infinite chemical space, and evaluation of physical property for each molecule in the library is not computationally feasible. In the current study, a novel Machine learning framework for Enhanced MolEcular Screening ("MEMES") based on Bayesian optimization is proposed for efficient sampling of chemical space. The proposed framework is demonstrated to identify 90% of top-1000 molecules from a molecular library of size about 100 million, while calculating the docking score only for about 6% of the complete library. We believe that such a framework would tremendously help to reduce the computational effort in not only drug-discovery but also areas that require such high-throughput experiments.

Keywords

Chemical space

Artificial Intelligence

machine Learning

Bayesian optimization

virtual screening

high throughput screening

Supplementary materials

Title

Description

Actions

Title

Supplementary Material

Description

Tables of performance of ExactMEMES and DeepMEMES, performance comparison of MEMES with Deep Docking, Figures of structure of top hits, distribution plots of binding affinities, distributions of molecular clusters, distributions of binding affinities of missed hits, fractions matched against sampled percentage, protein-ligand complexes and protein-ligand interactions, and supplementary discussions and methods.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

MEMES: Machine learning framework for Enhanced MolEcular Screening

Sarvesh Mehta, Siddhartha Laghuvarapu, Yashaswi Pathak, Aaftaab Sethi, Mallika Alvala, U. Deva Priyakumar journal article

Chemical Science , Volume 12, Issue 35

Online publication date: 2021

Version History

Jul 15, 2021 Version 2

Mar 03, 2021 Version 1

Version Notes

Discussions have been improved

Metrics

2,983

1,025

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2021-nr0vn-v2

Funding

DST-SERB

CVD/2020/000343

IHub-Data, IIIT Hyderabad

Intel Corp.

Author’s competing interest statement

International Institute of Information Technology, Hyderabad has filed provisional patent application for the use of MEMES framework in high-throughput screening exercises, with U.D.P, S.M., S.L., and Y.P. listed as inventors. Provisional Patent Application No.: 202041050608 Status: Awaiting Complete Specification (Provisional Patent Filed) The funders did not have any role in the design, idea, data collection, analysis, interpretation, writing of the manuscript or decision to submit it for publication.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

MEMES: Machine learning framework for Enhanced MolEcular Screening

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share