Compound2Drug – a Machine/deep Learning Tool for Predicting the Bioactivity of PubChem Compounds

Ben Geoffrey A S; Pavan Preetham Valluri; Akhil Sanker; Rafal Madaj; Host Antony Davidd; Beutline Malgija; Konka Dinesh; Suyash Pant; Shweta Chakrabarti; Sharvani Togata; Bharti Mittal; Manish Upadhyay; Judith Gracia; Adarsh VK; Varun T K

doi:10.26434/chemrxiv.13052951.v1

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

Compound2Drug – a Machine/deep Learning Tool for Predicting the Bioactivity of PubChem Compounds

06 October 2020, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Network data is composed of nodes and edges. Successful application of machine learning/deep learning algorithms on network data to make node classification and link prediction has been shown in the area of social networks through which highly customized suggestions are offered to social network users. Similarly one can attempt the use of machine learning/deep learning algorithms on biological network data to generate predictions of scientific usefulness. In the present work, compound-drug target interaction data set from bindingDB has been used to train machine learning/deep learning algorithms which are used to predict the drug targets for any PubChem compound queried by the user. The user is required to input the PubChem Compound ID (CID) of the compound the user wishes to gain information about its predicted biological activity and the tool outputs the RCSB PDB IDs of the predicted drug target. The tool also incorporates a feature to perform automated In Silico modelling for the compounds and the predicted drug targets to uncover their protein-ligand interaction profiles. The programs fetches the structures of the compound and the predicted drug targets, prepares them for molecular docking using standard AutoDock Scripts that are part of MGLtools and performs molecular docking, protein-ligand interaction profiling of the targets and the compound and stores the visualized results in the working folder of the user. The program is hosted, supported and maintained at the following GitHub repository

https://github.com/bengeof/Compound2Drug

Keywords

machine Learning Methods Enable Predictive Modeling

drug discovery problems

Artificial Intelligence research

RCSB Protein Data Bank

PubChem collection

Small drug molecules

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.