Using a Genetic Algorithm to Find Molecules with Good Docking Scores

Casper Steinmann; Jan H. Jensen

doi:10.26434/chemrxiv.13525589.v2

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Using a Genetic Algorithm to Find Molecules with Good Docking Scores

29 January 2021, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

A graph-based genetic algorithm (GA) is used to identify molecules (ligands) with high absolute docking scores as estimated by the Glide software, starting from randomly chosen molecules from the ZINC database, for four different targets: Bacillus subtilis chorismate mutase (CM), human β₂-adrenergic G protein-coupled receptor (β₂AR), the DDR1 kinase domain (DDR1), and β-cyclodextrin (BCD). By the combined use of functional group filters and a score modifier based on a heuristic synthetic accessibility (SA) score our approach identifies between ca 500 and 6000 structurally diverse molecules with scores better than known binders by screening a total of 400,000 molecules starting from 8000 randomly selected molecules from the ZINC database. Screening 250,000 molecules from the ZINC database identifies significantly more molecules with better docking scores than known binders, with the exception of CM, where the conventional screening approach only identifies 60 compounds compared to 511 with GA+Filter+SA. In the case of β₂AR and DDR1 the GA+Filter+SA approach finds significantly more molecules with docking scores lower than -9.0 and -10.0. The GA+Filters+SA docking methodology is thus effective in generating a large and diverse set of synthetically accessible molecules with very good docking scores for a particular target. An early incarnation of the GA+Filter+SA approach was used to identify potential binders to the COVID-19 main protease and submitted to the early stages of the COVID Moonshot project, a crowd-sourced initiative to accelerate the development of a COVID antiviral.

Keywords

docking

genetic algorithm

Supplementary weblinks

Title

Description

Actions

Title

Description

Actions

View

Title

Description

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Using a genetic algorithm to find molecules with good docking scores

Casper Steinmann, Jan H. Jensen journal article

PeerJ Physical Chemistry , Volume 3

Online publication date: May 17, 2021

Version History

Jan 29, 2021 Version 2

Jan 07, 2021 Version 1

Version Notes

typo changes

Metrics

9,646

1,091

Views

Downloads

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv.13525589.v2

Author’s competing interest statement

no conflict of interest

Using a Genetic Algorithm to Find Molecules with Good Docking Scores

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Share