SpaceHASTEN: A structure-based virtual screening tool for non-enumerated virtual chemical libraries

01 October 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Given the size of the drug discovery relevant chemical space, working with fully enumerated compound libraries (especially in 3D) is unfeasible. Non-enumerated virtual chemical spaces are a practical solution to this issue; where compounds are described as building blocks which are then connected by rules. One concrete example of such is the BioSolveIT chemical spaces file format (.space). Tools to search these space-files exist that are using ligand-based methods including, 2D fingerprint similarity, substructure matching, and fuzzier similarity metrics such as FTrees. However, there is no software available that enables the screening of these spaces using molecular docking. Here, a tool, called SpaceHASTEN, was developed on top of SpaceLight, FTrees, LigPrep, and Glide to allow efficient virtual screening of nonenumerated chemical spaces. SpaceHASTEN was validated using three public targets picked from the DUD-E dataset. It was able to retrieve a large number of diverse and novel high scoring compounds (virtual hits) from non-enumerated chemical spaces of billions of molecules, after docking a few million compounds. The software can be freely used and is available from http://github.com/TuomoKalliokoski/SpaceHASTEN. Keywords:

Keywords

structure-based virtual screening
non-enumerated chemical spaces
machine learning

Supplementary materials

Title
Description
Actions
Title
Virtual hits for KIF11
Description
Virtual hits for KIF11.
Actions
Title
Virtual hits for TGFR1.
Description
Virtual hits for TGFR1.
Actions
Title
Virtual hits for PYRD.
Description
Virtual hits for PYRD.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.