Fragment libraries from large synthetic compounds and natural products: A comparative chemoinformatic analysis

06 February 2025, Version 1

Abstract

We report comprehensive fragment libraries obtained from large natural product databases and compare their chemical space coverage and diversity with synthetic fragment libraries. Specifically, we obtained 2,583,127 fragments derived from the recently updated Collection of Open Natural Products (COCONUT) data set with more than 695,133 non-redundant natural products, and 74,193 fragments derived from the Latin America Natural Product Database (LANaPDB) with 13,578 unique natural products from Latin America. The content, chemical space coverage and chemical diversity of the natural product libraries were compared to the recently developed CRAFT library, which contains 1,214 fragments based on novel heterocyclic scaffolds and natural product-derived chemicals. The fragments libraries herein obtained and curated are freely available at https://github.com/DIFACQUIM/Fragment-libraries-from-large-synthetic-compounds-and-natural-products-collections.git.

Keywords

chemoinformatics
chemical space
drug design
natural products
open science

Supplementary materials

Title
Description
Actions
Title
Supplementary figures
Description
Eight supplementary figures.
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.