From descriptors to intrinsic fish toxicity of chemicals: an alternative approach to chemical prioritization

18 November 2022, Version 4
This content is a preprint and has not undergone peer review at the time of posting.


The European and US chemical agencies have listed approximately 800k chemicals where knowledge on potential risks to human health and the environment are lacking. Filling these data gaps experimentally is impossible so in-silico approaches and prediction are essential. Many existing models are however limited by assumptions (e.g. linearity and continuity) and small training sets. In this study we present a supervised direct classification model that connects molecular descriptors to toxicity. Categories can be either data-driven (using k-means clustering) or regulatory-defined. This was tested via 907 experimentally defined 96h LC50 values for acute fish toxicity. Our classification model explained ~90% of variance in our data for the training set and ~80% for the test set. This strategy gave a 5-fold decrease in the incorrect categorization compared to a QSAR regression model. Our model was subsequently employed to predict the toxicity categories of ~32k chemicals. A comparison between the model-based applicability domain (AD) and the training set AD was performed, suggesting that the training set based AD is a more adequate way to avoid extrapolation when using such models. The better performance of our direct classification model compared to QSAR methods, makes this approach a viable tool for hazard and risk assessment of chemicals.


Data scinece
Toxicity category

Supplementary materials

Supporting Information

Supplementary weblinks


Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.