Machine Learning Modeling and Insights into the Structural Foundations of Polymyxin-like Antimicrobials

22 March 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Antimicrobial resistance (AMR) is a silent pandemic that represents an urgent threat to human health. Unfortunately, the antibiotic development pipeline is slow even though AMR has been escalating uncontrollably fast, namely amongst Gram-negative pathogens. Although out of use until recently due to their toxic side effects, polymyxins have been revived as a last-line therapeutic option since all other antibiotics are currently failing. In an attempt to ameliorate their toxicity and improve antimicrobial activity, many studies have been generating polymyxin analogues through different strategies, mostly empirical. As such, there is still a lack of faster and more reliable approaches to make analog design efficient in order to tackle AMR in a timely fashion. The solution to accelerate the discovery of new drugs probably lies in the use of in silico approaches, such as machine learning, due to their faster pace and time and cost efficiency. In this work, machine learning was applied to Quantitative Structure-Activity Relationship (QSAR) modeling with the objective of providing a working semi-quantitative model capable of predicting the activity of polymyxin-like molecules for a given species. For this, we applied four different learning algorithms and ten different families of molecular descriptors to our dataset of 408 molecule/microorganism pairs retrieved from PubChem. The AdaBoost model devised using the CKP set of descriptors was the best performer, with good accuracies and very low false negative and positive predictions. Preliminary exploration of the model's response to systematic changes in the structure of polymyxin B reveals a trend towards increased antimicrobial activity when exchanging some of its constituent amino acids for more lipophilic ones. Experimental studies are already underway based on this model's application and we believe it will become a crucial tool for drug development.

Keywords

Polymyxins
Antimicrobial resistance
Drug design
QSAR
Machine Learning

Supplementary materials

Title
Description
Actions
Title
Electronic Supporting Information
Description
Software code for using the final model, scores of all tested ML models, optimized hyper-parameters for all random forest and AdaBoost models, and partial dependence plots for the features with less than 10\% PI
Actions
Title
Data
Description
Collected data set
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.