GPT-3 accurately predicts antimicrobial peptide activity and hemolysis

Markus Orsi; Jean-Louis Reymond

doi:10.26434/chemrxiv-2023-74041

Biological and Medicinal Chemistry

Search within Biological and Medicinal Chemistry

GPT-3 accurately predicts antimicrobial peptide activity and hemolysis

01 June 2023, Version 1

This is not the most recent version. There is a

newer version

of this content available

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Antimicrobial peptides (AMPs) have gained significant attention in the field of drug discovery due to their potential therapeutic applications in the fight against antimicrobial resistance. Since rationally designing AMPs is notoriously difficult due to the vast number of possible peptide sequences and their complex structure-activity relationship landscape, this problem is ideally suited for machine-learning models, which can be trained from available data to predict new sequences with a desired activity profile. Here we investigated the performance of large language models (LLMs) fine-tuned with data from Database of Antimicrobial Activity and Structure of Peptides (DBAASP) to predict AMP antimicrobial activity and hemolysis from their amino acid sequence. We show that GPT-3 based models perform slightly better than previously reported recurrent neural networks (RNN) and related architectures on comparable datasets. Furthermore, GPT-3 based models perform remarkably well on low data regime. Advantages in terms of training time and costs are also discussed.

Keywords

large language models

antimicrobial peptides

Supplementary weblinks

Title

Description

Actions

Title

GitHub repository

Description

all training data (peptide sequences annotated with activities) and code (code to access the models) to reproduce the results

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Mar 12, 2024 Version 2

Jun 01, 2023 Version 1

Metrics

2,186

1,038

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2023-74041

Funding

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

200020_178998

European Research Council

885076

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

GPT-3 accurately predicts antimicrobial peptide activity and hemolysis

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share