All that glitters is not gold: Importance of rigorous evaluation of proteochemometric models

Polina Avdiunina; Shamieraah Jamal; Filipp Gusev; Olexandr Isayev

doi:10.26434/chemrxiv-2025-vbmgc

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

All that glitters is not gold: Importance of rigorous evaluation of proteochemometric models

22 January 2025, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Proteochemometric models (PCM) are used in computational drug discovery to leverage both protein and ligand representations for bioactivity prediction. While machine learning (ML) and deep learning (DL) have come to dominate PCMs, often serving as scoring functions, rigorous evaluation standards have not always been consistently applied. In this study, using kinase-ligand bioactivity prediction as a model system, we highlight the critical roles of dataset curation, permutation testing, class imbalances, data splitting strategies, and embedding quality in determining model performance. Our findings indicate that data splitting and class imbalances are the most critical factors affecting PCM performance, emphasizing the challenges in generalizing ability of ML/DL-PCMs. We evaluated various protein-ligand descriptors and embeddings, including those augmented with multiple sequence alignment (MSA) information. However, permutation testing consistently demonstrated that protein embeddings contributed minimally to PCM efficacy. This study advocates for the adoption of stringent evaluation standards to enhance the generalizability of models to out-of-distribution data and improve benchmarking practices.

Keywords

Proteochemometric models

bioactivity prediction

statistical validation

protein embeddings

Supplementary materials

Title

Description

Actions

Title

Supplementary information and figures

Description

Dataset curation Baseline model dataset Hyperparameter tuning Implementation of Convolutional Autoencoder

Actions

Title

Supplementary tables

Description

Raw statistical data analysis and ANOVA results

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Jan 22, 2025 Version 1

Metrics

542

242

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.26434/chemrxiv-2025-vbmgc

Funding

National Science Foundation

CHE-2154447

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

All that glitters is not gold: Importance of rigorous evaluation of proteochemometric models

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share