Times are changing but order matters: Transferable prediction of   small molecule liquid chromatography retention times

Fleming Kretschmer; Eva-Maria Harrieder; Michael Witting; Sebastian Böcker

doi:10.26434/chemrxiv-2024-wd5j8-v2

Analytical Chemistry

Search within Analytical Chemistry

Times are changing but order matters: Transferable prediction of small molecule liquid chromatography retention times

28 May 2025, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Hundreds of models for the prediction of small molecule retention times were published during the last decades. Our goal is the transferable prediction of retention times: Our method should predict retention times for a target dataset, without the need of training data from that chromatographic system. Unfortunately, retention times may change massively, even for nominally identical chromatographic conditions. Retention order is much better retained, yet even the retention order of compounds may change if chromatographic conditions vary. We present a machine learning model that can predict retention order or, more precisely, a retention order index, taking into account chromatographic conditions. We show how to map predicted retention order indices to retention times. Disentangling these two task finally enables transferable retention time prediction across chromatographic conditions and compound classes. Our 2-step method outperforms existing methods that were trained on the target dataset. Finally, we systematically study what chromatographic conditions result in notable changes of retention order.

Keywords

Retention Time Prediction

Metabolomics

Liquid Chromatography

Supplementary materials

Title

Description

Actions

Title

Supplementary Table 2. List of RepoRT datasets used for retention order statistics and model evaluation

Description

All datasets from RepoRT are listed, detailing in which evaluation scenario each dataset is used. Information on which datasets are missing important metadata (HSM and Tanaka parameters, pH, void volume estimate, column temperature, flow rate) are also provided. Datasets removed from evaluation following manual curation are specified.

Actions

Supplementary weblinks

Title

Description

Actions

Title

Code for model training, evaluation and application

Description

GitHub repository containing the code to train, evaluate and apply the 2-step retention time prediction models.

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

May 28, 2025 Version 2

Dec 23, 2024 Version 1

Version Notes

We have majorly restructured and rewritten the manuscript, made new experiments showing the advantages of our method, including comparisons on new, realistic cross-validation splits, a simulation of performance gain with more training data, and added and extended supplementary notes on potential applications and benefits over existing approaches.

Metrics

1,258

480

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2024-wd5j8-v2

Funding

Deutsche Forschungsgemeinschaft

BO 1910/23

Deutsche Forschungsgemeinschaft

MW 4382/10-1

Ministry for Economics, Sciences and Digital Society of Thuringia

Framework ProDigital, DigLeben 5575/10-9

Author’s competing interest statement

S.B. is a cofounder of Bright Giant GmbH.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Times are changing but order matters: Transferable prediction of small molecule liquid chromatography retention times

Authors

Abstract

Keywords

Supplementary materials

Supplementary weblinks

Comments

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share