RT-Tranformer: Retention Time Prediction
for Metabolite Annotation to Assist in
Metabolite Identification

jun xue; Bingyi Wang; Weihua li

doi:10.26434/chemrxiv-2023-pf268-v2

Analytical Chemistry

Search within Analytical Chemistry

RT-Tranformer: Retention Time Prediction for Metabolite Annotation to Assist in Metabolite Identification

27 March 2023, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Liquid chromatography retention times (RTs) prediction can assist in metabolite identification, which is a critical task and challenge in non-targeted metabolomics.However, different chromatographic methods (CM) may result in different RTs for the same metabolite. Current RT prediction methods lack sufficient scalability to transfer from one specific chromatographic method to another. Therefore, we present RT-Transformer, a novel deep neural network model coupled with graph attention network (GAT) and 1D-Transformer, which can predict RTs under any chromatographic methods. First, we obtain a pre-trained model by training RT-Transformer on the large small molecule retention time (SMRT) dataset containing 80038 molecules, and then project the resulting model onto different chromatographic methods based on transfer learning. When tested on the METLIN dataset, as other authors did, the average absolute error reached 27.30 after removing samples with retention times fewer than five minutes. Still, it reached 33.41 when no samples were removed. The pre-trained　RT-Transformer was further transferred to 5 datasets corresponding to different chromatographic conditions and fine-tuned. According to the experimental results, RT-Transformer achieves competitive performance compared to state-of-the-art methods.In addition, RT-Transformer was applied to 41 external molecular RT datasets. Extensive evaluations indicate that RT-Transformer has excellent scalability in predicting RTs for liquid chromatography and improves the accuracy of metabolite identification.

Keywords

Rentention Time

Deep Learning

Metabolomics

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Mar 27, 2023 Version 2

Mar 17, 2023 Version 1

Version Notes

Modified some inconsistent data and optimized some statements

Metrics

1,238

652

Views

Downloads

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2023-pf268-v2

Funding

National Natural Science Foundation of China

32060151

Yunnan Provincial Foundation for Leaders of Disciplines in Science and Technology, China

202305AC160014

Innovation Research Foundation for Graduate Students of Yunnan University

KC-22221489

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content