A guide to reverse metabolomics – a framework for big data discovery strategy.

Vincent Charron-Lamoureux; Helena Mannochio-Russo; Santosh Lamichhane; Shipei Xing; Abubaker Patan; Paulo Wender Portal Gomes; Prajit Rajkumar; Victoria Deleray; Andrés Mauricio Caraballo-Rodriguez; Kee Voon Chua; Lye Siang Lee; Zhao Liu; Jianhong Ching; Mingxun Wang; Pieter C. Dorrestein

doi:10.26434/chemrxiv-2024-4cb43-v2

Analytical Chemistry

Search within Analytical Chemistry

A guide to reverse metabolomics – a framework for big data discovery strategy.

20 December 2024, Version 2

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Untargeted metabolomics is evolving into a field of big data science. There is a growing interest within the metabolomics community in mining MS/MS-based data from public repositories. The theme of this protocol, reverse metabolomics, is a data science strategy that differs from the traditional LC-MS/MS-based untargeted metabolomics approach. In traditional untargeted metabolomics, we first collect the samples to address a predefined question and then collect LC-MS/MS data. We then identify metabolites associated with a phenotype (e.g., disease vs. healthy), and elucidate or validate their structural details (e.g., molecular formula, structural classification, substructure, or complete structural annotation or identification). Reverse metabolomics, however, does not necessarily involve collecting new data or requiring the structural characterization of molecules. Instead, we start with MS/MS spectra for known or unknown molecules and discover phenotype-relevant information such as organ/biofluid distribution, disease condition, intervention status (e.g., pre- and post-intervention), organisms (e.g., mammals vs. others), geography, and any other biologically relevant associations available in public repositories. This protocol guides the reader through the step-by-step process of utilizing available MS/MS data and discovering repository-scale associations of the associated MS/MS spectra. As example, we utilize MS/MS spectra from three small molecules: phenylalanine-cholic acid (a microbially conjugated bile acid), phenylalanine-C4:0, and histidine-C4:0 (two N-acyl amides). We leverage the GNPS-based framework to explore the microbial producers of these molecules and their associations with health conditions and organ distributions in humans and rodents.

Keywords

Biological techniques

Metabolomics

Biochemistry

Data processing

Computational biology

Mass spectrometry

Supplementary materials

Title

Description

Actions

Title

Visual Cheatsheet guide

Description

Explication of the script through screenshots.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Dec 20, 2024 Version 2

Dec 19, 2024 Version 1

Version Notes

Supplementary information was added

Metrics

1,018

780

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2024-4cb43-v2

Author’s competing interest statement

PCD is an advisor and holds equity in Cybele, BileOmix and Sirenas and a Scientific co-founder, advisor and holds equity to Ometa, Enveda, and Arome with prior approval by UC-San Diego. PCD also consulted for DSM animal health in 2023. MW is a co-founder of Ometa Labs LLC.

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

A guide to reverse metabolomics – a framework for big data discovery strategy.

Authors

Abstract

Keywords

Supplementary materials

Comments

Version History

Version Notes

Metrics

License

DOI

Author’s competing interest statement

Ethics

Share