A Software Tool for Rapid and Automated Pre-Processing of Large-Scale Serum Metabolomic Data by Multisegment Injection-Capillary Electrophoresis-Mass Spectrometry

14 August 2024, Version 1

Abstract

Mass spectrometry (MS)-based metabolomics often rely on separation techniques when analyzing complex biological specimens to improve method resolution, metabolome coverage, quantitative performance, and/or unknown identification. However, low sample throughput and complicated data pre-processing procedures remain major barriers to affordable metabolomic studies that are also scalable to large populations. Herein, we introduce PeakMeister as a new software tool in the R statistical environment to enable the automated processing of serum metabolomic data acquired by multisegment injection-capillary electrophoresis-mass spectrometry (MSI-CE-MS) under positive ion mode. MSI-CE-MS takes advantage of a multiplexed separation format involving serial injection of 13 serum/plasma filtrate samples, quality controls, calibrants and/or blanks introduced within a single analytical run (< 4 min/sample). We performed a rigorous validation of PeakMeister by analyzing 47 cationic metabolites consistently measured in 5,000 serum samples from the Brazilian National Survey on Child Nutrition (ENANI-2019) comprising a total of 224,983 sample peaks analyzed in three batches over an eight-month period. A migration time index using a panel of internal standards was introduced to correct for large variations in apparent migration times, which allowed for reliable peak picking, peak integration and sample position assignment for serum metabolites having a single co-migrating stable-isotope internal standard or two flanking internal standards. PeakMeister accelerated data pre-processing times by 30-fold compared to manual processing of MSI-CE-MS data by an experienced analyst using vendor software, while also achieving excellent peak annotation fidelity (median accuracy > 99.9%), acceptable intermediate precision (median CV = 16.0 %), consistent metabolite peak integration (mean bias = 2.1%), and good mutual agreement when quantifying 16 plasma metabolites from NIST SRM-1950 (mean bias = -1.3%). We also report reference intervals for 40 serum metabolites in a national nutritional survey of children under 5 years of age (ENANI-2019). MSI-CE-MS in conjunction with PeakMeister allows for rapid and automated data pre-processing of large-scale metabolomic studies while tolerating long-term migration time shifts without the need for complicated dynamic time warping or effective mobility scale transformations.

Keywords

Automation
Data pre-processing
High throughput metabolomics
Migration time index
Multisegment injection-capillary electrophoresis-mass spectrometry
Batch effects
R statistical environment

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supporting experimental description, and supporting tables and figures referred to in main manuscript (Tables S1-S2, and Figures S1-S10).
Actions
Title
Supporting file
Description
Representative excel file including parameter setting used for automated metabolite analysis of MSI-CE-MS using PeakMeister
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.