An in silico infrared spectral library of molecular ions for metabolite identification

09 March 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Infrared ion spectroscopy (IRIS) continues to see increasing use as an analytical tool for small-molecule identification in conjunction with mass spectrometry (MS). The IR spectrum of an m/z selected population of ions constitutes a unique fingerprint that is specific to the molecular structure. However, direct translation of an IR spectrum to a molecular structure remains challenging, as reference libraries of IR spectra of molecular ions largely do not exist. Quantum-chemically computed spectra can reliably be used as reference, but the challenge of selecting the candidate structures remains. Here we introduce an in silico library of vibrational spectra of common MS adducts of over 4500 compounds found in the human metabolome database (HMDB). In total, the library currently contains more than 75 000 spectra computed at the DFT level that can be queried with an experimental IR spectrum. Moreover, we introduce a database of 189 experimental IRIS spectra, which is employed to validate the automated spectral matching routines. This demonstrates that 75% of metabolites in the experimental dataset is correctly identified, based solely on their exact m/z and IRIS spectrum. Additionally, we demonstrate an approach for specifically identifying substructures by performing a search without m/z constraints to find structural analogues. Such an unsupervised search paves the way towards the de novo identification of unknowns that are absent in spectral libraries. We apply the in silico spectral library to identify an unknown in a plasma sample as 3-hydroyxhexanoic acid, highlighting the potential of the method.

Keywords

infrared ion spectroscopy
metabolite identification
mass spectrometry
untargeted metabolomics
spectral library
HMDB
ketosis
3-hydroxy-hexanoic acid

Supplementary materials

Title
Description
Actions
Title
Experimental spectra vs all computed spectra
Description
For each experimental spectrum the 9 best matching computed spectra in the library are shown, without selection on chemical formula.
Actions
Title
Experimental spectra vs all computed tautomers and conformers
Description
For each experimental spectrum the match with all computed conformers and tautomers are shown.
Actions
Title
Experimental spectra vs all computed isomers
Description
For each experimental spectrum the match with all computed spectra of isomers in the library are shown.
Actions
Title
Main supporting information to the manuscript
Description
LC-MS procedure, optimization of the spectral similarity scoring, additional sample information and tables/figures in support of the results.
Actions
Title
Supporting tables
Description
Excel versions of tables in the supporting information
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.