Glycan Mixture Analysis by Kernel Component Composition for Matrix Factorization

26 November 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

A major challenge in structural glycomics is the presence of isomeric glycan structures, which may not be fully resolved by separation techniques such as liquid chromatography (LC) and ion mobility spectrometry (IMS). Tandem mass spectrometry (MS/MS) can be employed following on-line separation to distinguish unresolved features, as the temporal profiles of various fragment ions reflect different combinations of those from their respective precursor ions. However, traditional principal component analysis can produce negative signals that are unrealistic for real data, and classic non-negative matrix factorization (NMF) methods may result in factors that include contributions from multiple components. This paper introduces a new variation of NMF, termed kernel component composition (KCC), which enables users to impose domain-specific prior knowledge about the components as parametric kernels. These kernel parameters are then learned directly from the data. We developed a theoretically guaranteed algorithm based on proximal gradient descent to solve the optimization problem posed by KCC and derived detailed parameter update rules when using Gaussian kernels. The effectiveness of the KCC algorithm is demonstrated through simulation tests and its application to deconvoluting chemical datasets, including LC- and IM-MS/MS analysis of isomeric glycan mixtures.

Keywords

LC-MS/MS
IM-MS/MS
Glycans
Isomer Analysis
Kernel Component Composition
Non-negative Matrix Factorization

Supplementary materials

Title
Description
Actions
Title
IM-MS/MS data of trisaccharides
Description
Arrival time distributions of CID fragments of maltotriose, isomaltotriose and their mixture, acquired with 95 ms separation time.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.