A High-Throughput Workflow to Analyze Sequence-Conformation Relationships and Explore Hydrophobic Patterning in Disordered Peptoids

04 October 2023, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Understanding how a macromolecule’s primary sequence governs its conformational landscape is crucial for elucidating its function, yet these design principles are still emerging for macromolecules with intrinsic disorder. While parameters describing subsets of disordered proteins and synthetic materials have been established, they are often tailored to specific chemical interactions and monomer classes, limiting their broader applicability. To address this gap, we present a high-throughput workflow that implements a versatile colorimetric conformational assay, introduces a semi-automated sequencing protocol using MALDI-MS/MS, and pioneers a data-driven sequence parameterization methodology that integrates into a predictive algorithm. Using a model system consisting of two-component peptidomimetics (20mer peptoids) containing polar glycine and hydrophobic N-butylglycine residues in a one-bead one-compound (OBOC) library, we visually identified nine classifications of conformational disorder. From this library, we identified 122 unique sequences across varied compositions and conformations, and we developed an image analysis tool that ultimately characterized an order of magnitude larger fraction of the complete library. Low-throughput techniques, atomistic simulations and ion mobility spectrometry coupled with liquid chromatography and mass spectrometry separations (LC-IMS-MS) of purified peptoids, yielded quantitative descriptors of the conformational ensembles formed by three compositionally identical sequences selected from the library. Finally, a data-driven technique was developed that exploits ‘motifs’ within the 20mer sequences to inform a gradient-boosted tree machine learning algorithm towards conformation prediction. This multifaceted approach enhances our understanding of sequence-conformation relationships and offers a powerful tool for accelerating the discovery and development of advanced materials with precise conformational control.

Keywords

peptoids
intrinsic disorder
high-throughput characterization
oligomer sequencing
data-driven analysis

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Supporting information includes a PDF containing the experimental details, synthetic procedures, and supplemental figures and tables including LC-MS, MALDI-TOF, NMR, and other characterization.
Actions
Title
High-resolution Images
Description
High-resolution brightfield images of one-bead one-compound library after incubation with Reichardt's dye.
Actions
Title
MALDI-MS/MS Spectra
Description
MALDI-MS/MS spectra used to identify the library 20mers sequences in this work.
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.