Abstract
Understanding how a macromolecule’s primary sequence governs its conformational landscape is crucial for elucidating its function, yet these design principles are still emerging for macromolecules with intrinsic disorder. Herein, we introduce a high-throughput workflow that implements a practical colorimetric conformational assay, introduces a semi-automated sequencing protocol using MALDI-MS/MS, and develops a generalizable sequence-structure algorithm. Using a model system of 20mer peptidomimetics containing polar glycine and hydrophobic N-butylglycine residues, we identified nine classifications of conformational disorder and isolated 122 unique sequences across varied compositions and conformations. Conformational distributions of three compositionally identical library sequences were corroborated through atomistic simulations and ion mobility spectrometry coupled with liquid chromatography. A data-driven strategy was developed using existing sequence variables and data-derived ‘motifs’ to inform a machine learning algorithm towards conformation prediction. This multifaceted approach enhances our understanding of sequence-conformation relationships and offers a powerful tool for accelerating the discovery of materials with conformational control.
Supplementary materials
Title
Supporting Information
Description
Supporting information includes a PDF containing the experimental details, synthetic procedures, and supplemental figures and tables including LC-MS, MALDI-TOF, NMR, and other characterization.
Actions
Title
High-resolution Images
Description
High-resolution brightfield images of one-bead one-compound library after incubation with Reichardt's dye.
Actions
Title
MALDI-MS/MS Spectra
Description
MALDI-MS/MS spectra used to identify the library 20mers sequences in this work.
Actions