Clustering and Analyzing Ensembles of Residue Interaction Networks from Molecular Dynamics Simulations

11 June 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Network methods and molecular dynamics (MD) simulations have become essential tools for studying protein dynamics. However, applying network methods to MD simulations of flexible proteins is a major challenge, since the high conformational heterogeneity in such multi-state systems can lead to vastly different network topolo- gies across an ensemble. To address this, tools that can disentangle conformational ensembles on a network level are needed. Here, we propose a graph-based clustering framework that provides state-specific insight into the residue interactions of flexible proteins. The framework hinges on using the set of graph-theoretic closeness centralities of all amino acid residues as a structural fingerprint and input for unsupervised machine learning algorithms to perform dimensionality reduction and clustering. The resulting clusters - states with shared network topology - are subsequently fed back into the upstream workflow and characterized at every representation level. Based on the example of FAT10 - a protein with intrinsically disordered regions and two folded domains connected by a flexible linker - we demonstrate how this approach can be used to understand the protein’s residue interactions on different, interconnected levels and to characterize its most populated states. Due to the modularity of the framework, it can be easily adapted, which makes it a suitable method to support network-based analyses of MD simulations for a wide variety of proteins.

Keywords

Molecular Dynamics simulations
Network Analysis
Residue Interaction Networks
Multidomain Protein
Conformational Ensemble
FAT10

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.