AlphaFold2-RAVE: Protein Ensemble Generation with Physics-Based Sampling

06 February 2025, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

We introduce AlphaFold2-RAVE (af2rave), an open-source Python package that integrates machine learning-based structure prediction with physics-driven sampling to generate alternative protein conformations efficiently. Protein structures are not static but exist as ensembles of conformations, many of which are functionally relevant yet challenging to resolve experimentally. While deep learning models like AlphaFold2 can predict structural ensembles, they lack explicit physical validation. af2rave addresses this limitation by combining reduced multiple sequence alignment (MSA) AlphaFold2 predictions with molecular dynamics (MD) simulations to efficiently explore local conformational space. A feature selection module identifies key structural degrees of freedom, and the State Predictive Information Bottleneck (SPIB) method uncovers the underlying conformational topology, classifying functionally relevant states. Under the Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE) protocol, either unbiased or biased sampling can be performed to further explore the conformation ensembles. We validate af2rave on multiple systems, including E. coli adenosine kinase (ADK) and human DDR1 kinase, successfully identifying distinct functional states with minimal prior biological knowledge. Furthermore, we demonstrate that af2rave achieves conformational sampling efficiency comparable to long unbiased MD simulations on the SARS-CoV-2 spike protein receptor-binding domain while significantly reducing computational cost. The af2rave package provides a streamlined workflow for researchers to generate and analyze alternative protein conformations, offering an accessible tool for drug discovery and structural biology.

Keywords

Protein conformations
Machine learning
Molecular dynamics
AlphaFold2

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.