Quick-and-Easy Validation of Protein–Ligand Binding Models Using Fragment-Based Semi-Empirical Quantum Chemistry

29 October 2024, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Electronic structure calculations in enzymes converge very slowly with respect to the size of the model region that is described using quantum mechanics (QM), requiring hundreds of atoms to obtain converged results and exhibiting substantial sensitivity (at least in smaller models) to which amino acids are included in the QM region. As such, there is considerable interest in developing automated procedures to construct a QM model region based on well-defined criteria. However, testing such procedures is burdensome due to the cost of large-scale electronic structure calculations. Here, we show that semi-empirical methods can be used as alternatives to density functional theory (DFT) to assess convergence in sequences of models generated by various automated protocols. The cost of these convergence tests is reduced even further by means of a many-body expansion. We use this approach to examine convergence (with respect to model size) of protein–ligand binding energies. Fragment-based semi-empirical calculations afford well-converged interaction energies in a tiny fraction of the cost required for DFT calculations. Two-body interactions between the ligand and single-residue amino acid fragments provide affords a low-cost way to construct a "QM-informed" enzyme model of reduced size, furnishing an automatable active-site model-building procedure. This provides a streamlined, user-friendly approach for constructing ligand binding-site models that requires neither a priori information nor manual adjustments. Extension to model-building for thermochemical calculations should be straightforward.

Keywords

drug discovery
QM cluster models
binding energies
quantum chemistry
fragmentation
intermolecular interactions

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Additional figures and tables
Actions
Title
Model specifications
Description
List of residues included in each QM model
Actions
Title
Protein–ligand structures
Description
PDB files for each system examined
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.