Abstract
Small molecule empirical force fields (FFs), including the CHARMM General Force Field (CGenFF), are designed to have wide coverage of organic molecules and to rapidly assign parameters to molecules not explicitly included in the FF. Assignment of parameters to new molecules in CGenFF is based on a trained bond-angle-dihedral charge increment linear interpolation scheme for the partial atomic charges along with bonded parameters assigned based on analogy using a rules-based penalty score scheme associated with atom types and chemical connectivity. Accordingly, the accuracy of CGenFF is related to the extent of the training set of available parameters. In the present study that training set is extended by 1,390 molecules selected to represent connectivities new to CGenFF training compounds. Quantum mechanical (QM) data for optimized geometries, bond, valence angle, and dihedral angle potential energy scans, interactions with water, molecular dipole moments, and electrostatic potentials were used as target data. The resultant bonded parameters and partial atomic charges were used to train a new version of the CGenFF program, v5.0, which was used to generate parameters for a validation set of molecules, including drug-like molecules approved by the FDA, which were then benchmarked against both experimental and QM data. CGenFF v5.0 shows overall improvements with respect to QM intramolecular geometries, vibrations, dihedral potential energy scans, dipole moments and interactions with water. Tests of pure solvent properties of 216 molecules show small improvements versus the previous release of CGenFF v2.5.1 reflecting the high quality of the Lennard-Jones parameters that were explicitly optimized during the initial optimization of both the CGenFF and the CHARMM36 force field. CGenFF v5.0 represents an improvement that is anticipated to more accurately model intramolecular geometries and strain energies as well as non-covalent interactions of drug-like and other organic molecules.
Supplementary materials
Title
Supporting Information
Description
Additional figures and tables for: water-compound interactions; pure solvent MD simulations; partial charge distribution; penalty and charge correlation; example of equilibrium values for a molecule with optimized bonded terms; 4-membered ring validation molecules and corresponding comparison of MM vs QM internal coordinates; PES scans energy plots; vibrational analysis; additional data for FDA compounds validation and HFE calculations. Also see GitHub repository (Additional figures and tables for: water-compound interactions; pure solvent MD simulations; partial charge distribution; penalty and charge correlation; example of equilibrium values for a molecule with optimized bonded terms; 4-membered ring validation molecules and corresponding comparison of MM vs QM internal coordinates; PES scans energy plots; vibrational analysis; additional data for FDA compounds validation and HFE calculations.
Actions
Supplementary weblinks
Title
CGenFF v5.0 GitHub repository
Description
The GitHub repository (https://github.com/mackerell-lab/cgenff5) contains topology and parameter files for the 1390 molecules for which optimization was performed and developer program; QM and MM coordinates, 2D representations and names for all the molecular datasets; magnitude, and angles of the molecular dipole moment for each molecule, individual empirical and QM water interaction energies and distances for compounds with optimized parameters and the FDA compounds; PES scan plots for all optimized bonded terms; all data points used to generate the figures and tables of the article.
Actions
View Title
CGenFF Online Program
Description
For academic users with an .edu email address, individual molecules may be uploaded online into CGenFF at https://cgenff.silcsbio.com/ and the CGenFF program can be obtained at no charge for academic users from Silcsbio LLC (silcsbio.com).
Actions
View