ChemPer: An Open Source Tool for Automatically Generating SMIRKS Patterns

21 June 2019, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Force fields are used in a variety of research fields including computer-aided drug design, biomaterials, and polymer chemistry. However, force fields also continue to limit the accuracy of predictions of physical properties. Current parameterization of these force fields involves a huge amount of human effort -- often years of work -- and depends heavily on the chemical intuition of those involved. The Open Force Field Initiative is working to replace this tedious process with an automated machinery to learn parameters and chemical perception. Our new SMIRKS-based force field format, SMIRNOFF, allows all parameter types to be defined independently. This allows for easier extension compared to the traditional atom type-based force fields where the chemical perception of all parameter types is intertwined.
We will need to be capable of programmatically learning SMIRKS patterns in order to fully automate force field parameterization. In this work, we present ChemPer -- a new tool for generating SMIRKS patterns based on clustered fragments (i.e. bonds, angles, or torsions) which should be assigned the same force field parameter. We demonstrate the utility of ChemPer by clustering fragments based on a reference force field, and then regenerating those parameters starting with a simple set of alkanes, ethers, and alcohols. Next, we create SMIRKS patterns for a protein SMIRNOFF which match the parameters from AMBER99. We conclude with a discussion of other potential applications and expansions to ChemPer.

Keywords

Force Field Parameterization
force Field
chemical perception

Supplementary materials

Title
Description
Actions
Title
electronic SI
Description
Actions
Title
SI
Description
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.