These are preliminary reports that have not been peer-reviewed. They should not be regarded as conclusive, guide clinical practice/health-related behavior, or be reported in news media as established information. For more information, please see our FAQs.
3 files

ChemPer: An Open Source Tool for Automatically Generating SMIRKS Patterns

submitted on 21.06.2019, 00:49 and posted on 21.06.2019, 16:47 by Caitlin C. Bannan, David Mobley
Force fields are used in a variety of research fields including computer-aided drug design, biomaterials, and polymer chemistry. However, force fields also continue to limit the accuracy of predictions of physical properties. Current parameterization of these force fields involves a huge amount of human effort -- often years of work -- and depends heavily on the chemical intuition of those involved. The Open Force Field Initiative is working to replace this tedious process with an automated machinery to learn parameters and chemical perception. Our new SMIRKS-based force field format, SMIRNOFF, allows all parameter types to be defined independently. This allows for easier extension compared to the traditional atom type-based force fields where the chemical perception of all parameter types is intertwined.
We will need to be capable of programmatically learning SMIRKS patterns in order to fully automate force field parameterization. In this work, we present ChemPer -- a new tool for generating SMIRKS patterns based on clustered fragments (i.e. bonds, angles, or torsions) which should be assigned the same force field parameter. We demonstrate the utility of ChemPer by clustering fragments based on a reference force field, and then regenerating those parameters starting with a simple set of alkanes, ethers, and alcohols. Next, we create SMIRKS patterns for a protein SMIRNOFF which match the parameters from AMBER99. We conclude with a discussion of other potential applications and expansions to ChemPer.


NIH 1R01GM108889-01

NSF ACI-1547580


Email Address of Submitting Author


University of California, Irvine


United States of America

ORCID For Submitting Author


Declaration of Conflict of Interest

DLM serves on the scientific advisory board of OpenEye Scientific Software. As far as we know no conflict of interest exist as this work is free and open source.