Building Machine Learning Force Fields of Proteins with Fragment-Based Approach and Data Transfer

Zheng Cheng; Jiahui  Du; Lei  Zhang; Jing Ma; Wei Li; Shuhua  Li

doi:10.26434/chemrxiv-2021-d3k50-v3

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Building Machine Learning Force Fields of Proteins with Fragment-Based Approach and Data Transfer

18 June 2021, Version 3

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

We combined our generalized energy-based fragmentation (GEBF) approach and transfer learning technique to construct machine learning force fields (MLFFs) for proteins only from quantum mechanics (QM) calculations of small subsystems. Using a kernel-based model called Gaussian Approximation Potential (GAP), our protocol can automatically generate training sets with high efficiency. To facilitate the construction of training sets for various proteins, a protein’s data library is created to store all data of subsystems generated from trained proteins. With this data library, for a new protein only its subsystems with new topological types are required for the construction of the corresponding training set. With two polypeptides, 4ZNN and 1XQ8 segment, as examples, we demonstrate that GEBF-MLFFs can be constructed by either kernel methods or neural network methods with full QM quality. Therefore, the present work provides an effi-cient and systematic way to build force fields for biological systems like proteins with QM accuracy.

Keywords

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

The supporting information of the new manuscript.

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Building quantum mechanics quality force fields of proteins with the generalized energy-based fragmentation approach and machine learning

Zheng Cheng, Jiahui Du, Lei Zhang, Jing Ma, Wei Li, Shuhua Li journal article

Physical Chemistry Chemical Physics , Volume 24, Issue 3

Online publication date: 2022

Version History

Jun 18, 2021 Version 3

May 07, 2021 Version 2

Apr 06, 2021 Version 1

Version Notes

In this new manuscript, we added the results of GEBF-MLFFs constructed with Neural Network (NN) method.

Metrics

3,101

1,477

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv-2021-d3k50-v3

Funding

National Natural Science Foundation of China

22073043

21833002

National Natural Science Foundation of China

21833002

22033004

National Natural Science Foundation of China

22033004

22073043

National Natural Science Foundation of China

21873046

Author’s competing interest statement

The authors declare no competing financial interest

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Building Machine Learning Force Fields of Proteins with Fragment-Based Approach and Data Transfer

Authors

Abstract

Keywords

Supplementary materials

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share