A General Protocol for the Accurate Predictions of Molecular 13C/1H NMR Chemical Shifts via Machine Learning

Peng Gao; Jun Zhang; Qian Peng; Vassiliki-Alexandra Glezakou

doi:10.26434/chemrxiv.11302295.v1

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

A General Protocol for the Accurate Predictions of Molecular 13C/1H NMR Chemical Shifts via Machine Learning

10 December 2019, Version 1

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Accurate prediction of NMR chemical shifts with affordable computational cost is of great importance for rigorous structural assignments of experimental studies. However, the most popular computational schemes for NMR calculation—based on density functional theory (DFT) and gauge-including atomic orbital (GIAO) methods—still suffer from ambiguities in structural assignments. Using state-of-the-art machine learning (ML) techniques, we have developed a DFT+ML model that is capable of predicting 13C/1H NMR chemical shifts of organic molecules with high accuracy. The input for this generalizable DFT+ML model contains two critical parts: one is a vector providing insights into chemical environments, which can be evaluated without knowing the exact geometry of the molecule; the other one is the DFT-calculated isotropic shielding constant. The DFT+ML model was trained with a dataset containing 476 13C and 270 1H experimental chemical shifts. For the DFT methods used here, the root-mean-square-derivations (RMSDs) for the errors between predicted and experimental 13C/1H chemical shifts are as small as 2.10/0.18 ppm, which is much lower than the typical DFT (5.54/0.25 ppm), or DFT+linear regression (4.77/0.23 ppm) approaches. It also has smaller RMSDs and maximum absolute errors than two previously reported NMR-predicting ML models. We test the robustness of the model on two classes of organic molecules (TIC10 and hyacinthacines), where we unambiguously assigned the correct isomers to the experimental ones. This DFT+ML model is a promising way of predicting NMR chemical shifts and can be easily adapted to calculated shifts for any chemical compound.

Keywords

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

General Protocol for the Accurate Prediction of Molecular 13C/1H NMR Chemical Shifts via Machine Learning Augmented DFT

Peng Gao, Jun Zhang, Qian Peng, Jie Zhang, Vassiliki-Alexandra Glezakou journal article

Journal of Chemical Information and Modeling , Volume 60, Issue 8

Online publication date: Jun 30, 2020

Version History

Dec 10, 2019 Version 1

Metrics

3,770

952

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.11302295.v1

Author’s competing interest statement

The authors declare no competing financial interests.

A General Protocol for the Accurate Predictions of Molecular 13C/1H NMR Chemical Shifts via Machine Learning

Authors

Abstract

Keywords

Comments

Now Published

Version History

Metrics

License

DOI

Author’s competing interest statement

Share