Using Machine Learning to Predict the pKa of C–H Bonds. Relevance to Catalytic Methane Functionalization

Christopher Zhou; William Grumbles; Thomas Cundari

doi:10.26434/chemrxiv.12646772.v1

Six machine learning models (random forest, neural network, support vector machine, k-nearest neighbors, Bayesian ridge regression, least squares linear regression) were trained on a dataset of 3d transition metal-methyl and -methane complexes to predict pK_a(C–H), a property demonstrated to be important in catalytic activity and selectivity. Results illustrate that the machine learning models are quite promising, with RMSE metrics ranging from 4.6 to 8.8 pK_a units, despite the relatively modest amount of data available to train on. Importantly, the machine learning models agreed that (a) conjugate base properties were more impactful than those of the corresponding conjugate acid, and (b) the energy of the highest occupied molecular orbital conjugate base was the most significant input feature in the prediction of pK_a(C–H). Furthermore, results from additional testing conducted using an external dataset of Sc-methyl complexes demonstrated the robustness of all models, with RMSE metrics ranging from 1.5 to 6.6 pK_a units. In all, this research demonstrates the potential of machine learning models in organometallic catalyst development.

Using Machine Learning to Predict the pKa of C–H Bonds. Relevance to Catalytic Methane Functionalization

Abstract

Keywords

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share

Using Machine Learning to Predict the pKa of C–H Bonds. Relevance to Catalytic Methane Functionalization

Authors

Abstract

Keywords

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Share