Abstract
Carbon dioxide is a versatile C1 building block in organic synthesis. Understanding its reactivity is crucial for predicting reaction outcomes and identifying suitable substrates for the creation of value-added chemicals and drugs. A recent study [Li et al., J. Am. Chem. Soc., 2020, 142, 8383] estimated the reactivity of CO2 in the form of Mayr's electrophilicity parameter E on the basis of a single carboxylation reaction. The disagreement between experiment (E = –16.3) and computation (E = –11.4) corresponds to a deviation of up to ten orders of magnitude in bimolecular rate constants of carboxylation reactions. Here, we introduce a data-driven approach incorporating supervised learning, quantum chemistry, and uncertainty quantification to resolve this discrepancy. The dataset used for reducing the uncertainty in E(CO2) represents 15 carboxylation reactions in DMSO. However, experimental data is only available for one of these reactions. To ensure reliable predictions, we selected a training set composed of this and 19 additional reactions comprising heteroallenes other than CO2 for which experimental data is available. With the new data-driven protocol, we can narrow down the electrophilicity of carbon dioxide to E = –14.6(5) with 95 % confidence.
Supplementary materials
Title
Supplementary information
Description
Information on benchmarks for computational methods, comparison between isolated reactants and pre-reaction complexes, information on conformer ensembles, information on the training and validation of regression models, overview of tested model parameters, results for the determination of the electrophilicity of electrophiles E1, E2, and E3, additional information on the determination of the electrophilicity of carbon dioxide.
Actions
Supplementary weblinks
Title
Gitlab repository
Description
Example input structures, optimised structures, interactive notebook for (reproductive) data analyses.
Actions
View