Abstract
Understanding complex, multi-step chemical reactions at the molecular level is a major challenge whose solution would greatly benefit the design and optimization of numerous chemical processes. The separation of rare-earth (4f) and actinide (5f) elements is an example where improving our chemical understanding is important for designing and optimizing new chemistries, even with a limited number of observations. In this work, we leverage data-driven artificial intelligence and machine-learning approaches to develop kinetic reaction networks that describe the liquid-liquid extraction mechanism of uranium using N,N-di-2-ethylhexyl-isobutyramide (DEHiBA). Specifically, we compare and contrast the properties of two classes of models: (1) purely data-driven models that are regularized using chemistry-agnostic, L1 regression and (2) chemistry-informed models which are regularized using relative reaction energies provided by quantum mechanical calculations. We observe that purely data-driven models are unbiased, simple, and accurate in their predictions of experimental measurements when provided with sufficient data but are difficult to fully constrain and interpret. In contrast, chemistry-informed models exhibit significantly improved chemical interpretability and consistency, providing a detailed description of the separation process, while achieving high accuracy through ensemble averaging. Overall, the dominant species predicted to be extracted into the organic phase is UO2(NO3)2(DEHiBA)2, agreeing with experimental slope analysis, thermodynamic modeling, EXAFS, and crystal structures. This work demonstrates that leveraging the fundamental structure of the problem can lead to efficient learning schemes that provide both accurate predictions and chemical insights at low computational cost.
Supplementary materials
Title
Supplementary Information for Assessment of Data-Driven Kinetic Reaction Networks for Separation Chemistry
Description
Details on the kinetic reaction network models; Extraction data used for this
work; Regularization of chemistry-informed models; Parity plots at different level of
accuracy (RMSE); Fraction of non-zero parameters of data-driven models; Ensemble
net flux networks; Details of DFT-optimized structures and relative energies
Actions
Title
DFT-optimized structures and relative energies
Description
DFT-optimized structures and relative energies
Actions