Machine learning for the experimental and computational development of heterogeneous catalysis

Carlota Bozal-Ginesta; Sergio Pablo-García; Changhyeok Choi; Albert Tarancón; Alán Aspuru-Guzik

doi:10.26434/chemrxiv-2025-6v1sf

Catalysis

Search within Catalysis

Machine learning for the experimental and computational development of heterogeneous catalysis

06 March 2025, Version 1

Review

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Machine learning techniques have emerged as a useful tool for identifying complex patterns and correlations in large datasets. These techniques could be particularly useful in heterogeneous catalysis research for enabling the correlation of the catalyst performance to its physicochemical properties. So far in the chemistry and material science communities, machine learning models have mostly been built on high-throughput quantum chemistry calculations, and only selected case studies have led to the experimental discovery of improved catalyst materials. The slow pace and limited number of scientific breakthroughs may be attributed to simplistic assumptions about catalyst structure in quantum chemistry calculations and the incomplete experimental data available. Therefore, we believe that the development of high-throughput approaches closely coupled with machine-learning-based approaches could help accelerate experimental catalysis research. To aid the community, we bring together the available body of work applying high-throughput approaches and machine learning to the development of solid heterogeneous catalysis. We offer an objective view of the trends in the field by performing a detailed and systematic comparison of papers based on the (1) the ML method, the features used as model input and output, (3) the material, device or reaction investigated, (4) the dataset size, and (5) the overall achievement. Furthermore, for models reporting unitless R2 values, we quantitatively analyze the model performance as a function of the features used, the reaction type and the dataset size.

Supplementary materials

Title

Description

Actions

Title

Supporting Information

Description

Supporting Information

Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Mar 06, 2025 Version 1

Metrics

924

662

Views

Downloads

Citations

License

The content is available under CC BY NC 4.0

DOI

10.26434/chemrxiv-2025-6v1sf

Funding

Marie Skłodowska Curie Actions Postdoctoral Fellowship

101064374

U.S. Department of Energy, Office of Science

, Subaward by "University of Minnesota, Project title: Development of Machine Learning and Molecular Simulation Approaches to Accelerate the Discovery of Porous Materials for Energy-Relevant Applications under Award Number DE-SC0023454

Generalitat de Catalunya

2021-SGR-00750, NANOEN

Anders G. Frøseth, Acceleration Consortium, the Natural Resources Canada and the Canada 150 Research Chairs

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) declare that they have sought and gained approval from the relevant ethics committee/IRB for this research and its publication.

Machine learning for the experimental and computational development of heterogeneous catalysis

Authors

Abstract

Supplementary materials

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Share