Abstract
The rise of data-driven catalyst design has led to an increasing availability of curated catalyst datasets. These datasets likely contain historical trends that could guide research, but extracting such trends from high-dimensional, high-volume data remains challenging, limiting broader use beyond data scientists. This study proposes a catalyst phylogenetic tree, a pipelined method for visualizing vast catalyst datasets to provide an overview of their evolution. It groups catalysts by distinct elemental combinations, termed catalyst sets, and maps their physicochemical distances onto a phylogenetic tree. Applied to two publicly available datasets on oxidative coupling and dry reforming of methane, this method successfully identified catalyst lineages with similar designs emerging across different eras, as well as the standard catalyst designs for each lineage. This approach can be extended beyond catalyst data to various materials, maximizing the value of literature-based data curation and accelerating research and development across diverse fields.
Supplementary materials
Title
Supporting informatoin
Description
Pairwise similarity of elements; characteristics of catalyst sets for oxidative coupling of methane; phylogenetic trees with different color-coding.
Actions
Supplementary weblinks
Title
Code availability
Description
The code for generating the catalyst phylogenetic tree can be publicly found at https://github.com/TaniikeLaboratory/Catalyst-Phylogenetic-Tree. The data used in this paper are also there as examples to run the code.
Actions
View