Abstract
We present CBR-db, a curated biochemical reaction database that integrates and refines data from the Kyoto Encyclopedia of Genes and Genomes (KEGG) and the ATLAS of Biochemistry databases to enable chemically consistent analyses of biochemical reaction data. This curation effort addresses key limitations of both KEGG and ATLAS, such as malformed chemical representations, inaccurate stoichiometry, and ambiguous or incomplete reaction entries. These limitations are addressed in CBR-db, enabling more chemically realistic analyses of biochemical reactions and hypothesized reactions and reaction networks constructed from these, which are essential for applications in research areas ranging from prebiotic chemistry to metabolism and its evolution, and synthetic biology and metabolic engineering. Altogether, CBR-db features 148,673 high-quality reactions and 18,716 compounds, with details of the refinement and curation procedures highlighted in this report. CBR-db is designed to be continuously updated, incorporating the latest releases from the KEGG and ATLAS databases. Furthermore, it provides a rigorous framework so that the reaction list can be extended, and further issues can be improved.