Theoretical and Computational Chemistry

Discovery of Novel Chemical Reactions by Deep Generative Recurrent Neural Network


Here, we report an application of Artificial Intelligence techniques to generate novel chemical reactions of the given type. A sequence-to-sequence autoencoder was trained on the USPTO reaction database. Each reaction was converted into a single Condensed Graph of Reaction (CGR), followed by their translation into on-purpose developed SMILES/GGR text strings. The autoencoder latent space was visualized on the two-dimensional generative topographic map, from which some zones populated by Suzuki coupling reactions were targeted. These served for the generation of novel reactions by sampling the latent space points and decoding them to SMILES/CGR.


Thumbnail image of CGR generation-arXiv_P_17.01.2020.pdf
download asset CGR generation-arXiv_P_17.01.2020.pdf 1 MB [opens in a new tab]