These are preliminary reports that have not been peer-reviewed. They should not be regarded as conclusive, guide clinical practice/health-related behavior, or be reported in news media as established information. For more information, please see our FAQs.
CGR generation-arXiv_P_17.01.2020.pdf (1.85 MB)
Discovery of Novel Chemical Reactions by Deep Generative Recurrent Neural Network
Preprints are manuscripts made publicly available before they have been submitted for formal peer review and publication. They might contain new research findings or data. Preprints can be a draft or final version of an author's research but must not have been accepted for publication at the time of submission.
submitted on 17.01.2020 and posted on 22.01.2020by William Bort, Igor I. Baskin, Pavel Sidorov, Gilles Marcou, Dragos Horvath, Timur Madzhidov, Alexandre Varnek, Timur Gimadiev, Ramil Nugmanov, Artem Mukanov
Here, we report an application of Artificial Intelligence techniques to generate novel chemical reactions of the given type. A sequence-to-sequence autoencoder was trained on the USPTO reaction database. Each reaction was converted into a single Condensed Graph of Reaction (CGR), followed by their translation into on-purpose developed SMILES/GGR text strings. The autoencoder latent space was visualized on the two-dimensional generative topographic map, from which some zones populated by Suzuki coupling reactions were targeted. These served for the generation of novel reactions by sampling the latent space points and decoding them to SMILES/CGR.