Mapping the Space of Chemical Reactions using Attention-Based Neural Networks

Philippe Schwaller; Daniel Probst; Alain C. Vaucher; Vishnu H Nair; David Kreutter; Teodoro Laino; Jean-Louis Reymond

doi:10.26434/chemrxiv.9897365.v4

Theoretical and Computational Chemistry

Search within Theoretical and Computational Chemistry

Mapping the Space of Chemical Reactions using Attention-Based Neural Networks

11 December 2020, Version 4

Working Paper

Show author details

This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Organic reactions are usually assigned to classes grouping reactions with similar reagents and mechanisms. Reaction classes facilitate communication of complex concepts and efficient navigation through chemical reaction space. However, the classification process is a tedious task, requiring the identification of the corresponding reaction class template via annotation of the number of molecules in the reactions, the reaction center and the distinction between reactants and reagents. In this work, we show that transformer-based models can infer reaction classes from non-annotated, simple text-based representations of chemical reactions. Our best model reaches a classification accuracy of 98.2%. We also show that the learned representations can be used as reaction fingerprints which capture fine-grained differences between reaction classes better than traditional reaction fingerprints. The unprecedented insights into chemical reaction space enabled by our learned fingerprints is illustrated by an interactive reaction atlas providing visual clustering and similarity searching.

Code: https://github.com/rxn4chemistry/rxnfp

Tutorials: https://rxn4chemistry.github.io/rxnfp/

Interactive reaction atlas: https://rxn4chemistry.github.io/rxnfp//tmaps/tmap_ft_10k.html

Keywords

SMILES-Encoded Molecular Structures

SMILES

SMILES string representation

reaction fingerprints

Supplementary weblinks

Title

Description

Actions

Title

Description

Actions

View

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Now Published

Mapping the space of chemical reactions using attention-based neural networks

Philippe Schwaller, Daniel Probst, Alain C. Vaucher, Vishnu H. Nair, David Kreutter, Teodoro Laino, Jean-Louis Reymond journal article

Nature Machine Intelligence , Volume 3, Issue 2

Online publication date: Jan 28, 2021

Version History

Dec 11, 2020 Version 4

Aug 07, 2020 Version 3

Dec 26, 2019 Version 2

Sep 27, 2019 Version 1

Version Notes

- USPTO 1k TPL data set and experiments - Additional experiments and code - Machine Learning and the Physical Sciences Workshop at the 33rd Conference on Neural Information Processing Systems (NeurIPS)

Metrics

17,664

5,815

Views

Downloads

Citations

License

The content is available under CC BY NC ND 4.0

DOI

10.26434/chemrxiv.9897365.v4

Funding

DP and JLP: NCCR TransCure - From transport physiology to identification of therapeutic targets. Swiss National Science Foundation

Author’s competing interest statement

No conflict of interest

Mapping the Space of Chemical Reactions using Attention-Based Neural Networks

Authors

Abstract

Keywords

Supplementary weblinks

Comments

Now Published

Version History

Version Notes

Metrics

License

DOI

Funding

Author’s competing interest statement

Share