Modern Hopfield Networks for Few- and Zero-Shot Reaction Template Prediction

Finding synthesis routes for molecules of interest is an essential step in the discovery of new drugs and materials. To find such routes, computer-assisted synthesis planning (CASP) methods are employed, which rely on a model of chemical reactivity. In this study, we model single-step retrosynthesis in a template-based approach using modern Hopfield networks (MHNs). We adapt MHNs to associate different modalities, reaction templates and molecules, which allows the model to leverage structural information about reaction templates. This approach significantly improves the performance of template relevance prediction, especially for templates with few or zero training examples. With inference speed several times faster than baseline methods, we improve predictive performance for top-k exact match accuracy for k ≥ 5 in the retrosynthesis benchmark USPTO-50k. Code to reproduce the results will be available at github.com/ml-jku/mhn-react.

[1]  H. L. Morgan The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. , 1965 .

[2]  E J Corey,et al.  Computer-assisted design of complex organic syntheses. , 1969, Science.

[3]  W G Richards,et al.  Computer-aided molecular design. , 1983, Endeavour.

[4]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[5]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[7]  Daniel M. Lowe Extraction of chemical structures and reactions from the literature , 2012 .

[8]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[9]  Nishanth G. Chemmangattuvalappil,et al.  Challenges and opportunities in computer-aided molecular design , 2015, Comput. Chem. Eng..

[10]  Daniel M. Lowe,et al.  Development of a Novel Fingerprint for Chemical Reactions and Its Application to Large-Scale Reaction Classification and Similarity , 2015, J. Chem. Inf. Model..

[11]  Piotr Dittwald,et al.  Computer-Assisted Synthetic Planning: The End of the Beginning. , 2016, Angewandte Chemie.

[12]  Juno Nam,et al.  Linking the Neural Machine Translation and the Prediction of Organic Chemistry Reactions , 2016, ArXiv.

[13]  Alán Aspuru-Guzik,et al.  Neural Networks for the Prediction of Organic Chemistry Reactions , 2016, ACS central science.

[14]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[15]  Gregory A Landrum,et al.  What's What: The (Nearly) Definitive Guide to Reaction Role Assignment , 2016, J. Chem. Inf. Model..

[16]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[17]  Sepp Hochreiter,et al.  Self-Normalizing Neural Networks , 2017, NIPS.

[18]  Frank Hutter,et al.  Fixing Weight Decay Regularization in Adam , 2017, ArXiv.

[19]  Marwin H. S. Segler,et al.  Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction. , 2017, Chemistry.

[20]  Bowen Liu,et al.  Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models , 2017, ACS central science.

[21]  William H. Green,et al.  Computer-Assisted Retrosynthesis Based on Molecular Similarity , 2017, ACS central science.

[22]  Samuel S. Schoenholz,et al.  Neural Message Passing for Quantum Chemistry , 2017, ICML.

[23]  Stella X. Yu,et al.  Unsupervised Feature Learning via Non-parametric Instance Discrimination , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Connor W. Coley,et al.  Machine Learning in Computer-Aided Synthesis Planning. , 2018, Accounts of chemical research.

[25]  Hugo Ceulemans,et al.  Large-scale comparison of machine learning methods for drug target prediction on ChEMBL† †Electronic supplementary information (ESI) available: Overview, Data Collection and Clustering, Methods, Results, Appendix. See DOI: 10.1039/c8sc00148k , 2018, Chemical science.

[26]  Oriol Vinyals,et al.  Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[27]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[28]  Constantine Bekas,et al.  “Found in Translation”: predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models† †Electronic supplementary information (ESI) available: Time-split test set and example predictions, together with attention weights, confidence and token probabilities. See DO , 2017, Chemical science.

[29]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[30]  Klavs F Jensen,et al.  RDChiral: An RDKit Wrapper for Handling Stereochemistry in Retrosynthetic Template Extraction and Application , 2019, J. Chem. Inf. Model..

[31]  Regina Barzilay,et al.  Learning to Make Generalizable and Diverse Predictions for Retrosynthesis , 2019, ArXiv.

[32]  Le Song,et al.  Retrosynthesis Prediction with Conditional Graph Logic Network , 2020, NeurIPS.

[33]  Matt J. Kusner,et al.  A Model to Search for Synthesizable Molecules , 2019, NeurIPS.

[34]  Nicholas A Cilfone,et al.  Enhancing Retrosynthetic Reaction Prediction with Deep Learning Using Multiscale Reaction Classification , 2019, J. Chem. Inf. Model..

[35]  Sepp Hochreiter,et al.  On failure modes in molecule generation and optimization. , 2019, Drug discovery today. Technologies.

[36]  Connor W. Coley,et al.  A graph-convolutional neural network model for the prediction of chemical reactivity† †Electronic supplementary information (ESI) available: Additional model and dataset details, results, discussion, and ref. 38 and 39. See DOI: 10.1039/c8sc04228d , 2018, Chemical science.

[37]  Christopher A. Hunter,et al.  Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction , 2018, ACS central science.

[38]  Igor V. Tetko,et al.  A Transformer Model for Retrosynthesis , 2019, ICANN.

[39]  Ryosuke Kojima,et al.  Prediction and Interpretable Visualization of Retrosynthetic Reactions Using Graph Convolutional Networks , 2019, J. Chem. Inf. Model..

[40]  Laurens van der Maaten,et al.  Self-Supervised Learning of Pretext-Invariant Representations , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Katsuhiko Ishiguro,et al.  Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis , 2020, ArXiv.

[42]  Emmanuel Noutahi,et al.  Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning , 2020, ACS omega.

[43]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[44]  Jian Tang,et al.  A Graph to Graphs Framework for Retrosynthesis Prediction , 2020, ICML.

[45]  I. Tetko,et al.  State-of-the-art augmented NLP transformer models for direct and single-step retrosynthesis , 2020, Nature Communications.

[46]  O. Engkvist,et al.  Artificial applicability labels for improving policies in retrosynthesis prediction , 2020, Mach. Learn. Sci. Technol..

[47]  Klavs F Jensen,et al.  Data Augmentation and Pretraining for Template-Based Retrosynthetic Prediction in Computer-Aided Synthesis Planning , 2020, J. Chem. Inf. Model..

[48]  Bo Dai,et al.  Energy-based View of Retrosynthesis , 2020, ArXiv.

[49]  Regina Barzilay,et al.  Current and Future Roles of Artificial Intelligence in Medicinal Chemistry Synthesis , 2020, Journal of medicinal chemistry.

[50]  Ross B. Girshick,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Stephen Wu,et al.  A Bayesian algorithm for retrosynthesis , 2020, Journal of chemical information and modeling.

[52]  Xianzhen Wang,et al.  Interpretable machine-learning strategy for soft-magnetic property and thermal stability in Fe-based metallic glasses , 2020, npj Computational Materials.

[53]  Regina Barzilay,et al.  Learning Graph Models for Template-Free Retrosynthesis , 2020, ArXiv.

[54]  Jun Xu,et al.  Predicting Retrosynthetic Reactions Using Self-Corrected Transformer Neural Networks , 2019, J. Chem. Inf. Model..

[55]  Junzhou Huang,et al.  Molecular Graph Enhanced Transformer for Retrosynthesis Prediction , 2019, bioRxiv.

[56]  Geir Kjetil Sandve,et al.  Modern Hopfield Networks and Attention for Immune Repertoire Classification , 2020, bioRxiv.

[57]  Yang Yu,et al.  RetroXpert: Decompose Retrosynthesis Prediction like a Chemist , 2020, NeurIPS.

[58]  Yoshua Bengio,et al.  Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning , 2020, ICML.

[59]  Jinwoo Shin,et al.  RetCL: A Selection-based Approach for Retrosynthesis via Contrastive Learning , 2021, IJCAI.

[60]  Venkat Venkatasubramanian,et al.  Retrosynthesis prediction using grammar-based neural machine translation: An information-theoretic approach , 2021, Comput. Chem. Eng..

[61]  Stanislaw Jastrzebski,et al.  Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits , 2020, J. Chem. Inf. Model..

[62]  Ilya Sutskever,et al.  Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.

[63]  David P. Kreil,et al.  Hopfield Networks is All You Need , 2020, ICLR.

[64]  Takashi Ishida,et al.  Single-Step Retrosynthesis Prediction Based on the Identification of Potential Disconnection Sites Using Molecular Substructure Fingerprints , 2021, J. Chem. Inf. Model..

[65]  Huanxiang Liu,et al.  RetroPrime: A Diverse, Plausible and Transformer-based Method for Single-Step Retrosynthesis Predictions , 2021 .

[66]  Substructure-based neural machine translation for retrosynthetic prediction , 2021, Journal of Cheminformatics.