PharmaNet: Pharmaceutical discovery with deep recurrent neural networks

The discovery and development of novel pharmaceuticals is an area of active research mainly due to the large investments required and long payback times. As of 2016, the development of a novel drug candidate required up to $ USD 2.6 billion in investment for only 10% rate of approval by the FDA. To help decreasing the costs associated with the process, a number of in silico approaches have been developed with relatively low success due to limited predicting performance. Here, we introduced a machine learning-based algorithm as an alternative for a more accurate search of new pharmacological candidates, which takes advantage of Recurrent Neural Networks (RNN) for active molecule prediction within large databases. Our approach, termed PharmaNet was implemented here to search for ligands against specific cell receptors within 102 targets of the DUD-E database, which contains 22886 active molecules. PharmaNet comprises three main phases. First, a SMILES representation of the molecule is converted into a raw molecular image. Second, a convolutional encoder processes the data to obtain a fingerprint molecular image that is finally analyzed by a Recurrent Neural Network (RNN). This approach enables precise predictions of the molecules’ target on the basis of the feature extraction, the sequence analysis and the relevant information filtered out throughout the process. Molecule Target prediction is a highly unbalanced detection problem and therefore, we propose that an adequate evaluation metric of performance is the area under the Normalized Average Precision (NAP) curve. PharmaNet largely surpasses the previous state-of-the-art method with 95.8% in the Receiver Operating Characteristic curve (ROC-AUC) and 58.9% in the NAP curve. We obtained a perfect performance for human farnesyl pyrophosphate synthase (FPPS), which is a potential target for antimicrobial and anticancer treatments. We decided to test PharmaNet for activity prediction against FPPS by searching in the CHEMBL data set. We obtained [3] potential inhibitors that were further validated through both molecular docking and in silico toxicity prediction. Most importantly, one of this candidates, CHEMBL2007613, was predicted as a potential antiviral due to its involvement on the PCDH17 pathway, which has been reported to be related to viral infections.

[1]  Sergey Ivanov,et al.  DIGEP-Pred: web service for in silico prediction of drug-induced gene expression profiles based on structural formula , 2013, Bioinform..

[2]  A. Berghuis,et al.  Probing the molecular and structural elements of ligands binding to the active site versus an allosteric pocket of the human farnesyl pyrophosphate synthase. , 2015, Bioorganic & medicinal chemistry letters.

[3]  Manfred Kansy,et al.  High throughput solubility measurement in drug discovery and development. , 2007, Advanced drug delivery reviews.

[4]  Luc Van Gool,et al.  Deep Retinal Image Understanding , 2016, MICCAI.

[5]  C. Humblet,et al.  Idea2Data: Toward a New Paradigm for Drug Discovery. , 2019, ACS medicinal chemistry letters.

[6]  M. Notarnicola,et al.  Deregulated expression and activity of Farnesyl Diphosphate Synthase (FDPS) in Glioblastoma , 2017, Scientific Reports.

[7]  Y. Tsantrizos,et al.  Inhibition of farnesyl pyrophosphate (FPP) and/or geranylgeranyl pyrophosphate (GGPP) biosynthesis and its implication in the treatment of cancers , 2019, Critical reviews in biochemistry and molecular biology.

[8]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[9]  M. Kansy,et al.  'Flexible-acceptor' General Solubility Equation for 'beyond Rule of 5' Drugs. , 2020, Molecular pharmaceutics.

[10]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[11]  Andreas Persidis,et al.  Artificial intelligence for drug design , 1997, Nature Biotechnology.

[12]  Konstantinos Kamnitsas,et al.  Fast Fully Automatic Segmentation of the Human Placenta from Motion Corrupted MRI , 2016, MICCAI.

[13]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  L. Kastl,et al.  miRNA-34a is associated with docetaxel resistance in human breast cancer cells , 2011, Breast Cancer Research and Treatment.

[15]  Keith C. C. Chan,et al.  Large-scale prediction of drug-target interactions from deep representations , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[16]  Behzad Baradaran,et al.  The Different Mechanisms of Cancer Drug Resistance: A Brief Review , 2017, Advanced pharmaceutical bulletin.

[17]  B. Vincenzi,et al.  Bisphosphonates as anticancer agents in early breast cancer: preclinical and clinical evidence , 2015, Breast Cancer Research.

[18]  Tyler E. Miller,et al.  MicroRNA-221/222 Confers Tamoxifen Resistance in Breast Cancer by Targeting p27Kip1*♦ , 2008, Journal of Biological Chemistry.

[19]  Derek Hoiem,et al.  Diagnosing Error in Object Detectors , 2012, ECCV.

[20]  Izhar Wallach,et al.  AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery , 2015, ArXiv.

[21]  Ji Hoon Jeon,et al.  Farnesyl diphosphate synthase is important for the maintenance of glioblastoma stemness , 2018, Experimental & Molecular Medicine.

[22]  P. Sanseau,et al.  Drug repurposing: progress, challenges and recommendations , 2018, Nature Reviews Drug Discovery.

[23]  Thierry Kogej,et al.  Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks , 2017, ACS central science.

[24]  Jakub M. Tomczak,et al.  Interaction prediction in structure-based virtual screening using deep learning , 2018, Comput. Biol. Medicine.

[25]  M. Clemons,et al.  Adjuvant bisphosphonate treatment for breast cancer: Where are we heading and can the pre-clinical literature help us get there? , 2012, Journal of bone oncology.

[26]  Emma J. Chory,et al.  A Deep Learning Approach to Antibiotic Discovery , 2020, Cell.

[27]  Sharangdhar S. Phatak,et al.  High-throughput and in silico screenings in drug discovery , 2009, Expert opinion on drug discovery.

[28]  A. Berghuis,et al.  Human isoprenoid synthase enzymes as therapeutic targets , 2014, Front. Chem..

[29]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[30]  Vladimir Poroikov,et al.  QSAR Modelling of Rat Acute Toxicity on the Basis of PASS Prediction , 2011, Molecular informatics.

[31]  Xiuping Liu,et al.  Role of MicroRNA miR-27a and miR-451 in the regulation of MDR1/P-glycoprotein expression in human cancer cells. , 2008, Biochemical pharmacology.

[32]  Abhinav Vishnu,et al.  SMILES2Vec: An Interpretable General-Purpose Deep Neural Network for Predicting Chemical Properties , 2017, ArXiv.

[33]  Pablo Andrés Arbeláez,et al.  ISINet: An Instance-Based Approach for Surgical Instrument Segmentation , 2020, MICCAI.

[34]  Jugal K. Kalita,et al.  Deep Learning applied to NLP , 2017, ArXiv.

[35]  Qing Liu,et al.  miR-29c contribute to glioma cells temozolomide sensitivity by targeting O6-methylguanine-DNA methyltransferases indirectly , 2016, Oncotarget.

[36]  A. Addario,et al.  Role of microRNAs in drug-resistant ovarian cancer cells. , 2008, Gynecologic oncology.

[37]  Gerard D. Wright,et al.  Expanding the soil antibiotic resistome: exploring environmental diversity. , 2007, Current opinion in microbiology.

[38]  J. Choubey,et al.  Insilico Docking Analysis of Nitrogen Containing Bisphosphonate with Human Fernasyl Pyrophosphate Synthase , 2013 .

[39]  Jitendra Malik,et al.  Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[41]  N. Shomron,et al.  Restoration of miR-424 suppresses BCR-ABL activity and sensitizes CML cells to imatinib treatment. , 2015, Cancer letters.

[42]  Edward W. Lowe,et al.  Computational Methods in Drug Discovery , 2014, Pharmacological Reviews.

[43]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[44]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[45]  Raymond R Tice,et al.  Genetic toxicology in silico protocol. , 2019, Regulatory toxicology and pharmacology : RTP.

[46]  Viktor Hornak,et al.  Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening , 2019, PloS one.

[47]  Jakub M. Tomczak,et al.  Learning Deep Architectures for Interaction Prediction in Structure-based Virtual Screening , 2016, ArXiv.

[48]  C. L. Ventola The antibiotic resistance crisis: part 1: causes and threats. , 2015, P & T : a peer-reviewed journal for formulary management.

[49]  Julian Fierrez,et al.  Exploring Recurrent Neural Networks for On-Line Handwritten Signature Biometrics , 2018, IEEE Access.

[50]  W. Ko,et al.  Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease-2019 (COVID-19): The epidemic and the challenges , 2020, International Journal of Antimicrobial Agents.

[51]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[52]  C. Xiao,et al.  MicroRNA-221 Induces Cell Survival and Cisplatin Resistance through PI3K/Akt Pathway in Human Osteosarcoma , 2013, PloS one.

[53]  Günter Klambauer,et al.  DeepTox: Toxicity Prediction using Deep Learning , 2016, Front. Environ. Sci..

[54]  Peter Cresswell,et al.  The interferon-inducible protein viperin inhibits influenza virus release by perturbing lipid rafts. , 2007, Cell host & microbe.

[55]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[57]  G. Schneider,et al.  Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. , 2019, Chemical reviews.

[58]  Udo Oppermann,et al.  The inhibition of human farnesyl pyrophosphate synthase by nitrogen-containing bisphosphonates. Elucidating the role of active site threonine 201 and tyrosine 204 residues using enzyme mutants☆ , 2015, Bone.

[59]  Wenpeng Yin,et al.  Comparative Study of CNN and RNN for Natural Language Processing , 2017, ArXiv.

[60]  Lynette M. Smith,et al.  FDPS cooperates with PTEN loss to promote prostate cancer progression through modulation of small GTPases/AKT axis , 2019, Oncogene.