Global multi-output decision trees for interaction prediction

Interaction data are characterized by two sets of objects, each described by their own set of features. They are often modeled as networks and the values of interest are the possible interactions between two instances, represented usually as a matrix. Here, a novel global decision tree learning method is proposed, where multi-output decision trees are constructed over the global interaction setting, addressing the problem of interaction prediction as a multi-label classification task. More specifically, the tree is constructed by splitting the interaction matrix both row-wise and column-wise, incorporating this way both interaction dataset features in the learning procedure. Experiments are conducted across several heterogeneous interaction datasets from the biomedical domain. The experimental results indicate the superiority of the proposed method against other decision tree approaches in terms of predictive accuracy, model size and computational efficiency. The performance is boosted by fully exploiting the multi-output structure of the model. We conclude that the proposed method should be considered in interaction prediction tasks, especially where interpretable models are desired.

[1]  Charles Elkan,et al.  Predicting labels for dyadic data , 2010, Data Mining and Knowledge Discovery.

[2]  Grigorios Tsoumakas,et al.  Introduction to the special issue on learning from multi-label data , 2012, Machine Learning.

[3]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[4]  Yizhou Sun,et al.  Mining Heterogeneous Information Networks: Principles and Methodologies , 2012, Mining Heterogeneous Information Networks: Principles and Methodologies.

[5]  Ting Wang,et al.  An improved map of conserved regulatory sites for Saccharomyces cerevisiae , 2006, BMC Bioinformatics.

[6]  Ying Ju,et al.  Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier , 2016, Scientific Reports.

[7]  Wei Fan,et al.  Mining big data: current status, and forecast to the future , 2013, SKDD.

[8]  Luc De Raedt,et al.  Top-Down Induction of Clustering Trees , 1998, ICML.

[9]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[10]  Lei Tang,et al.  Large scale multi-label classification via metalabeler , 2009, WWW '09.

[11]  E. Marcotte,et al.  A flaw in the typical evaluation scheme for pair-input computational predictions , 2012, Nature Methods.

[12]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[13]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[14]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[15]  Huijun Gao,et al.  Data-Based Techniques Focused on Modern Industry: An Overview , 2015, IEEE Transactions on Industrial Electronics.

[16]  Yong-Yeol Ahn,et al.  Optimizing drug–target interaction prediction based on random walk on heterogeneous networks , 2015, Journal of Cheminformatics.

[17]  Pierre Geurts,et al.  Random Forests with Random Projections of the Output Space for High Dimensional Multi-label Classification , 2014, ECML/PKDD.

[18]  Grigorios Tsoumakas,et al.  Discovering and Exploiting Deterministic Label Relationships in Multi-Label Learning , 2015, KDD.

[19]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[20]  Eyke Hüllermeier,et al.  On label dependence and loss minimization in multi-label classification , 2012, Machine Learning.

[21]  Yizhou Sun,et al.  Mining heterogeneous information networks: a structural analysis approach , 2013, SKDD.

[22]  Weixiong Zhang,et al.  A bi-dimensional regression tree approach to the modeling of gene expression regulation , 2006, Bioinform..

[23]  Pietro Liò,et al.  Warped Matrix Factorisation for Multi-view Data Integration , 2016, ECML/PKDD.

[24]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[25]  Feng Liu,et al.  Predicting drug side effects by multi-label learning and ensemble learning , 2015, BMC Bioinformatics.

[26]  Pierre Geurts,et al.  Supervised learning with decision tree-based methods in computational and systems biology. , 2009, Molecular bioSystems.

[27]  Hsinchun Chen,et al.  Recommendation as link prediction in bipartite graphs: A graph kernel-based machine learning approach , 2013, Decis. Support Syst..

[28]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[29]  Bernard De Baets,et al.  Efficient Pairwise Learning Using Kernel Ridge Regression: an Exact Two-Step Method , 2016, ArXiv.

[30]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[31]  Lei Huang,et al.  Protein-protein interaction prediction based on multiple kernels and partial network with linear programming , 2016, BMC Systems Biology.

[32]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[33]  Christian von Mering,et al.  STITCH: interaction networks of chemicals and proteins , 2007, Nucleic Acids Res..

[34]  Pierre Geurts,et al.  Classifying pairs with trees for supervised biological network inference† †Electronic supplementary information (ESI) available: Implementation and computational issues, supplementary performance curves, and illustration of interpretability of trees. See DOI: 10.1039/c5mb00174a Click here for additi , 2014, Molecular bioSystems.

[35]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Saso Dzeroski,et al.  Decision Trees for Hierarchical Multilabel Classification: A Case Study in Functional Genomics , 2006, PKDD.

[38]  Tao Mei,et al.  Correlative multi-label video annotation , 2007, ACM Multimedia.

[39]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[40]  Jean-Philippe Vert,et al.  Reconstruction of Biological Networks by Supervised Machine Learning Approaches , 2008 .

[41]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[42]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[43]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[44]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[45]  Saso Dzeroski,et al.  Tree ensembles for predicting structured outputs , 2013, Pattern Recognit..

[46]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[47]  Michelangelo Ceci,et al.  Network Regression with Predictive Clustering Trees , 2011, ECML/PKDD.

[48]  Jean-Philippe Vert,et al.  Supervised reconstruction of biological networks with local models , 2007, ISMB/ECCB.

[49]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[50]  Cláudia Antunes,et al.  A structured view on pattern mining-based biclustering , 2015, Pattern Recognit..

[51]  Ivan G. Costa,et al.  A multiple kernel learning algorithm for drug-target interaction prediction , 2016, BMC Bioinformatics.

[52]  Pierre Geurts,et al.  On protocols and measures for the validation of supervised methods for the inference of biological networks , 2013, Front. Genet..

[53]  Robert E. Schapire,et al.  Hierarchical multi-label prediction of gene function , 2006, Bioinform..