论文信息 - Completion of biological networks: the output kernel trees approach

Completion of biological networks: the output kernel trees approach

The inference of biological networks from various sources of experimental data is an important problem of computational biology. In this paper, we propose a new method for the supervised inference of biological networks, which is based on a kernelization of the output of regression trees. It inherits several features of this method such as interpretability, robustness to irrelevant variables, and input scalability. We applied this method on the inference of a protein-protein interaction network where we obtained results competitive with existing approaches. Furthermore, our method provides relevant insights on input data regarding their potential relationship with the existence of interactions.

Pierre Geurts | Marie Dutreix | Nizar Touleimat | Florence d Alché-Buc

[1] William Stafford Noble,et al. Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[2] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[3] Pierre Geurts,et al. Kernelizing the output of tree-based methods , 2006, ICML '06.

[4] Yoshihiro Yamanishi,et al. Protein network inference from multiple genomic data: a supervised approach , 2004, ISMB/ECCB.

[5] Pierre Geurts,et al. Inferring biological networks with output kernel trees , 2007, BMC Bioinformatics.

[6] Pierre Geurts,et al. Extremely randomized trees , 2006, Machine Learning.

[7] Tsuyoshi Kato,et al. Selective integration of multiple biological data for supervised network inference , 2005, Bioinform..