Gene Interaction in DNA Microarray Data Is Decomposed by Information Geometric Measure

MOTIVATION Given the vast amount of gene expression data, it is essential to develop a simple and reliable method of investigating the fine structure of gene interaction. We show how an information geometric measure achieves this. RESULTS We introduce an information geometric measure of binary random vectors and show how this measure reveals the fine structure of gene interaction. In particular, we propose an iterative procedure by using this measure (called IPIG). The procedure finds higher-order dependencies which may underlie the interaction between two genes of interest. To demonstrate the method, we investigate the interaction between the two genes of interest in the data from human acute lymphoblastic leukemia cells. The method successfully discovered biologically known findings and also selected other genes as hidden causes that constitute the interaction. AVAILABILITY Softwares are currently not available but are possibly made available in future at http://www.mns.brain.riken.go.jp/~nakahara/DNA_pub.html where all the related information is also linked.

[1]  M. Hemler,et al.  Specific tetraspanin functions , 2001, The Journal of cell biology.

[2]  Nir Friedman,et al.  Inferring subnetworks from perturbed expression profiles , 2001, ISMB.

[3]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[4]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[5]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[6]  Shun-ichi Amari,et al.  Information-Geometric Decomposition in Spike Analysis , 2001, NIPS.

[7]  Neal N. Iwakoshi,et al.  Plasma cell differentiation requires the transcription factor XBP-1 , 2001, Nature.

[8]  Satoru Miyano,et al.  Inferring qualitative relations in genetic networks and metabolic pathways , 2000, Bioinform..

[9]  J. Griesenbeck,et al.  Regulation of RNA Polymerase II-dependent Transcription by Poly(ADP-ribosyl)ation of Transcription Factors* , 1998, The Journal of Biological Chemistry.

[10]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[11]  Michael I. Jordan Graphical Models , 1998 .

[12]  Shun-ichi Amari,et al.  Information-Geometric Measure for Neural Spikes , 2002, Neural Computation.

[13]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[15]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.