A method for calling gains and losses in array CGH data.

Array CGH is a powerful technique for genomic studies of cancer. It enables one to carry out genome-wide screening for regions of genetic alterations, such as chromosome gains and losses, or localized amplifications and deletions. In this paper, we propose a new algorithm 'Cluster along chromosomes' (CLAC) for the analysis of array CGH data. CLAC builds hierarchical clustering-style trees along each chromosome arm (or chromosome), and then selects the 'interesting' clusters by controlling the False Discovery Rate (FDR) at a certain level. In addition, it provides a consensus summary across a set of arrays, as well as an estimate of the corresponding FDR. We illustrate the method using an application of CLAC on a lung cancer microarray CGH data set as well as a BAC array CGH data set of aneuploid cell strains.

[1]  R. Cox,et al.  Journal of the Royal Statistical Society B , 1972 .

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  W. Kuo,et al.  High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays , 1998, Nature Genetics.

[4]  K. Kinzler,et al.  Genetic instabilities in human cancers , 1998, Nature.

[5]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[6]  D. Hanahan,et al.  Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas , 2001, Nature Genetics.

[7]  Ajay N. Jain,et al.  Assembly of microarrays for genome-wide measurement of DNA copy number , 2001, Nature Genetics.

[8]  Bradley Efron,et al.  Microarrays empirical Bayes methods, and false discovery rates , 2001 .

[9]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  John D. Storey A direct approach to false discovery rates , 2002 .

[11]  Elena Marchiori,et al.  Chromosomal Breakpoint Detection in Human Cancer , 2003, EvoWorkshops.

[12]  Lue Ping Zhao,et al.  Array rank order regression analysis for the detection of gene copy-number changes in human cancer. , 2003, Genomics.

[13]  Jaakko Astola,et al.  CGH-Plotter: MATLAB toolbox for CGH-data analysis , 2003, Bioinform..

[14]  Jane Fridlyand,et al.  Shaping of tumor and drug-resistant genomes by instability and selection , 2003, Oncogene.

[15]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .