Multiclass cancer diagnosis using tumor gene expression signatures

The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.

[1]  Z. Hall Cancer , 1906, The Hospital.

[2]  J. Hair Multivariate data analysis , 1972 .

[3]  J. H. Scarffe,et al.  Cancer Medicine , 1982, British Journal of Cancer.

[4]  AC Tose Cell , 1993, Cell.

[5]  J. Hainsworth,et al.  Treatment of patients with cancer of an unknown primary site. , 1993, The New England journal of medicine.

[6]  A. Mccarthy Development , 1996, Current Opinion in Neurobiology.

[7]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  V. Livolsi,et al.  Mandatory second opinion of pathologic slides , 1999, Cancer.

[10]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[11]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[12]  H. Clevers,et al.  Linking Colorectal Cancer to Wnt Signaling , 2000, Cell.

[13]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[14]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[15]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[16]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[17]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[18]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  H Clevers,et al.  Wnt/(beta)-catenin signaling regulates the expression of the homeobox gene Cdx1 in embryonic intestine. , 2000, Development.

[20]  Tomaso A. Poggio,et al.  Regularization Networks and Support Vector Machines , 2000, Adv. Comput. Math..

[21]  Ash A. Alizadeh,et al.  'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns , 2000, Genome Biology.

[22]  Satoru Miyano,et al.  Proceedings of the Fourth Annual International Conference on Computational Molecular Biology, RECOMB 2000, Tokyo, Japan, April 8-11, 2000 , 1997, Annual International Conference on Research in Computational Molecular Biology.

[23]  D. Botstein,et al.  A gene expression database for the molecular pharmacology of cancer , 2000, Nature Genetics.

[24]  Arnold J. Levine,et al.  Identification of a Mouse Homolog of the Human BTEB2Transcription Factor as a β-Catenin-Independent Wnt-1-Responsive Gene , 2001, Molecular and Cellular Biology.

[25]  J. Taipale,et al.  The Hedgehog and Wnt signalling pathways in cancer , 2001, Nature.

[26]  J. Welsh,et al.  Molecular classification of human carcinomas by use of gene expression signatures. , 2001, Cancer research.

[27]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[28]  S. Dhanasekaran,et al.  Delineation of prognostic biomarkers in prostate cancer , 2001, Nature.

[29]  J. Mesirov,et al.  Chemosensitivity prediction by transcriptional profiling , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.