A tail strength measure for assessing the overall univariate significance in a dataset.

We propose an overall measure of significance for a set of hypothesis tests. The 'tail strength' is a simple function of the p-values computed for each of the tests. This measure is useful, for example, in assessing the overall univariate strength of a large set of features in microarray and other genomic and biomedical studies. It also has a simple relationship to the false discovery rate of the collection of tests. We derive the asymptotic distribution of the tail strength measure, and illustrate its use on a number of real datasets.

[1]  Galen R. Shorack,et al.  Functions of Order Statistics , 1972 .

[2]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[3]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[4]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[5]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[6]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Y. Benjamini,et al.  On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent Statistics , 2000 .

[8]  J. A. Cuesta-Albertos,et al.  Contributions of empirical and quantile processes to the asymptotic theory of goodness-of-fit tests , 2000 .

[9]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[10]  Christopher R. Genovese,et al.  Operating Characteristics and Extensions of the FDR Procedure , 2001 .

[11]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[12]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[13]  Bradley Efron,et al.  Microarrays empirical Bayes methods, and false discovery rates , 2001 .

[14]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[15]  J. -B. Poline,et al.  Estimating the Delay of the fMRI Response , 2002, NeuroImage.

[16]  L. Staudt,et al.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. , 2002, The New England journal of medicine.

[17]  John D. Storey A direct approach to false discovery rates , 2002 .

[18]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[19]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  R. Tibshirani,et al.  Empirical bayes methods and false discovery rates for microarrays , 2002, Genetic epidemiology.

[21]  Laurence L. George,et al.  The Statistical Analysis of Failure Time Data , 2003, Technometrics.

[22]  Lori E. Dodd,et al.  Partial AUC Estimation and Regression , 2003, Biometrics.

[23]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[24]  John D. Storey,et al.  Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach , 2004 .

[25]  L. Staudt,et al.  Prediction of survival in follicular lymphoma based on molecular features of tumor-infiltrating immune cells. , 2004, The New England journal of medicine.

[26]  D. Donoho,et al.  Higher criticism for detecting sparse heterogeneous mixtures , 2004, math/0410072.

[27]  R. Tibshirani,et al.  Toxicity from radiation therapy associated with abnormal transcriptional responses to DNA damage. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Marcel Dettling,et al.  BagBoosting for tumor classification with gene expression data , 2004, Bioinform..

[29]  Bradley Efron,et al.  Local False Discovery Rates , 2005 .

[30]  B. Wandell,et al.  Children's Reading Performance is Correlated with White Matter Structure Measured by Diffusion Tensor Imaging , 2005, Cortex.

[31]  R. Warnke,et al.  Immune signatures in follicular lymphoma. , 2005, The New England journal of medicine.

[32]  R. Dougherty,et al.  Cross‐subject comparison of principal diffusion direction maps , 2005, Magnetic resonance in medicine.