论文信息 - Efficient quadratic regularization for expression arrays.

Efficient quadratic regularization for expression arrays.

Gene expression arrays typically have 50 to 100 samples and 1000 to 20,000 variables (genes). There have been many attempts to adapt statistical models for regression and classification to these data, and in many cases these attempts have challenged the computational resources. In this article we expose a class of techniques based on quadratic regularization of linear models, including regularized (ridge) regression, logistic and multinomial regression, linear and mixture discriminant analysis, the Cox model and neural networks. For all of these models, we show that dramatic computational savings are possible over naive implementations, using standard transformations in numerical linear algebra.

R. Tibshirani | T. Hastie

[1] David R. Cox,et al. Regression models and life tables (with discussion , 1972 .

[2] Gene H. Golub,et al. Matrix computations , 1983 .

[3] J. Friedman. Regularized Discriminant Analysis , 1989 .

[4] Vladimir Vapnik,et al. The Nature of Statistical Learning , 1995 .

[5] R. Tibshirani,et al. Penalized Discriminant Analysis , 1995 .

[6] R. Tibshirani,et al. Discriminant Analysis by Gaussian Mixtures , 1996 .

[7] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[8] Bernhard Schölkopf,et al. GACV for Support Vector Machines , 2000 .

[9] D. Botstein,et al. Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10] A. E. Hoerl,et al. Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[11] Hansong Zhang,et al. Gacv for support vector machines , 2000 .