论文信息 - Penalized Discriminant Analysis

Penalized Discriminant Analysis

Fisher's linear discriminant analysis (LDA) is a popular data-analytic tool for studying the relationship between a set of predictors and a categorical response. In this paper we describe a penalized version of LDA. It is designed for situations in which there are many highly correlated predictors, such as those obtained by discretizing a function, or the grey-scale values of the pixels in a series of images. In cases such as these it is natural, efficient and sometimes essential to impose a spatial smoothness constraint on the coefficients, both for improved prediction performance and interpretability. We cast the classification problem into a regression framework via optimal scoring. Using this, our proposal facilitates the use of any penalized regression technique in the classification setting. The technique is illustrated with examples in speech recognition and handwritten character recognition.

R. Tibshirani | T. Hastie | A. Buja

[1] H. Vinod. Canonical ridge and econometrics of joint production , 1976 .

[2] Pasquale J. Di Pillo. Further applications of bias to discriminant analysis , 1976 .

[3] Forrest W. Young,et al. Additive structure in qualitative data: An alternating least squares method with optimal scaling features , 1976 .

[4] Forrest W. Young,et al. The principal components of mixed measurement level multivariate data: An alternating least squares method with optimal scaling features , 1978 .

[5] N. Campbell. Shrunken Estimators in Discriminant and Canonical Variate Analysis , 1980 .

[6] A. Morineau,et al. Multivariate descriptive statistical analysis , 1984 .

[7] B. Yandell,et al. Semi-Parametric Generalized Linear Models. , 1985 .

[8] J. Friedman. Regularized Discriminant Analysis , 1989 .

[9] Lawrence D. Jackel,et al. Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[10] G. Wahba. Spline models for observational data , 1990 .

[11] M. Hill,et al. NONLINEAR MULTIVARIATE ANALYSIS , 1990 .

[12] J. Ramsay,et al. Some Tools for Functional Data Analysis , 1991 .

[13] F. O’Sullivan. Discretized Laplacian Smoothing by Fourier Methods , 1991 .

[14] H. Kiiveri. Canonical variate analysis of high-dimensional spectral data , 1992 .

[15] B. Silverman,et al. Canonical correlation analysis when the data are curves. , 1993 .

[16] R. Tibshirani,et al. Flexible Discriminant Analysis by Optimal Scoring , 1994 .