Regression Shrinkage and Selection via the Lasso

SUMMARY We propose a new method for estimation in linear models. The 'lasso' minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant. Because of the nature of this constraint it tends to produce some coefficients that are exactly 0 and hence gives interpretable models. Our simulation studies suggest that the lasso enjoys some of the favourable properties of both subset selection and ridge regression. It produces interpretable models like subset selection and exhibits the stability of ridge regression. There is also an interesting relationship with recent work in adaptive function estimation by Donoho and Johnstone. The lasso idea is quite general and can be applied in a variety of statistical models: extensions to generalized regression models and tree-based models are briefly described.

[1]  C. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[2]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[3]  Philip E. Gill,et al.  Practical optimization , 1981 .

[4]  C. Stein Estimation of the Mean of a Multivariate Normal Distribution , 1981 .

[5]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[6]  T. Stamey,et al.  Prostate specific antigen in the diagnosis and treatment of adenocarcinoma of the prostate. II. Radical prostatectomy treated patients. , 1989, The Journal of urology.

[7]  J. Friedman Multivariate adaptive regression splines , 1990 .

[8]  I. Johnstone,et al.  Maximum Entropy and the Nearly Black Object , 1992 .

[9]  L. Breiman,et al.  Submodel selection and evaluation in regression. The X-random case , 1992 .

[10]  T. Hastie,et al.  [A Statistical View of Some Chemometrics Regression Tools]: Discussion , 1993 .

[11]  Ping Zhang Model Selection Via Multifold Cross Validation , 1993 .

[12]  J. Friedman,et al.  A Statistical View of Some Chemometrics Regression Tools , 1993 .

[13]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[14]  J. Shao Linear Model Selection by Cross-validation , 1993 .

[15]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[16]  D. Donoho,et al.  Basis pursuit , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[17]  I. Johnstone,et al.  Wavelet Shrinkage: Asymptopia? , 1995 .

[18]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[19]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[20]  R. Tibshirani A proposal for variable selection in the Cox model , 1997 .