Online citations, reference lists, and bibliographies.
← Back to Search

Regression Shrinkage And Selection Via The Lasso

R. Tibshirani
Published 1996 · Mathematics

Save to my Library
Download PDF
Analyze on Scholarcy Visualize in Litmaps
Share
Reduce the time it takes to create your bibliography by a factor of 10 by using the world’s favourite reference manager
Time to take this seriously.
Get Citationsy
SUMMARY We propose a new method for estimation in linear models. The 'lasso' minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant. Because of the nature of this constraint it tends to produce some coefficients that are exactly 0 and hence gives interpretable models. Our simulation studies suggest that the lasso enjoys some of the favourable properties of both subset selection and ridge regression. It produces interpretable models like subset selection and exhibits the stability of ridge regression. There is also an interesting relationship with recent work in adaptive function estimation by Donoho and Johnstone. The lasso idea is quite general and can be applied in a variety of statistical models: extensions to generalized regression models and tree-based models are briefly described.
This paper references
10.2307/2005340
Solving Least Squares Problems
Ake Bjork (1976)
10.1214/AOS/1176344552
Bootstrap Methods: Another Look at the Jackknife
B. Efron (1979)
10.2307/3616583
Practical optimization
P. Gill (1981)
10.1214/AOS/1176345632
Estimation of the Mean of a Multivariate Normal Distribution
C. Stein (1981)
10.2307/2530946
Classification and Regression Trees
L. Breiman (1984)
Classiication and Regression Trees
L Breiman (1984)
10.1016/S0022-5347(17)41175-X
Prostate specific antigen in the diagnosis and treatment of adenocarcinoma of the prostate. II. Radical prostatectomy treated patients.
T. Stamey (1989)
10.2307/2532174
Generalized Additive Models
T. Hastie (1991)
Multivariate adaptive regression splines (with discussion)
J. Friedman (1991)
10.1214/AOS/1176347963
Multivariate Adaptive Regression Splines
J. Friedman (1991)
10.1111/J.2517-6161.1992.TB01864.X
Maximum Entropy and the Nearly Black Object
D. L. Donoho (1992)
10.2307/1403680
Submodel selection and evaluation in regression. The X-random case
L. Breiman (1992)
Maximum entropy and the nearly black object ( with discussion )
D. L. Donoho (1992)
10.1007/978-1-4899-4541-9
An Introduction to the Bootstrap
B. Efron (1993)
10.1214/AOS/1176349027
Model Selection Via Multifold Cross Validation
P. Zhang (1993)
10.1080/00401706.1993.10485033
A Statistical View of Some Chemometrics Regression Tools
lldiko E. Frank (1993)
10.1080/01621459.1993.10476353
Variable selection via Gibbs sampling
E. George (1993)
10.1080/01621459.1993.10476299
Linear Model Selection by Cross-validation
J. Shao (1993)
Model selection via multifold cv
P. Zhang (1993)
Better subset selection using the nonnegative garotte
L. Breiman (1993)
Better subset selection using the non-negative garotte
L Breiman (1993)
10.1093/BIOMET/81.3.425
Ideal spatial adaptation by wavelet shrinkage
D. Donoho (1994)
10.1109/ACSSC.1994.471413
Basis pursuit
S. Chen (1994)
10.1111/J.2517-6161.1995.TB02032.X
Wavelet Shrinkage: Asymptopia?
D. Donoho (1995)
10.1093/BIOMET/82.4.711
Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
P. Green (1995)
10.1137/1.9781611971217
Solving least squares problems
C. Lawson (1995)
A proposal for variable selection in the Cox model
R. Tibshirani (1997)



This paper is referenced by
10.1016/j.najef.2021.101564
Group penalized logistic regressions predict up and down trends for stock prices
(2022)
10.1016/j.forpol.2021.102645
Oil palm expansion among non-industrial producers in Cameroon: Potentials for synergy between agro-economic gains and ecological safeguards
(2022)
10.1016/J.ENERGY.2021.121634
Identification method of market power abuse of generators based on lasso-logit model in spot market
Bo Sun (2022)
10.1016/j.cam.2021.113819
Almost unbiased Liu-type estimators in gamma regression model
Yasin Asar (2022)
10.1016/j.eswa.2021.115924
An integrated deep learning and stochastic optimization approach for resource management in team-based healthcare systems
Mohammad Hessam Olya (2022)
10.1016/j.sigpro.2021.108300
Solving inverse problems with autoencoders on learnt graphs
(2022)
10.1016/j.eswa.2021.115966
Predicting clinical scores for Alzheimer's disease based on joint and deep learning
Baiying Lei (2022)
10.1016/j.compenvurbsys.2021.101716
Cultivating historical heritage area vitality using urban morphology approach based on big data and machine learning
(2022)
10.1016/J.PATCOG.2021.108260
Bayesian Compression for Dynamically Expandable Networks
Yang Yang (2022)
10.1016/j.jaerosci.2021.105874
Improving quantitative analysis of spark-induced breakdown spectroscopy: Multivariate calibration of metal particles using machine learning
Hanyang Li (2022)
10.1016/J.JSPI.2021.07.010
Incorporating spatial structure into inclusion probabilities for Bayesian variable selection in generalized linear models with the spike-and-slab elastic net
Justin M. Leach (2022)
10.1016/j.energy.2021.121960
Design of a deep inference framework for required power forecasting and predictive control on a hybrid electric mining truck
Qing-dong Yan (2022)
10.1016/j.csda.2021.107348
Dimension reduction for block-missing data based on sparse sliced inverse regression
(2022)
10.1016/j.ces.2021.117184
Estimation of Hansen solubility parameters with regularized regression for biomass conversion products: An application of adaptable group contribution
Evan Terrell (2022)
10.1016/j.aei.2021.101443
Data science and reinforcement learning for price forecasting and raw material procurement in petrochemical industry
Chia-Yen Lee (2022)
10.1016/J.JSPI.2021.07.003
Penalized kernel quantile regression for varying coefficient models
Eun Ryung Lee (2022)
10.1016/j.ijepes.2021.107626
Transient stability assessment in large-scale power systems using sparse logistic classifiers
(2022)
10.1016/j.commatsci.2021.110877
Molecular dynamic characteristic temperatures for predicting metallic glass forming ability
L. Schultz (2021)
10.1016/j.jspi.2021.07.013
Exact model comparisons in the plausibility framework
S. Bohringer (2019)
10.1080/24725854.2020.1856982
A calibration-free method for biosensing in cell manufacturing
Jialei Chen (2020)
10.1590/0103-8478cr20201072
Kennard-Stone method outperforms the Random Sampling in the selection of calibration samples in SNPs and NIR data
Roberta de Amorim Ferreira (2022)
10.4310/21-SII669
Sparsity-restricted estimation for the accelerated failure time model
Xiaoyu Zhang (2022)
10.1016/j.apenergy.2021.117983
Solar and wind power generation forecasts using elastic net in time-varying forecast combinations
(2022)
10.1016/J.EJOR.2021.05.028
Instance-dependent cost-sensitive learning for detecting transfer fraud
Sebastiaan Höppner (2020)
10.1016/j.eswa.2021.115845
Joint sparse principal component regression with robust property
Kai Qi (2022)
10.1016/j.catena.2021.105718
Correlation of banana productivity levels and soil morphological properties using regularized optimal scaling regression
Barlin Orlando Olivares (2022)
10.1016/J.ARTINT.2021.103589
Bayesian feature interaction selection for factorization machines
Yifan Chen (2022)
10.1016/j.supflu.2021.105421
Prediction of partition coefficient in high-pressure carbon dioxide–water systems using machine learning
(2022)
10.1016/j.apnum.2021.09.013
An explicit algorithm for solving monotone variational inequalities
(2022)
10.1016/j.enpol.2021.112595
German efficiency gone wrong: Unintended incentives arising from the gas TSOs’ benchmarking
Paul Waidelich (2022)
10.1145/3483941
A Compact High-Dimensional Yield Analysis Method using Low-Rank Tensor Approximation
Xiao Shi (2022)
10.1016/j.cogpsych.2021.101444
Robust priors for regularized regression
(2020)
See more
Semantic Scholar Logo Some data provided by SemanticScholar