Published on 01 January 2015

Using Lasso for Predictor Selection and to Assuage Overfitting: A Method Long Overlooked in Behavioral Sciences

View Dataset
McNeish, Daniel M.

Description

Ordinary least squares and stepwise selection are widespread in behavioral science research; however, these methods are well known to encounter overfitting problems such that R2 and regression coefficients may be inflated while standard errors and p values may be deflated, ultimately reducing both the parsimony of the model and the generalizability of conclusions. More optimal methods for selecting predictors and estimating regression coefficients such as regularization methods (e.g., Lasso) have existed for decades, are widely implemented in other disciplines, and are available in mainstream software, yet, these methods are essentially invisible in the behavioral science literature while the use of sub optimal methods continues to proliferate. This paper discusses potential issues with standard statistical models, provides an introduction to regularization with specific details on both Lasso and its related predecessor ridge regression, provides an example analysis and code for running a Lasso analysis in R and SAS, and discusses limitations and related methods.

Citations (0)

Mentions (0)

Metrics

Dataset Index

2.0

FAIR Score

81%

Citations

0

Mentions

0

Metrics Over Time

Publication Details

DOI

Publisher

Taylor & Francis

Assigned Domain

Subfield

Statistics and Probability

Field

Mathematics

Domain

Physical Sciences

Confidence Score

52%

Source

Scholar Data Model

Keywords

Science PolicyMathematicsFOS: MathematicsBiological SciencesEvolutionary BiologyFOS: Biological sciencesCell Biology

Normalization Factors

FT

13.46

CTw

1.00

MTw

1.00