Click here to load reader

Classification using L1-Penalized Logistic Regression

Download PDF Report

Upload
setio-pramono
View
3.110
Download
6

Embed Size (px)

DESCRIPTION

L1-Penalized Logistic Regression, is commonly used for classification in high dimensional data such as microarray. This slide presents a brief overview of the algorithm.

Citation preview

‐Penalized Logistic Regression

Setia PramanaMedical Epidemiology and Biostatistics Department,

Karolinska Institute, Stockholm

Page 2: Classification using L1-Penalized Logistic Regression

Introduction

• Logistic regression is a supervised method for binary or multi‐class classification.

• Logistic regression models the probability of membership of a class with transforms of linear combinations of explanatory variables.

Page 3: Classification using L1-Penalized Logistic Regression

Introduction

• In high‐dimensional data (e.g., microarray): More variables than the observations Classical logistic regression does not work.

• Other problems: Variables are correlated (multicolinierity) and overfitting.

• Solution: Introduce a penalty for complexity in the model.

Logistic Regression

• Let is an array of binary (e.g., malaria types) and is expression level of the j‐th protein. We can fit the following logistic model:

• probability that an array with measured expression represents a class of type .

• Maximize the log‐likelihood:

Penalized Logistic Regression

• Penalized log‐likelihood:

• : Tuning parameter• : a penalty function• ‐penalization (Lasso):

‐Penalized Logistic Regression

• Shrinks all regression coefficients () toward zero and set some of them to zero.

• Performs parameter estimation and variable selection at the same time.

• L1‐penalized log‐likelihood is maximized using full gradient algorithm (Goeman, 2010).

• The choice of λ is crucial and chosen via cross‐validation procedure.

• The procedure is implemented in an R package called penalized.

Page 7: Classification using L1-Penalized Logistic Regression

K‐fold Cross Validation

• Randomly divide your data into K pieces/folds• Treat 1st fold as the test dataset. Fit the model to the other folds (training data).

• Apply the model to the test data and repeat k times.• Calculate statistics of model accuracy and fit from the test data only.

k-fold

traintestTrain on (k -1) splitsTest

Page 8: Classification using L1-Penalized Logistic Regression

Obtaining The Optimal

• Plot of cross‐validated likelihood against :

• In this example, the optimal = 4.11 gives the maximum log‐likelihood. 8

Page 9: Classification using L1-Penalized Logistic Regression

Obtaining The Optimal

• Note that in k‐fold CV method, allocation of the sample into k folds are perform randomly.

• Different randomization may lead to slightly different results ().

• To obtain more robust λ value, a hundred different cross validations were performed.

• Take the average of ,…, ( ). • Use for fitting the final L1‐penalized logistic model.

Page 10: Classification using L1-Penalized Logistic Regression

Validation?

• Use another experiment for validation

Page 11: Classification using L1-Penalized Logistic Regression

Thank you for your attention !

Generalized Penalized Weighted Least-Squares ...aiai.jhu.edu/papers/Fully3D2015_tilley.pdfGeneralized Penalized Weighted Least-Squares Reconstruction for Deblurred Flat-Panel CBCT

Documents

Firth's penalized likelihood logistic regression: accurate ...prema.mf.uni-lj.si/files/FLICFLAC_final_9e6.pdf · Firth type logistic regression In logistic regression, the penalized

Documents

Penalized By Death

Documents

On the impact of model selection on predictor ...On the impact of model selection on predictor ... To ﬁt L2 penalized logistic regression models with a stepwise variable selection,

Documents

The biglasso Package: A Memory- and Computation-E cient ... · screening rule, SSR-Slores, for lasso-penalized logistic regression by combining SSR with the so-called Slores rule

Documents

An Improved GLMNET for L1-regularized Logistic Regressionjmlr.csail.mit.edu/papers/volume13/yuan12a/yuan12a.pdf · AN IMPROVED GLMNET FOR L1-REGULARIZED LOGISTIC REGRESSION better

Documents

Comparing MLE, MUE and Firth Estimates for Logistic …...Firth’s Penalized Likelihood Method • The MLE is the root when the score function (derivative of the loglikelihood) is

Documents

Penalized Profiled Semiparametric Estimating Functionsusers.stat.umn.edu/~wangx346/research/profile.pdfElectronic Journal of Statistics ISSN: 1935-7524 Penalized Pro led Semiparametric

Documents

Bivariate Penalized Splines for Geo-Spatial Modelsgyu.people.wm.edu/Spring2016/MATH410_Presentation.pdf · MOTIVATION BIVARIATE SPLINES TRIANGULATIONS BIVATIATE PENALIZED SPLINE ESTIMATORS

Documents

Research Article Coordinate Descent Based Hierarchical ...downloads.hindawi.com/journals/mpe/2014/430201.pdf · Coordinate Descent Based Hierarchical Interactive Lasso Penalized Logistic

Documents

Support Vector Machines, Kernel Logistic Regression, and ...hastie/TALKS/svm.pdf · Support Vector Machines, ... • We ﬁt the model f(x)=h(x)Tβ by penalized least squares: min

Documents

An Interior-Point Method for Large-Scale l1-Regularized Logistic

Documents

Medical Image Analysis Machine learning 2 · 2015-11-10 · A.4 L1-regularized Logistic Regression L1-regularized LR solves the following unconstrained optimization problem: min w

Documents

Penalized regression elucidates aberration hotspots

Documents

Uncertain Covariance Models & Uncertainty-Penalized Portfolio … · Uncertain Covariance Models & Uncertainty-Penalized Portfolio Optimization “BETTER DECISIONS THROUGH ARTICULATED

Documents

$IPF-LASSO: integrative L1-penalized regression with … · pm form the mth group of clinical or \omics" variables, termed \modality" ... (PLOS Comp Biol 2015) ... Recall: o set strategy$