Logistic Regressionbrenocon/inlp2015/04-logreg.pdf · Logistic Regression. Classification: LogReg...

Logistic RegressionSeptember 17, 2015

Questions?

● From previous lecture?

● From HW?

Naive Bayes Recap● What do you remember about classification

with Naive Bayes?

Naive Bayes Recap● What do you remember about classification

with Naive Bayes?

● What statistics do you need to make a classification?

Naive Bayes: Bag of Words

● BoW - Order independent

● Can we add more features to the model?

Naive Bayes: Bag of Words

● Features statistically independent given class

● Examples of non-independent features?

Independence Assumption

● Correlated features -> double counting

● Can hurt classifier accuracy & calibration

● (Log) Linear Model - similar to Naive Bayes

● Doesn’t assume features are independent

● Correlated features don’t “double count”

Logistic Regression

Classification: LogReg (I)First, we’ll discuss how LogReg works.

First, we’ll discuss how LogReg works.

Then, why it’s set up the way that it is.

Application: spam filtering

Classification: LogReg (I)

● compute features (xs)

= (count “nigerian”, count “prince”, count “nigerian prince”)

● given weights (betas)

= (-1.0, -1.0, 4.0)

● compute features (x’s)● given weights (betas)● compute the dot product

● compute the dot product

Classification: LogReg (II)

● compute the dot product

● compute the logistic function

Classification: LogReg (II)

LogReg Exercise

= (-1.0, -1.0, 4.0)

features: (count “nigerian”, count “prince”, count “nigerian prince”)

= (1, 1, 1)

LogReg Exercise

= (-1.0, -1.0, 4.0)

= (1, 1, 1)

OK, let’s take this step by step...

● Why dot product?

Classification: LogReg

OK, let’s take this step by step...

● Why dot product?

● Why would we use the logistic function?

Classification: LogReg

Intuition: weighted sum of features

All linear models have this form!

Classification: Dot Product

NB as Log-Linear Model

Recall that Naive Bayes is also a linear model...

NB as Log-Linear Model● What are the features in Naive Bayes?

● What are the weights in Naive Bayes?

NB as Log-Linear Model

In both NB and LogReg

we compute the dot product!

What does this function look like?

What properties does it have?

Logistic Function

● logistic function

Logistic Function

● decision boundary is dot product = 0 (2 class)

Logistic Function

● decision boundary is dot product = 0 (2 class)

● comes from linear log odds

Logistic Function

● Both compute the dot product

● NB: sum of log probs; LogReg: logistic fun.

NB vs. LogReg

● NB: learn conditional probabilities separately via counting

● LogReg: learn weights jointly

Learning Weights

Learning Weights● given: a set of feature vectors and labels

● goal: learn the weights.

Learning Weights

n examples; xs - features; ys - class

Learning WeightsWe know:

So let’s try to maximize probability of the entire dataset - maximum likelihood estimation

Learning WeightsSo let’s try to maximize probability of the entire

dataset - maximum likelihood estimation

Learning WeightsSo let’s try to maximize probability of the entire

dataset - maximum likelihood estimation

LogReg Exercise

= (1.0, -3.0, 2.0)

63% accuracy

LogReg Exercise

= (1.0, -3.0, 2.0)

63% accuracy

= (0.5, -1.0, 3.0) 75% accuracy

LogReg Exercise

= (1.0, -3.0, 2.0)

63% accuracy

= (0.5, -1.0, 3.0)

= (-1.0, -1.0, 4.0)

75% accuracy

81% accuracy

Pros & Cons● LogReg doesn’t assume independence

○ better calibrated probabilities

● NB is faster to train; less likely to overfit

NB & Log Reg● Both are linear models:

● Training is different:○ NB: weights trained independently○ LogReg: weights trained jointly

LogReg: Important Details!● Overfitting / regularization● Visualizing decision boundary / bias term● Multiclass LogReg

You can use scikit-learn (python) to test it out!

Bias Term

Logistic Regressionbrenocon/inlp2015/04-logreg.pdf · Logistic Regression. Classification: LogReg...

Documents

Evaluation of logistic regression and random forest ...724982/FULLTEXT01.pdf · Evaluation of logistic regression and random forest classification based on prediction accuracy and

Lecture 3: Multi-Class Classification · Previous Lecture vBinary linear classification models vPerceptron, SVMs, Logistic regression vPrediction is simple: vGiven an example !, prediction

Classification Logistic Regression

SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Basic Relationships Logistic Regression Describing Relationships Classification Accuracy

DATA MINING LECTURE 11tsap/teaching/2014-cs059/slides/... · 2014. 12. 23. · LECTURE 11 Classification Nearest Neighbor Classification Support Vector Machines Logistic Regression

Logistic Regression - cs.wellesley.educs.wellesley.edu/~cs305/lectures/6_Logistic_Regression.pdfLogistic Regression Logistic regression is used for classification, not regression!

Classification - Logistic regression, CART (rpart) and ... · Logistic regression CART Regression Classi cation Example Estimation Partitioning Model complexity Pruning Surrogates

Classification : Logistic regression. Generative ...milos/courses/cs2750... · CS 2750 Machine Learning Logistic regression model • Discriminant functions: • Values of discriminant

Lecture 11: Logistic Regression Extensions and Evaluating … · 2019. 12. 25. · Lecture 11: Logistic Regression Extensions and Evaluating Classification. CS109A, PROTOPAPAS, RADER,

Evaluation of logistic regression and random forest classification …724982/... · 2014. 6. 13. · formed better on these datasets than logistic regression with an average prediction

Game Theory & Logistic Regression: Monetizing Trust in Contracts Through Binary Classification

Classification: Logistic Regression and Naive Bayes Book Chapter 4

CLASSIFICATION USING E-MINER: LOGISTIC REGRESSION AND CART · CLASSIFICATION USING E-MINER: LOGISTIC ... of multiple linear regression (MLR) analysis has been applied for ... Using

Penalized Logistic Regression and Classification of - Bioconductor

Classification using Logistic Regressionasv.informatik.uni-leipzig.de/.../530/TMI05.2_logistic_regression.pdf · Institut für Informatik Classification using Logistic Regression

18 logistic regression - Colorado State UniversityText classification: Logistic Regression Chapter 5 in Martin/Jurafsky Logistic Regression • Important analytic tool in natural and

Binary Classification Modeling Final Deliverable Using Logistic …facultyweb.kennesaw.edu/jpriestl/courses/docs/jpriestl... · 2016-08-04 · Binary Classification Modeling Final

Lecture 6 Linear Classification & Logistic RegressionFormulate a classification problem using logistic regression Binary and multi-class Describe the logistic and soft-max function

Log-linear models and CRFs - UMass Amherstbrenocon/inlp2015/08... · 2015. 10. 1. · Logistic Regression HMMs Linear-chain CRFs Naive Bayes SEQUENCE SEQUENCE CONDITIONAL CONDITIONAL

Slide 1 Multinomial Logistic Regression: A Problem in Personnel Classification In our first class on Multinomial Logistic Regression we studied Rulon's