27
The Mismeasure of The Mismeasure of Poverty, Poverty, Happiness & Bias, and Happiness & Bias, and Regression Regression

The Mismeasure of Poverty, Happiness & Bias, and Regression

Embed Size (px)

Citation preview

Page 1: The Mismeasure of Poverty, Happiness & Bias, and Regression

The Mismeasure of Poverty, The Mismeasure of Poverty, Happiness & Bias, and Happiness & Bias, and

RegressionRegression

Page 2: The Mismeasure of Poverty, Happiness & Bias, and Regression

In “The Mismeasure of Poverty,” Nicholas Eberstadt notes that the percentage of Americans living under the poverty rate has been stagnant for 30 years, suggesting that

antipoverty efforts have failed, and that life has not improved for the nation's poor.

Page 3: The Mismeasure of Poverty, Happiness & Bias, and Regression

The problem, Eberstadt continues, is that there are anomalies in the official statistics.

Why does the poverty rate tend to go up as unemployment falls? Why hasn't it budged while the amounts of money the government spends on the poor

have more than doubled in constant dollars?

Page 4: The Mismeasure of Poverty, Happiness & Bias, and Regression

Eberstadt goes back and looks at how the poverty rate is calculated, and finds two realities that are masked by our current statistics, one heartening, one disheartening.

The first reality is that people living under the poverty line are materially much better off than they were three decades ago.

- They live in much bigger homes.

- Three-quarters own at least one motor vehicle.

- They spend roughly twice as much as they report as income, and not because they are going into debt. (Networths have not declined.)

In general, poor people today live at about the same standard of living as middle-class people did in the 1960s.

Eberstadt: The Mismeasure of PovertyEberstadt: The Mismeasure of Poverty

Page 5: The Mismeasure of Poverty, Happiness & Bias, and Regression

Eberstadt: The Mismeasure of PovertyOn the other hand, they live with greater insecurity. In fact, relatively

few people live permanently in poverty (power-law distributions).

But nearly a third of the U.S. population dips into poverty from time to time. Eberstadt paints a picture of greater volatility at the bottom end of the income scale -- a different image from the one portrayed by the immobile statistics, with radically different policy implications (such as?).

Page 6: The Mismeasure of Poverty, Happiness & Bias, and Regression

Daniel Gilbert: “I’m O.K., You’re Biased”Daniel Gilbert: “I’m O.K., You’re Biased”Research shows that decision-makers don’t realize how easily Research shows that decision-makers don’t realize how easily

and often their objectivity is compromised. Much of what and often their objectivity is compromised. Much of what happens in the brain is not evident to the brain itself.happens in the brain is not evident to the brain itself.

And yet, if decision-makers are more biased than they realize, they are less biased than the rest of us suspect:

bathroom scalesbathroom scales saliva test strips for dangerous enzyme deficiencysaliva test strips for dangerous enzyme deficiency evaluating students’ intelligence by examining info. one piece at a time; evaluating students’ intelligence by examining info. one piece at a time;

when subjects liked the student, they kept turning cards searching for one when subjects liked the student, they kept turning cards searching for one good piece of info; when they disliked the students, they turned over a few good piece of info; when they disliked the students, they turned over a few cards, shrugged and quitcards, shrugged and quit

The majority of people claim to be less biased than the majority of people:The majority of people claim to be less biased than the majority of people:

84% of medical resident claimed that their colleagues were influenced by 84% of medical resident claimed that their colleagues were influenced by gifts from Rx drug companies, but only 16% thought they were similarly gifts from Rx drug companies, but only 16% thought they were similarly influenced.influenced.

Page 7: The Mismeasure of Poverty, Happiness & Bias, and Regression

Daniel Gilbert: “I’m O.K., You’re Biased”Daniel Gilbert: “I’m O.K., You’re Biased”Because the brain cannot see itself fooling itself, the only

reliable method for avoiding bias is to avoid the situations that produce it.

Doctors should refuse to accept gifts from those who supply drugs to their patients, when justices refuse to hear cases involving those with whom they share familial ties and when chief executives refuse to let their compensation be determined by those beholden to them, then everyone sleeps well.

Professors should grade “blind” as often as possible.

What about college admissions?

Page 8: The Mismeasure of Poverty, Happiness & Bias, and Regression

Stock Market Volatility & Regression to the MeanStock Market Volatility & Regression to the Mean

Page 9: The Mismeasure of Poverty, Happiness & Bias, and Regression

Stock Market Volatility & Regression to the MeanStock Market Volatility & Regression to the Mean

Page 10: The Mismeasure of Poverty, Happiness & Bias, and Regression

RegressionAllows you to go beyond correlation and

to predict how much, say, a dependent variable Y (e.g., income) changes if independent variable X (e.g., years of education) changes by 1 unit.

Page 11: The Mismeasure of Poverty, Happiness & Bias, and Regression

RegressionTwo kinds of regression analysis:

- bivariate (income by years of education)

- multivariate (or multiple regression)

controls for other independent variables

(income by education, age, race, gender, etc)

Page 12: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate Regression

Y = a + bx

Y = dependent variable (what you basically want

to predict/determine)

x = independent variable

a = Y-intercept (the point at which the regression

line intercepts or crosses the Y-axis: anchor)

b = slope of the regression line (the regression

coefficient)

Page 13: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate RegressionExample: Amex Business Service does routine accounting as one of its basic job offerings. Its rate is $20/per hour plus a $25 floppy disk charge.

The total cost to a customer depends, of course, on the number of hours it takes to complete the job.

So the total cost, Y, of a job that takes x hours is . . .

Y = $25 + 20x

Time & Cost: 5 hours ($125), 7.5 hours ($175),

15 hours ($325), 20 hours ($425), 22.5 hours ($475)

Page 14: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate Regression

5.00 10.00 15.00 20.00

Time in Hours

200.00

300.00

400.00

Co

st (

$)

Cost ($) = 25.00 + 20.00 * hoursR-Square = 1.00

Page 15: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate RegressionY = a + bx + e (error term)

Real-life applications in the natural and social sciences are seldom this straight-forward.

Instead, what we usually have are sets of scores for various independent and dependent variables.

We display them on a scatterplot and use inferential statistics to determine both: the [1] slope (the regression coefficient) and, then, [2] predictions for other independent variables.

For example, consider the education levels and income of 32 employees ...

Page 16: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate Regression

5.00 10.00 15.00 20.00

Years

8000.00

12000.00

16000.00

20000.00In

com

e ($

)

Income ($) = 5077.51 + 732.40 * edction

R-Square = 0.56

Page 17: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate Regression & its Measure of Association Remember that Lambda (for nominal variables) and

Kendall’s tau-b/c (for ordinal variables) are PRE measures, which stands for Proportional Reduction in Error.

PRE: When predicting a dependent variable, how much error can you reduce by knowing the independent variable?

Regression has a similar statistic: R-Square or (R2) or “goodness of fit” always between 0 and 1 (0=no reduction in error, 1=error eliminated)

Page 18: The Mismeasure of Poverty, Happiness & Bias, and Regression

R2 = 0.49Vis

its

to C

ard

iolo

gist

s p

er e

nro

llee

Vis

its

to C

ard

iolo

gist

s p

er e

nro

llee

0.00.0

0.50.5

1.01.0

1.51.5

2.02.0

2.52.5

0.00.0 2.52.5 5.05.0 7.57.5 10.010.0 12.512.5 15.015.0

Number of Cardiologists per 100,000 residentsNumber of Cardiologists per 100,000 residents

Association between cardiologists and visits per person to Association between cardiologists and visits per person to cardiologists among Medicare enrollees: 306 HRRscardiologists among Medicare enrollees: 306 HRRs

SourceSource: John Wennberg (2005): John Wennberg (2005)

Page 19: The Mismeasure of Poverty, Happiness & Bias, and Regression

Discharges forHip Fracture

R2 = 0.06

Discharges forall MedicalConditionsR2 = 0.54

00

5050

100100

150150

200200

250250

300300

350350

400400

1.01.0 2.02.0 3.03.0 4.04.0 5.05.0# of Hospital Beds/per 1,000 Residents# of Hospital Beds/per 1,000 Residents

Dis

char

ge R

ate

Dis

char

ge R

ate

Association between # of hospital beds per 1,000 residents and Association between # of hospital beds per 1,000 residents and discharges per 1,000 Medicare enrollees in 306 HRRsdischarges per 1,000 Medicare enrollees in 306 HRRs

SourceSource: John Wennberg (2005): John Wennberg (2005)

Page 20: The Mismeasure of Poverty, Happiness & Bias, and Regression

Bivariate Regression & Measure of Association

5.00 10.00 15.00 20.00

Years

8000.00

12000.00

16000.00

20000.00In

com

e ($

)

Income ($) = 5077.51 + 732.40 * edctionR-Square = 0.56

13866.31

Page 21: The Mismeasure of Poverty, Happiness & Bias, and Regression

5.00 10.00 15.00 20.00

Years

8000.00

12000.00

16000.00

20000.00

Income ($) = 5077.51 + 732.40 * edctionR-Square = 0.56

13866.31

Bivariate Regression & Measure of AssociationX = 6 years of education & $8,000 income

Regression line estimated approx. $9,000 in income for 6 years of education.

Yet the mean score ($13,866) would have left you with $5,866 in estimated error.

With the regression line (coefficient), you reduced the amount of error by $4,866; you are left with $1,000 in estimated error.

In other words, by knowing the independent variable, you have $4,866 of explained variation and $1,000 of unexplained variation.

explained ($4,866)2

explained ($4,866)2 + unexplained ($1,000)2

R-Square = aggregating (summing) this for all of the data points in the scatterplot

Page 22: The Mismeasure of Poverty, Happiness & Bias, and Regression

Correlations

1 .751**

. .000

32 32

.751** 1

.000 .

32 32

Pearson Correlation

Sig. (2-tailed)

N

Pearson Correlation

Sig. (2-tailed)

N

Years

Income ($)

Years Income ($)

Correlation is significant at the 0.01 level (2-tailed).**.

5.00 10.00 15.00 20.00

Years

8000.00

12000.00

16000.00

20000.00

Inco

me

($)

Income ($) = 5077.51 + 732.40 * edctionR-Square = 0.56

13866.31

Bivariate Regression & Measure of Association

.751 x .751 = 0.56r2 = R2

Page 23: The Mismeasure of Poverty, Happiness & Bias, and Regression

Multivariate or Multiple RegressionY = a + b1x1 + b2x2 + b3x3 + e

Allows you to determine the marginal effect of an independent variable on one dependent variable by simultaneously controlling for several other independent variables.

The regression coefficient b1 estimates the average change in Y (dependent variable) for each unit change in independent variable x1, controlling for the other independent variables: x2 and x3, etc.

Same concepts as bivariate regression, but in a three-dimensional + scatterplot (which can’t be graphically illustrated)

The difference is that you obtain partial slopes and partial regression coefficients, which means the “marginal” effect that each independent variable has on the dependent variable.

Page 24: The Mismeasure of Poverty, Happiness & Bias, and Regression

0 to 1,600 Low (4.6%)1,600 to 3,150 Below Average (25.5%)3,150 to 5,150 Average (43.5%)5,150 to 6,750 Above Average (19.6%)6,750 to 8,350 High (4.9%)

grams/per 100,000 Individuals

8,350 to 11,000 Extremely High (1.8%)

Methylphenidate and Amphetamine Methylphenidate and Amphetamine Distribution (DEA data)Distribution (DEA data)

(average = 4,150 grams/100,000 individuals)(average = 4,150 grams/100,000 individuals)

Page 25: The Mismeasure of Poverty, Happiness & Bias, and Regression

Analysis of the Demand for Psychostimulants:Multivariate Regression

Data Sources:

– Dependent variable I: DEA data provides the distribution of methylphenidate and amphetamine in grams down to the 5 digit zip code-level (1998-2001)

– Dependent variable II: Source® Territory Manager by NDC HEALTH provides Rx Sales and Rx Quantity sold by product down to zip/county or HSA level from 1996 onwards; ** Average price to be derived from Total Sales/Total Quantity **

– Independent variables: Area Resource Files (ARF), InterStudy, Census Estimates, County Business Patterns, Dept. of Education’s Common Core of Data

Market Definitions:

– 3,030 U.S. counties

Page 26: The Mismeasure of Poverty, Happiness & Bias, and Regression

Selected Regression Coefficients for Market Model * indicates that the coefficient is significant at the 10% level, ** at the 5% level, *** at the 1% level

Variable RITALIN ADDERALL

Coefficient Std. Error Coefficient Std. Error

Intercept -5.34*** 1.64 8.05*** 1.56

Price of Ritalin -3.76*** .654 .828*** .220

Price of Adderall .436 .477 -3.86*** .410

Price of Dexedrine 1.13*** .284 1.28*** .273

Price of Dextrostat .653* .337 1.53*** .388

Per Capita Income 1.54*** .189 .228 .183

Population (in 100s) .115 .287 1.91*** .280

Page 27: The Mismeasure of Poverty, Happiness & Bias, and Regression

Selected Regression Coefficients for Market Model * indicates that the coefficient is significant at the 10% level, ** at the 5% level, *** at the 1% level

Variable RITALIN ADDERALL

Coefficient Std. Error Coefficient Std. Error

# of kids (<19) .768*** .254 -.860*** .261

HMO penetration -.062** .026 -.083*** .026

Students-to-Teacher -.392** .175 -.359** .180

# of Private School Students .064*** .018 .024 .020

Accountability Index^ .055*** .018 .041** .108

# of MDs (Total Patient Care)

.260*** .059 .085 .062

Young-to-Old MDs .094** .039 .110*** .038

^ Accountability Index: Ranges from 0 to 5 0 = no teacher accountability and 5 = most stringent policies

R2 (Ritalin) = 0.715 R2 (Adderall) = 0.672