20
Section 4.2: How to Look for Differences

Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Embed Size (px)

Citation preview

Page 1: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Section 4.2: How to Look for Differences

Page 2: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Cross-Tabulations

College student binge drinkers experienced many personal and social problems, the researchers said. For instance, 90% of those who binged more than once a week said they experienced a hangover and 63% said they did something they regretted. --- Newsday, 12/94

Page 3: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Making Sense of the Data

Were binge drinkers more likely to do things they regretted than students who did not binge drink?

Defn: Binge drinking – consuming 5 or more dinks in a row for men or four or more in a row for women.

Page 4: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Wechsler Study on American College Students

Nonbinge drinkers (n=6894)

Infrequent binge drinkers (n=4090)

Frequent binge drinkers (n=3291)

Did something you regret

14% 37% 63%

Missed a class

8% 30% 61%

Got behind in schoolwork

6% 21% 46%

Page 5: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Cross-Tab Tables

Categories of one measurement are listed on the top and the categories of the other measurement are listed on the side. Also, list the number of people that appear in each cell.

Page 6: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Wechsler Cross Tab

Nonbinge Infrequent Binge

Frequent Binge

Kept up with schoolwork

6,480 3,231 1,777

Got behind with schoolwork

414 859 1,514

Page 7: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Marginal Totals and Proportions

Cross Tabs will often show the totals (proportions) of each column in the bottom margin under the column and the total (proportions) of each row in the margin to the right of the row.

Page 8: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Cross-Tab with Column Marginals

Nonbinge Infrequent Binge

Frequent Binge

Kept up with schoolwork

6,480 3,231 1,777

Got behind with schoolwork

414 859 1,514

Column Count

6,894 4,090 3,291

Column % 48% 29% 23%

Page 9: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Correlation

A cross-tab is a useful way of checking for correlation between two categorical variables.

Two variables are correlated if knowing an observation’s value for one of the measurements helps you to better estimate the likelihoods of that observation’s value on the other measurement.

Page 10: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Correlation – Another View

If the counts you would expect if there was no correlation are the counts you observed, then there is no correlation.

Page 11: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Computing Expected Counts

Expected counts can be thought of as the count that would be if the row proportion applied equally to all the data in it; that is, if there were no correlation between the observations in the cross-tab.

Page 12: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Correlation in the Sample vs. Population Even though the expected counts may

show a correlation in the sample data, how does one show there is correlation in the population?Perform a hypothesis test.

Page 13: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 1: State the Null Hypothesis

H0: No correlation in the population data.

Remember…. H0 is always the hypothesis of “no difference”. In this instance “no difference” means no difference between expected and observed counts – or no correlation.

Page 14: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 2: Collect Sample Data

Use the Wechsler data.

Is the sample data statistically significant? That is, assuming the null hypothesis is correct (i.e. there is no correlation between the two variables), how likely is it to get the sample data that we observed?

Page 15: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 3: Compute the Appropriate Test Statistic In testing a claim about one sample

proportion, the test statistic is the z-test statistic.

Since we have several sample proportions we’ll need a new test statistic – the chi-squared test statistic.

Page 16: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

To compute the test statistic:

Calculate the expected counts for each cell.

We’ll actually calculate the chi-squared test statistic tomorrow.

Page 17: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 4: Find the p-value

A p-value means just what it did earlier…. Assuming the null hypothesis is true (that is, there is no correlation between the two variables), it is the probability of getting sample data as extreme or more extreme than we observed.

Page 18: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Or more simply….

If there is no correlation (association) between the two variables, could the differences in the observed and expected cell counts be attributed to chance?

With this test we’ll use the TI-83 to find the p-value.

Page 19: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 4: Reject or retain the null hypothesis

Page 20: Section 4.2: How to Look for Differences. Cross-Tabulations College student binge drinkers experienced many personal and social problems, the researchers

Step 5: State Your Conclusions

There is statistical evidence to suggest that there is (is not) correlation between the two variables…..