13
Chi-Square Tests and the F- Distribution Chapter 10

Chi-Square Tests and the F - Distribution

  • Upload
    arama

  • View
    26

  • Download
    2

Embed Size (px)

DESCRIPTION

Chapter 10. Chi-Square Tests and the F - Distribution. Independence. § 10.2. Age. Gender. 16 – 20. 21 – 30. 31 – 40. 41 – 50. 51 – 60. 61 and older. Male. 32. 51. 52. 43. 28. 10. Female. 13. 22. 33. 21. 10. 6. Contingency Tables. - PowerPoint PPT Presentation

Citation preview

Page 1: Chi-Square Tests and the  F - Distribution

Chi-Square Tests and the F-

Distribution

Chapter 10

Page 2: Chi-Square Tests and the  F - Distribution

§ 10.2Independence

Page 3: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 3

Contingency TablesAn r c contingency table shows the observed frequencies for two variables. The observed frequencies are arranged in r rows and c columns. The intersection of a row and a column is called a cell.

The following contingency table shows a random sample of 321 fatally injured passenger vehicle drivers by age and gender. (Adapted from Insurance Institute for Highway Safety)

61021332213Female

102843525132Male

61 and older51 – 6041 – 5031 – 4021 – 3016 – 20Gender

Age

Page 4: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 4

Expected FrequencyAssuming the two variables are independent, you can use the contingency table to find the expected frequency for each cell.

Finding the Expected Frequency for Contingency Table CellsThe expected frequency for a cell Er,c in a contingency table is

,(Sum of row ) (Sum of column )Expected frequency .

Sample sizer cr cE

Page 5: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 5

Expected Frequency

Example:Find the expected frequency for each “Male” cell in the contingency table for the sample of 321 fatally injured drivers. Assume that the variables, age and gender, are independent.

10561021332213Female

16

10

61 and older

3213864857345Total

2162843525132Male

Total51 – 60

41 – 50

31 – 40

21 – 30

16 – 20

Gender

Age

Continued.

Page 6: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 6

Expected FrequencyExample continued:

10561021332213Female

16

10

61 and older

3213864857345Total

2162843525132Male

Total51 – 6041 – 5031 – 4021 – 3016 – 20GenderAge

,(Sum of row ) (Sum of column )Expected frequency

Sample sizer cr cE

1,2216 73 49.12

321E 1,1

216 45 30.28321

E 1,3216 85 57.20

321E

1,5216 38 25.57

321E 1,4

216 64 43.07321

E 1,6216 16 10.77

321E

Page 7: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 7

Chi-Square Independence Test

A chi-square independence test is used to test the independence of two variables. Using a chi-square test, you can determine whether the occurrence of one variable affects the probability of the occurrence of the other variable.

For the chi-square independence test to be used, the following must be true.

1. The observed frequencies must be obtained by using a random sample.

2. Each expected frequency must be greater than or equal to 5.

Page 8: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 8

Chi-Square Independence Test

The Chi-Square Independence Test If the conditions listed are satisfied, then the sampling distribution for the chi-square independence test is approximated by a chi-square distribution with

(r – 1)(c – 1)degrees of freedom, where r and c are the number of rows and columns, respectively, of a contingency table. The test statistic for the chi-square independence test is

where O represents the observed frequencies and E represents the expected frequencies.

22 ( )O Eχ

E

The test is always a right-tailed test.

Page 9: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 9

Chi-Square Independence Test

1. Identify the claim. State the null and alternative hypotheses.

2. Specify the level of significance.

3. Identify the degrees of freedom.

4. Determine the critical value.

5. Determine the rejection region.Continued.

Performing a Chi-Square Independence Test

In Words In Symbols

State H0 and Ha.

Identify .

Use Table 6 in Appendix B.

d.f. = (r – 1)(c – 1)

Page 10: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 10

Chi-Square Independence Test

Performing a Chi-Square Independence Test

In Words In Symbols

If χ2 is in the rejection region, reject H0. Otherwise, fail to reject H0.

6. Calculate the test statistic.

7. Make a decision to reject or fail to reject the null hypothesis.

8. Interpret the decision in the context of the original claim.

22 ( )O Eχ

E

Page 11: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 11

Chi-Square Independence Test

Example:The following contingency table shows a random sample of 321 fatally injured passenger vehicle drivers by age and gender. The expected frequencies are displayed in parentheses. At = 0.05, can you conclude that the drivers’ ages are related to gender in such accidents?

1056 (5.23)

10 (12.43)

21 (20.93)

33 (27.80)

22 (23.88)

13 (14.72)

Female

16

10 (10.77)

61 and older

3213864857345

21628 (25.57)

43 (43.07)

52 (57.20)

51 (49.12)

32 (30.28)

Male

Total51 – 60

41 – 50

31 – 40

21 – 30

16 – 20

Gender

Age

Page 12: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 12

Chi-Square Independence Test

Example continued:

Continued.

Ha: The drivers’ ages are dependent on gender. (Claim)

H0: The drivers’ ages are independent of gender.

Because each expected frequency is at least 5 and the drivers were randomly selected, the chi-square independence test can be used to test whether the variables are independent.

With d.f. = 5 and = 0.05, the critical value is χ20 =

11.071.

d.f. = (r – 1)(c – 1) = (2 – 1)(6 – 1) = (1)(5) = 5

Page 13: Chi-Square Tests and the  F - Distribution

Larson & Farber, Elementary Statistics: Picturing the World, 3e 13

Chi-Square Independence Test

Example continued:

X2

0.05

Rejection region

χ20 = 11.071

5.2312.4320.9327.8023.8814.7210.7725.5743.0757.2049.1230.28

E

0.772.430.075.2

1.881.720.772.430.075.21.881.72

O – E

0.2012.9584130.05510.5929100.23095.9049280.00010.004943

0.1483.534422

0.472727.04520.0723.5344510.09772.958432

0.59295.90490.004927.04

(O – E)2

0.113460.4751100.0002210.972733

O2( )O E

E

22 ( ) 2.84O Eχ

E

Fail to reject H0.

There is not enough evidence at the 5% level to conclude that age is dependent on gender in such accidents.