14
BPS - 5th Ed. Chapter 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple of its weight. The adhesion of one 4400-horsepower diesel locomotive model varies in actual use according to a Normal distribution with mean μ = 0.34 and standard deviation σ = 0.049 What proportion of adhesions (± 0.001) measured in use are higher than 0.47? Z = (0.47-0.34) / 0.049 = 2.653 Area to the right of z = 2.65 is 0.0040 What proportion of adhesions (± 0.001) are between 0.47 and 0.49? The new z-score is (0.49-0.34)/0.049 = 3.06 To find the area between the two z-scores, we find the difference in the areas to the left of each. Area left of 3.06 = 0.9989 Area left of 2.65 = 0.9960 Area between = 0.9989-0.9960 = 0.0029 What do you do if your z-value is bigger than the table values? Use the last value on the table. We know that the probability to the right of z = 3.939 is smaller than the area to the right for 3.49 (or whatever the last value on the table is). For a z-score of 6.2978, you can safely put down 0 or 1 (whichever side is appropriate) and be correct to within rounding.

BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Embed Size (px)

Citation preview

Page 1: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

BPS - 5th Ed.Chapter 6 1

An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple of its weight. The adhesion of one 4400-horsepower diesel locomotive model varies in actual use according to a Normal distribution with mean μ = 0.34 and standard deviation σ = 0.049 What proportion of adhesions (± 0.001) measured in use are higher than 0.47? Z = (0.47-0.34) / 0.049 = 2.653 Area to the right of z = 2.65 is 0.0040 What proportion of adhesions (± 0.001) are between 0.47 and 0.49? The new z-score is (0.49-0.34)/0.049 = 3.06To find the area between the two z-scores, we find the difference in the areas to the left of each.Area left of 3.06 = 0.9989 Area left of 2.65 = 0.9960

Area between = 0.9989-0.9960 = 0.0029 What do you do if your z-value is bigger than the table values? Use the last value on the table. We know that the probability to the right of z = 3.939 is smaller than the area to the right for 3.49 (or whatever the last value on the table is). For a z-score of 6.2978, you can safely put down 0 or 1 (whichever side is appropriate) and be correct to within rounding.

Page 2: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Chapter 6Two-Way Tables

BPS - 5th Ed.Chapter 62

Page 3: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Categorical Variables

• In this chapter we will study the relationship between two categorical variables (variables whose values fall in groups or categories).

•To analyze categorical data, use the counts or percents of individuals that fall into various categories.

BPS - 5th Ed.Chapter 6 3

Page 4: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Two-Way Table

• When there are two categorical variables, the data are summarized in a two-way table• each row in the table represents a value of the row

variable• each column of the table represents a value of the

column variable

• The number of observations falling into each combination of categories is entered into each cell of the table

BPS - 5th Ed.Chapter 6 4

Page 5: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Marginal Distributions

• A distribution for a categorical variable tells how often each outcome occurred • totaling the values in each row of the table

gives the marginal distribution of the row variable (totals are written in the right margin)

• totaling the values in each column of the table gives the marginal distribution of the column variable (totals are written in the bottom margin)

BPS - 5th Ed.Chapter 6 5

Page 6: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Marginal Distributions

• It is usually more informative to display each marginal distribution in terms of percents rather than counts

• each marginal total is divided by the table total to give the percents

• A bar graph could be used to graphically display marginal distributions for categorical variables

BPS - 5th Ed.Chapter 6 6

Page 7: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 7

Data from the U.S. Census Bureau for the year 2000 on the level of education reached by Americans

of different ages.

(Statistical Abstract of the United States, 2001)

Age and Education

Page 8: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 8

Age and Education

Variables

Marginal distributions

Page 9: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 9

Age and Education

Variables

Marginal distributions

21.6% 46.5% 32.0%

15.9%33.1%25.4%25.6%

Page 10: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 10

Age and Education

Marginal Distributionfor Education Level

Not HS grad 15.9%

HS grad 33.1%

College 1-3 yrs 25.4%

College ≥4 yrs 25.6%

Page 11: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Conditional Distributions

• Relationships between categorical variables are described by calculating appropriate percents from the counts given in the table• prevents misleading comparisons due to

unequal sample sizes for different groups

BPS - 5th Ed.Chapter 6 11

Page 12: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 12

Age and EducationCompare the 25-34 age group to the 35-54 age group in terms of success in completing at least 4 years of college:

Data are in thousands, so we have that 11,071,000 persons in the 25-34 age group have completed at least 4 years of college, compared to 23,160,000 persons in the 35-54 age group.

The groups appear greatly different, but look at the group totals.

Page 13: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 13

Age and Education

Change the counts to percents:

Now, with a fairer comparison using percents, the groups appear very similar.

Compare the 25-34 age group to the 35-54 age group in terms of success in completing at least 4 years of college:

Page 14: BPS - 5TH ED.CHAPTER 6 1 An important measure of the performance of a locomotive is its "adhesion," which is the locomotive's pulling force as a multiple

Case Study

BPS - 5th Ed.Chapter 6 14

Age and Education

If we compute the percent completing at least four years of college for all of the age groups, this would give us the conditional distribution of age, given that the education level is “completed at least 4 years of college”:

Age: 25-34 35-54 55 and over

Percent with≥ 4 yrs college: 29.3% 28.4% 18.9%