39
CHAPTER 9 CONFIDENCE INTERVAL ESTIMATION MULTIPLE CHOICE QUESTIONS In the following multiple-choice questions, please circle the correct answer. 1. The confidence interval for a proportion is based on the assumption of a large sample size. A rule of thumb for checking the validity of this assumption is if are all greater than what value? a. 0 b. n c. 2 d. 3 e. 5 ANSWER: e 2. When the samples we want to compare are paired in some natural way, such as pretest/posttest for each person or husband/wife pairs, a more appropriate form of analysis is to not compare two separate variables, but their . a. difference b. sum c. ratio d. total 185

0 - Georgia State University - Georgia State Universitydscaas/testbank/TB Ch …  · Web view · 2012-06-15chapter 9. confidence interval ... 141.05 29 151.10 157.39 10 142.87 152.90

Embed Size (px)

Citation preview

CHAPTER 9

CONFIDENCE INTERVAL ESTIMATION

MULTIPLE CHOICE QUESTIONS

In the following multiple-choice questions, please circle the correct answer.

1. The confidence interval for a proportion is based on the assumption of a large sample size. A rule of thumb for checking the validity of this assumption is if

are all greater than what value?

a. 0b. nc. 2d. 3e. 5ANSWER: e

2. When the samples we want to compare are paired in some natural way, such as pretest/posttest for each person or husband/wife pairs, a more appropriate form of analysis is to not compare two separate variables, but their .

a. differenceb. sumc. ratiod. totale. productANSWER: a

3. Confidence intervals are a function of which of the following three things?

a. The population, the sample, and the standard deviationb. The sample, the variable of interest, and the degrees of freedomc. The data in the sample, the confidence level, and the sample sized. The sampling distribution, the confidence level, and the degrees of freedome. The mean, the median, and the modeANSWER: c

185

Chapter 9

4. The chi-square and F distributions are used primarily to make inferences about population ___________.

a. meansb. variancesc. mediansd. modese. proportionsANSWER: b

5. If you increase the confidence level, the confidence interval .

a. decreasesb. increasesc. stays the samed. may increase or decrease, depending on the sample dataANSWER: b

6. A random sample allows us to use:

a. the rules of probabilitiesb. the rules of large numbersc. the laws of parametersd. the laws of distributionse. the laws of gravityANSWER: a

7. Suppose there are 500 accounts in a population. You sample 50 of them and find a sample total of $5,000. What would be your estimate for the population total?

a. $5,000b. $50,000c. $250,000d. $2,500,000e. None of the aboveANSWER: b

8. Suppose there are 400 accounts in a population. You sample 50 of them and find a sample mean of $500. What would be your estimate for the population total?

a. $5,000b. $50,000c. $250,000d. $2,500,000e. None of the aboveANSWER: b

186

Confidence Interval Estimation

9. When we replace with the sample standard deviation (s), we introduce a new source of variability and the sampling distribution becomes the .

a. t distributionb. F distributionc. chi-square distributiond. robust distributionANSWER: a

10. Another commonly used random mechanism, besides a simple random sample, is called:

a. interval estimationb. a random hypothesis testc. a randomized experimentd. a nuisance sampleANSWER: c

11. If the odds of a horse winning a race are 2 to 1, then the probability of this horse winning the race is .

a. 1/4b. 1/3c. 1/2d. 2/3e. 2/10ANSWER: d

12. There are, generally speaking, two types of statistical inference. They are:

a. sample estimation and population estimationb. confidence interval estimation and hypothesis testingc. interval estimation for a mean and interval estimation for a proportiond. independent sample estimation and dependent sample estimatione. none of the aboveANSWER: b

13. The t distribution has degrees of freedom.

a. nb. 2c. 10d. n – 1e. trillion ANSWER: d

187

Chapter 9

14. If you are constructing a confidence interval for a single mean, the confidence interval will with an increase in the sample size.

a. decreasesb. increasesc. stays the samed. may increase or decrease, depending on the sample dataANSWER: a

15. As the sample size increases, the t distribution becomes more similar to the __ distribution.

a. normalb. exponentialc. Fd. chi-squaree. binomialANSWER: a

16. A parameter, such as , is sometimes referred to as a ________ parameter, because many times we need its value even though it is not the parameter of primary interest.

a. specialb. randomc. nuisanced. independente. dependentANSWER: c

17. When you calculate the sample size for a proportion, you use an estimate for the population proportion ( ). A conservative value for n can be obtained by using

= .

a. 0.0b. 0.05c. 0.10d. 0.50e. 1.00ANSWER: d

188

Confidence Interval Estimation

QUESTIONS 18 THROUGH 23 ARE BASED ON THE FOLLOWING INFORMATION:

The following values have been calculated using the TDIST and TINV functions in Excel. These values come from a t distribution with 15 degrees of freedom.

These values represent the probability to the right of the given positive values.

Value t probability1.00 0.16361.20 0.12091.40 0.0872

These values represent the t value for a given probability.

Probability t value0.20 1.31780.10 1.71090.05 2.0639

18. What is the probability of a t-value smaller 1.00?

a. 0.1209b. 0.1636c. 0.8364d. 0.8791ANSWER: c

19. What is the probability of a t-value larger than 1.20?

a. 0.0872b. 0.1209c. 0.1636d. 0.2000ANSWER: b

20. What is the probability of a t-value between –1.40 and +1.40?

a. 0.7582b. 0.8256c. 0.9128d. 0.9500ANSWER: b

189

Chapter 9

21. What would be the t-value where 0.05 of the values are in the upper tail?

a. +1.000b. +1.318c. +1.711d. +2.064ANSWER: c

22. What would be the t-values where 0.10 of the values are in both tails (sum of both tails)?

a. –1.000, +1.000b. –1.318, +1.318c. –1.711, +1.711d. –2.064, +2.064ANSWER: c

23. What would be the t-values where 0.95 of the values would fall within this interval?

a. –1.000, +1.000b. –1.318, +1.315c. –1.711, +1.711d. –2.064, +2.064ANSWER: d

QUESTIONS 24 THROUGH 29 ARE BASED ON THE FOLLOWING INFORMATION:

The following values have been calculated using the TDIST and TINV functions in Excel. These values come from a t distribution with 15 degrees of freedom.

These values represent the probability to the right of the given positive values.

Value t probability0.95 0.17861.15 0.13411.20 0.1244

These values represent the t value for a given probability.

Probability t value0.20 1.3410.15 1.5170.10 1.753

24. What is the probability of a t-value smaller than 1.20?

190

Confidence Interval Estimation

a. 0.8756b. 0.8659c. 0.1341d. 0.1244ANSWER: a

25. What is the probability of a t-value larger than 1.15?

a. 0.1786b. 0.1341c. 0.1244d. 0.1500ANSWER: b

26. What is the probability of a t-value between –0.95 and +0.95?

a. 0.1786b. 0.3572c. 0.6428d. 0.8214ANSWER: c

27. What would be the t-value where 0.075 of the values are in the upper tail?

a. +1.000b. +1.341c. +1.517d. +1.753ANSWER: c

28. What would be the t-values where 0.80 of the values would fall within this interval?

a. –1.000, +1.000b. –1.341, +1.341c. –1.517, +1.517d. –1.753, +1.753ANSWER: b

29. What would be the t-values where 0.10 of the values are in both tails (sum of both tails)?

191

Chapter 9

a. –1.000, +1.000b. –1.341, +1.341c. –1.517, +1.517d. –1.753, +1.753ANSWER: d

192

Confidence Interval Estimation

TEST QUESTIONS

30. You are told that a random sample of 150 people from Iowa has been given cholesterol tests, and 60 of these people had levels over the “safe” count of 200. Construct a 95% confidence interval for the population proportion of people in Iowa with cholesterol levels over 200.

ANSWER:

Lower limit = 0.3216, and upper limit = 0.4784

31. You are trying to estimate the average amount a family spends on food during a year. In the past, the standard deviation of the amount a family has spent on food during a year has been approximately $1200. If you want to be 99% sure that you have estimated average family food expenditures within $60, how many families do you need to survey?

ANSWER:=1200, z-multiple = 2.575, B = 60 . The sample size for a mean is given by

32. You have been assigned to determine whether more people prefer Coke to Pepsi. Assume that roughly half the population prefers Coke and half prefers Pepsi.

How large a sample would you need to take to ensure that you could estimate, with 95% confidence, the proportion of people preferring Coke within 3% of the actual value?

ANSWER:= 0.50, z-multiple = 1.96, B = 0.03. The sample size for a proportion is given

by

QUESTIONS 33 THROUGH 35 ARE BASED ON THE FOLLOWING INFORMATION:

A marketing research consultant hired by Coca-Cola is interested in determining the proportion of customers who favor Coke over other soft drinks. A random sample of 400 consumers was selected from the market under investigation and showed that 53% favored Coca-Cola over other brands.

193

Chapter 9

33. Compute a 95% confidence interval for the true proportion of people who favor Coke. Do the results of this poll convince you that a majority of people favors Coke?

ANSWER:0.53 0.0489 = (0.4811, 0.5789).Since confidence interval ranges from 48% to 57.9%, it is difficult to conclude that a majority of people favors Coke. It could be below 50%.

34. Suppose 2,000 (not 400) people were polled and 53% favored Coke. Would you now be convinced that a majority of people favor Coke? Why might your answer be different than in Question 33?

ANSWER:0.53 0.0219 = (0.5081, 0.5519).In this case the 95% confidence interval is entirely above 50%, the data is now more convincing than it was previously.

35. How many people would have to be surveyed to be 95% confident that you can estimate the fraction of people who favor Coca-Cola within 1%?

ANSWER:9,569.43 or 9,570.

QUESTIONS 36 AND 37 ARE BASED ON THE FOLLOWING INFORMATION:

The employee benefits manager of a medium size business would like to estimate the proportion of full-time employees who prefer adopting plan A of three available health care plans in the coming annual enrollment period. A reliable frame of the company’s employees and their tentative health care preferences are available. Using Excel, the manager chose a random sample of size 50 from the frame. There were 17 employees in the sample who preferred plan A.

36. Construct a 99% confidence interval for the proportion of company employees who prefer plan A. Assume that the population consists of the preferences of all employees in the frame.

ANSWER:

lower limit = 0.1675, upper limit = 0.5125

194

Confidence Interval Estimation

37. Interpret the 99% confidence interval constructed in Question 36.

ANSWER:We are 99% confident that the proportion of all employees who prefer plan A is

between 0.1675 and 0.5125.

QUESTIONS 38 THROUGH 40 ARE BASED ON THE FOLLOWING INFORMATION:

Q-Mart is interested in comparing its male and female customers. Q-Mart would like to know if its female charge customers spend more money, on average, than its male charge customers. They have collected random samples of 25 female customers and 22 male customers. On average, women charge customers spend $102.23 and men charge customers spend $86.46. Some information are shown below.

Summary statistics for two samplesFemale Male

Sample sizes 25 22Sample means 102.23 86.46Sample standard deviations 93.393 59.695

Confidence interval for difference between means

Sample mean difference 15.77Pooled standard deviation 79.466Std error of difference 23.23

38. Using a t-value of 2.014, calculate a 95% confidence interval for the difference between the average female purchase and the average male purchase. Would you conclude that there is a significant difference between females and males in this case? Explain.

ANSWER:15.77 46.785 = (-31.015, 62.555). Since the range includes 0, there does not appear to be a significant difference between the means of the two groups.

39. What are the degrees of freedom for the t-statistic in this calculation? Explain how you would calculate the degrees of freedom in this case.

ANSWER:n1 + n2 – 2 = 45

195

Chapter 9

40. What is the assumption in this case that allows you to use the pooled standard deviation for this confidence interval?

ANSWER:In order to use the pooled standard deviation for this confidence interval, we must assume that the two populations standard deviations are equal ( ).

QUESTIONS 41 AND 42 ARE BASED ON THE FOLLOWING INFORMATION:

A company employs two shifts of workers. Each shift produces a type of gasket where the thickness is the critical dimension. The average thickness and the standard deviation of thickness for shift 1, based on a random sample of 40 gaskets, are 10.85 mm and 0.16 mm, respectively. The similar figures for shift 2, based on a random sample of 30 gaskets, are 10.90 mm and 0.19 mm. Let be the difference in thickness between shifts 1 and 2, and assume that the population variances are equal.

41. Construct a 95% confidence interval for .

ANSWER:

The pooled standard deviation is = 0.1734

Lower limit = -0.1336, and upper limit = 0.0336.

42. Based on your answer to Question 41, are you convinced that the gaskets from shift 2 are, on average, wider than those from shift 1? Why or why not?

ANSWER:The confidence interval extends from a negative number (indicating shift 2 thickness is larger) to a positive number (indicating shift 2 thickness is smaller). So we are not absolutely sure which mean is greater.

QUESTIONS 43 AND 44 ARE BASED ON THE FOLLOWING INFORMATION:

A sample of 9 production managers with over 15 years of experience has an average salary of $71,000 and a sample standard deviation of $18,000.

43. You can be 95% confident that the mean salary for all production managers with at least 15 years of experience is between what two numbers (the t-statistic with 8 degrees of freedom is 2.306)? What assumption are you making about the distribution of salaries?

196

Confidence Interval Estimation

ANSWER:$71,000 $13,836 = ($57,164, $84,836). The assumption is that the population is normal or near normal. This is particularly important since the sample size is so small (9). However, the t distribution is rather robust to violations of normality.

44. What sample size would be needed to ensure that we could estimate the true mean salary of all production managers with more than 15 years of experience and have only 5 chances in 100 of being off by more than $600?

ANSWER:69.18 or 70

QUESTIONS 45 THROUGH 50 REQUIRE THE USE OF EXCEL:

45. Compute has a t-distribution with 10 degrees of freedom.

ANSWER:0.74730

46. Compute has a t-distribution with 100 degrees of freedom.

ANSWER:0.77176

47. Compute where Z is a standard normal random variable.

ANSWER:0.77454

48. Compare the result of Question 47 to the results obtained in Questions 45 and 46. How do you explain the difference in these probabilities?

ANSWER:The variance of t with a small degree of freedom is larger than a t with a large degree of freedom, which is larger than for a Z. This explains why the “between” probabilities in Questions 45, 46, and 47 increase.

49. Find the 75th percentile of the t-distribution with 25 degrees of freedom.

ANSWER:0.32217

197

Chapter 9

50. Find the 75th percentile of the t-distribution with 5 degrees of freedom.

ANSWER:0.33672

QUESTIONS 51 and 52 ARE BASED ON THE FOLLOWING INFORMATION:

A sample of 40 country CD recordings of Willie Nelson has been examined. The average playing time of these recordings is 51.3 minutes, and the standard deviation is 5.8 minutes.

51. Construct a 95% confidence interval for the mean playing time of all Willie Nelson recordings.

ANSWER:n = 10, = 54000, s = 15000

Lower limit = 49.445, and upper limit = 53.155

52. Interpret the confidence interval you constructed.

ANSWER:We are 95% confident that the mean playing time of all Willie Nelson recordings is between. 49.445 and 53.155 minutes.

QUESTIONS 53 AND 54 ARE BASED ON THE FOLLOWING INFORMATION:

A department store is interested in the average balance that is carried on its store’s credit card. A sample of 40 accounts reveals an average balance of $1,250 and a standard deviation of $350.

53. Find a 95% confidence interval for the mean account balance on this store’s credit card (the t-statistic with 39 degrees of freedom is 2.02).

ANSWER:$1,250 $111.79 = ($1,138.21, $1,361.79).

54. What sample size would be needed to ensure that we could estimate the true mean account balance and have only 5 chances in 100 of being off by more than $100?

ANSWER:49.98 or 50.

198

Confidence Interval Estimation

QUESTIONS 55 AND 56 ARE BASED ON THE FOLLOWING INFORMATION:

A market research consultant hired by Coke Classic Company is interested in estimating the difference between the proportions of female and male customers who favor Coke Classic over Pepsi Cola in Chicago. A random sample of 200 consumers from the market under investigation showed the following frequency distribution.

Male FemaleCoke 72 38 110Pepsi 58 32 90

130 70 200

55. Construct a 95% confidence interval for the difference between the proportions of male and female customers who prefer Coke Classic over Pepsi Cola.

ANSWER:

= 0.0738

Lower limit = -0.1337, and upper limit = 0.1555

56. Interpret the constructed confidence interval.

ANSWER:

We are 95% confident that the population difference between these proportions is between –13.37% and 15.55%.

QUESTIONS 57 THROUGH 60 ARE BASED ON THE FOLLOWING INFORMATION:

The percent defective for parts produced by a manufacturing process is targeted at 4%. The process is monitored daily by taking samples of sizes n = 160 units. Suppose that today’s sample contains 14 defectives.

57. Determine a 95% confidence interval for the proportion defective for the process today.

ANSWER:

199

Chapter 9

0.0875 0.0438 = (0.0437, 0.1313).58. Based on your answer to Question 57, is it still reasonable to think the overall

proportion defective produced by today’s process is actually the targeted 4%? Explain your reasoning.

ANSWER:No, since 4% falls outside of this range.

59. The confidence interval in Question 57 is based on the assumption of a large sample size. Is this sample size sufficiently large in this example? Explain how you arrived at your answer.

ANSWER:Yes. Because are all greater than 5.0.

60. How many units would have to be sampled to be 95% confident that you can estimate the fraction of defective parts within 2% (using the information from today’s sample)?

ANSWER:766.40 or 767.

QUESTIONS 61 AND 62 ARE BASED ON THE FOLLOWING INFORMATION:

Auditors of Independent Bank are interested in comparing the reported value of all 1775 customer saving account balances with their own findings regarding the actual value of such assets. Rather than reviewing the records of each savings account at the bank, the auditors randomly selected a sample of 100 savings account balances from the frame. The sample mean and sample standard deviations were $505.75 and 360.95, respectively.

61. Construct a 90% confidence interval for the total value of all savings account balances within this bank. Assume that the population consists of all savings account balances in the frame.

ANSWER:

= ($719,326.70, $1,004,085.8)

62. Interpret the 90% confidence interval constructed in Question 61.

ANSWER:We are 90% confident that the total balance of all 1775 savings account balances

within the bank are between $791,327 and $1,004,086.

200

Confidence Interval Estimation

QUESTIONS 63 AND 64 ARE BASED ON THE FOLLOWING INFORMATION:

A real estate agent has collected a random sample of 40 houses that were recently sold in Grand Rapids, Michigan. She is interested in comparing the appraised value and recent selling price (in thousands of dollars) of the houses in this particular market. The values of these two variables for each of the 40 randomly selected houses are shown below.

House Value Price House Value Price1 140.93 140.24 21 136.57 135.352 132.42 129.89 22 130.44 121.543 118.30 121.14 23 118.13 132.984 122.14 111.23 24 130.98 147.535 149.82 145.14 25 131.33 128.496 128.91 139.01 26 141.10 141.937 134.61 129.34 27 117.87 123.558 121.99 113.61 28 160.58 162.039 150.50 141.05 29 151.10 157.39

10 142.87 152.90 30 120.15 114.5511 155.55 157.79 31 133.17 139.5412 128.50 135.57 32 140.16 149.9213 143.36 151.99 33 124.56 122.0814 119.65 120.53 34 127.97 136.5115 122.57 118.64 35 101.93 109.4116 145.27 149.51 36 131.47 127.2917 149.73 146.86 37 121.27 120.4518 147.70 143.88 38 143.55 151.9619 117.53 118.52 39 136.89 132.5420 140.13 146.07 40 106.11 114.33

63. Using the sample data, generate a 95% confidence interval for the mean difference between the appraised values and selling prices of the houses sold in Grand Rapids.

ANSWER:We applied the paired sample analysis using , where: D = Difference = Appraised value – selling price.

Lower limit = -3.785, and Upper limit = 0.561 (in thousands of dollars)

64. Interpret the constructed confidence interval for the real estate agent.

ANSWER:

201

Chapter 9

We are 95% confident that the actual mean difference between the appraised values and selling prices of all the houses sold in Grand Rapids is between -$3785 and $561.

QUESTIONS 65 THROUGH 69 REQUIRE THE USE OF EXCEL:

65. Compute has a t-distribution with 15 degrees of freedom.

ANSWER:0.03197

66. Compute has a t-distribution with 150 degrees of freedom.

ANSWER:0.02365

67. How do you explain the difference between the results obtained in Questions 65 and 66?

ANSWER:The smaller the degrees of freedom, the higher the variance of t, and so the larger the tail probabilities are.

68. Compute where Z is a standard normal random variable.

ANSWER:0.02275

69. Compare the results of Question 68 to the results obtained in Questions 65 and 66. How do you explain the difference in these probabilities?

ANSWER:First, the variance of t with a small degree of freedom is larger than a t with a large degree of freedom, which is larger than for a Z. This explains why the probabilities in Questions 65, 66, and 68 increases. Second, when the sample size is large, the degrees of freedom of t are large; and that the t distribution and the standard normal distribution are practically indistinguishable. This explains why the probabilities in Questions 66 and 68 are close.

QUESTIONS 70 THROUGH 72 ARE BASED ON THE FOLLOWING INFORMATION:

Senior management of a consulting services firm is concerned about a growing decline in the firm’s weekly number of billable hours. The firm expects each professional employee to spend at least 40 hours per week on work. In an effort to understand this problem better, management would like to estimate the standard deviation of the number of hours their employees spend on work-related activities in a typical week. Rather than reviewing the records of all the firm’s full-time employees, the management randomly

202

Confidence Interval Estimation

selected a sample of size 50 from the available frame. The sample mean and sample standard deviations were 48.5 and 7.5 hours, respectively.

70. Construct a 99% confidence interval for the standard deviation of the number of hours this firm’s employees spend on work-related activities in a typical week.

ANSWER:

Lower limit = =5.936

Upper limit =

71. Interpret the 99% confidence interval constructed in Question 70.

ANSWER:We are 99% confident that the population standard deviation is between 5.936 and 10.057.

72. Given the target range of 40 to 60 hours of work per week, should senior management be concerned about the number of hours their employees are currently devoting to work? Explain why or why not.

ANSWER:The best guess for the population mean is 48.5 hours per week, and about 95% of all employees are within 2 standard deviations of this, where we are almost sure (99% sure) that this standard deviation is between 5.9 and 10.1. But even if the standard deviation is only 5.9, then 48.5 standard deviations will produce the range 36.7 to 60.3. Maybe management should be concerned.

QUESTIONS 73 THROUGH 75 REQUIRE THE USE OF EXCEL:

73. Compute has a t-distribution with 20 degrees of freedom.

ANSWER:Because of the symmetry of the t distribution, this left-hand tail probability can be calculated exactly like right-hand tail. The answer is 0.17673.

74. Compute has a t-distribution with 2 degrees of freedom.

ANSWER:Because of the symmetry of the t distribution, this left-hand tail probability can be calculated exactly like right-hand tail. The answer is 0.22119.

203

Chapter 9

75. How do you explain the difference between the results obtained in Questions 73 and 74?

ANSWER:The larger the degrees of freedom, the lower the variance of t, so the smaller the tail probabilities are. This explains why the probability in Question 73 is smaller than that in Question 74.

QUESTIONS 76 AND 77 ARE BASED ON THE FOLLOWING INFORMATION:

A sample of 10 quality control managers with over 15 years of experience has an average salary of $54,000 and a standard deviation of $15,000.

76. You can be 95% confident that the mean salary for all quality control managers with at least 15 years of experience is between what two numbers? What assumptions are you making about the distribution of salaries?

ANSWER:n = 10, = 54000, s = 15000

Lower limit = 43,269.443, and upper limit = 64,730.557We must assume that the population distribution of salaries is normal, especially since the sample size is so small.

77. What size sample would be needed to ensure that we could estimate the true mean salary of all quality control managers with more than 15 years of experience and have only 2 chances in 100 of being off by more than $800?

ANSWER:=15000, z-multiple = 2.326, B = 800

The approximate sample size required to produce a 98% confidence interval for the mean is given by

QUESTIONS 78 THROUGH 80 ARE BASED ON THE FOLLOWING INFORMATION:

Q-Mart is interested in comparing customer who used its own charge card with those who use other types of credit cards. Q-Mart would like to know if customers who use the Q-

204

Confidence Interval Estimation

Mart card spend more money per visit, on average, than customers who use some other type of credit card. They have collected information on a random sample of 38 charge customers and the data is presented below. On average, the person using a Q-Mart card spends $192.81 per visit and customers using another type of card spend $104.47 per visit. Use the information below to answer the following questions.

Summary statistics for two samplesQ-Mart Other Charges

Sample sizes 13 25Sample means 192.81 104.47Sample standard deviations 115.243 71.139

Confidence interval for difference between means

Sample mean difference 88.34Pooled standard deviation 88.323Std error of difference 30.201

78. Using a t-value of 2.023, calculate a 95% confidence interval for the difference between the average Q-Mart charge and the average charge on another type of credit card. Would you conclude that there is a significant difference between the two types of customers in this case? Explain.

ANSWER:88.34 61.0966 = +27.2434 – +149.4366. Since the range does not include 0, there appears to be a significant difference between the means of the two groups. In this case, it appears as though the Q-Mart charge card holders spend more money than those who use other types of charge cards.

79. What are the degrees of freedom for the t-statistic in this calculation? Explain how you would calculate the degrees of freedom in this case.

ANSWER:n1 + n2 – 2 = 36

80. What is the assumption in this case that allows you to use the pooled standard deviation for this confidence interval?

ANSWER:In order to use the pooled standard deviation for this confidence interval, we must assume that the two populations standard deviations are equal ( ).

205

Chapter 9

QUESTIONS 81 THROUGH 84 ARE BASED ON THE FOLLOWING INFORMATION:

The average annual household income levels of citizens of selected U.S. cities are shown below.

81. Use Excel to obtain a simple random sample of size 10 from this frame.

ANSWER:I used StatPro’s Generate Random Samples to generate a sample of size 10, then used the VLOOKUP function to get the corresponding incomes. The following sample is obtained:

City Index Income50 61,80014 46,2004 50,80056 65.50048 51,50049 53,5008 63,20011 77,600

206

City Household City Household City HouseholdIndex Income Index Income Index Income

1 $54,300 21 $53,500 41 $61,5002 $61,800 22 $45,600 42 $53,0003 $61,400 23 $70,100 43 $51,0004 $50,800 24 $108,700 44 $55,6005 $56,200 25 $46,400 45 $51,6006 $48,300 26 $56,700 46 $57,2007 $61,600 27 $59,100 47 $54,3008 $63,200 28 $46,300 48 $51,5009 $55,200 29 $52,900 49 $53,500

10 $58,000 30 $56,300 50 $61,80011 $77,600 31 $67,300 51 $44,80012 $47,600 32 $63,800 52 $57,40013 $62,700 33 $70,600 53 $48,10014 $46,200 34 $49,800 54 $52,70015 $64,300 35 $51,300 55 $57,40016 $56,000 36 $56,600 56 $65,50017 $53,400 37 $49,600 57 $59,60018 $56,800 38 $67,400 58 $62,00019 $51,200 39 $53,700 59 $49,70020 $59,000 40 $48,700 60 $54,400

Confidence Interval Estimation

38 67,40052 57,400

82. Using the sample generated in Question 81, construct a 95% confidence interval for the mean average annual household income level of citizens in the selected U.S. cities. Assume that the population consists of all average annual household income levels in the given frame.

ANSWER:

Lower limit = 52,737.0439, upper limit = 66,242.9561

83. Interpret the 95% confidence interval constructed in Question 82.

ANSWER:We are 95% confident that the average of all incomes is between $52,737 and $66,243.

84. Does the 95% confidence interval contain the actual population mean? If not, explain why not. What proportion of many similarly constructed confidence intervals should include the true population mean value?

ANSWER:This confidence interval easily captures the true population mean of $57,043. Approximately 95% of the confidence intervals constructed in this way should contain the true population mean.

QUESTIONS 85 THROUGH 91 ARE BASED ON THE FOLLOWING INFORMATION:

The personnel department of a large corporation wants to estimate the family dental expenses of its employees to determine the feasibility of providing a dental insurance plan. A random sample of 12 employees reveals the following family dental expenses (in dollars) for the year 2001.

115 370 250 93 540 225 177 425 318 182 275 228

85. Construct a 90% confidence interval estimate of the mean family dental expenses for all employees of this corporation.

ANSWER:

207

Chapter 9

86. What assumption about the population distribution must be made to answer Question 85?

ANSWER:The population of dental expenses must be approximately normally distributed.

87. Interpret the 90% confidence interval constructed in question 85.

ANSWER:We are 90% confident that the mean family dental expenses for all employees of this corporation is between $199.26 and $333.74.

88. Suppose you used a 95% confidence interval in Question 85. What would be your answer to Question 85?

ANSWER:

208

Confidence Interval Estimation

89. Suppose the fourth value were 593 instead of 93. What would be your answer to Question 88? What effect does this change have on the confidence interval?

ANSWER:

The additional $500 in dental expenses, divided across the sample of 12, raises the mean by $41.67 and increases the standard deviation by nearly $18.20. The interval width increases over $23 in the process.

90. Construct a 90% confidence interval estimate for the standard deviation of family dental expenses for all employees of this corporation.

ANSWER:

91. Interpret the 90% confidence interval constructed in question 90.

ANSWER:We are 90% confident that the standard deviation for family dental expenses for all employees of this corporation is between 110.61 and 229.38.

209

Chapter 9

QUESTIONS 92 AND 93 ARE BASED ON THE FOLLOWING INFORMATION:

An automobile dealer wants to estimate the proportion of customers who still own the cars they purchased six years ago. A random sample of 200 customers selected from the automobile dealer’s records indicates that 88 still own cars that were purchased six years earlier.

92. Construct a 95% confidence interval estimate of the population proportion of all customers who still own the cars they purchased six years ago

ANSWER:

= 0.44 0.0688

Lower limit = 0.3712, and upper limit = 0.5088

93. How can the result in Question 92 be used by the automobile dealer to study satisfaction with cars purchased at the dealership?

ANSWER:The dealer can infer that the proportion of all customers who still own the cars they purchased at the dealership 6 years earlier is somewhere between 03712 and 0.5088 with a 95% level of confidence.

210

Confidence Interval Estimation

TRUE / FALSE QUESTIONS

94. The degrees of freedom for the t and chi-square distributions is a numerical parameter of the distribution that defines the precise shape of the distribution.

ANSWER: T

95. When all possible samples of size n are drawn from any population, then the sampling distribution of the sample mean is approximately normal provided that n is reasonably large.

ANSWER: T

96. The mean of the sampling distribution of the sample proportion , when the sample size n = 100 and the population proportion p = 0.58, is 58.0.

ANSWER: F

97. The standard error of the sampling distribution of the sample proportion , when the sample size n = 50 and the population proportion p = 0.25, is 0.00375.

ANSWER: F

98. In developing a confidence interval for the population standard deviation , we make use of the fact that the sampling distribution of the sample standard deviation S is not the normal distribution or the t distribution, but rather a right-skewed distribution called the chi-square distribution, which (for this procedure) has n – 1 degrees of freedom.

ANSWER: T

99. As a general rule, the normal distribution is used to approximate the sampling distribution of the sample proportion only if the sample size n is greater than 30.

ANSWER: F

100. In general, the paired-sample procedure is appropriate when the samples are naturally paired in some way and there is a reasonably large positive correlation between the pairs. In this case, the paired-sample procedure makes more efficient use of the data and generally results in narrower confidence intervals.

ANSWER: T

211

Chapter 9

101. If the standard error of the sampling distribution of the sample proportion is 0.0324 for samples of size 200, then the population proportion must be 0.30.

ANSWER: F

102. If a random sample of size 250 is taken from a population, where it is known that the population proportion p = 0.4, then the mean of the sampling distribution of the sample proportion is 0.60.

ANSWER: F

103. If two random samples of size 40 each are selected independently from two populations whose variances are 35 and 45, then the standard error of the sampling distribution of the sample mean difference, , equals 1.4142.

ANSWER: T

104. If two random samples of sizes 30 and 35 are selected independently from two populations whose means are 85 and 90, then the mean of the sampling distribution of the sample mean difference, , equals 5.

ANSWER: F

105. A confidence interval is an interval estimate for which there is a specified degree of certainty that the actual value of the population parameter will fall within the interval.

ANSWER: T

106. The 95% confidence interval for the population mean , given that the sample size n = 49 and the population standard deviation = 7, is .

ANSWER: T

107. In order to construct a confidence interval estimate of the population mean , the value of must be given.

ANSWER: F

108. The interval estimate 18.5 2.5 was developed for a population mean when the sample standard deviation S was 7.5. Had S equaled 15, the interval estimate would be 37 5.0.

ANSWER: F

212

Confidence Interval Estimation

109. We can form a confidence interval for the population total T by finding a confidence interval for the population mean in the usual way, and then multiplying each end point of the confidence interval by the population size N.

ANSWER: T

110. A 90% confidence interval estimate for a population mean is determined to be 72.8 to 79.6. If the confidence level is reduced to 80%, the confidence interval for

becomes narrower.

ANSWER: T

111. In general, increasing the confidence level will narrow the confidence interval, and decreasing the confidence level widens the interval.

ANSWER: F

112. The upper limit of the 90% confidence interval for the population proportion p, given that n = 100; and = 0.20 is 0.2658.

ANSWER: T

113. The lower limit of the 95% confidence interval for the population proportion p, given that n = 300; and = 0.10 is 0.1339.

ANSWER: F

114. The t-distribution and the standard normal distribution are practically indistinguishable as the degrees of freedom increase.

ANSWER: T

115. In determining the sample size n for estimating the population proportion p, a conservative value of n can be obtained by using 0.50 as an estimate of p.

ANSWER: T

116. In developing confidence interval for the difference between two population means using two independent samples, we use the pooled estimate in estimating the standard error of the sampling distribution of the sample mean difference if the populations are normal with equal variances.

ANSWER: T

213