SLCC MATH 1040 FINAL EXAM FALL 2016 NAME: Form A INSTRUCTOR: · SLCC MATH 1040 FINAL EXAM FALL 2016 NAME: _____ Form A INSTRUCTOR: ... finds that 27 out of the 300 in his sample are

SLCC MATH 1040 FINAL EXAM FALL 2016 NAME: _________________________

Form A INSTRUCTOR: ________________________

TEST INSTRUCTIONS:

This exam consists of both "multiple choice" and "free response" questions. No partial credit is

available on "multiple choice" problems. For "free response" questions, you must support your

answer with relevant work. Work for these problems must be shown on the exam. Work

carefully and neatly, appropriately justifying your solutions. Partial credit may be awarded for

relevant work on "free response" problems.

You may not use your own notes, books, headphones, phones, nor any device that connects to

the internet. Any calculator may be used and formulas are provided. You have 120 minutes to

complete this exam.

A facilities manager at SLCC believes that his supply of light bulbs has too many defects. The

manager’s null hypothesis is that the supply of light bulbs has a defect rate of 0.07p (the

defect rate stated by the light bulb manufacturer). The manager takes a random sample and

finds that 27 out of the 300 in his sample are defective. Suppose he does a hypothesis test with a

significance level of 0.05 . Symbolically, his null and alternative hypotheses are:

0 : 0.07H p and 1 : 0.07H p .

1. (4 points) Choose the statement that best describes the significance level in the context of

this hypothesis test.

a. The significance level of 0.05 is the defect rate we believe is the true defect rate.

b. The significance level of 0.05 is the test statistic that we will use to compare the observed

outcome to the null hypothesis.

c. The significance level of 0.05 is the probability of concluding that the defect rate is equal

to 0.07 when in fact the defect rate is greater than 0.07.

d. The significance level of 0.05 is the probability of concluding that the defect rate is

higher than 0.07 when in fact the defect rate is equal to 0.07.

2. (4 points) For the hypothesis test, the facilities manager calculates a p-value of 0.087. Choose

the correct interpretation for the p-value.

a. The p-value tells us that the probability of concluding that the defect rate is equal to 0.07,

when in fact the rate is greater than 0.07, is approximately 0.087.

b. The p-value tells us that if the defect rate is 0.07, then the probability that the manager

will have 27 or more defective light bulbs out of 300 is approximately 0.087. At the

significance level of 0.05, this would not be an unusual outcome.

c. The p-value tells us that the true population rate of defective light bulbs is

approximately 0.087.

d. None of these.

MATH 1040 FINAL EXAM FALL 2016 A page 2

3. (4 points) For what kinds of variables would it be appropriate to create a histogram?

a. Only for a single quantitative variable

b. Only for a single categorical variable

c. Either for two quantitative or for two categorical variables together

d. It varies according to the situation

4. (4 points) The monthly salaries of the three people working in a small firm are $3,500,

$4,000, and $4,500. Suppose the firm makes a profit and everyone gets a $100 raise. How, if

at all, would the mean and standard deviation of the three salaries change?

a. Both would stay the same

b. The mean would increase and the standard deviation would stay the same

c. The mean would stay the same and the standard deviation would increase

d. The mean would stay the same and the standard deviation would decrease

e. The mean would increase and the standard deviation would decrease

f. Both would increase

g. Cannot be answered without doing calculations

5. (4 points) Given is a relative frequency histogram for the price of homes from a simple

random sample of homes sold during 2010. Based on this histogram, is a large sample

30 necessary to conduct a hypothesis test about the mean sale price? Why?

a. No; the data appear to be normally distributed

b. Yes; the data do not appear to be normally distributed, but rather skewed left

c. Yes; the data do not appear to be normally distributed, but rather skewed right

d. Yes; the data do not appear to be normally distributed but bimodal


6. (4 points) Suppose a random sample of 50 college students is asked if they regularly eat

breakfast. A 95% confidence interval for the proportion of all students that regularly eat

breakfast is found to be 0.70 to 0.90. If a 99% confidence interval was calculated instead,

how would it differ from the 95% confidence interval?

a. The 99% confidence interval would have the same width as the 95% confidence interval.

b. The 99% confidence interval would be wider.

c. The 99% confidence interval would be narrower.

d. There is no way to tell if the 99% confidence interval would be wider or narrower than

the 95% confidence interval.

7. (4 points) A recent Gallup poll showed the president’s approval rating at 60%. Some friends

use this information (along with the sample size from the poll) and find confidence intervals

for the proportion of all adult Americans that approve of the president’s performance. For

the following confidence intervals, which one could have been done correctly? (The others

were definitely done incorrectly.)

a. (0.57, 0.63)

b. (0.60, 0.66)

c. (0.62, 0.69)

d. (0.47, 0.53)

8. (4 points) The following linear regression model was created to show the association

between the number of massages received per month and self-predicted stress level:

ˆ 10 0.02y x where x is the number of massages per month and y is the predicted stress

level. The coefficient of determination for the model is 2 0.066R or 6.6%. Choose the true

statement regarding this model.

a. The model shows that getting even one massage a month will decrease your stress level

by 0.02 on average.

b. The model shows that there is a negative association between stress level and number of

monthly massages, but the relatively small coefficient of determination 2R suggests

that very little of the variation is explained by the model, so the model is probably not

very good at predicting stress level.

c. The model shows that on average a person getting no massages will have a stress level

of 10.

d. All of these are true statements.


9. (4 points) A General Social Survey (GSS) asks many questions of American adults regularly.

Identify which of the following questions elicit an answer that is coded as a quantitative

variable and which as a categorical variable. (Circle the correct response for each.)

a. Are you currently employed? QUANTITATIVE CATEGORICAL

b. What is your annual income? QUANTITATIVE CATEGORICAL

c. Do you have your driver’s license? QUANTITATIVE CATEGORICAL

d. What is your age? QUANTITATIVE CATEGORICAL

10. (6 points) The medal counts for the top 10 countries at the Rio Olympics are given below,

ranked by total medals (according to rio2016.com 10/17/2016).

Country

Total

Medals Gold Silver Bronze

1 United States 121 46 37 38

2 China 70 26 18 26

3 England 67 27 23 17

4 Russia 56 19 18 19

5 Germany 42 17 10 15

6 France 42 10 18 14

7 Japan 41 12 8 21

8 Australia 29 8 11 10

9 Italy 28 8 12 8

10 Canada 22 4 3 15

a. Report the 5-number summary for Total Medals won by the top 10 countries at the Rio

Olympics.

b. Create a boxplot for Total Medals won by the top 10 countries at the Rio Olympics. Be

sure to scale it appropriately.


11. (6 points) USA Today (January 24, 2012) reported that ownership of tablet computers and e-

readers is soaring. Suppose you want to estimate the proportion of students at SLCC who

own at least one tablet or e-reader. What sample size would you use in order to estimate this

proportion at the 95% confidence level, with a margin of error of 3%?

12. (6 points) MATCHING: Choose which statistical procedure (from the second column)

would most likely be used to answer each of the three research questions in the first

column. Assume all requirements have been met for using the procedures.

I. Does the proportion of females

participating in major college sports

(e.g. basketball), differ from the

proportion participating in minor

sports (e.g. swimming)?

II. Is the number of females participating

in major college sports (e.g. basketball)

associated with the number of

scholarships available to female

athletes?

III. What is the average number of females

participating in major college sports at

SLCC?

a. Test the difference between two

proportions

b. Construct a confidence interval

estimate for the proportion.

c. Test one mean against a

hypothesized value.

d. Test the difference in means

between two paired or dependent

samples.

e. Construct a confidence interval

estimate for the mean.

f. Test if the correlation coefficient is

significant.


13. (4 points) The data given in the table represent the amount of pressure (psi) exerted by a

stamping machine ( x ), and the amount of scrap brass shavings (in pounds) that are

collected from a machine each hour ( y ). A scatterplot of the data confirms that there is a

linear association and the correlation coefficient is found to be significant.

x y

2.00 2.30

7.80 15.14

14.51 28.65

2.80 4.15

4.01 6.35

6.21 10.52

11.84 24.05

5.11 8.75

11.67 22.22

8.70 17.02

a. Find the equation of the regression line. Round values to the nearest thousandth.

b. What is the best predicted amount of scrap brass shavings for a stamping machine

pressure of 6.25? (Report to the nearest hundredth of a pound.)

14. (6 points) Suppose the amount of money spent by SLCC students each semester on

textbooks is normally distributed with a mean of $195 and a standard deviation of $20. If

you were to randomly sample 100 SLCC students from this population, what is the

probability that the sample mean x amount spent will be between $193 and $197?

Support your answer.


15. (6 points) A survey of 35,353 people asked questions about their happiness and health. One

would think that health plays a role in one’s happiness. You are interested in determining if

there is evidence that healthier people tend also to be happier, treating level of health as the

explanatory variable. A conditional distribution of happiness by health is given below.

Level of Health L

evel

of

Hap

pin

ess

Poor Fair Good Excellent

Not too happy 0.35 0.21 0.10 0.07

Pretty happy 0.48 0.58 0.61 0.47

Very happy 0.17 0.21 0.29 0.46

Totals 1 1 1 1

a. Draw a bar graph of the conditional distribution. Draw three bars, for each level of

happiness (the first bar for people who are not too happy, the second bar for people who

are pretty happy, and the third bar for people who are very happy). The horizontal axis

will represent health and the vertical axis will represent the relative frequency.

b. Does your bar graph indicate that there is an association between health and happiness?

(circle one)

i. No, the change in the percent that are very happy cancels out the change in the

percent who are not too happy

ii. Yes; the percent that are not happy decreases when health improves.

iii. No; the percent that are very happy decreases when health improves.

iv. Yes; the percent that are pretty happy stays constant when health improves.

v. No; the percent that are not happy increases when health improves.


16. (8 points) In the initial test of the Salk vaccine for polio, 400,000 children were selected and

divided into two groups of 200,000. One group was vaccinated with the Salk vaccine while

the second group was vaccinated with a placebo. Of those vaccinated with the Salk vaccine,

33 later developed polio. Of those receiving the placebo, 115 later developed polio.

A hypothesis test is performed at the 0.01 significance level to test the hypothesis that

the Salk vaccine is effective in lowering the polio rate. The test yields a standardized test

statistic of 0 6.742z and a p-value of nearly zero.

a. What are the hypotheses for this test?

b. What decision should be made about the null hypothesis? (circle one)

REJECT 0H or FAIL TO REJECT 0H

c. Does the test provide evidence to support the claim that the Salk vaccine is effective

in lowering the polio rate? Support your decision by explaining the statistical

evidence.

17. (4 points) A set of 8 observations yields a linear correlation coefficient of 0.335r . Does

this indicate a significant linear relationship between the variables? Respond with a

complete sentence, include the value of the correlation coefficient and the appropriate

critical value in your response.


18. (6 points) Clarinex-D is a medication whose purpose is to reduce the symptoms associated

with a variety of allergies. In clinical trials of Clarinex-D, 5% of the patients in the study

experienced insomnia as a side effect. We will randomly select 240 users of Clarinex-D.

a. Demonstrate that this experiment meets all the criteria for a Binomial Experiment.

b. How many of the 240 users of Clarinex-D would we expect to experience insomnia as a

side effect?

c. Would it be unusual to observe exactly 20 patients experiencing insomnia as a side effect

in 240 trials of the probability experiment? Why? Include the appropriate probability

computation in your response.

Statistics Formulas and Tables Descriptive Statistics Probability

� = ∑ �� mean

� = ∑��∑ � approximate mean from grouped data

� = �∑�� sample standard deviation

� = �∑�� ∙��∑ � � approx. std. dev. from grouped data

Sample variance = ��

Interquartile Range: �� = �� − ��

Lower fence = �� − 1.5��

Upper fence = �� + 1.5��

General Addition Rule

�� = �� + �� − �� !"#

multiplication rule for independent events

�� !"# = �� ∙ ��

multiplication rule for dependent events

�� !"# = �� ∙ �� |�

complement rule ��% = 1 − ��

�&� = �!�� &! Permutations (no elements alike)

�!�(!��!⋯�*! Permutations ("� alike, ⋯)

+&� = �!�� &!&! Combinations

Probability Distributions Normal Distribution and Sampling Distributions

mean (expected value) of a discrete random variable

,� = ∑-� ∙ ��.

standard deviation of a discrete random variable

/� = 0∑�� − ,� ∙ ��

�� = +�� ∙ 1� ∙ �1 − 1� � Binomial probability

,� = " ∙ 1 mean for a Binomial distribution

/� = 0" ∙ 1 ∙ �1 − 1 std. dev. for a Binomial distribution

2 = � 34 standard normal score

mean and std. dev. of the sampling distribution of � ,� = , /� = 4

√� Standard Error

mean and std. dev. of the sampling distribution of 1

,78 = 1 /78 = �7�� 7�

Estimating a Population Parameter Hypothesis Testing

Proportion:

�� ( )

2

1p pp z

nα

−±

( )

2

2

2

0.25zn

E

α= sample size, 1 unknown

( ) ( )

2

2

2

ˆ ˆ1z p pn

E

α −= sample size, 1 known

Mean:

2

sx t

nα±

2

2z s

nE

α =

sample size

Difference of Two Proportions (Independent Samples):

� �( )� �( ) � �( )1 1 2 2

21 2

1 2

1 1p p p pp p z

n nα

− −− ± +

proportion – one population

( )0

0

0 0

ˆ

1

p pz

p p

n

−=

−

mean – one population 0

0

xt

s n

µ−=

two population proportions (Independent Samples)

� �( ) ( )

� �( )

1 21 2

0

1 2

1 11

p p p pz

p pn n

− − −=

− +

where � 1 2

1 2

x xp

n n

+=

+

two population proportions (Dependent Samples)

12 21

0

12 21

1f fz

f f

− −=

+

Linear Correlation and Regression

Correlation Coefficient � = ∑9:�;:<=: >?@�;@<

=@ A� �

B8 = CD + C�� estimated eqn. of linear regression line

�� = �� coefficient of determination

Residual = B − B8

Documents

SLCC MATH 1040 FINAL EXAM FALL 2016 NAME: Form A INSTRUCTOR: · SLCC MATH 1040 FINAL EXAM FALL 2016 NAME: _____ Form A INSTRUCTOR: ... finds that 27 out of the 300 in his sample are