Part 13: Statistical Tests – Part 1 13-1/37 Statistics and Data Analysis Professor William Greene...

Part 13: Statistical Tests – Part 113-1/37

Statistics and Data Analysis

Professor William Greene

Stern School of Business

IOMS Department

Department of Economics

Statistics and Data Analysis

Part 13 – Statistical Tests: 1

Statistical Testing

Methodology: The scientific method and statistical testing

Classical hypothesis testing Setting up the test Test of a hypothesis about a mean Other kinds of statistical tests

Mechanics of hypothesis testing Applications

Classical Hypothesis Testing

The scientific method applied to statistical hypothesis testing Hypothesis: The world works according to my hypothesis Testing or supporting the hypothesis

Data gathering Rejection of the hypothesis if the data are inconsistent with it Retention and exposure to further investigation if the data are

consistent with the hypothesis Failure to reject is not equivalent to acceptance.

http://query.nytimes.com/gst/fullpage.html?res=9C00E4DF113BF935A3575BC0A9649C8B63

Methodology

The standard approach would be to hypothesize that there is no link and seek data (evidence) that are (is) inconsistent with the hypothesis.

That is the way the NCI usually carries out an investigation.

This one was different.

Errors in Testing

Correct Decision

Type II Error

Type I ErrorCorrect

Decision

Hypothesis is Hypothesis is True False

I Do Not Reject the Hypothesis

I Reject the Hypothesis

A Legal Analogy: The Null Hypothesis is INNOCENT

Correct DecisionType II Error

Guilty defendant goes free

Type I ErrorInnocent defendant is

convictedCorrect Decision

Null Hypothesis Alternative Hypothesis Not Guilty Guilty

Finding: Verdict Not Guilty

Finding: VerdictGuilty

The errors are not symmetric. Most thinkers consider Type I errors to be more serious than Type II in this setting.

(Worldwide) Standard Methodology

“Statistical” testing Methodology

Formulate the “null” hypothesis Decide (in advance) what kinds of “evidence”

(data) will lead to rejection of the null hypothesis. I.e., define the rejection region)

Gather the data Carry out the test.

Formulating the Hypothesis

Stating the hypothesis: A belief about the “state of nature” A parameter takes a particular value There is a relationship between variables And so on…

The null vs. the alternative By induction: If we wish to find evidence of

something, first assume it is not true. Look for evidence that leads to rejection of

the assumed hypothesis.

Terms of Art

Null Hypothesis: The proposed state of nature

Alternative hypothesis: The state of nature that is believed to prevail if the null is rejected.

Example: Credit Rule

Investigation: I believe that Fair Isaacs relies on home ownership in deciding whether to “accept” an application. Null hypothesis: There is no relationship Alternative hypothesis: They do use

homeownership data. What decision rule should I use?

Some Evidence

= Homeowners

48% of cardholders are homeowners.

38% of nonholders are homeowners.

The Rejection Region

What is the “rejection region?” Data (evidence) that are

inconsistent with my hypothesis Evidence is divided into two types:

Data that are inconsistent with my hypothesis (the rejection region)

Everything else

Application: Breast Cancer On Long Island

Null Hypothesis: There is no link between the high cancer rate on LI and the use of pesticides and toxic chemicals in dry cleaning, farming, etc.

Neyman-Pearson Procedure Examine the physical and statistical evidence If there is convincing covariation, reject the null hypothesis What is the rejection region?

The NCI study: Working hypothesis: There is a link: We will find the

evidence. How do you reject this hypothesis?

Formulating the Testing Procedure

Usually: What kind of data will lead me to reject the hypothesis?

Thinking scientifically: If you want to “prove” a hypothesis is true (or you want to support one) begin by assuming your hypothesis is not true, and look for plausible evidence that contradicts the assumption.

Hypothesis Testing Strategy

Formulate the null hypothesis Gather the evidence Question: If my null hypothesis were

true, how likely is it that I would have observed this evidence? Very unlikely: Reject the hypothesis Not unlikely: Do not reject. (Retain the

hypothesis for continued scrutiny.)

Hypothesis About a Mean

I believe that the average income of individuals in a population is (about) $30,000. H0 : μ = $30,000 (The null)

H1: μ ≠ $30,000 (The alternative)

I will draw the sample and examine the data. The rejection region is data for which the

sample mean is far from $30,000. How far is far????? That is the test.

Application

The mean of a population takes a specific value:

Null hypothesis: H0: μ = $30,000H1: μ ≠ $30,000

Test: Sample mean close to hypothesized population mean?

Rejection region: Sample means that are far from $30,000

Deciding on the Rejection Region

If the sample mean is far from $30,000, I will reject the hypothesis. I choose, the region, for example, < 29,500 or > 30,500

The probability that the mean falls in the rejection region even though the hypothesis is true (should not be rejected) is the probability of a type 1 error. Even if the true mean really is $30,000, the sample mean could fall in the rejection region.

29,500 30,000 30,500

Rejection Rejection

Reduce the Probability of a Type I Error by Making the Rejection Region Smaller

28,500 29,500 30,000 30,500 31,500

Reduce the probability of a type I error by moving the boundaries of the rejection region farther out.

You can make a type I error impossible by making the rejection region very far from the null. Then you would never make a type I error because you would never reject H0.

Probability outside this interval is large.

Probability outside this interval is much smaller.

Setting the α Level

“α” is the probability of a type I error Choose the width of the interval by choosing the

desired probability of a type I error, based on the t or normal distribution. (How confident do I want to be?)

Multiply the corresponding z or t value by the standard error of the mean.

Testing Procedure The rejection region will be the range of

values greater than μ0 + zσ/√N orless than μ0 - zσ/√N

Use z = 1.96 for 1 - α = 95% Use z = 2.576 for 1 - α = 99% Use the t table if small sample and

sampling from a normal distribution.

Deciding on the Rejection Region

If the sample mean is far from $30,000, reject the hypothesis.

Choose, the region, say,

Rejection Rejection

$30,000 1.96N

$30,000 1.96

I am 95% certain that I will not commit a type I error (reject the hypothesis in error). (I cannot be 100% certain.)

The Testing Procedure (For a Mean)

Reject if x > 1.96N

or x - > 1.96N

x - or > 1.96

/ N or z > 1.96

Reject if x < -1.96N

or x - < -1.96N

x - or < -1.96

/ N or z < -1.96

x - 30,000Reject if 1.96

The Test Procedure

Choosing z = 1.96 makes the probability of a Type I error 0.05.

Choosing z = 2.576 would reduce the probability of a Type I error to 0.01.

What to use for σ?

The known value if there is one The sample estimate if random sampling.

Application

0H : = $30,000

N = 13,444 (Huge sample. t is the same as normal)

x = $30,144.3 (Is this far from $30,000?)

s = $15035.5

$30114.3 - $30,000t = = 0.881

$15035.5/ 13,444

The rejection region is |t| > 1.96.

Do not reject the hypothesis.

If you choose 1-Sample Z… to use the normal distribution, Minitab assumes you know σ and asks for the value.

Specify the Hypothesis Test

Minitab assumes 95%. You can choose some other value.

The Test Results (Are In)

2NNi 1 ii 1 i

x xx sMean x , StDev=s= , SE Mean=

N N 1 N

sx 1.96

An Intuitive Approach

Using the confidence interval The confidence interval gives the range of plausible values.

If this range does not include the null hypothesis, reject the hypothesis.If the confidence interval contains the hypothesized value, retain the hypothesis.

Includes $30,000.

The P value

The “P value” is the probability that you would have observed the evidence that you did observe if the null hypothesis were true.

If the P value is less than the Type I error probability (usually 0.05) you have chosen, you will reject the hypothesis.

Insignificant Results

The test results are “significant” if the P value is less than α.

These test results are “insignificant” at the 5% level.

This is 1 – α.

Application: One sided test of a mean

Hypothesis: The mean is greater than some value Business application: Does a new machine that we

might buy produce grommets faster than the one we have now? H0: μ ≤ M (where M is the mean for the old machine.)

H1: μ > M Rejection region: Mean of a sample of production rates

from the new machine is far above M. Buy the new machine,

Academic Application: Do SAT Test Courses work? Null hypothesis: The mean grade on the do-overs is less

than the mean on the original test. Reject means the do-over appears to be better.

Summary

Methodological issues: Science and hypothesis tests

Standard methods: Formulating a testing procedure Determining the “rejection region”

Many different kinds of applications

Part 13: Statistical Tests – Part 1 13-1/37 Statistics and Data Analysis Professor William Greene...

Documents

Part 17: Multiple Regression – Part 1 17-1/26 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department

Research strategy for RNOH-IOMS David Marsh Professor of Clinical Orthopaedics Institute of Orthopaedics and Musculoskeletal Science, University College

Part 20: Aspects of Regression 20-1/26 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Fernando-State Department Documents Part III

Part 16: Regression Model Specification 16-1/25 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department

Department Project Server Overview Part 1

Part 16: Linear Regression 16-1/46 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

FOSTER PROVOST – external packetweb-docs.stern.nyu.edu/ioms/IOMS CVs/Pinedo_CV_2018.pdf · “Scheduling in Flexible Assembly Systems”, Chapter 18 in Reference Book on Intelligent

Part 12: Statistical Inference 12-1/45 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Statistical Inference and Regression Analysis: GB.3302.30 Professor William Greene Stern School of Business IOMS Department Department of Economics

Tactical Operations Training · PDF fileIOMS: CSOT: RPL 101 Rappelling Equipment SFA IOMS: CSOT: RPL 102 Rappelling Part I SFA ... Urban Operations Part II SFA IOMS: CSOT: OPS 117

Department Activity(Part II)

Anesthesia department event Part I

Part 0: Introduction 0-1/18 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Part 15: Hypothesis Tests 15-1/18 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Part 6: Multiple Regression 6-1/35 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics

Anesthesia department event part ii

Part 11: Random Walks and Approximations 11-1/28 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department

Part 21: Statistical Inference 21-1/43 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Part 4: Expected Value 4-1/25 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics