13
Testing Normality Many test procedures that we have developed rely on the assumption of Normality. There are many test for Normality of data. One uses the normal to provide cell probabilities for the chi- square goodness-of-fit test. A “better” test is based on the Normal Probability Plot

Testing Normality

  • Upload
    lester

  • View
    46

  • Download
    4

Embed Size (px)

DESCRIPTION

Testing Normality. Many test procedures that we have developed rely on the assumption of Normality. There are many test for Normality of data. One uses the normal to provide cell probabilities for the chi-square goodness-of-fit test. A “better” test is based on the Normal Probability Plot. - PowerPoint PPT Presentation

Citation preview

Page 1: Testing Normality

Testing NormalityMany test procedures that we have developed rely on the assumption of Normality. There are many test for Normality of data. One uses the normal to provide cell probabilities for the chi-square goodness-of-fit test. A “better” test is based on the Normal Probability Plot

Page 2: Testing Normality

Testing Normality Cont’d

Recall: The NPP should be approx linear for normal data, and the correlation coefficient is a measure of linearity.

If r is much less than one, we would conclude that the data doesn’t come from a Normal distribution.

Page 3: Testing Normality

Ryan-Joiner Test1. Order the data x(1),…,x(n)

2. Compute the normal percentiles

3. Compute the correlation coefficient, r, for the (yi,x(i)) pairs

25.

375.

n

iyi

Page 4: Testing Normality

Ryan-Joiner Test

4. State the Null and Alternative Hypotheses

Ho: The population is normal

Ha: The population is not normal

5. Specify alpha and obtain critical values from Table A.14. Compare r to this value

Page 5: Testing Normality

Example

Consider the following data. Use the Ryan-Joiner test to test the assumption of normality at =.10

1.15 1.4 1.34 1.29 1.36 1.26 1.22 1.41.29 1.14 1.32 1.34 1.26 1.36 1.36 1.31.28 1.45 1.29 1.28 1.38 1.55 1.46 1.32

Page 6: Testing Normality

Testing Homogeneity of Populations

*We wish to compare I multinomial populations, each with J categories. *

Take ni samples from the ith population

Let Nij be the number of observations from the ith population in the jth category. Hence, j Nij = ni

Place the data in a I x J table

Page 7: Testing Normality

TableCategory

1 2 …. J Total1 n11 n12 …. n1J n12 n21 n22 n2J n2

Pop. . . . …. . .. . . …. . .. . . …. . .I nI1 nI2 …. nIJ nI

Total n.1 n.2 …. n.J n

Page 8: Testing Normality

Corresponding to each cell, there is a cell probability pij=probability and outcome for the ith population falls into the jth category, where j pij = 1

Category1 2 …. J

1 p11 p12 …. p1J2 p21 p22 p2J

Pop. . . . …. .. . . …. .. . . …. .I pI1 pI2 …. pIJ

Page 9: Testing Normality

TestHo: p1j = p2j = … = pIj , j = 1,…,J

Ha: Some pij pi’j

Under Ho, the common cell probability pj is estimated by

n

np jj

ˆ

Page 10: Testing Normality

Test Cont’dThe estimated expected cell frequency is

n

nnpnE jijiij

ˆˆ

The test statistic is

rows columns ij

ijij

E

EnX

ˆ

ˆ 2

2

Rejection Region: X2 > 2 with d.f = (I-1)(J-1)

Page 11: Testing Normality

Testing for Association

* Individuals are categorized by two categorical variables. We wish to determine whether these variables are associated. *

Row Categories – A1,…,AI

Column Categories – B1,…,BJ

Page 12: Testing Normality

n = Total number of observations

nij = the number of individuals classified as Ai and Bj

Hence, nij = n

Ho: P(AiBj) = P(Ai)P(Bj) for all i,j

Ha: Some P(AiBj) P(Ai)P(Bj)

Page 13: Testing Normality

Expected Frequency:

n

n x nˆ j iijE

Test Statistic:

rows columns ij

ijij

E

EnX

ˆ

ˆ 2

2

Rejection Region: X2 > 2 with d.f = (I-1)(J-1)