Educational Statistics

Preview:

DESCRIPTION

Educational Statistics. GURU K MOORTHY. Outline. Introduction Frequency Distribution Measures of Central Tendency Measures of Dispersion. Outline-Continued. Other Measures Concept of a Population and Sample The Normal Curve Tests for Normality. Learning Objectives. - PowerPoint PPT Presentation

Citation preview

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Educational StatisticsEducational Statistics

GURU K MOORTHYGURU K MOORTHYGURU K MOORTHYGURU K MOORTHY

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

OutlineOutline

Introduction Frequency Distribution Measures of Central Tendency Measures of Dispersion

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Outline-ContinuedOutline-Continued

Other Measures Concept of a Population and Sample The Normal Curve Tests for Normality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning ObjectivesLearning Objectives

When you have completed this chapter you should be able to:

Know the difference between a variable and an attribute.

Perform mathematical calculations to the correct number of significant figures.

Construct histograms for simple and complex data.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning Objectives-cont’d.Learning Objectives-cont’d.

When you have completed this chapter you should be able to:

Calculate and effectively use the different measures of central tendency, dispersion, and interrelationship.

Understand the concept of a universe and a sample.

Understand the concept of a normal curve and the relationship to the mean and standard deviation.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning Objectives-cont’d.Learning Objectives-cont’d.

When you have completed this chapter you should be able to:

Calculate the percent of items below a value, above a value, or between two values for data that are normally distributed.

Calculate the process center given the percent of items below a value

Perform the different tests of normality Construct a scatter diagram and perform

the necessary related calculations.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Definition of Statistics:

1. A collection of quantitative data pertaining to a subject or group. Examples are blood pressure statistics etc.

2. The science that deals with the collection, tabulation, analysis, interpretation, and presentation of quantitative data

IntroductionIntroduction

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Two phases of statistics:Descriptive Statistics:

Describes the characteristics of a product or process using information collected on it.

Inferential Statistics (Inductive):Draws conclusions on unknown process

parameters based on information contained in a sample.

Uses probability

IntroductionIntroduction

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Attribute:

Discrete data. Data values can only be integers. Counted data or attribute data. Examples include: How many of the products are

defective? How often are the machines repaired? How many people are absent each day?

Collection of DataCollection of Data

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Attribute:

Discrete data. Data values can only be integers. Counted data or attribute data. Examples include: How many days did it rain last month? What kind of performance was

achieved? Number of defects, defectives

Collection of Data – Cont’d.Collection of Data – Cont’d.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Variable:

Continuous data. Data values can be any real number. Measured data.

Examples include: How long is each item? How long did it take to complete the

task? What is the weight of the product? Length, volume, time

Collection of DataCollection of Data

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Collection of DataCollection of Data

Significant Figures Rounding

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Significant Figures = Measured numbers When you measure something there is

always room for a little bit of error How tall are you 5 ft 9 inches or 5 ft 9.1

inches? Counted numbers and defined numbers ( 12

ins. = 1 ft, there are 6 people in my family)

Significant FiguresSignificant Figures

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Significant figures are used to indicate the amount of variation which is allowed in a number.

It is believed to be closer to the actual value than any other digit.

Significant figures:3.69 – 3 significant digits.36.900 – 5 significant digits.

Significant FiguresSignificant Figures

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Use Scientific Notation3x10^2 (1 significant digit)3.0x10^2 (2 significant digits)

Significant Figures – Cont’d.Significant Figures – Cont’d.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Rules for Multiplying and Dividing Number of sig. = the same as the number

with the least number of significant digits.6.59 x 2.3 = 1532.65/24 = 1.4 (where 24 is not a

counting number)32.64/24=1.360(24 is a counting

number i.e. 24.00)

Significant FiguresSignificant Figures

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Rules for Adding and Subtracting Result can have no more sig. fig. after the

decimal point than the number with the fewest sig. fig. after the decimal point.38.26 – 6 = 32 (6 is not a counting

number)38.2 -6 = 32.2 (6 is a counting number)38.26 – 6.1 = 32.2 (rounded from 32.16) If the last digit >=5 then round up, else

round down

Significant FiguresSignificant Figures

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Precision

The precision of a measurement is determined by how reproducible that measurement value is.

For example if a sample is weighed by a student to be 42.58 g, and then measured by another student five different times with the resulting data: 42.09 g, 42.15 g, 42.1 g, 42.16 g, 42.12 g Then the original measurement is not very precise since it cannot be reproduced.

Precision and AccuracyPrecision and Accuracy

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Accuracy The accuracy of a measurement is

determined by how close a measured value is to its “true” value.

For example, if a sample is known to weigh 3.182 g, then weighed five different times by a student with the resulting data: 3.200 g, 3.180 g, 3.152 g, 3.168 g, 3.189 g

The most accurate measurement would be 3.180 g, because it is closest to the true “weight” of the sample.

Precision and AccuracyPrecision and Accuracy

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Precision and AccuracyPrecision and Accuracy

Figure 4-1 Difference between accuracy and precision

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Frequency Distribution Measures of Central Tendency Measures of Dispersion

DescribingDescribing DataData

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Ungrouped Data Grouped Data

Frequency DistributionFrequency Distribution

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-72-7There are three types of frequency distributions

Categorical frequency distributions Ungrouped frequency distributions Grouped frequency distributions

Frequency DistributionFrequency Distribution

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-72-7Categorical frequency distributions Can be used for data that can be placed

in specific categories, such as nominal- or ordinal-level data.

Examples - political affiliation, religious affiliation, blood type etc.

CategoricalCategorical

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-82-8 Example :Blood Type Frequency Example :Blood Type Frequency DistributionDistribution

Class Frequency Percent

A 5 20

B 7 28

O 9 36

AB 4 16

CategoricalCategorical

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-92-9Ungrouped frequency distributions Ungrouped frequency distributions - can

be used for data that can be enumerated and when the range of values in the data set is not large.

Examples - number of miles your instructors have to travel from home to campus, number of girls in a 4-child family etc.

UngroupedUngrouped

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-102-10 Example :Number of Miles TraveledExample :Number of Miles Traveled

Class Frequency

5 24

10 16

15 10

UngroupedUngrouped

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-112-11 Grouped frequency distributions Can be used when the range of values

in the data set is very large. The data must be grouped into classes that are more than one unit in width.

Examples - the life of boat batteries in hours.

GroupedGrouped

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-122-12Example: Lifetimes of Boat BatteriesExample: Lifetimes of Boat Batteries

Classlimits

ClassBoundaries

Cumulative

24 - 30 23.5 - 37.5 4 4

38 - 51 37.5 - 51.5 14 18

52 - 65 51.5 - 65.5 7 25

frequencyFrequency

GroupedGrouped

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Number non conforming

Frequency Relative Frequency

Cumulative Frequency

RelativeFrequency

0 15 0.29 15 0.29

1 20 0.38 35 0.67

2 8 0.15 43 0.83

3 5 0.10 48 0.92

4 3 0.06 51 0.98

5 1 0.02 52 1.00

Table 4-3 Different Frequency Distributions of Data Given in Table 4-1

Frequency DistributionsFrequency Distributions

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Frequency Histogram

0

5

10

15

20

25

0 1 2 3 4 5

Number Nonconforming

Freq

uenc

y

Frequency HistogramFrequency Histogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relative Frequency Histogram

0.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45

0 1 2 3 4 5

Number Nonconforming

Rela

tive F

req

uen

cy

Relative Frequency Relative Frequency HistogramHistogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Cumulative Frequency Histogram

0

10

20

30

40

50

60

0 1 2 3 4 5

Number Nonconforming

Cu

mu

lati

ve F

req

uen

cy

Cumulative Frequency Cumulative Frequency HistogramHistogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The histogram is the most important graphical tool for exploring the shape of data distributions.

Check: http://quarknet.fnal.gov/toolkits/ati/histograms.html for the construction ,analysis and understanding of histograms

The HistogramThe Histogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The Fast WayStep 1: Find range of distribution, largest - smallest valuesStep 2: Choose number of classes, 5 to 20Step 3: Determine width of classes, one decimal place more than the data, class width = range/number of classesStep 4: Determine class boundariesStep 5: Draw frequency histogram

#classes n

Constructing a HistogramConstructing a Histogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Number of groups or cellsIf no. of observations < 100 – 5 to 9

cellsBetween 100-500 – 8 to 17 cellsGreater than 500 – 15 to 20 cells

Constructing a HistogramConstructing a Histogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

For a more accurate way of drawing a histogram see the section on grouped data in your textbook

Constructing a HistogramConstructing a Histogram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Bar Graph Polygon of Data Cumulative Frequency Distribution or

Ogive

Other Types of Other Types of Frequency Distribution Frequency Distribution

GraphsGraphs

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Bar Graph and Polygon of Bar Graph and Polygon of DataData

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Cumulative FrequencyCumulative Frequency

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Figure 4-6 Characteristics of frequency distributions

Characteristics of FrequencyCharacteristics of FrequencyDistribution GraphsDistribution Graphs

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Analysis of HistogramsAnalysis of Histograms

Figure 4-7 Differences due to location, spread, and shape

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Analysis of HistogramsAnalysis of Histograms

Figure 4-8 Histogram of Wash Concentration

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The three measures in common use are the: Average Median Mode

Measures of Central Measures of Central TendencyTendency

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

There are three different techniques available for calculating the average three measures in common use are the:

Ungrouped data Grouped data Weighted average

AverageAverage

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

ni

i

XX

n

Average-Ungrouped DataAverage-Ungrouped Data

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

1 1 2 2

1 2

... .

...

hi i

i

h h

h

f XX

n

f X f X f X

f f f

h = number of cellsh = number of cellsfi=frequencyfi=frequencyXi=midpointXi=midpoint

Average-Grouped DataAverage-Grouped Data

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

1

n

i iiw n

ii

w XX

w

Used when a number of averages are combined with different frequencies

Average-Weighted AverageAverage-Weighted Average

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2m

d mm

ncf

M L if

Lm=lower boundary of the cell with the medianN=total number of observationsCfm=cumulative frequency of all cells below mFm=frequency of median celli=cell interval

Median-Grouped DataMedian-Grouped Data

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Boundaries Midpoint Frequency Computation

23.6-26.5 25.0 4 100

26.6-29.5 28.0 36 1008

29.6-32.5 31.0 51 1581

32.6-35.5 34.0 63 2142

35.6-38.5 37.0 58 2146

38.6-41.5 40.0 52 2080

41.6-44.5 43.0 34 1462

44.6-47.5 46.0 16 736

47.6-50.5 49.0 6 294

Total 320 11549

Table 4-7 Frequency Distribution of the Life of 320 tires in 1000 km

Example ProblemExample Problem

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2m

d mm

ncf

M L if

320154

235.6 3 35.958

Md

Median-Grouped DataMedian-Grouped Data

Using data from Table 4-7

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

ModeMode

The Mode is the value that occurs with the greatest frequency.

It is possible to have no modes in a series or numbers or to have more than one mode.

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Figure 4-9 Relationship among average, median and mode

Relationship Among theRelationship Among theMeasures of Central Measures of Central

TendencyTendency

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Range Standard Deviation Variance

Measures of DispersionMeasures of Dispersion

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The range is the simplest and easiest to calculate of the measures of dispersion.

Range = R = Xh - Xl Largest value - Smallest value in

data set

MeasuresMeasures of Dispersion- of Dispersion-RangeRange

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Sample Standard Deviation:

2

1( )

1

n

iXi X

Sn

2

2

11

/

1

nn

ii

Xi Xi n

Sn

Measures of Dispersion-Measures of Dispersion-Standard DeviationStandard Deviation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Ungrouped Technique

2 2

1 1( )

( 1)

n n

i in Xi Xi

Sn n

Standard DeviationStandard Deviation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2 2

11

( ) ( )

( 1)

hh

i i i iii

n f X f Xs

n n

Standard DeviationStandard Deviation

Grouped Technique

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship Between the Relationship Between the Measures of DispersionMeasures of Dispersion

As n increases, accuracy of R decreases Use R when there is small amount of data

or data is too scattered If n> 10 use standard deviation A smaller standard deviation means better

quality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship Between the Relationship Between the Measures of DispersionMeasures of Dispersion

Figure 4-10 Comparison of two distributions with equal average and range

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Other MeasuresOther Measures

There are three other measures that are frequently used to analyze a collection of data: Skewness Kurtosis Coefficient of Variation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Skewness is the lack of symmetry of the data.

For grouped data:3

13 3

( ) /h

i iif X X n

as

SkewnessSkewness

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

SkewnessSkewness

Figure 4-11 Left (negative) and right (positive) skewness distributions

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Kurtosis provides information regrading the shape of the population distribution (the peakedness or heaviness of the tails of a distribution).

For grouped data:4

14 4

( ) /h

i iif X X n

as

KurtosisKurtosis

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

KurtosisKurtosis

Figure 4-11 Leptokurtic and Platykurtic distributions

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Correlation variation (CV) is a measure of how much variation exists in relation to the mean.

Coefficient of VariationCoefficient of Variation

(100%)sCV

X

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Population Set of all items that possess a characteristic of interest

Sample Subset of a population

Population and SamplePopulation and Sample

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Parameter is a characteristic of a population, i.o.w. it describes a population Example: average weight of the population, e.g. 50,000 cans made in a month.Statistic is a characteristic of a sample, used to make inferences on the population parameters that are typically unknown, called an estimator Example: average weight of a sample of 500 cans from that month’s output, an estimate of the average weight of the 50,000 cans.

Parameter and StatisticParameter and Statistic

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Characteristics of the normal curve: It is symmetrical -- Half the cases are to

one side of the center; the other half is on the other side.

The distribution is single peaked, not bimodal or multi-modal

Also known as the Gaussian distribution

The Normal CurveThe Normal Curve

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Characteristics:

Most of the cases will fall in the center portion of the curve and as values of the variable become more extreme they become less frequent, with "outliers" at the "tail" of the distribution few in number. It is one of many frequency distributions.

The Normal CurveThe Normal Curve

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1. Normal distributions can be transformed to standard normal distributions by the formula:

iXZ

Standard Normal DistributionStandard Normal Distribution

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship between the Relationship between the

Mean and Standard Mean and Standard DeviationDeviation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Mean and Standard Mean and Standard DeviationDeviation

Same mean but different standard deviation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Mean and Standard Mean and Standard DeviationDeviation

Same mean but different standard deviation

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

IF THE DISTRIBUTION IS NORMAL

Then the mean is the best measure of central tendencyMost scores “bunched up” in

middleExtreme scores are less frequent,

therefore less probable

Normal DistributionNormal Distribution

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Percent of items included between certain values of the std. deviation

Normal DistributionNormal Distribution

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Histogram Skewness Kurtosis

Tests for NormalityTests for Normality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Histogram:

ShapeSymmetrical

The larger the sampler size, the better the judgment of normality. A minimum sample size of 50 is recommended

Tests for NormalityTests for Normality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Skewness (a3) and Kurtosis (a4)” Skewed to the left or to the right (a3=0 for

a normal distribution) The data are peaked as the normal

distribution (a4=3 for a normal distribution)

The larger the sample size, the better the judgment of normality (sample size of 100 is recommended)

Tests for NormalityTests for Normality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Probability Plots Order the data from the smallest to the

largest Rank the observations (starting from 1 for

the lowest observation) Calculate the plotting position

100( 0.5)iPP

n

Where i = rank PP=plotting position n=sample size

Tests for NormalityTests for Normality

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure: Order the data Rank the observations Calculate the plotting position

Probability PlotsProbability Plots

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure cont’d: Label the data scale Plot the points Attempt to fit by eye a “best

line” Determine normality

Probability PlotsProbability Plots

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure cont’d: Order the data Rank the observations Calculate the plotting position Label the data scale Plot the points Attempt to fit by eye a “best line” Determine normality

Probability PlotsProbability Plots

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Chi-Square Test

2 Chi-squared

Observed value in a cell

Expected value for a cell

i

i

O

E

Where

22

1

( )ik

i

ii

O E

E

Chi-Square Goodness of Fit Chi-Square Goodness of Fit TestTest

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The simplest way to determine if a The simplest way to determine if a cause and-effect relationship exists cause and-effect relationship exists between two variablesbetween two variables

Scatter DiagramScatter Diagram

Figure 4-19 Scatter Diagram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Supplies the data to confirm a Supplies the data to confirm a hypothesis that two variables are hypothesis that two variables are relatedrelated

Provides both a visual and statistical Provides both a visual and statistical means to test the strength of a means to test the strength of a relationshiprelationship

Provides a good follow-up to cause and Provides a good follow-up to cause and effect diagramseffect diagrams

Scatter DiagramScatter Diagram

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Straight Line FitStraight Line Fit

2 2

[( )( ) /

[( ) / ]

/ ( / )

xy x y nm

x x n

a y n m x n

y a mx

Where m=slope of the line and a is the intercept on the y axis

Recommended