87
Besterfield: Quality Control, 8 th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458. All rights reserved Educational Educational Statistics Statistics GURU K MOORTHY GURU K MOORTHY

Educational Statistics

Embed Size (px)

DESCRIPTION

Educational Statistics. GURU K MOORTHY. Outline. Introduction Frequency Distribution Measures of Central Tendency Measures of Dispersion. Outline-Continued. Other Measures Concept of a Population and Sample The Normal Curve Tests for Normality. Learning Objectives. - PowerPoint PPT Presentation

Citation preview

Page 1: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Educational StatisticsEducational Statistics

GURU K MOORTHYGURU K MOORTHYGURU K MOORTHYGURU K MOORTHY

Page 2: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

OutlineOutline

Introduction Frequency Distribution Measures of Central Tendency Measures of Dispersion

Page 3: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Outline-ContinuedOutline-Continued

Other Measures Concept of a Population and Sample The Normal Curve Tests for Normality

Page 4: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning ObjectivesLearning Objectives

When you have completed this chapter you should be able to:

Know the difference between a variable and an attribute.

Perform mathematical calculations to the correct number of significant figures.

Construct histograms for simple and complex data.

Page 5: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning Objectives-cont’d.Learning Objectives-cont’d.

When you have completed this chapter you should be able to:

Calculate and effectively use the different measures of central tendency, dispersion, and interrelationship.

Understand the concept of a universe and a sample.

Understand the concept of a normal curve and the relationship to the mean and standard deviation.

Page 6: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Learning Objectives-cont’d.Learning Objectives-cont’d.

When you have completed this chapter you should be able to:

Calculate the percent of items below a value, above a value, or between two values for data that are normally distributed.

Calculate the process center given the percent of items below a value

Perform the different tests of normality Construct a scatter diagram and perform

the necessary related calculations.

Page 7: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Definition of Statistics:

1. A collection of quantitative data pertaining to a subject or group. Examples are blood pressure statistics etc.

2. The science that deals with the collection, tabulation, analysis, interpretation, and presentation of quantitative data

IntroductionIntroduction

Page 8: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Two phases of statistics:Descriptive Statistics:

Describes the characteristics of a product or process using information collected on it.

Inferential Statistics (Inductive):Draws conclusions on unknown process

parameters based on information contained in a sample.

Uses probability

IntroductionIntroduction

Page 9: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Attribute:

Discrete data. Data values can only be integers. Counted data or attribute data. Examples include: How many of the products are

defective? How often are the machines repaired? How many people are absent each day?

Collection of DataCollection of Data

Page 10: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Attribute:

Discrete data. Data values can only be integers. Counted data or attribute data. Examples include: How many days did it rain last month? What kind of performance was

achieved? Number of defects, defectives

Collection of Data – Cont’d.Collection of Data – Cont’d.

Page 11: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Types of Data:

Variable:

Continuous data. Data values can be any real number. Measured data.

Examples include: How long is each item? How long did it take to complete the

task? What is the weight of the product? Length, volume, time

Collection of DataCollection of Data

Page 12: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Collection of DataCollection of Data

Significant Figures Rounding

Page 13: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Significant Figures = Measured numbers When you measure something there is

always room for a little bit of error How tall are you 5 ft 9 inches or 5 ft 9.1

inches? Counted numbers and defined numbers ( 12

ins. = 1 ft, there are 6 people in my family)

Significant FiguresSignificant Figures

Page 14: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Significant figures are used to indicate the amount of variation which is allowed in a number.

It is believed to be closer to the actual value than any other digit.

Significant figures:3.69 – 3 significant digits.36.900 – 5 significant digits.

Significant FiguresSignificant Figures

Page 15: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Use Scientific Notation3x10^2 (1 significant digit)3.0x10^2 (2 significant digits)

Significant Figures – Cont’d.Significant Figures – Cont’d.

Page 16: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Rules for Multiplying and Dividing Number of sig. = the same as the number

with the least number of significant digits.6.59 x 2.3 = 1532.65/24 = 1.4 (where 24 is not a

counting number)32.64/24=1.360(24 is a counting

number i.e. 24.00)

Significant FiguresSignificant Figures

Page 17: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Rules for Adding and Subtracting Result can have no more sig. fig. after the

decimal point than the number with the fewest sig. fig. after the decimal point.38.26 – 6 = 32 (6 is not a counting

number)38.2 -6 = 32.2 (6 is a counting number)38.26 – 6.1 = 32.2 (rounded from 32.16) If the last digit >=5 then round up, else

round down

Significant FiguresSignificant Figures

Page 18: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Precision

The precision of a measurement is determined by how reproducible that measurement value is.

For example if a sample is weighed by a student to be 42.58 g, and then measured by another student five different times with the resulting data: 42.09 g, 42.15 g, 42.1 g, 42.16 g, 42.12 g Then the original measurement is not very precise since it cannot be reproduced.

Precision and AccuracyPrecision and Accuracy

Page 19: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Accuracy The accuracy of a measurement is

determined by how close a measured value is to its “true” value.

For example, if a sample is known to weigh 3.182 g, then weighed five different times by a student with the resulting data: 3.200 g, 3.180 g, 3.152 g, 3.168 g, 3.189 g

The most accurate measurement would be 3.180 g, because it is closest to the true “weight” of the sample.

Precision and AccuracyPrecision and Accuracy

Page 20: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Precision and AccuracyPrecision and Accuracy

Figure 4-1 Difference between accuracy and precision

Page 21: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Frequency Distribution Measures of Central Tendency Measures of Dispersion

DescribingDescribing DataData

Page 22: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Ungrouped Data Grouped Data

Frequency DistributionFrequency Distribution

Page 23: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-72-7There are three types of frequency distributions

Categorical frequency distributions Ungrouped frequency distributions Grouped frequency distributions

Frequency DistributionFrequency Distribution

Page 24: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-72-7Categorical frequency distributions Can be used for data that can be placed

in specific categories, such as nominal- or ordinal-level data.

Examples - political affiliation, religious affiliation, blood type etc.

CategoricalCategorical

Page 25: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-82-8 Example :Blood Type Frequency Example :Blood Type Frequency DistributionDistribution

Class Frequency Percent

A 5 20

B 7 28

O 9 36

AB 4 16

CategoricalCategorical

Page 26: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-92-9Ungrouped frequency distributions Ungrouped frequency distributions - can

be used for data that can be enumerated and when the range of values in the data set is not large.

Examples - number of miles your instructors have to travel from home to campus, number of girls in a 4-child family etc.

UngroupedUngrouped

Page 27: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-102-10 Example :Number of Miles TraveledExample :Number of Miles Traveled

Class Frequency

5 24

10 16

15 10

UngroupedUngrouped

Page 28: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-112-11 Grouped frequency distributions Can be used when the range of values

in the data set is very large. The data must be grouped into classes that are more than one unit in width.

Examples - the life of boat batteries in hours.

GroupedGrouped

Page 29: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2-122-12Example: Lifetimes of Boat BatteriesExample: Lifetimes of Boat Batteries

Classlimits

ClassBoundaries

Cumulative

24 - 30 23.5 - 37.5 4 4

38 - 51 37.5 - 51.5 14 18

52 - 65 51.5 - 65.5 7 25

frequencyFrequency

GroupedGrouped

Page 30: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Number non conforming

Frequency Relative Frequency

Cumulative Frequency

RelativeFrequency

0 15 0.29 15 0.29

1 20 0.38 35 0.67

2 8 0.15 43 0.83

3 5 0.10 48 0.92

4 3 0.06 51 0.98

5 1 0.02 52 1.00

Table 4-3 Different Frequency Distributions of Data Given in Table 4-1

Frequency DistributionsFrequency Distributions

Page 31: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Frequency Histogram

0

5

10

15

20

25

0 1 2 3 4 5

Number Nonconforming

Freq

uenc

y

Frequency HistogramFrequency Histogram

Page 32: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relative Frequency Histogram

0.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45

0 1 2 3 4 5

Number Nonconforming

Rela

tive F

req

uen

cy

Relative Frequency Relative Frequency HistogramHistogram

Page 33: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Cumulative Frequency Histogram

0

10

20

30

40

50

60

0 1 2 3 4 5

Number Nonconforming

Cu

mu

lati

ve F

req

uen

cy

Cumulative Frequency Cumulative Frequency HistogramHistogram

Page 34: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The histogram is the most important graphical tool for exploring the shape of data distributions.

Check: http://quarknet.fnal.gov/toolkits/ati/histograms.html for the construction ,analysis and understanding of histograms

The HistogramThe Histogram

Page 35: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The Fast WayStep 1: Find range of distribution, largest - smallest valuesStep 2: Choose number of classes, 5 to 20Step 3: Determine width of classes, one decimal place more than the data, class width = range/number of classesStep 4: Determine class boundariesStep 5: Draw frequency histogram

#classes n

Constructing a HistogramConstructing a Histogram

Page 36: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Number of groups or cellsIf no. of observations < 100 – 5 to 9

cellsBetween 100-500 – 8 to 17 cellsGreater than 500 – 15 to 20 cells

Constructing a HistogramConstructing a Histogram

Page 37: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

For a more accurate way of drawing a histogram see the section on grouped data in your textbook

Constructing a HistogramConstructing a Histogram

Page 38: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Bar Graph Polygon of Data Cumulative Frequency Distribution or

Ogive

Other Types of Other Types of Frequency Distribution Frequency Distribution

GraphsGraphs

Page 39: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Bar Graph and Polygon of Bar Graph and Polygon of DataData

Page 40: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Cumulative FrequencyCumulative Frequency

Page 41: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Figure 4-6 Characteristics of frequency distributions

Characteristics of FrequencyCharacteristics of FrequencyDistribution GraphsDistribution Graphs

Page 42: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Analysis of HistogramsAnalysis of Histograms

Figure 4-7 Differences due to location, spread, and shape

Page 43: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Analysis of HistogramsAnalysis of Histograms

Figure 4-8 Histogram of Wash Concentration

Page 44: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The three measures in common use are the: Average Median Mode

Measures of Central Measures of Central TendencyTendency

Page 45: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

There are three different techniques available for calculating the average three measures in common use are the:

Ungrouped data Grouped data Weighted average

AverageAverage

Page 46: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

ni

i

XX

n

Average-Ungrouped DataAverage-Ungrouped Data

Page 47: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

1 1 2 2

1 2

... .

...

hi i

i

h h

h

f XX

n

f X f X f X

f f f

h = number of cellsh = number of cellsfi=frequencyfi=frequencyXi=midpointXi=midpoint

Average-Grouped DataAverage-Grouped Data

Page 48: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

1

1

n

i iiw n

ii

w XX

w

Used when a number of averages are combined with different frequencies

Average-Weighted AverageAverage-Weighted Average

Page 49: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2m

d mm

ncf

M L if

Lm=lower boundary of the cell with the medianN=total number of observationsCfm=cumulative frequency of all cells below mFm=frequency of median celli=cell interval

Median-Grouped DataMedian-Grouped Data

Page 50: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Boundaries Midpoint Frequency Computation

23.6-26.5 25.0 4 100

26.6-29.5 28.0 36 1008

29.6-32.5 31.0 51 1581

32.6-35.5 34.0 63 2142

35.6-38.5 37.0 58 2146

38.6-41.5 40.0 52 2080

41.6-44.5 43.0 34 1462

44.6-47.5 46.0 16 736

47.6-50.5 49.0 6 294

Total 320 11549

Table 4-7 Frequency Distribution of the Life of 320 tires in 1000 km

Example ProblemExample Problem

Page 51: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2m

d mm

ncf

M L if

320154

235.6 3 35.958

Md

Median-Grouped DataMedian-Grouped Data

Using data from Table 4-7

Page 52: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

ModeMode

The Mode is the value that occurs with the greatest frequency.

It is possible to have no modes in a series or numbers or to have more than one mode.

Page 53: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Figure 4-9 Relationship among average, median and mode

Relationship Among theRelationship Among theMeasures of Central Measures of Central

TendencyTendency

Page 54: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Range Standard Deviation Variance

Measures of DispersionMeasures of Dispersion

Page 55: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The range is the simplest and easiest to calculate of the measures of dispersion.

Range = R = Xh - Xl Largest value - Smallest value in

data set

MeasuresMeasures of Dispersion- of Dispersion-RangeRange

Page 56: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Sample Standard Deviation:

2

1( )

1

n

iXi X

Sn

2

2

11

/

1

nn

ii

Xi Xi n

Sn

Measures of Dispersion-Measures of Dispersion-Standard DeviationStandard Deviation

Page 57: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Ungrouped Technique

2 2

1 1( )

( 1)

n n

i in Xi Xi

Sn n

Standard DeviationStandard Deviation

Page 58: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

2 2

11

( ) ( )

( 1)

hh

i i i iii

n f X f Xs

n n

Standard DeviationStandard Deviation

Grouped Technique

Page 59: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship Between the Relationship Between the Measures of DispersionMeasures of Dispersion

As n increases, accuracy of R decreases Use R when there is small amount of data

or data is too scattered If n> 10 use standard deviation A smaller standard deviation means better

quality

Page 60: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship Between the Relationship Between the Measures of DispersionMeasures of Dispersion

Figure 4-10 Comparison of two distributions with equal average and range

Page 61: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Other MeasuresOther Measures

There are three other measures that are frequently used to analyze a collection of data: Skewness Kurtosis Coefficient of Variation

Page 62: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Skewness is the lack of symmetry of the data.

For grouped data:3

13 3

( ) /h

i iif X X n

as

SkewnessSkewness

Page 63: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

SkewnessSkewness

Figure 4-11 Left (negative) and right (positive) skewness distributions

Page 64: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Kurtosis provides information regrading the shape of the population distribution (the peakedness or heaviness of the tails of a distribution).

For grouped data:4

14 4

( ) /h

i iif X X n

as

KurtosisKurtosis

Page 65: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

KurtosisKurtosis

Figure 4-11 Leptokurtic and Platykurtic distributions

Page 66: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Correlation variation (CV) is a measure of how much variation exists in relation to the mean.

Coefficient of VariationCoefficient of Variation

(100%)sCV

X

Page 67: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Population Set of all items that possess a characteristic of interest

Sample Subset of a population

Population and SamplePopulation and Sample

Page 68: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Parameter is a characteristic of a population, i.o.w. it describes a population Example: average weight of the population, e.g. 50,000 cans made in a month.Statistic is a characteristic of a sample, used to make inferences on the population parameters that are typically unknown, called an estimator Example: average weight of a sample of 500 cans from that month’s output, an estimate of the average weight of the 50,000 cans.

Parameter and StatisticParameter and Statistic

Page 69: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Characteristics of the normal curve: It is symmetrical -- Half the cases are to

one side of the center; the other half is on the other side.

The distribution is single peaked, not bimodal or multi-modal

Also known as the Gaussian distribution

The Normal CurveThe Normal Curve

Page 70: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Characteristics:

Most of the cases will fall in the center portion of the curve and as values of the variable become more extreme they become less frequent, with "outliers" at the "tail" of the distribution few in number. It is one of many frequency distributions.

The Normal CurveThe Normal Curve

Page 71: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1. Normal distributions can be transformed to standard normal distributions by the formula:

iXZ

Standard Normal DistributionStandard Normal Distribution

Page 72: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Relationship between the Relationship between the

Mean and Standard Mean and Standard DeviationDeviation

Page 73: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Mean and Standard Mean and Standard DeviationDeviation

Same mean but different standard deviation

Page 74: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Mean and Standard Mean and Standard DeviationDeviation

Same mean but different standard deviation

Page 75: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

IF THE DISTRIBUTION IS NORMAL

Then the mean is the best measure of central tendencyMost scores “bunched up” in

middleExtreme scores are less frequent,

therefore less probable

Normal DistributionNormal Distribution

Page 76: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Percent of items included between certain values of the std. deviation

Normal DistributionNormal Distribution

Page 77: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Histogram Skewness Kurtosis

Tests for NormalityTests for Normality

Page 78: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Histogram:

ShapeSymmetrical

The larger the sampler size, the better the judgment of normality. A minimum sample size of 50 is recommended

Tests for NormalityTests for Normality

Page 79: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Skewness (a3) and Kurtosis (a4)” Skewed to the left or to the right (a3=0 for

a normal distribution) The data are peaked as the normal

distribution (a4=3 for a normal distribution)

The larger the sample size, the better the judgment of normality (sample size of 100 is recommended)

Tests for NormalityTests for Normality

Page 80: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Probability Plots Order the data from the smallest to the

largest Rank the observations (starting from 1 for

the lowest observation) Calculate the plotting position

100( 0.5)iPP

n

Where i = rank PP=plotting position n=sample size

Tests for NormalityTests for Normality

Page 81: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure: Order the data Rank the observations Calculate the plotting position

Probability PlotsProbability Plots

Page 82: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure cont’d: Label the data scale Plot the points Attempt to fit by eye a “best

line” Determine normality

Probability PlotsProbability Plots

Page 83: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Procedure cont’d: Order the data Rank the observations Calculate the plotting position Label the data scale Plot the points Attempt to fit by eye a “best line” Determine normality

Probability PlotsProbability Plots

Page 84: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Chi-Square Test

2 Chi-squared

Observed value in a cell

Expected value for a cell

i

i

O

E

Where

22

1

( )ik

i

ii

O E

E

Chi-Square Goodness of Fit Chi-Square Goodness of Fit TestTest

Page 85: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

The simplest way to determine if a The simplest way to determine if a cause and-effect relationship exists cause and-effect relationship exists between two variablesbetween two variables

Scatter DiagramScatter Diagram

Figure 4-19 Scatter Diagram

Page 86: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Supplies the data to confirm a Supplies the data to confirm a hypothesis that two variables are hypothesis that two variables are relatedrelated

Provides both a visual and statistical Provides both a visual and statistical means to test the strength of a means to test the strength of a relationshiprelationship

Provides a good follow-up to cause and Provides a good follow-up to cause and effect diagramseffect diagrams

Scatter DiagramScatter Diagram

Page 87: Educational   Statistics

Besterfield: Quality Control, 8th ed.. © 2009 Pearson Education, Upper Saddle River, NJ 07458.All rights reserved

Straight Line FitStraight Line Fit

2 2

[( )( ) /

[( ) / ]

/ ( / )

xy x y nm

x x n

a y n m x n

y a mx

Where m=slope of the line and a is the intercept on the y axis