102
Slide 1 Copyright © 2004 Pearson Education, Inc.

tes9e_ch02

Embed Size (px)

Citation preview

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 1/102

Slide 1

Copyright © 2004 Pearson Education, Inc.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 2/102

Slide 2

Copyright © 2004 Pearson Education, Inc.

Chapter 2Describing, Exploring, and

Comparing Data2-1 Overview

2-2 Frequency Distributions

2-3 Visualizing Data

2-4 Measures of Center

2-5 Measures of Variation

2-6 Measures of Relative Standing2-7 Exploratory Data Analysis

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 3/102

Slide 3

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-1Overview

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 4/102

Slide 4

Copyright © 2004 Pearson Education, Inc.

D escriptive Statistics

summarize or describe the important

characteristics of a known set of population data

Inferential Statisticsuse sample data to make inferences (or generalizations) about a population

Overview

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 5/102

Slide 5

Copyright © 2004 Pearson Education, Inc.

1. Center : A representative or average value thatindicates where the middle of the data set is located

2. Variation : A measure of the amount that the values

vary among themselves

3. D istribution : The nature or shape of the distributionof data (such as bell-shaped, uniform, or skewed)

4. Outliers : Sample values that lie very far away fromthe vast majority of other sample values

5. Time : Changing characteristics of the data over

time

Important Characteristics of D ata

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 6/102

Slide 6

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-2Frequency D istributions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 7/102

Slide 7

Copyright © 2004 Pearson Education, Inc.

Frequency D istribution

lists data values (either individually or bygroups of intervals), along with their corresponding frequencies or counts

Frequency D istributions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 8/102

Slide 8

Copyright © 2004 Pearson Education, Inc.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 9/102

Slide 9

Copyright © 2004 Pearson Education, Inc.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 10/102

Slide 10

Copyright © 2004 Pearson Education, Inc.

Frequency D istributions

Definitions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 11/102

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 12/102

Slide 12

Copyright © 2004 Pearson Education, Inc.

are the smallest numbers that can actually belong todifferent classes

L ower ClassL imits

L ower Class L imits

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 13/102

Slide 13

Copyright © 2004 Pearson Education, Inc.

U pper Class L imits

are the largest numbers that can actually belong todifferent classes

U pper ClassL imits

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 14/102

Slide 14

Copyright © 2004 Pearson Education, Inc.

are the numbers used to separate classes, butwithout the gaps created by class limits

Class Boundaries

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 15/102

Slide 15

Copyright © 2004 Pearson Education, Inc.

number separating classes

Class Boundaries

- 0.5

99.5

199.5

299.5

399.5

499.5

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 16/102

Slide 16

Copyright © 2004 Pearson Education, Inc.

Class Boundaries

number separating classes

Class

Boundaries

- 0.5

99.5

199.5

299.5

399.5

499.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 17/102

Slide 17

Copyright © 2004 Pearson Education, Inc.

midpoints of the classes

Class Midpoints

Class midpoints can be found byadding the lower class limit to the

upper class limit and dividing the sumby two.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 18/102

Slide 18

Copyright © 2004 Pearson Education, Inc.

ClassMidpoints

midpoints of the classes

Class Midpoints

49.5

149.5

249.5

349.5

449.5

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 19/102

Slide 19

Copyright © 2004 Pearson Education, Inc.

Class Widthis the difference between two consecutive lower class limitsor two consecutive lower class boundaries

ClassWidth

100

100

100

100100

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 20/102

Slide 20

Copyright © 2004 Pearson Education, Inc.

1. L arge data sets can be summarized.

2. Can gain some insight into the nature of data.

3. Have a basis for constructing graphs.

Reasons for ConstructingFrequency D istributions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 21/102

Slide 21

Copyright © 2004 Pearson Education, Inc.

3. Starting point: Begin by choosing a lower limit of the first class.

4. U sing the lower limit of the first class and class width, proceed tolist the lower class limits.

5. L ist the lower class limits in a vertical column and proceed to

enter the upper class limits.6. Go through the data set putting a tally in the appropriate class for

each data value.

Constructing A Frequency Table

1. D ecide on the number of classes (should be between 5 and 20) .

2. Calculate (round up).

class width } (highest value) ± (lowest value)number of classes

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 22/102

Slide 22

Copyright © 2004 Pearson Education, Inc.

Relative Frequency D istribution

relative frequency = class frequencysum of all frequencies

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 23/102

Slide 23

Copyright © 2004 Pearson Education, Inc.

Relative Frequency D istribution

11/40 = 28%

12/40 = 40%

etc.

Total Frequency = 40

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 24/102

Slide 24

Copyright © 2004 Pearson Education, Inc.

Cumulative FrequencyD istribution

CumulativeFrequencies

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 25/102

Slide 25

Copyright © 2004 Pearson Education, Inc.

Frequency Tables

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 26/102

Slide 26

Copyright © 2004 Pearson Education, Inc.

Recap

In this Section we have discussed

Important characteristics of data

Frequency distributionsProcedures for constructing frequency distributions

Relative frequency distributions

Cumulative frequency distributions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 27/102

Slide 27

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-3Visualizing D ata

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 28/102

Slide 28

Copyright © 2004 Pearson Education, Inc.

Visualizing D ata

D epict the nature of shape or shape of the data distribution

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 29/102

Slide 29

Copyright © 2004 Pearson Education, Inc.

Histogram

A bar graph in which the horizontal scale represents

the classes of data values and the vertical scalerepresents the frequencies.

Figure 2-1

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 30/102

Slide 30

Copyright © 2004 Pearson Education, Inc.

Relative Frequency Histogram

Has the same shape and horizontal scale as ahistogram, but the vertical scale is marked withrelative frequencies.

Figure 2-2

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 31/102

Slide 31

Copyright © 2004 Pearson Education, Inc.

Histogramand

Relative Frequency Histogram

Figure 2-1 Figure 2-2

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 32/102

Slide 32

Copyright © 2004 Pearson Education, Inc.

Frequency Polygon

U ses line segments connected to points directlyabove class midpoint values

Figure 2-3

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 33/102

Slide 33

Copyright © 2004 Pearson Education, Inc.

Ogive

A line graph that depicts cumulative frequencies

Figure 2-4

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 34/102

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 35/102

Slide 35

Copyright © 2004 Pearson Education, Inc.

Stem-and L eaf Plot

Represents data by separating each value into twoparts: the stem (such as the leftmost digit) and theleaf (such as the rightmost digit)

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 36/102

Slide 36

Copyright © 2004 Pearson Education, Inc.

Pareto Chart

A bar graph for qualitative data, with the barsarranged in order according to frequencies

Figure 2-6

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 37/102

Slide 37

Copyright © 2004 Pearson Education, Inc.

Pie Chart

A graph depicting qualitative data as slices pf a pie

Figure 2-7

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 38/102

Slide 38

Copyright © 2004 Pearson Education, Inc.

Scatter D iagram

A plot of paired (x,y) data with a horizontal x-axisand a vertical y-axis

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 39/102

Slide 39

Copyright © 2004 Pearson Education, Inc.

Time-Series Graph

D ata that have been collected at different points intime

Figure 2-8

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 40/102

Slide 40

Copyright © 2004 Pearson Education, Inc.

Other Graphs

Figure 2-9

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 41/102

Slide 41

Copyright © 2004 Pearson Education, Inc.

Recap

In this Section we have discussed graphsthat are pictures of distributions.

Keep in mind that the object of this section isnot just to construct graphs, but to learn

something about the data sets ± that is, tounderstand the nature of their distributions.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 42/102

Slide 42

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-4Measures of Center

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 43/102

Slide 43

Copyright © 2004 Pearson Education, Inc.

D efinition

Measure of Center The value at the center or middleof a data set

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 44/102

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 45/102

Slide 45

Copyright © 2004 Pearson Education, Inc.

N otation

7 denotes the addition of a set of values

x is the variable usually used to represent the individualdata values

n represents the number of values in a sample

N represents the number of values in a population

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 46/102

Slide 46

Copyright © 2004 Pearson Education, Inc.

N otation

µ is pronounced µmu¶ and denotes the mean of all valuesin a population

x =n

7 x is pronounced µx-bar¶ and denotes the mean of a setof sample values x

N µ =

7 x

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 47/102

Slide 47

Copyright © 2004 Pearson Education, Inc.

D efinitionsMedian

the middle value when the originaldata values are arranged in order of

increasing (or decreasing) magnitudeoften denoted by x (pronounced µx-tilde¶)~

is not affected by an extreme value

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 48/102

Slide 48

Copyright © 2004 Pearson Education, Inc.

Finding the Median

If the number of values is odd, themedian is the number located in theexact middle of the list

If the number of values is even, themedian is found by computing the

mean of the two middle numbers

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 49/102

Slide 49

Copyright © 2004 Pearson Education, Inc.

5.40 1.10 0.42 0.73 0.48 1.10 0.66

0.42 0.48 0.66 0.73 1.10 1.10 5.40

(in order - odd number of values)

exact middle MED IAN is 0.73

5.40 1.10 0.42 0.73 0.48 1.10

0.42 0.48 0.73 1.10 1.10 5.40

0.73 + 1.10

2

(even number of values ± no exact middleshared by two numbers)

MED IAN is 0.915

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 50/102

Slide 50

Copyright © 2004 Pearson Education, Inc.

D efinitionsMode

the value that occurs most frequently

The mode is not always unique. A data set may be:

BimodalMultimodalN o Mode

denoted by Mthe only measure of central tendency that

can be used with nominal data

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 51/102

Slide 51

Copyright © 2004 Pearson Education, Inc.

a. 5.40 1.10 0.42 0.73 0.48 1.10

b. 27 27 27 55 55 55 88 88 99

c. 1 2 3 6 7 8 9 10

Examples

Mode is 1.10

Bimodal - 27 & 55

N o Mode

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 52/102

Slide 52

Copyright © 2004 Pearson Education, Inc.

Midrange

the value midway between the highestand lowest values in the original dataset

D efinitions

Midrange = highest score + lowest score

2

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 53/102

Slide 53

Copyright © 2004 Pearson Education, Inc.

Carry one more decimal place than ispresent in the original set of values

Round-off Rule for Measures of Center

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 54/102

Slide 54

Copyright © 2004 Pearson Education, Inc.

Assume that in each class, all samplevalues are equal to the class

midpoint

Mean from a FrequencyD istribution

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 55/102

Slide 55

Copyright © 2004 Pearson Education, Inc.

use class midpoint of classes for variable x

Mean from a FrequencyD istribution

x = class midpoint

f = frequency

7 f = n

x = Formula 2-2 f

7 ( f x)7

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 56/102

Slide 56

Copyright © 2004 Pearson Education, Inc.

Weighted Mean

x =

w

7 (w x)

7

In some cases, values vary in their degree of importance, so they are weighted accordingly

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 57/102

Slide 57

Copyright © 2004 Pearson Education, Inc.

Best Measure of Center

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 58/102

Slide 58

Copyright © 2004 Pearson Education, Inc.

SymmetricD ata is symmetric if the left half of its

histogram is roughly a mirror image of its right half.

SkewedD ata is skewed if it is not symmetricand if it extends more to one side thanthe other.

D efinitions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 59/102

Slide 59

Copyright © 2004 Pearson Education, Inc.

SkewnessFigure 2-11

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 60/102

Slide 60

Copyright © 2004 Pearson Education, Inc.

RecapIn this section we have discussed:

Types of Measures of Center MeanMedian

Mode

Mean from a frequency distribution

Weighted means

Best Measures of Center

Skewness

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 61/102

Slide 61

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-5Measures of Variation

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 62/102

Slide 62

Copyright © 2004 Pearson Education, Inc.

Measures of Variation

Because this section introduces the conceptof variation, this is one of the most importantsections in the entire book

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 63/102

Slide 63

Copyright © 2004 Pearson Education, Inc.

D efinition

The range of a set of data is thedifference between the highestvalue and the lowest value

valuehighest lowestvalue

D f

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 64/102

Slide 64

Copyright © 2004 Pearson Education, Inc.

D efinition

The standard deviation of a set of sample values is a measure of variation of values about the mean

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 65/102

S l S d d D i i

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 66/102

Slide 66

Copyright © 2004 Pearson Education, Inc.

Sample Standard D eviation(Shortcut Formula)

Formula 2-5

n (n - 1)s = n (

7 x

2

) - (7 x )

2

St d d D i ti

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 67/102

Slide 67

Copyright © 2004 Pearson Education, Inc.

Standard D eviation -Key Points

The standard deviation is a measure of variation of all values from the mean

The value of the standard deviation s is usuallypositive

The value of the standard deviation s can increasedramatically with the inclusion of one or more

outliers (data values far away from all others)

The units of the standard deviation s are the same asthe units of the original data values

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 68/102

D fi i i

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 69/102

Slide 69

Copyright © 2004 Pearson Education, Inc.

Population variance: Square of the populationstandard deviation W

D efinition

The variance of a set of values is a measure of variation equal to the square of the standarddeviation.

Sample variance: Square of the sample standarddeviation s

V i N i

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 70/102

Slide 70

Copyright © 2004 Pearson Education, Inc.

Variance - N otation

standard deviation squared

sW

2

2 }N otationSample variance

Population variance

R d ff R l

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 71/102

Slide 71

Copyright © 2004 Pearson Education, Inc.

Round-off Rulefor Measures of Variation

Carry one more decimal place thanis present in the original set of data.

Round only the final answer, not values in

the middle of a calculation.

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 72/102

St d d D i ti f

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 73/102

Slide 73

Copyright © 2004 Pearson Education, Inc.

Standard D eviation from aFrequency D istribution

U se the class midpoints as the x values

Formula 2-6

n (n - 1) S =n [7

( f x 2

)]- [7

( f x

)]2

Estimation of Standard

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 74/102

Slide 74

Copyright © 2004 Pearson Education, Inc.

Estimation of StandardD eviation

Range Rule of Thumb

For estimating a value of the standard deviation s ,

U se

Where range = (highest value) ± (lowest value)

Range

4s }

Estimation of Standard

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 75/102

Slide 75

Copyright © 2004 Pearson Education, Inc.

Estimation of StandardD eviation

Range Rule of Thumb

For interpreting a known value of the standard deviation s ,find rough estimates of the minimum and maximum³usual´ values by using:

Minimum ³usual´ value (mean) ± 2 X (standard deviation)}

Maximum ³usual´ value (mean) + 2 X (standard deviation)}

D efinition

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 76/102

Slide 76

Copyright © 2004 Pearson Education, Inc.

D efinition

Empirical (68-95-99.7) Rule

For data sets having a distribution that is approximatelybell shaped, the following properties apply:

About 68% of all values fall within 1 standarddeviation of the mean

About 95% of all values fall within 2 standarddeviations of the mean

About 99.7% of all values fall within 3 standarddeviations of the mean

The Empirical Rule

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 77/102

Slide 77

Copyright © 2004 Pearson Education, Inc.

The Empirical Rule

FIG U RE 2-13

The Empirical Rule

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 78/102

Slide 78

Copyright © 2004 Pearson Education, Inc.

The Empirical Rule

FIG U RE 2-13

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 79/102

D efinition

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 80/102

Slide 80

Copyright © 2004 Pearson Education, Inc.

D efinition

Chebyshev¶s Theorem

The proportion (or fraction) of any set of data lyingwithin K standard deviations of the mean is always atleast 1-1/ K 2, where K is any positive number greater than 1.

For K = 2, at least 3/4 (or 75%) of all values liewithin 2 standard deviations of the mean

For K = 3, at least 8/9 (or 89%) of all values liewithin 3 standard deviations of the mean

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 81/102

Recap

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 82/102

Slide 82

Copyright © 2004 Pearson Education, Inc.

Recap

In this section we have looked at:Range

Standard deviation of a sample and population

Variance of a sample and populationCoefficient of Variation (CV)

Standard deviation using a frequency distribution

Range Rule of Thumb

Empirical D istributionChebyshev¶s Theorem

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 83/102

Slide 83

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-6Measures of Relative

Standing

D efinition

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 84/102

Slide 84

Copyright © 2004 Pearson Education, Inc.

z Score (or standard score)

the number of standard deviationsthat a given value x is above or belowthe mean.

D efinition

Measures of Position

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 85/102

Slide 85

Copyright © 2004 Pearson Education, Inc.

Sample Population

x - µz =W

Round to 2 decimal places

Measures of Positionz score

z = x - x

s

Interpreting Z Scores

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 86/102

Slide 86

Copyright © 2004 Pearson Education, Inc.

Interpreting Z Scores

Whenever a value is less than the mean, its

corresponding z score is negativeOrdinary values: z score between ±2 and 2 sd

U nusual Values: z score < -2 or z score > 2 sd

FIG U RE 2-14

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 87/102

Q til

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 88/102

Slide 88

Copyright © 2004 Pearson Education, Inc.

Q 1, Q 2, Q 3divides ranked scores into four equal parts

Quartiles

25% 25% 25% 25%

Q3

Q2

Q1

(minimum) (maximum)

(median)

Percentiles

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 89/102

Slide 89

Copyright © 2004 Pearson Education, Inc.

Percentiles

Just as there are quartiles separating data

into four parts, there are 99 percentilesdenoted P 1, P 2, . . . P 99 , which partition thedata into 100 groups.

Finding the Percentile

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 90/102

Slide 90

Copyright © 2004 Pearson Education, Inc.

Finding the Percentileof a Given Score

Percentile of value x = 100number of values less than x

total number of values

Converting from the

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 91/102

Slide 91

Copyright © 2004 Pearson Education, Inc.

n total number of values in the data set

k percentile being used L locator that gives the position of a value

P k k th percentile

L = nk 100

N otation

Converting from thek th Percentile to the

CorrespondingD

ata Value

Converting

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 92/102

Slide 92

Copyright © 2004 Pearson Education, Inc.

Figure 2-15

gfrom the

k th Percentileto the

CorrespondingD ata Value

Some Other Statistics

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 93/102

Slide 93

Copyright © 2004 Pearson Education, Inc.

Interquartile Range (or IQR): Q 3 - Q 1

10 - 90 Percentile Range: P 90 - P 10

Semi-interquartile Range: 2

Q 3 - Q 1

Midquartile:

2

Q 3 + Q 1

Some Other Statistics

Recap

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 94/102

Slide 94

Copyright © 2004 Pearson Education, Inc.

Recap

In this section we have discussed:

z Scores

z Scores and unusual values

Quartiles

Percentiles

Converting a percentile to corresponding datavalues

Other statistics

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 95/102

Slide 95

Copyright © 2004 Pearson Education, Inc.

Created by Tom Wegleitner, Centreville, Virginia

Section 2-7Exploratory D ata Analysis

(E D A)

D efinition

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 96/102

Slide 96

Copyright © 2004 Pearson Education, Inc.

Exploratory D ata Analysis is theprocess of using statistical tools (such

as graphs, measures of center, andmeasures of variation) to investigatedata sets in order to understand their important characteristics

D efinition

D efinition

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 97/102

Slide 97

Copyright © 2004 Pearson Education, Inc.

D efinition

An outlier is a value that is located veryfar away from almost all the other

values

Important Principles

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 98/102

Slide 98

Copyright © 2004 Pearson Education, Inc.

Important Principles

An outlier can have a dramatic effect on themean

An outlier have a dramatic effect on thestandard deviation

An outlier can have a dramatic effect on thescale of the histogram so that the truenature of the distribution is totallyobscured

D efinitions

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 99/102

Slide 99

Copyright © 2004 Pearson Education, Inc.

For a set of data, the 5-number summary consistsof the minimum value; the first quartile Q 1; themedian (or second quartile Q 2); the third quartile,

Q 3; and the maximum value

A boxplot ( or box-and-whisker-diagram ) is agraph of a data set that consists of a lineextending from the minimum value to themaximum value, and a box with lines drawn at thefirst quartile, Q 1; the median; and the thirdquartile, Q 3

D efinitions

Boxplots

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 100/102

Slide 100

Copyright © 2004 Pearson Education, Inc.

Boxplots

Figure 2-16

Boxplots

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 101/102

Slide 101

Copyright © 2004 Pearson Education, Inc.

Figure 2-17

Boxplots

Slid 102Recap

8/7/2019 tes9e_ch02

http://slidepdf.com/reader/full/tes9ech02 102/102

Slide 102Recap

In this section we have looked at:

Exploratory D ata Analysis

Effects of outliers

5-number summary and boxplots