Upload
yuliandriansyah
View
217
Download
0
Embed Size (px)
Citation preview
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 1/102
Slide 1
Copyright © 2004 Pearson Education, Inc.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 2/102
Slide 2
Copyright © 2004 Pearson Education, Inc.
Chapter 2Describing, Exploring, and
Comparing Data2-1 Overview
2-2 Frequency Distributions
2-3 Visualizing Data
2-4 Measures of Center
2-5 Measures of Variation
2-6 Measures of Relative Standing2-7 Exploratory Data Analysis
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 3/102
Slide 3
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-1Overview
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 4/102
Slide 4
Copyright © 2004 Pearson Education, Inc.
D escriptive Statistics
summarize or describe the important
characteristics of a known set of population data
Inferential Statisticsuse sample data to make inferences (or generalizations) about a population
Overview
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 5/102
Slide 5
Copyright © 2004 Pearson Education, Inc.
1. Center : A representative or average value thatindicates where the middle of the data set is located
2. Variation : A measure of the amount that the values
vary among themselves
3. D istribution : The nature or shape of the distributionof data (such as bell-shaped, uniform, or skewed)
4. Outliers : Sample values that lie very far away fromthe vast majority of other sample values
5. Time : Changing characteristics of the data over
time
Important Characteristics of D ata
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 6/102
Slide 6
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-2Frequency D istributions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 7/102
Slide 7
Copyright © 2004 Pearson Education, Inc.
Frequency D istribution
lists data values (either individually or bygroups of intervals), along with their corresponding frequencies or counts
Frequency D istributions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 8/102
Slide 8
Copyright © 2004 Pearson Education, Inc.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 9/102
Slide 9
Copyright © 2004 Pearson Education, Inc.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 10/102
Slide 10
Copyright © 2004 Pearson Education, Inc.
Frequency D istributions
Definitions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 12/102
Slide 12
Copyright © 2004 Pearson Education, Inc.
are the smallest numbers that can actually belong todifferent classes
L ower ClassL imits
L ower Class L imits
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 13/102
Slide 13
Copyright © 2004 Pearson Education, Inc.
U pper Class L imits
are the largest numbers that can actually belong todifferent classes
U pper ClassL imits
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 14/102
Slide 14
Copyright © 2004 Pearson Education, Inc.
are the numbers used to separate classes, butwithout the gaps created by class limits
Class Boundaries
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 15/102
Slide 15
Copyright © 2004 Pearson Education, Inc.
number separating classes
Class Boundaries
- 0.5
99.5
199.5
299.5
399.5
499.5
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 16/102
Slide 16
Copyright © 2004 Pearson Education, Inc.
Class Boundaries
number separating classes
Class
Boundaries
- 0.5
99.5
199.5
299.5
399.5
499.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 17/102
Slide 17
Copyright © 2004 Pearson Education, Inc.
midpoints of the classes
Class Midpoints
Class midpoints can be found byadding the lower class limit to the
upper class limit and dividing the sumby two.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 18/102
Slide 18
Copyright © 2004 Pearson Education, Inc.
ClassMidpoints
midpoints of the classes
Class Midpoints
49.5
149.5
249.5
349.5
449.5
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 19/102
Slide 19
Copyright © 2004 Pearson Education, Inc.
Class Widthis the difference between two consecutive lower class limitsor two consecutive lower class boundaries
ClassWidth
100
100
100
100100
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 20/102
Slide 20
Copyright © 2004 Pearson Education, Inc.
1. L arge data sets can be summarized.
2. Can gain some insight into the nature of data.
3. Have a basis for constructing graphs.
Reasons for ConstructingFrequency D istributions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 21/102
Slide 21
Copyright © 2004 Pearson Education, Inc.
3. Starting point: Begin by choosing a lower limit of the first class.
4. U sing the lower limit of the first class and class width, proceed tolist the lower class limits.
5. L ist the lower class limits in a vertical column and proceed to
enter the upper class limits.6. Go through the data set putting a tally in the appropriate class for
each data value.
Constructing A Frequency Table
1. D ecide on the number of classes (should be between 5 and 20) .
2. Calculate (round up).
class width } (highest value) ± (lowest value)number of classes
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 22/102
Slide 22
Copyright © 2004 Pearson Education, Inc.
Relative Frequency D istribution
relative frequency = class frequencysum of all frequencies
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 23/102
Slide 23
Copyright © 2004 Pearson Education, Inc.
Relative Frequency D istribution
11/40 = 28%
12/40 = 40%
etc.
Total Frequency = 40
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 24/102
Slide 24
Copyright © 2004 Pearson Education, Inc.
Cumulative FrequencyD istribution
CumulativeFrequencies
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 25/102
Slide 25
Copyright © 2004 Pearson Education, Inc.
Frequency Tables
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 26/102
Slide 26
Copyright © 2004 Pearson Education, Inc.
Recap
In this Section we have discussed
Important characteristics of data
Frequency distributionsProcedures for constructing frequency distributions
Relative frequency distributions
Cumulative frequency distributions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 27/102
Slide 27
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-3Visualizing D ata
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 28/102
Slide 28
Copyright © 2004 Pearson Education, Inc.
Visualizing D ata
D epict the nature of shape or shape of the data distribution
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 29/102
Slide 29
Copyright © 2004 Pearson Education, Inc.
Histogram
A bar graph in which the horizontal scale represents
the classes of data values and the vertical scalerepresents the frequencies.
Figure 2-1
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 30/102
Slide 30
Copyright © 2004 Pearson Education, Inc.
Relative Frequency Histogram
Has the same shape and horizontal scale as ahistogram, but the vertical scale is marked withrelative frequencies.
Figure 2-2
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 31/102
Slide 31
Copyright © 2004 Pearson Education, Inc.
Histogramand
Relative Frequency Histogram
Figure 2-1 Figure 2-2
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 32/102
Slide 32
Copyright © 2004 Pearson Education, Inc.
Frequency Polygon
U ses line segments connected to points directlyabove class midpoint values
Figure 2-3
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 33/102
Slide 33
Copyright © 2004 Pearson Education, Inc.
Ogive
A line graph that depicts cumulative frequencies
Figure 2-4
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 35/102
Slide 35
Copyright © 2004 Pearson Education, Inc.
Stem-and L eaf Plot
Represents data by separating each value into twoparts: the stem (such as the leftmost digit) and theleaf (such as the rightmost digit)
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 36/102
Slide 36
Copyright © 2004 Pearson Education, Inc.
Pareto Chart
A bar graph for qualitative data, with the barsarranged in order according to frequencies
Figure 2-6
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 37/102
Slide 37
Copyright © 2004 Pearson Education, Inc.
Pie Chart
A graph depicting qualitative data as slices pf a pie
Figure 2-7
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 38/102
Slide 38
Copyright © 2004 Pearson Education, Inc.
Scatter D iagram
A plot of paired (x,y) data with a horizontal x-axisand a vertical y-axis
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 39/102
Slide 39
Copyright © 2004 Pearson Education, Inc.
Time-Series Graph
D ata that have been collected at different points intime
Figure 2-8
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 40/102
Slide 40
Copyright © 2004 Pearson Education, Inc.
Other Graphs
Figure 2-9
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 41/102
Slide 41
Copyright © 2004 Pearson Education, Inc.
Recap
In this Section we have discussed graphsthat are pictures of distributions.
Keep in mind that the object of this section isnot just to construct graphs, but to learn
something about the data sets ± that is, tounderstand the nature of their distributions.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 42/102
Slide 42
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-4Measures of Center
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 43/102
Slide 43
Copyright © 2004 Pearson Education, Inc.
D efinition
Measure of Center The value at the center or middleof a data set
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 45/102
Slide 45
Copyright © 2004 Pearson Education, Inc.
N otation
7 denotes the addition of a set of values
x is the variable usually used to represent the individualdata values
n represents the number of values in a sample
N represents the number of values in a population
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 46/102
Slide 46
Copyright © 2004 Pearson Education, Inc.
N otation
µ is pronounced µmu¶ and denotes the mean of all valuesin a population
x =n
7 x is pronounced µx-bar¶ and denotes the mean of a setof sample values x
N µ =
7 x
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 47/102
Slide 47
Copyright © 2004 Pearson Education, Inc.
D efinitionsMedian
the middle value when the originaldata values are arranged in order of
increasing (or decreasing) magnitudeoften denoted by x (pronounced µx-tilde¶)~
is not affected by an extreme value
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 48/102
Slide 48
Copyright © 2004 Pearson Education, Inc.
Finding the Median
If the number of values is odd, themedian is the number located in theexact middle of the list
If the number of values is even, themedian is found by computing the
mean of the two middle numbers
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 49/102
Slide 49
Copyright © 2004 Pearson Education, Inc.
5.40 1.10 0.42 0.73 0.48 1.10 0.66
0.42 0.48 0.66 0.73 1.10 1.10 5.40
(in order - odd number of values)
exact middle MED IAN is 0.73
5.40 1.10 0.42 0.73 0.48 1.10
0.42 0.48 0.73 1.10 1.10 5.40
0.73 + 1.10
2
(even number of values ± no exact middleshared by two numbers)
MED IAN is 0.915
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 50/102
Slide 50
Copyright © 2004 Pearson Education, Inc.
D efinitionsMode
the value that occurs most frequently
The mode is not always unique. A data set may be:
BimodalMultimodalN o Mode
denoted by Mthe only measure of central tendency that
can be used with nominal data
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 51/102
Slide 51
Copyright © 2004 Pearson Education, Inc.
a. 5.40 1.10 0.42 0.73 0.48 1.10
b. 27 27 27 55 55 55 88 88 99
c. 1 2 3 6 7 8 9 10
Examples
Mode is 1.10
Bimodal - 27 & 55
N o Mode
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 52/102
Slide 52
Copyright © 2004 Pearson Education, Inc.
Midrange
the value midway between the highestand lowest values in the original dataset
D efinitions
Midrange = highest score + lowest score
2
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 53/102
Slide 53
Copyright © 2004 Pearson Education, Inc.
Carry one more decimal place than ispresent in the original set of values
Round-off Rule for Measures of Center
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 54/102
Slide 54
Copyright © 2004 Pearson Education, Inc.
Assume that in each class, all samplevalues are equal to the class
midpoint
Mean from a FrequencyD istribution
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 55/102
Slide 55
Copyright © 2004 Pearson Education, Inc.
use class midpoint of classes for variable x
Mean from a FrequencyD istribution
x = class midpoint
f = frequency
7 f = n
x = Formula 2-2 f
7 ( f x)7
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 56/102
Slide 56
Copyright © 2004 Pearson Education, Inc.
Weighted Mean
x =
w
7 (w x)
7
In some cases, values vary in their degree of importance, so they are weighted accordingly
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 57/102
Slide 57
Copyright © 2004 Pearson Education, Inc.
Best Measure of Center
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 58/102
Slide 58
Copyright © 2004 Pearson Education, Inc.
SymmetricD ata is symmetric if the left half of its
histogram is roughly a mirror image of its right half.
SkewedD ata is skewed if it is not symmetricand if it extends more to one side thanthe other.
D efinitions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 59/102
Slide 59
Copyright © 2004 Pearson Education, Inc.
SkewnessFigure 2-11
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 60/102
Slide 60
Copyright © 2004 Pearson Education, Inc.
RecapIn this section we have discussed:
Types of Measures of Center MeanMedian
Mode
Mean from a frequency distribution
Weighted means
Best Measures of Center
Skewness
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 61/102
Slide 61
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-5Measures of Variation
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 62/102
Slide 62
Copyright © 2004 Pearson Education, Inc.
Measures of Variation
Because this section introduces the conceptof variation, this is one of the most importantsections in the entire book
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 63/102
Slide 63
Copyright © 2004 Pearson Education, Inc.
D efinition
The range of a set of data is thedifference between the highestvalue and the lowest value
valuehighest lowestvalue
D f
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 64/102
Slide 64
Copyright © 2004 Pearson Education, Inc.
D efinition
The standard deviation of a set of sample values is a measure of variation of values about the mean
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 66/102
Slide 66
Copyright © 2004 Pearson Education, Inc.
Sample Standard D eviation(Shortcut Formula)
Formula 2-5
n (n - 1)s = n (
7 x
2
) - (7 x )
2
St d d D i ti
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 67/102
Slide 67
Copyright © 2004 Pearson Education, Inc.
Standard D eviation -Key Points
The standard deviation is a measure of variation of all values from the mean
The value of the standard deviation s is usuallypositive
The value of the standard deviation s can increasedramatically with the inclusion of one or more
outliers (data values far away from all others)
The units of the standard deviation s are the same asthe units of the original data values
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 69/102
Slide 69
Copyright © 2004 Pearson Education, Inc.
Population variance: Square of the populationstandard deviation W
D efinition
The variance of a set of values is a measure of variation equal to the square of the standarddeviation.
Sample variance: Square of the sample standarddeviation s
V i N i
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 70/102
Slide 70
Copyright © 2004 Pearson Education, Inc.
Variance - N otation
standard deviation squared
sW
2
2 }N otationSample variance
Population variance
R d ff R l
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 71/102
Slide 71
Copyright © 2004 Pearson Education, Inc.
Round-off Rulefor Measures of Variation
Carry one more decimal place thanis present in the original set of data.
Round only the final answer, not values in
the middle of a calculation.
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 73/102
Slide 73
Copyright © 2004 Pearson Education, Inc.
Standard D eviation from aFrequency D istribution
U se the class midpoints as the x values
Formula 2-6
n (n - 1) S =n [7
( f x 2
)]- [7
( f x
)]2
Estimation of Standard
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 74/102
Slide 74
Copyright © 2004 Pearson Education, Inc.
Estimation of StandardD eviation
Range Rule of Thumb
For estimating a value of the standard deviation s ,
U se
Where range = (highest value) ± (lowest value)
Range
4s }
Estimation of Standard
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 75/102
Slide 75
Copyright © 2004 Pearson Education, Inc.
Estimation of StandardD eviation
Range Rule of Thumb
For interpreting a known value of the standard deviation s ,find rough estimates of the minimum and maximum³usual´ values by using:
Minimum ³usual´ value (mean) ± 2 X (standard deviation)}
Maximum ³usual´ value (mean) + 2 X (standard deviation)}
D efinition
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 76/102
Slide 76
Copyright © 2004 Pearson Education, Inc.
D efinition
Empirical (68-95-99.7) Rule
For data sets having a distribution that is approximatelybell shaped, the following properties apply:
About 68% of all values fall within 1 standarddeviation of the mean
About 95% of all values fall within 2 standarddeviations of the mean
About 99.7% of all values fall within 3 standarddeviations of the mean
The Empirical Rule
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 77/102
Slide 77
Copyright © 2004 Pearson Education, Inc.
The Empirical Rule
FIG U RE 2-13
The Empirical Rule
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 78/102
Slide 78
Copyright © 2004 Pearson Education, Inc.
The Empirical Rule
FIG U RE 2-13
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 80/102
Slide 80
Copyright © 2004 Pearson Education, Inc.
D efinition
Chebyshev¶s Theorem
The proportion (or fraction) of any set of data lyingwithin K standard deviations of the mean is always atleast 1-1/ K 2, where K is any positive number greater than 1.
For K = 2, at least 3/4 (or 75%) of all values liewithin 2 standard deviations of the mean
For K = 3, at least 8/9 (or 89%) of all values liewithin 3 standard deviations of the mean
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 82/102
Slide 82
Copyright © 2004 Pearson Education, Inc.
Recap
In this section we have looked at:Range
Standard deviation of a sample and population
Variance of a sample and populationCoefficient of Variation (CV)
Standard deviation using a frequency distribution
Range Rule of Thumb
Empirical D istributionChebyshev¶s Theorem
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 83/102
Slide 83
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-6Measures of Relative
Standing
D efinition
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 84/102
Slide 84
Copyright © 2004 Pearson Education, Inc.
z Score (or standard score)
the number of standard deviationsthat a given value x is above or belowthe mean.
D efinition
Measures of Position
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 85/102
Slide 85
Copyright © 2004 Pearson Education, Inc.
Sample Population
x - µz =W
Round to 2 decimal places
Measures of Positionz score
z = x - x
s
Interpreting Z Scores
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 86/102
Slide 86
Copyright © 2004 Pearson Education, Inc.
Interpreting Z Scores
Whenever a value is less than the mean, its
corresponding z score is negativeOrdinary values: z score between ±2 and 2 sd
U nusual Values: z score < -2 or z score > 2 sd
FIG U RE 2-14
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 88/102
Slide 88
Copyright © 2004 Pearson Education, Inc.
Q 1, Q 2, Q 3divides ranked scores into four equal parts
Quartiles
25% 25% 25% 25%
Q3
Q2
Q1
(minimum) (maximum)
(median)
Percentiles
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 89/102
Slide 89
Copyright © 2004 Pearson Education, Inc.
Percentiles
Just as there are quartiles separating data
into four parts, there are 99 percentilesdenoted P 1, P 2, . . . P 99 , which partition thedata into 100 groups.
Finding the Percentile
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 90/102
Slide 90
Copyright © 2004 Pearson Education, Inc.
Finding the Percentileof a Given Score
Percentile of value x = 100number of values less than x
total number of values
Converting from the
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 91/102
Slide 91
Copyright © 2004 Pearson Education, Inc.
n total number of values in the data set
k percentile being used L locator that gives the position of a value
P k k th percentile
L = nk 100
N otation
Converting from thek th Percentile to the
CorrespondingD
ata Value
Converting
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 92/102
Slide 92
Copyright © 2004 Pearson Education, Inc.
Figure 2-15
gfrom the
k th Percentileto the
CorrespondingD ata Value
Some Other Statistics
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 93/102
Slide 93
Copyright © 2004 Pearson Education, Inc.
Interquartile Range (or IQR): Q 3 - Q 1
10 - 90 Percentile Range: P 90 - P 10
Semi-interquartile Range: 2
Q 3 - Q 1
Midquartile:
2
Q 3 + Q 1
Some Other Statistics
Recap
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 94/102
Slide 94
Copyright © 2004 Pearson Education, Inc.
Recap
In this section we have discussed:
z Scores
z Scores and unusual values
Quartiles
Percentiles
Converting a percentile to corresponding datavalues
Other statistics
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 95/102
Slide 95
Copyright © 2004 Pearson Education, Inc.
Created by Tom Wegleitner, Centreville, Virginia
Section 2-7Exploratory D ata Analysis
(E D A)
D efinition
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 96/102
Slide 96
Copyright © 2004 Pearson Education, Inc.
Exploratory D ata Analysis is theprocess of using statistical tools (such
as graphs, measures of center, andmeasures of variation) to investigatedata sets in order to understand their important characteristics
D efinition
D efinition
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 97/102
Slide 97
Copyright © 2004 Pearson Education, Inc.
D efinition
An outlier is a value that is located veryfar away from almost all the other
values
Important Principles
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 98/102
Slide 98
Copyright © 2004 Pearson Education, Inc.
Important Principles
An outlier can have a dramatic effect on themean
An outlier have a dramatic effect on thestandard deviation
An outlier can have a dramatic effect on thescale of the histogram so that the truenature of the distribution is totallyobscured
D efinitions
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 99/102
Slide 99
Copyright © 2004 Pearson Education, Inc.
For a set of data, the 5-number summary consistsof the minimum value; the first quartile Q 1; themedian (or second quartile Q 2); the third quartile,
Q 3; and the maximum value
A boxplot ( or box-and-whisker-diagram ) is agraph of a data set that consists of a lineextending from the minimum value to themaximum value, and a box with lines drawn at thefirst quartile, Q 1; the median; and the thirdquartile, Q 3
D efinitions
Boxplots
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 100/102
Slide 100
Copyright © 2004 Pearson Education, Inc.
Boxplots
Figure 2-16
Boxplots
8/7/2019 tes9e_ch02
http://slidepdf.com/reader/full/tes9ech02 101/102
Slide 101
Copyright © 2004 Pearson Education, Inc.
Figure 2-17
Boxplots
Slid 102Recap