Upload
ankitahanda
View
237
Download
0
Embed Size (px)
Citation preview
8/14/2019 Chap003 (1).pptx
1/51
Business Statistics: Communicating with Numbers
By Sanjiv Jaggia and Alison Kelly
McGraw-Hill/Irwin Copyri ght 2013 by The McGraw-H il l Companies, In c. All ri ghts reserved.
8/14/2019 Chap003 (1).pptx
2/51
3-2
Chapter 3 Learning Objectives (LOs)
LO 3.1:Calculate and interpret the arithmetic mean,the median, and the mode.
LO 3.2:Calculate and interpret percentiles and a box
plot.LO 3.3:Calculate and interpret a geometric mean
return and an average growth rate.
LO 3.4: Calculate and interpret the range, the meanabsolute deviation, the variance, thestandard deviation, and the coefficient ofvariation.
8/14/2019 Chap003 (1).pptx
3/51
3-3
Chapter 3 Learning Objectives (LOs)
LO 3.5:Explain mean-variance analysis andthe Sharpe ratio.
LO 3.6:Apply Chebyshevs Theorem and the
empirical rule.LO 3.7:Calculate the mean and the variance
for grouped data.
LO 3.8:Calculate and interpret the covarianceand the correlation coefficient.
8/14/2019 Chap003 (1).pptx
4/51
3-4
As an investment counselor at a large bank,Rebecca Johnson was asked by aninexperienced investor to explain the differencesbetween two top-performing mutual funds:
Vanguards Precious Metals and Mining fund(Metals)
Fidelitys Strategic Income Fund (Income)
The investor has collected sample returns forthese two funds for years 2000 through 2009.These data are presented in the next slide.
Investment Decision
8/14/2019 Chap003 (1).pptx
5/51
3-5
Investment Decision
Rebecca would like to
1. Determine the typical return of the mutual
funds.2. Evaluate the investment risk of the mutual
funds.
8/14/2019 Chap003 (1).pptx
6/51
8/14/2019 Chap003 (1).pptx
7/513-7
3.1 Measures of Central Location
Example: Investment Decision
Use the data in the introductory case to calculate andinterpret the mean return of the Metals fund and themean return of the Income fund.
LO 3.1
8/14/2019 Chap003 (1).pptx
8/513-8
3.1 Measures of Central LocationLO 3.1
The mean is sensitive to outliers.
Consider the salaries of employees at Acetech.
This mean does not reflect the typical salary!
8/14/2019 Chap003 (1).pptx
9/513-9
3.1 Measures of Central Location
The medianis another measure of central locationthat is not affected by outliers.
When the data are arranged in ascending order,
the median is
the middle value if the number of observations isodd, or
the average of the two middle values if thenumber of observations is even.
LO 3.1
8/14/2019 Chap003 (1).pptx
10/513-10
3.1 Measures of Central Location Consider the sorted salaries of employees at
Acetech (odd number).
Median = 90,000
Consider the sorted data from the Metals funds ofthe introductory case study (even number).
Median = (33.35 + 34.30) / 2 = 33.83%.
LO 3.1
3 values above3 values below
8/14/2019 Chap003 (1).pptx
11/513-11
3.1 Measures of Central Location The modeis another measure of central location.
The most frequently occurring value in a data set
Used to summarize qualitative data
A data set can have no mode, one mode
(unimodal), or many modes (multimodal).
LO 3.1
Consider the salary of employees at Acetech
The mode is $40,000 since this value appears mostoften.
8/14/2019 Chap003 (1).pptx
12/513-12
3.2 Percentiles and Box Plots
In general, thepth percentiledivides a data set intotwo parts:
Approximatelyp percent of the observations have
values less than thepth percentile; Approximately (100 p) percent of the
observations have values greater than thepthpercentile.
LO 3.2 Calculate and interpret percentiles and a box plot.
8/14/2019 Chap003 (1).pptx
13/513-13
3.2 Percentiles and Box Plots
Calculating thepth percentile:
First arrange the data in ascending order.
Locate the position, Lp, of thepth percentile byusing the formula:
We use this position to find the percentile as
shown next.
LO 3.2
1100
p
pL n
8/14/2019 Chap003 (1).pptx
14/513-14
3.2 Percentiles and Box Plots
Consider the sorted data from the introductory case.
For the 25th percentile, we locate the position:
Similarly, for the 75thpercentile, we first find:
LO 3.2
25 1100
pL n
2510 1 2.75
100
75 1100
pL n
7510 1 8.25
100
8/14/2019 Chap003 (1).pptx
15/513-15
3.2 Percentiles and Box Plots
Calculating thepth percentile
Once you find Lp, observe whether or not itis aninteger.
If Lp
is an integer, then the Lp
thobservationin thesorted data set is thepth percentile.
If Lp is not an integer, then interpolate betweentwo corresponding observations to approximate
thepth percentile.
LO 3.2
8/14/2019 Chap003 (1).pptx
16/513-16
3.2 Percentiles and Box Plots
Both L25 = 2.75 and L75 = 8.25 are not integers, thus
The 25th percentile is located 75% of the distancebetween the second and third observations, and it is
The 75th percentile is located 25% of the distancebetween the eighth and ninth observations, and it is
LO 3.2
7.34 0.75(8.09 ( 7.34)) 4.23
43.79 0.25(59.45 43.79) 47.71
8/14/2019 Chap003 (1).pptx
17/513-17
A box plot allows you to:
Graphically display the distribution of a data set.
Compare two or more distributions.
Identify outliers in a data set.
Box
* *
WhiskersOutliers
3.2 Percentiles and Box PlotsLO 3.2
8/14/2019 Chap003 (1).pptx
18/513-18
3.2 Percentiles and Box Plots
The box plot displays 5 summary values:
S = smallest value
L = largest value
Q1 = first quartile = 25th percentile
Q2 = median =second quartile = 50th percentile
Q3 = third quartile = 75th percentile
LO 3.2
8/14/2019 Chap003 (1).pptx
19/513-19
Using the results obtained from the Metals fund
data, we can label the box plot with the 5summary values:
Note that IQR = Q3 Q1 =43.48
3.2 Percentiles and Box PlotsLO 3.2
Smallest
Value
Largest
Value
First
Quartile
Third
Quartile
SecondQuartile
47.71 4.23 = 43.48
8/14/2019 Chap003 (1).pptx
20/513-20
3.2 Percentiles and Box Plots
Detecting outliers Calculate IQR = 43.48
Calculate 1.5 IQR, or 1.5 43.48 = 65.22
There are outliers if
Q1S > 65.22, or if
LQ3 > 65.22
There are no outliers in this data set.
8/14/2019 Chap003 (1).pptx
21/513-21
3.3 The Geometric Mean
Remember that the arithmetic mean is an additiveaverage measurement.
Ignores the effects of compounding.
The geometric meanis a multiplicative averagethat incorporate compounding. It is used to
measure: Average investment returns over several years,
Average growth rates.
LO 3.3 Calculate and interpret a geometric mean return and an
average growth rate.
8/14/2019 Chap003 (1).pptx
22/513-22
3.3 The Geometric Mean
For multiperiod returns R1, R2, . . . , Rn, thegeometric mean return GRis calculated as:
where nis the number of multiperiod returns.
LO 3.3
8/14/2019 Chap003 (1).pptx
23/51
3-23
3.3 The Geometric Mean
Using the data from the Metals and Income funds,we can calculate the geometric mean returns:
LO 3.3
8/14/2019 Chap003 (1).pptx
24/51
3-24
3.3 The Geometric Mean Computing an average growth rate
For growth rates g1, g2, . . . , gn, the average growth rateGgis calculated as
where nis the number of multiperiod growth rates.
For observationsx1,x2, . . . ,xn, the average growth rateGgis calculated as
where n1 is the number of distinct growth rates.
LO 3.3
8/14/2019 Chap003 (1).pptx
25/51
3-25
3.3 The Geometric Mean For example, consider the sales for Adidas (in
millions of) for the years 2005 through 2009:
Annual growth rates are:
LO 3.3
The average growth rate
using the simplifiedformula is:
8/14/2019 Chap003 (1).pptx
26/51
3-26
3.4 Measures of Dispersion
Measures of dispersion gauge the variability of adata set.
Measures of dispersion include:
Range
Mean Absolute Deviation (MAD)
Variance and Standard Deviation Coefficient of Variation (CV)
LO 3.4 Calculate and interpret the range, the mean absolute deviation,
the variance, the standard deviation, and the coefficient of variation.
8/14/2019 Chap003 (1).pptx
27/51
3-27
3.4 Measures of Dispersion
Range
It is the simplest measure.
It is focusses on extreme values.
LO 3.4
Range Maximum Value Minimum Value
Calculate the range using the data from the Metalsand Income funds
8/14/2019 Chap003 (1).pptx
28/51
3-28
3.4 Measures of Dispersion
Mean Absolute Deviation (MAD)
MAD is an average of the absolute difference ofeach observation from the mean.
LO 3.4
Sample MAD ix xn
Population MAD
ix
N
m
8/14/2019 Chap003 (1).pptx
29/51
3-29
3.4 Measures of Dispersion
Calculate MAD using the data from the Metalsfund.
8/14/2019 Chap003 (1).pptx
30/51
3-30
3.4 Measures of Dispersion
Varianceand standard deviation
For a given sample,
For a given population,
LO 3.4
2
2 2 and1
ix xs s sn
2
2 2 andix
N
m
8/14/2019 Chap003 (1).pptx
31/51
3-31
3.4 Measures of Dispersion
Calculate the variance and the standarddeviation using the data from the Metals fund.
8/14/2019 Chap003 (1).pptx
32/51
8/14/2019 Chap003 (1).pptx
33/51
3-33
Calculate the coefficient of variation (CV)using the data from the Metals fund and theIncome fund.
Metals fund: CV
Income fund: CV
3.4 Measures of DispersionLO 3.4
8/14/2019 Chap003 (1).pptx
34/51
3-34
Mean and median returns for the Metals fund are
24.65% and 33.83%, respectively.
Mean and median returns for the Income fund are8.51% and 7.34%, respectively.
The standard deviation for the Metals fund and theIncome fund are 37.13% and 11.07%, respectively.
The coefficient of variation for the Metals fund andthe Income fund are 1.51 and 1.30, respectively.
Synopsis of Investment Decision
8/14/2019 Chap003 (1).pptx
35/51
3-35
3.5 Mean-Variance Analysis and the Sharpe Ratio
Mean-variance analysis:
The performance of an asset is measured by its rate ofreturn.
The rate of return may be evaluated in terms of its reward(mean) and risk (variance).
Higher average returns are often associated with higherrisk.
The Sharpe ratio uses the mean and variance toevaluate risk.
LO 3.5 Explain mean-variance analysis and the Sharpe Ratio.
8/14/2019 Chap003 (1).pptx
36/51
3-36
3.5 Mean-Variance Analysis and the
Sharpe Ratio
Sharpe Ratio
Measures the extra reward per unit of risk.
For an investment , the Sharpe ratio is computed as:
where is the mean return for the investment
is the mean return for a risk-free assetis the standard deviation for the investment
Sharpe Ratiox R
s
LO 3.5
8/14/2019 Chap003 (1).pptx
37/51
3-37
3.5 Mean-Variance Analysis and the
Sharpe Ratio
Sharpe Ratio Example Compute the Sharpe ratios for the Metals and Income
funds given the risk free return of 4%.
Since 0.56 > 0.41, the Metals fund offers more reward perunit of risk as compared to the Income fund.
LO 3.5
8/14/2019 Chap003 (1).pptx
38/51
3-38
3.6 Chebyshevs Theorem and the Empirical Rule
Chebyshevs Theorem For any data set, the proportion of observations that lie
within k standard deviations from the mean is at least11/k2 , where k is any number greater than 1.
Consider a large lecture class with 280 students. The meanscore on an exam is 74 with a standard deviation of 8. Atleast how many students scored within 58 and 90?
With k = 2, we have 11/22= 0.75. At least 75% of 280 or210 students scored within 58 and 90.
LO 3.6 Apply Chebyshevs Theorem and the empirical rule.
8/14/2019 Chap003 (1).pptx
39/51
3-39
3.6 Chebyshevs Theorem and the Empirical RuleLO 3.6
The Empirical Rule:
Approximately 68% of all observations fall in theinterval .
Approximately 95% of all
observations fall in theinterval .
Almost all observationsfall in the interval
.
x s
2x s
3x s
8/14/2019 Chap003 (1).pptx
40/51
3-40
Reconsider the example of the lecture class with
280 students with a mean score of 74 and astandard deviation of 8. Assume that the distributionis symmetric and bell-shaped. Approximately howmany students scored within 58 and 90?
The score 58 is two standard deviations below themean while the score 90 is two standarddeviations above the mean.
Therefore about 95% of 280 students, or0.95(280) = 266 students, scored within 58 and90.
3.6 Chebyshevs Theorem and the Empirical RuleLO 3.6
8/14/2019 Chap003 (1).pptx
41/51
3-41
3.7 Summarizing Grouped Data
When data are grouped or aggregated, we usethese formulas:
where miand iare the midpoint and frequency of the ithclass, respectively.
LO 3.7 Calculate the mean and the variance for grouped data.
2
2
2
Mean:
Variance:1
Standard Deviation:
i i
i i
mx
nm x
sn
s s
8/14/2019 Chap003 (1).pptx
42/51
8/14/2019 Chap003 (1).pptx
43/51
3-43
3.7 Summarizing Grouped DataLO 3.7
Calculate the sample variance and the standard
deviation.
First calculate the sum of the weighted squareddifferences from the mean.
Dividing this sum by (n1) = 361 = 35 yields a variance of10.635($)2.
The square root of the variance yields a standard deviationof $103.13.
8/14/2019 Chap003 (1).pptx
44/51
3-44
3.7 Summarizing Grouped DataLO 3.7
Weighted Mean
Let w1, w2, . . . , wn denote the weights of the
sample observationsx1,x2, . . . ,xnsuch that, then
i ix w x
1 2 1
nw w w
8/14/2019 Chap003 (1).pptx
45/51
3-45
3.7 Summarizing Grouped DataLO 3.7
A student scores 60 on Exam 1, 70 on Exam 2, and
80 on Exam 3. What is the students average scorefor the course if Exams 1, 2, and 3 are worth 25%,25%, and 50% of the grade, respectively?
Define w1= 0.25, w2= 0.25, and w3= 0.50.
The unweighted mean is only 70 as it does notincorporate the higher weight given to the score onExam 3.
0.25(60) 0.25(70) 0.5(80) 72.50i ix w x
8/14/2019 Chap003 (1).pptx
46/51
3-46
3.8 Covariance and Correlation
The covariance(sxyor xy) describes thedirection of the linear relationship between
two variables,xand y.
The correlation coefficient (rxyor rxy)
describes both the direction and strength ofthe relationship betweenxand y.
LO 3.8 Calculate and interpret the covariance and the correlation coefficient.
8/14/2019 Chap003 (1).pptx
47/51
3-47
3.8 Covariance and CorrelationLO 3.8
The sample covariance sxyis computed as
1
i i
xy
x x y ys
n
The population covariance xyis computed as
i x i y xy
x y
N
m m
8/14/2019 Chap003 (1).pptx
48/51
3-48
3.8 Covariance and CorrelationLO 3.8
The sample correlation rxyis computed as
xy
xy
x y
sr
s s
The population correlation rxyis computed as
Note, 1 < rxy< +1 or 1 < rxy< +1
xy
xy
x y
r
8/14/2019 Chap003 (1).pptx
49/51
3-49
3.8 Covariance and CorrelationLO 3.8
Let s calculate the covariance and the correlation
coefficient for the Metals (x) and Income (y) funds.
Positive Relationship
Also recall: 24.65, 37.13, 8.51, 11.07x yx s y s
8/14/2019 Chap003 (1).pptx
50/51
3-50
3.8 Covariance and CorrelationLO 3.8
We use the following table for the calculations.
Covariance:
Correlation:
8/14/2019 Chap003 (1).pptx
51/51
S Some Excel CommandsLOs 3.1, 3.4, and 3.8
Measures of central location and dispersion: select
the relevant data, and choose Data > Data Analysis> Descriptive Statistics.
Covariance: For sample covariance, chooseFormulas > Insert Function > COVARIANCE.S. Forpopulation covariance, use COVARIANCE.P.
Correlation: For sample correlation or populationcorrelation, choose Formulas > Insert Function >CORREL.