Business Statistics Outline Dealing with decision problem when the face of uncertainty are...

Business Statistics

Outline

Dealing with decision problem when the face of uncertainty are important.

Descriptive Statistics

Sampling and Sampling Distributions

Point and Interval Estimation

Hypothesis Testing

Non-parametric Test - Chi-square Test

Analysis of Variance

Outline (cont.)

Time Series and Forecasting

Survey and sampling methods

Multivariate Analysis

Bayesian Statistics and Decision Analysis

Session 1

Population and sampleMeasures of Central Tendency

Mean, Median, Mode

Measures of Dispersion Variance, Standard deviation

Percentile, Inter-quartile range

Grouped data and histogramOther data representations

Population and Sample

• Population The population consists of the set of all measurements in which the investigator is interested. The population is also called the universe.

• Sample A sample is a subset of measurements selected from the population. Sampling from the population is often done randomly i.e. such that every possible sample of n elements will have equal chance of being selected. A sample created in this way is called simple random sample or random sample.

A medical manufacturer interested in marketing a new drug may be required the

Food and Drug Administration (FDA) to prove that the drug does not cause any

serious side effect.

The sampling was made by selecting a sample of people randomly, the result of

tests of drug using on this sample may then be used in a statistical inference about the entire population of people who may use

the drug if it will be introduced.

Example 1.1.

Population

Sample

Simple Random sampling

Population

Sample

Biased Sampling

Illustration for simple random sampling

Measures of Central Tendency

Mean Arithmetic Mean - AMGiven a set of data , the arithmetic

mean is defined as follows:

Mode The mode of a data set is the value that occurs most frequently

n/xAMi

This kind of mean is the most frequently used.

Harmonic Mean - HM

This kind of mean is used when dealing with velocity.

• Population Mean

• Sample Mean

MedianThe median of a set of observations is a special point, it lies in position that half of the data lie below it and half above it.

Set 1: Ordering 7, 9, 15, 18, 20; median is 15Set 2: Ordering 15.8 20.7 21.1 22.5 33.4 40.3

Median = (21.1 + 22.5)/2 = 21.8

Example 1.2.

Find median of the following two sets of data.Set 1: 15 20 7 9 18 (n=5)Set 2: 20.7 22.5 15.8 40.3 33.4 21.1 (n=6)

Measurements of Dispersion

The variance of a set of observations is the average squared deviation of the data points from their mean.

Variance and Standard Deviation

Sample Variance1n

Note The denominator is of (n-1)

Population Variance

The standard deviation of a set of observations is the square root of the variance of the set

Variance and Standard Deviation

Percentiles

The Pth percentile of a group of numbers is that value below which lie P% (P percent) of the numbers in the group. The position is given by (n+1)* P /100 where n is the number of data points. (GRE , GMAT Test)

QuartilesThe percentage points that break the data set into 4 groups by the quarters-1st quarter, 2nd quarter and 3rd quarter

• 1st quartile Q1 is the 25th percentile.• 2nd quartile Q2 is the 50th percentile.

• 3rd quartile Q3 is the 75th percentile.

Inter-Quartile Range IQR = Q3 - Q1

Example 1.3.Given a data set including 22 points:88, 56, 64, 45, 52, 76, 54, 79, 38, 98, 69, 77, 71, 45, 60, 78, 90, 81, 87, 44, 80, 41. Find the 20th, 30th and 90th percentiles. Also find the IQR. What are mean, mode and median? What is the variance of the set ?

Grouped Data and Histogram

• Classes We divide the data values into classes which have the same length and cover all data points. Each class represents for a mi observation value.

• Frequencies fi The number of observations in each class. Total frequencies is number of observations N. The relative frequency of each class is the ratio of individual frequency and N.• Histogram

• Mean and Variance of grouped data

Population N/)mf(K

N/))m(f(K

Variance

MeanSample

Variancen/)mf(xK

1n/))xm(f(sK

Where K is number of classes, n is number observations of sample.

The number of errors in a text books was found. Number of errors per page is placed in column (mi) while column (fi) shows the number of pages contains errors. The following table and charts show histogram of errors distribution:

Example1.4

mi mi.mi fi Relative fi fi.mi fi.mi.mi0 0 102 0.204 0 01 1 138 0.276 138 1382 4 140 0.28 280 5603 9 79 0.158 237 7114 16 33 0.066 132 5285 25 8 0.016 40 200

500 1 827 2137

0.276 0.28

Example1.4

Other Descriptive Statistics

Index numbers

Simple index numbers

A index number is a number that measures the relative change in a set of measurements over time.

Index number for period i = 100 (value in period i / value in base period)

Year Price Index New Index73 121 100.000 84.61574 121 100.000 84.61575 122 100.826 85.31576 133 109.917 93.00777 136 112.397 95.10578 138 114.050 96.50379 143 118.182 100.00080 144 119.008 100.69981 144 119.008 100.69982 156 128.926 109.09183 162 133.884 113.28784 167 138.017 116.78385 230 190.083 160.83986 250 206.612 174.825

Price and Index

70 75 80 85 90

Consumer Price Index - Laspeyres Index

Laspeyres Index gives us a measurement for a change of quantity and price of items.

Items 1993 1994 1995Price Quantity Price Quantity

Price Quantity

Beef 238 50 240 52 233 54Pork 140 26 162 24 162 20Eggs 85 15 102 12 80 10Milk 105 85 112 91 113 92Bread 51 30 54 28 55 28Potatoes180 10 191 12 160 11Tomatoes 46 5 50 6 53 4Oranges 42 7 53 7 52 8

100*q.p

q.p)i(IndexLaspeyres

• Compute the Laspeyres Index:– Select year 1993 as a base year

• For 1993: Sum of quantity x price = 29594• For 1994: Sum of quantity x price = 31413• For 1995: Sum of quantity x price = 30546

– Laspeyres Index:• For 1993: 100• For 1994: 106.15• For 1993: 103.22

Stem-and-Leaf Displays

A way for re-arranging data to allow the data “speak for themselves”.

Given the data set: 11, 12, 12, 13, 14, 15, 15, 16, 20, 21, 21, 21, 21, 22, 25, 25, 26, 27, 28, 29, 29, 31, 32, 34, 35, 36, 38, 41, 42, 45, 47, 50, 52, 55, 60, 62

Example

The Stem-and-leaf display

1 122345562 01111255678993 1245684 12575 0256 02

Q 1 Q 3

Median

Inner fenceQ 1 - 1.5 (IQR)

Outer fenceQ 1 - 3( IQR)

Inner fenceQ 3 + 1.5 (IQR)

Outer fenceQ 3 + 3 (IQR)

Smallest observation Largest observation

Suspected outlierOutlier

Box-Whiskers plot

Examples for Box-Whiskers plot

Right skewed

Left skewed

Symmetric

Small variance

Suspectedoutlier

Outlier

Inner fence Outer fence

Box-Whisker plot (or Box plot) are useful for the following purposes.

•To identify the spread of data set.•To identify the location of data set based on median. •To identify possible skewness of the distribution.•To identify suspected outlier and outlier.•To quickly compare data sets.

Look at example in SPSS

Business Statistics Outline Dealing with decision problem when the face of uncertainty are...

Documents

Lecture VI Statistics. Lecture questions Mathematical statistics Sampling Statistical population and sample Descriptive statistics

Sampling and Descriptive Statistics - NTNUberlin.csie.ntnu.edu.tw/Courses/Introduction to...Sampling and Descriptive Statistics Berlin ChenBerlin Chen Department of Computer Science

CH.6 Random Sampling and Descriptive Statistics · 2014-02-07 · CH.6 Random Sampling and Descriptive Statistics • Population vs Sample • Random sampling ... • The compressive

Contents Basic Probability Descriptive Statistics Discrete Random Variables Continuous Random Variables Examples of Random Variables Sampling

QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures

Basic Statistics. Content Data Types Descriptive Statistics Graphical Summaries Distributions Sampling and Estimation Confidence Intervals

Chapter 2 Descriptive Statistics - MMathematics...Descriptive Statistics As described inChapter 1 "Introduction", statistics naturally divides into two branches, descriptive statistics

Descriptive Statistics Descriptive Statistics describe a set of data

Descriptive Statistics - StatPlus · Descriptive Statistics The DESCRIPTIVE STATISTICS procedure displays univariate summary statistics for selected variables. Descriptive statistics

Sampling and Basic Descriptive Statistics. Basic concepts and Techniques. Lecture 6 Leah Wild

SAMPLING AND DESCRIPTIVE STATISTICSathreya/psweur/chapters/07.pdf · SAMPLING AND DESCRIPTIVE STATISTICS The distinction between Probability and Statistics is somewhat fuzzy, but

Chapter 2 | Descriptive Statistics 67 2|DESCRIPTIVE STATISTICS

Session 8 SAMPLING THEORYcourses.aiu.edu/STATISTICS/8/Sampling Theory.pdfSession 8 SAMPLING THEORY STATISTICS SAMPLING THEORY STATISTICS STATISTICS ANALYTIC Sampling Theory A probability

Statistical Quality Control N.Obeidi Descriptive Statistics Descriptive Statistics include: Descriptive Statistics include: – The Mean- measure of central

Descriptive Statistics Organize Descriptive Statistics ...acfoos/Courses/381/02... · Descriptive Statistics PSYC 381 Arlo Clark-Foos • Descriptive Statistics – Organize – Summarize

6 Sampling and Basic Descriptive Statistics

Sample Descriptive Statistics This Table Presents Descriptive Statistics

Sampling and Descriptive Statistics - NTNUberlin.csie.ntnu.edu.tw/Courses/Introduction to... · 2008. 3. 5. · Statistics-Berlin Chen 3 Sampling (2/2) • Definition: A simple random

Dr. Héctor AllendeReview of Probability and Statistics 1 A Review of Probability and Statistics Descriptive statistics Probability Random variables Sampling

1 Chapter 1: Sampling and Descriptive Statistics