43
Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Health Economics Research Method 2003/2 Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging, ordering, and manipulating data to generate descriptive information

Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Research Methods

William G. Zikmund

Basic Data Analysis: Descriptive

Statistics

Health Economics Research Method 2003/2

Descriptive Analysis

• The transformation of raw data into a form

that will make them easy to understand and

interpret; rearranging, ordering, and

manipulating data to generate descriptive

information

Page 2: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Type ofMeasurement

Nominal

Two

categories

More than

two categories

Frequency table

Proportion (percentage)

Frequency table

Category proportions

(percentages)

Mode

Type of descriptive analysis

Type ofMeasurement

Type of descriptive analysis

Ordinal Rank orderMedian

Page 3: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Type ofMeasurement

Type of descriptive analysis

Interval Arithmetic mean

Type ofMeasurement

Type of descriptive analysis

RatioIndex numbers

Geometric meanHarmonic mean

Page 4: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Tabulation

• Tabulation - Orderly arrangement of data in

a table or other summary format

• Frequency table

• Percentages

Health Economics Research Method 2003/2

Frequency Table

• The arrangement of statistical data in a row-

and-column format that exhibits the count

of responses or observations for each

category assigned to a variable

Page 5: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Measure ofCentral Measure of

Type of Scale Tendency Dispersion

Nominal Mode NoneOrdinal Median PercentileInterval or ratio Mean Standard deviation

Central Tendency

Health Economics Research Method 2003/2

Cross-Tabulation

• A technique for organizing data by groups,

categories, or classes, thus facilitating

comparisons; a joint frequency distribution

of observations on two or more sets of

variables

• Contingency table- The results of a cross-

tabulation of two variables, such as survey

questions

Page 6: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Cross-Tabulation

• Analyze data by groups or categories

• Compare differences

• Contingency table

• Percentage cross-tabulations

Health Economics Research Method 2003/2

Base

• The number of respondents or observations

(in a row or column) used as a basis for

computing percentages

Page 7: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Elaboration and Refinement

• Moderator variable

– A third variable that, when introduced into an

analysis, alters or has a contingent effect on the

relationship between an independent variable

and a dependent variable.

– Spurious relationship

• An apparent relationship between two variables that

is not authentic.

TwoTwo

ratingrating

scalesscales 4 quadrants4 quadrants

twotwo--dimensionaldimensional

tabletable ImportanceImportance--

PerformancePerformance

Analysis)Analysis)

Quadrant Analysis

Page 8: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Data Transformation

• Data conversion

• Changing the original form of the data to a

new format

• More appropriate data analysis

• New variables

Data Transformation

Summative Score =

VAR1 + VAR2 + VAR 3

Health Economics Research Method 2003/2

Page 9: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Collapsing a Five-Point Scale

• Strongly Agree

• Agree

• Neither Agree nor

Disagree

• Disagree

• Strongly Disagree

• Strongly Agree/Agree

• Neither Agree nor

Disagree

• Disagree/Strongly

Disagree

Health Economics Research Method 2003/2

Index Numbers

• Score or observation recalibrated to indicate

how it relates to a base number

• CPI - Consumer Price Index

Health Economics Research Method 2003/2

Page 10: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Calculating Rank Order

• Ordinal data

• Brand preferences

Health Economics Research Method 2003/2

Tables

• Bannerheads for columns

• Studheads for rows

Page 11: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Charts and Graphs

• Pie charts

• Line graphs

• Bar charts

– Vertical

– Horizontal

Health Economics Research Method 2003/2

Page 12: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Line Graph

0

10

20

30

40

50

60

70

80

90

1st Qtr 2nd Qtr 3rd Qtr 4th Qtr

East

West

North

Health Economics Research Method 2003/2

Bar Graph

Page 13: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

643 Networking

213 print ad

179 Online recruitment site

112 Placement firm

18 Temporary agency

How did you find your last job?

7006005004003002001000

Networking

print ad

Online recruitment site

Placement firm

Temporary agency

55.2 %

18.3 %

15.4 %

9.6 %

1.5 %

WebSurveyor Bar Chart

Computer Programs

• SPSS

• SAS

• SYSTAT

• Microsoft Excel

• WebSurveyor

Health Economics Research Method 2003/2

Page 14: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Microsoft Excel -Data Analysis

The Paste Function Provides

Numerous Statistical Operations

Page 15: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Computer Programs

• Box and whisker plots

• Interquartile range - midspread

• Outlier

Page 16: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Interpretation

• The process of making pertinent inferences

and drawing conclusions

• concerning the meaning and implications of

a research investigation

Research Methods

William G. Zikmund

Univariate Statistics

Page 17: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Univariate Statistics

• Test of statistical significance

• Hypothesis testing one variable at a time

Health Economics Research Method 2003/2

Hypothesis

• Unproven proposition

• Supposition that tentatively explains certain

facts or phenomena

• Assumption about nature of the world

Page 18: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Hypothesis

• An unproven proposition or supposition that

tentatively explains certain facts or

phenomena

– Null hypothesis

– Alternative hypothesis

Health Economics Research Method 2003/2

Null Hypothesis

• Statement about the status quo

• No difference

Page 19: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Alternative Hypothesis

• Statement that indicates the opposite of the

null hypothesis

Health Economics Research Method 2003/2

Significance Level

• Critical probability in choosing between the

null hypothesis and the alternative

hypothesis

Page 20: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Significance Level

• Critical Probability

• Confidence Level

• Alpha

• Probability Level selected is typically .05 or

.01

• Too low to warrant support for the null

hypothesis

0.3 : =µoH

The null hypothesis that the mean is equal to 3.0:

Health Economics Research Method 2003/2

Page 21: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

0.3 :1 ≠µH

The alternative hypothesis that the mean does not equal to 3.0:

Health Economics Research Method 2003/2

A Sampling Distribution

µ=3.0 xHealth Economics Research Method 2003/2

Page 22: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

µ=3.0 x

α=.025 α=.025

Health Economics Research Method 2003/2

A Sampling Distribution

LOWER

LIMIT

UPPER

LIMIT

µ=3.0Health Economics Research Method 2003/2

A Sampling Distribution

Page 23: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Critical values of µµµµ

Critical value - upper limit

n

SZZS

X++= or µµ

+=

225

5.196.1 0.3

Health Economics Research Method 2003/2

( )1.096.1 0.3 +=

196. 0.3 +=

196.3=

Health Economics Research Method 2003/2

Critical values of µµµµ

Page 24: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Critical value - lower limit

n

SZZS

X- or - µµ=

=

225

5.196.1- 0.3

Health Economics Research Method 2003/2

Critical values of µµµµ

( )1.096.1 0.3 −=

196. 0.3 −=

804.2=

Health Economics Research Method 2003/2

Critical values of µµµµ

Page 25: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Region of Rejection

LOWER

LIMITUPPER

LIMIT

µ=3.0

Health Economics Research Method 2003/2

Hypothesis Test µµµµ ====3333....0000

2.804 3.196µ=3.0 3.78Health Economics Research Method 2003/2

Page 26: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Accept null Reject null

Null is true

Null is false

CorrectCorrect--

no errorno errorType IType I

errorerror

Type IIType II

errorerrorCorrectCorrect--

no errorno error

Health Economics Research Method 2003/2

Type I and Type II Errors

Type I and Type II Errors

in Hypothesis Testing

State of Null Hypothesis Decisionin the Population Accept Ho Reject Ho

Ho is true Correct--no error Type I error

Ho is false Type II error Correct--no error

Health Economics Research Method 2003/2

Page 27: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Calculating Zobs

xs

xz

obs

Health Economics Research Method 2003/2

X

obsS

XZ

Health Economics Research Method 2003/2

Alternate Way of Testing the

Hypothesis

Page 28: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

X

obsS

Zµ−

=78.3

1.

0.378.3 −=

1.

78.0= 8.7=

Health Economics Research Method 2003/2

Alternate Way of Testing the

Hypothesis

Health Economics Research Method 2003/2

Choosing the Appropriate

Statistical Technique

• Type of question to be answered

• Number of variables

– Univariate

– Bivariate

– Multivariate

• Scale of measurement

Page 29: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

PARAMETRIC

STATISTICS

NONPARAMETRIC

STATISTICS

Health Economics Research Method 2003/2

t-Distribution

• Symmetrical, bell-shaped distribution

• Mean of zero and a unit standard deviation

• Shape influenced by degrees of freedom

Health Economics Research Method 2003/2

Page 30: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Degrees of Freedom

• Abbreviated d.f.

• Number of observations

• Number of constraints

or

Xlc StX .. ±=µ

n

StX lc ..limitUpper +=

n

StX lc ..limitLower −=

Health Economics Research Method 2003/2

Confidence Interval Estimate

Using the t-distribution

Page 31: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

= population mean

= sample mean

= critical value of t at a specified confidence

level

= standard error of the mean

= sample standard deviation

= sample size

..lct

µX

XS

Sn

Health Economics Research Method 2003/2

Confidence Interval Estimate Using

the t-distribution

xcl stX ±=µ

17

66.2

7.3

=

=

=

n

S

X

Health Economics Research Method 2003/2

Confidence Interval Estimate Using

the t-distribution

Page 32: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

07.5

)1766.2(12.27.3limitupper

=

+=

Health Economics Research Method 2003/2

33.2

)1766.2(12.27.3limitLower

=

−=

Health Economics Research Method 2003/2

Page 33: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Hypothesis Test Using the

t-Distribution

Suppose that a production manager believes

the average number of defective assemblies

each day to be 20. The factory records the

number of defective assemblies for each of the

25 days it was opened in a given month. The

mean was calculated to be 22, and the standard deviation, ,to be 5.

XS

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Utilizing the t-Distribution

Page 34: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

20 :

20 :

1

0

=

µ

µ

H

H

Health Economics Research Method 2003/2

nSSX

Health Economics Research Method 2003/2

Page 35: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

The researcher desired a 95 percent

confidence, and the significance level becomes

.05.The researcher must then find the upper

and lower limits of the confidence interval to

determine the region of rejection. Thus, the

value of t is needed. For 24 degrees of freedom (n-1, 25-1), the t-value is 2.064.

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Utilizing the t-Distribution

:limitLower 25/5064.220 .. −=−

Xlc Stµ( )1064.220−=

936.17=

Health Economics Research Method 2003/2

Page 36: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

:limitUpper

25/5064.220 .. +=+Xlc Stµ

( )1064.220+=064.20=

Health Economics Research Method 2003/2

X

obsS

Xt

µ−=

1

2022 −=

1

2=

2=Health Economics Research Method 2003/2

Univariate Hypothesis Test

t-Test

Page 37: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Health Economics Research Method 2003/2

Testing a Hypothesis about a

Distribution

• Chi-Square test

• Test for significance in the analysis of

frequency distributions

• Compare observed frequencies with

expected frequencies

• “Goodness of Fit”

i

ii

E

EOx

Chi-Square Test

Health Economics Research Method 2003/2

Page 38: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

x² = chi-square statistics

Oi = observed frequency in the ith cell

Ei = expected frequency on the ith cell

Health Economics Research Method 2003/2

Chi-Square Test

ji

ij

Chi-Square TestEstimation for Expected Number

for Each Cell

Health Economics Research Method 2003/2

Page 39: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

Chi-Square TestEstimation for Expected Number

for Each Cell

Ri = total observed frequency in the ith row

Cj = total observed frequency in the jth column

n = sample size

Health Economics Research Method 2003/2

( ) ( )2

2

22

1

2

112

E

EO

E

EOX

−+

−=

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Chi-square Example

Page 40: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

( ) ( )50

5040

50

506022

2 −+

−=X

4=

Health Economics Research Method 2003/2

Univariate Hypothesis Test

Chi-square Example

Hypothesis Test of a Proportion

π is the population proportion

p is the sample proportion

π is estimated with p

Health Economics Research Method 2003/2

Page 41: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

5.:H

5.:H

1

0

¹π

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion

( )( )100

4.06.0=pS

100

24.=

0024.= 04899.=

Health Economics Research Method 2003/2

Page 42: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

pS

pZobs

π−=

04899.

5.6. −=

04899.

1.= 04.2=

Health Economics Research Method 2003/2

0115.Sp=

000133.Sp=

1200

16.Sp

=

1200

)8)(.2(.Sp

=

n

pqSp

=

20.p =200,1n =

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion:

Another Example

Page 43: Research Methodspioneer.chula.ac.th/~ppongsa/2900600/LMRM02.pdf · 2011-12-27 · 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East West North Health Economics Research Method 2003/2 Bar Graph

0115.Sp=

000133.Sp=

1200

16.Sp

=

1200

)8)(.2(.Sp

=

n

pqSp

=

20.p =200,1n =

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion:

Another Example

Indeed .001thebeyondtsignificantisit

level..05theatrejectedbeshouldhypothesisnulltheso1.96,exceedsvalueZThe

348.4Z

0115.

05.Z

0115.

15.20.Z

S

pZ

p

=

=

−=

π−=

Health Economics Research Method 2003/2

Hypothesis Test of a Proportion:

Another Example