Upload
sebastian-camacho
View
50
Download
1
Embed Size (px)
DESCRIPTION
SPSS Problem # 7. Page 467 13.5 Page 416 12.2. Cookbook due Wednesday May 4 th !!. What if. You were asked to determine if psychology and sociology majors have significantly different class attendance (i.e., the number of days a person misses class) You would simply do a two-sample t-test - PowerPoint PPT Presentation
Citation preview
SPSS Problem # 7
• Page 467 – 13.5
• Page 416– 12.2
Cookbook due WednesdayMay 4th!!
What if. . .
• You were asked to determine if psychology and sociology majors have significantly different class attendance (i.e., the number of days a person misses class)
• You would simply do a two-sample t-test– two-tailed
• Easy!
But, what if. . .
• You were asked to determine if psychology, sociology, and biology majors have significantly different class attendance
• You would do a one-way ANOVA
But, what if. . .
• You were asked to determine if psychology majors had significantly different class attendance than sociology and biology majors.
• You would do an ANOVA with contrast codes
But, what if. . .
• You were asked to determine the effects of both college major (psychology, sociology, and biology) and gender (male and female) on class attendance
• You now have 2 IVs and 1 DV
• You could do two separate analyses– Problem: “Throw away” information that could explain
some of the “error”– Problem: Will not be able to determine if there is an
interaction
Factorial Analysis of Variance
• Factor = IV
• Factorial design is when every level of every factor is paired with every level of every other factor
Psychology Sociology Biology
Male X X X
Female X X X
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
Main effect of gender
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
Main effect of major
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
Interaction between gender and major
Sum of Squares
• SS Total
– The total deviation in the observed scores
• Computed the same way as before
2..)( XXSSTotal
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SStotal = (2-2.06)2+ (3-2.06)2+ . . . . (1-2.06)2 = 30.94
*What makes this value get larger?
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SStotal = (2-2.06)2+ (3-2.06)2+ . . . . (1-2.06)2 = 30.94
*What makes this value get larger?
*The variability of the scores!
Sum of Squares
• SS A
– Represents the SS deviations of the treatment means around the grand mean
– Its multiplied by nb to give an estimate of the population variance (Central limit theorem)
– Same formula as SSbetween in the one-way
2. ..)( XXnbSS iA
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSA = (3*3) ((1.78-2.06)2+ (2.33-2.06)2)=1.36
*Note: it is multiplied by nb because that is the number of scores each mean is based on
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSA = (3*3) ((1.78-2.06)2+ (2.33-2.06)2)=1.36
*What makes these means differ?
*Error and the effect of A
Sum of Squares
• SS B
– Represents the SS deviations of the treatment means around the grand mean
– Its multiplied by na to give an estimate of the population variance (Central limit theorem)
– Same formula as SSbetween in the one-way
2. ..)( XXnaSS jB
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSB = (3*2) ((3.17-2.06)2+ (2.00-2.06)2+ (1.00-2.06)2)= 14.16
*Note: it is multiplied by na because that is the number of scores each mean is based on
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSB = (3*2) ((3.17-2.06)2+ (2.00-2.06)2+ (1.00-2.06)2)= 14.16
*What makes these means differ?
*Error and the effect of B
Sum of Squares
• SS Cells
– Represents the SS deviations of the cell means around the grand mean
– Its multiplied by n to give an estimate of the population variance (Central limit theorem)
2..)( XXnSS ijCells
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSCells = (3) ((2.67-2.06)2+ (1.00-2.06)2+. . . + (0.33-2.06)2)= 24.35
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06SSCells = (3) ((2.67-2.06)2+ (1.00-2.06)2+. . . + (0.33-2.06)2)= 24.35
What makes the cell means differ?
Sum of Squares
• SS Cells
• What makes the cell means differ?
• 1) error• 2) the effect of A (gender)• 3) the effect of B (major)• 4) an interaction between A and B
Sum of Squares
• Have a measure of how much cells differ– SScells
• Have a measure of how much this difference is due to A– SSA
• Have a measure of how much this difference is due to B– SSB
• What is left in SScells must be due to error and the interaction between A and B
Sum of Squares
• SSAB = SScells - SSA – SSB
• 8.83 = 24.35 - 14.16 - 1.36
Sum of Squares
• SSWithin
• The total deviation in the scores not caused by • 1) the main effect of A• 2) the main effect of B• 3) the interaction of A and B
• SSWithin = SSTotal – (SSA + SSB + SSAB)
6.59 = 30.94 – (14.16 +1.36 + 8.83)
Sum of Squares
• SSWithin
2)( ijWithin XXSS
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
SSWithin = ((2-2.67)2+(3-2.67)2+(3-2.67)2) + . .. + ((1-.33)2 + (0-.33)2 + (0-2..33)2 = 6.667
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
SSWithin = ((2-2.67)2+(3-2.67)2+(3-2.67)2) + . .. + ((1-.33)2 + (0-.33)2 + (0-2..33)2 = 6.667*What makes these values differ from the cell means?*Error
Compute df
Source df SS
A 1.36
B 14.16
AB 8.83
Within 6.59
Total 30.94
Source df SS
A 1.36
B 14.16
AB 8.83
Within 6.59
Total 17 30.94
dftotal = N - 1
Source df SS
A 1 1.36
B 2 14.16
AB 8.83
Within 6.59
Total 17 30.94
dftotal = N – 1
dfA = a – 1
dfB = b - 1
Source df SS
A 1 1.36
B 2 14.16
AB 2 8.83
Within 6.59
Total 17 30.94
dftotal = N – 1
dfA = a – 1
dfB = b – 1
dfAB = dfa * dfb
Source df SS
A 1 1.36
B 2 14.16
AB 2 8.83
Within 12 6.59
Total 17 30.94
dftotal = N – 1
dfA = a – 1
dfB = b – 1
dfAB = dfa * dfb
dfwithin= ab(n – 1)
Compute MS
Source df SS
A 1 1.36
B 2 14.16
AB 2 8.83
Within 12 6.59
Total 17 30.94
Compute MS
Source df SS MS
A 1 1.36 1.36
B 2 14.16 7.08
AB 2 8.83 4.42
Within 12 6.59 .55
Total 17 30.94
What does each MS mean?
Source df SS MS
A 1 1.36 1.36
B 2 14.16 7.08
AB 2 8.83 4.42
Within 12 6.59 .55
Total 17 30.94
22)( naMSE eB
22)( nbMSE eA 2)( eBMSE
22)( nMSE eA
Compute F
Source df SS MS
A 1 1.36 1.36
B 2 14.16 7.08
AB 2 8.83 4.42
Within 12 6.59 .55
Total 17 30.94
Source df SS MS F
A 1 1.36 1.36 2.47
B 2 14.16 7.08 12.87
AB 2 8.83 4.42 8.03
Within 12 6.59 .55
Total 17 30.94
2
22
e
e nb
Compute F
2
22
e
e na
2
22
e
e n
Test each F value for significance
Source df SS MS F
A 1 1.36 1.36 2.47
B 2 14.16 7.08 12.87
AB 2 8.83 4.42 8.03
Within 12 6.59 .55
Total 17 30.94
F critical values (may be different for each F test)
Use df for that factor and the df within.
Test each F value for significance
Source df SS MS F
A 1 1.36 1.36 2.47
B 2 14.16 7.08 12.87
AB 2 8.83 4.42 8.03
Within 12 6.59 .55
Total 17 30.94
F critical A (1, 12) = 4.75
F critical B (2, 12) = 3.89
F critical AB (2, 12) = 3.89
Test each F value for significance
Source df SS MS F
A 1 1.36 1.36 2.47
B 2 14.16 7.08 12.87*
AB 2 8.83 4.42 8.03*
Within 12 6.59 .55
Total 17 30.94
F critical A (1, 12) = 4.75
F critical B (2, 12) = 3.89
F critical AB (2, 12) = 3.89
15.500 3 5.167 9.300 .002
1.389 1 1.389 2.500 .140
14.111 2 7.056 12.700 .001
8.778 2 4.389 7.900 .006
24.278 5 4.856 8.740 .001
6.667 12 .556
30.944 17 1.820
(Combined)
GENDER
MAJOR
Main Effects
GENDER *MAJOR
2-Way Interactions
Model
Residual
Total
DAYS
Sum ofSquares df
MeanSquare F Sig.
Unique Method
ANOVAa,b
DAYS by GENDER, MAJORa.
All effects entered simultaneouslyb.
Interpreting the Results
• Main Effects
• Easy – just like a one-way ANOVA
15.500 3 5.167 9.300 .002
1.389 1 1.389 2.500 .140
14.111 2 7.056 12.700 .001
8.778 2 4.389 7.900 .006
24.278 5 4.856 8.740 .001
6.667 12 .556
30.944 17 1.820
(Combined)
GENDER
MAJOR
Main Effects
GENDER *MAJOR
2-Way Interactions
Model
Residual
Total
DAYS
Sum ofSquares df
MeanSquare F Sig.
Unique Method
ANOVAa,b
DAYS by GENDER, MAJORa.
All effects entered simultaneouslyb.
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06
Interpreting the Results
• Interaction– Does the effect of one IV on the DV depend on the
level of another IV?
15.500 3 5.167 9.300 .002
1.389 1 1.389 2.500 .140
14.111 2 7.056 12.700 .001
8.778 2 4.389 7.900 .006
24.278 5 4.856 8.740 .001
6.667 12 .556
30.944 17 1.820
(Combined)
GENDER
MAJOR
Main Effects
GENDER *MAJOR
2-Way Interactions
Model
Residual
Total
DAYS
Sum ofSquares df
MeanSquare F Sig.
Unique Method
ANOVAa,b
DAYS by GENDER, MAJORa.
All effects entered simultaneouslyb.
Sociology Psychology Biology Mean
Female 2.00 1.00 1.00
3.00 .00 2.00
3.00 2.00 2.00
Mean1j 2.67 1.00 1.67 1.78
Males 4.00 2.00 1.00
3.00 4.00 .00
4.00 3.00 .00
Mean2j
Mean.j
3.67
3.17
3.00
2.00
0.33
1.00
2.33
2.06Want to plot out the cell means
0
0.5
1
1.5
2
2.5
3
3.5
4
female
male
Sociology Psychology Biology
Practice
• 2 x 2 Factorial
• Determine if
• 1) there is a main effect of A
• 2) there is a main effect of B
• 3) if there is an interaction between AB
Practice
0123456789
10
B1 B2
A1
A2
A: NO
B: NO
AB: NO
Practice
0123456789
10
B1 B2
A1
A2
A: YES
B: NO
AB: NO
Practice
0123456789
10
B1 B2
A1
A2
A: NO
B: YES
AB: NO
Practice
0123456789
10
B1 B2
A1
A2
A: YES
B: YES
AB: NO
Practice
0123456789
10
B1 B2
A1
A2
A: YES
B: YES
AB: YES
Practice
0123456789
10
B1 B2
A1
A2
A: YES
B: NO
AB: YES
Practice
0123456789
10
B1 B2
A1
A2
A: NO
B: YES
AB: YES
Practice
0123456789
10
B1 B2
A1
A2
A: NO
B: NO
AB: YES
Practice
• Page 467
– 13.5– 13.6
544.622 4 136.156 4.645 .004
188.578 2 94.289 3.217 .052
356.044 2 178.022 6.074 .005
371.956 4 92.989 3.172 .025
916.578 8 114.572 3.909 .002
1055.200 36 29.311
1971.778 44 44.813
(Combined)
DELAY
AREA
Main Effects
DELAY *AREA
2-Way Interactions
Model
Residual
Total
TIME
Sum ofSquares df
MeanSquare F Sig.
Unique Method
ANOVAa,b
TIME by DELAY, AREAa.
All effects entered simultaneouslyb.
Neutral Area A Area B Mean50 28.6 16.8 24.4 23.27
100 28 23 16 22.33150 28 26.8 26.4 27.07
Mean 28.2 22.2 22.27 24.22
544.622 4 136.156 4.645 .004
188.578 2 94.289 3.217 .052
356.044 2 178.022 6.074 .005
371.956 4 92.989 3.172 .025
916.578 8 114.572 3.909 .002
1055.200 36 29.311
1971.778 44 44.813
(Combined)
DELAY
AREA
Main Effects
DELAY *AREA
2-Way Interactions
Model
Residual
Total
TIME
Sum ofSquares df
MeanSquare F Sig.
Unique Method
ANOVAa,b
TIME by DELAY, AREAa.
All effects entered simultaneouslyb.
0
5
10
15
20
25
30
35
50 100 150
Neutral
Area A
Area B
0
5
10
15
20
25
30
35
Neutral Area A Area B
50
100
150
Why is this important?
• Requirement
• Understand research articles
• Do research for yourself
• Real world
The Three Goals of this Course
• 1) Teach a new way of thinking
• 2) Teach “factoids”
Mean
r =
2
22
e
e nb
2)( ijWithin XXSS
YX
XY
SS
COVr
)(Z . . . . )(Z )(Z Y pp2211Z
)1(
)1(2
2
Rp
RpNF
aa
r YPS
1e b c a
a 2
).(
What you have learned!
• Chapter 1 – Introduced to statistics and learned key words – Scales of measurement– Populations vs. Samples
• Learned how to organize scores of one variable using:
– frequency distributions– graphs
What you have learned!
• Chapter 2 – Learned ways to describing data
• Measures of central tendency– Mean– Median– Mode
• Variability– Range– IQR– Standard Deviation– Variance
What you have learned!
• Chapter 3 – Learned about issues related to the normal curve:
– Z Scores
– Find the percentile of a give score– Find the score for a given percentile
What you have learned!
• Chapter 4 – Logic of hypothesis testing
– Is this quarter fair?– Sampling distribution
• CLT
– The probability of a given score occuring
What you have learned!
• Chapter 5 – Basic issues related to probability
– Joint probabilities– Conditional probabilities
– Different ways events can occur• Permutations• Combinations
– The probability of winning the lottery
– Binomial Distributions• Probability of winning the next 4 out of 10 games of Blingoo
What you have learned!
• Chapter 6 – Ways to analyze categorical data
– Chi square as a measure of independence• Phi coefficient
– Chi square as a measure of goodness of fit
What you have learned!
• Chapter 9 – Ways to analyze two continuous variables
– Correlation
– Regression
What you have learned!
• Chapter 10 – Other methods for correlations
– Pearson Formulas• Point-Biserial• Phi Coefficent• Spearman’s rho
– Non-Pearson Formulas• Kendall’s Tau
What you have learned!
• Chapter 15 – How to analyze continuous data with two or more IVs
– Multiple Regression• Causal Models• Standardized vs. unstandarized • Multiple R• Semipartical correlations
– Common applications• Mediator Models• Moderator Mordels
What you have learned!
• Chapter 7 – Significance testing applied to means
– One Sample t-tests
– Two Sample t-tests• Equal N• Unequal N• Dependent samples
What you have learned!
• Chapter 11 – Significance testing applied to two or more means
– ANOVA
– Computation of ANOVA
– Logic of ANOVA• Variance• Expected Mean Square• Sum of Squares
What you have learned!
• Chapter 12 – Extending ANOVA
– What to do with an omnibus ANOVA• Multiple t-tests• Linear Contrasts• Orthogonal Contrasts• Trend Analysis
– Controlling for Type I errors• Bonferroni t• Fisher Least Significance Difference• Studentized Range Statistic• Dunnett’s Test
What you have learned!
• Chapter 13 – How to analyze catagorical data with two or more IVs
– Factorial ANOVA
– Computation and logic of Factorial ANOVA
– Interpreting Results• Main Effects• Interactions
The Three Goals of this Course
• 1) Teach a new way of thinking
• 2) Teach “factoids”
• 3) Self-confidence in statistics