1 of 65
Inferential Statistics I: The t-test
Experimental Methods and Statistics
Department of Cognitive Science
Michael J. Kalsher
2 of 65
OutlineDefinitions
Descriptive vs. Inferential Statistics
The t-test
- One-group t-test
- Dependent-groups t-test
- Independent-groups t-test
3 of 65
The t-test: Basic Concepts
• Types of t-tests- Independent Groups vs. Dependent
Groups• Rationale for the tests
- Assumptions
• Interpretation• Reporting results• Calculating an Effect Size• t-tests as GLM
4 of 65
William Sealy Gosset (1876–1937)
Famous as a statistician, best known by his pen name Student and for his work on Student's t-distribution.
Beer and Statistics: A Winning Combination!
5 of 65
6 of 65
7 of 65
The One Group t testThe One-group t test is used to compare a sample mean to a specific value (e.g., a population parameter; a neutral point on a
Likert-type scale).
Examples:1. A study investigating whether stock brokers differ from the general population on
some rating scale where the mean for the general population is known.
2. An observational study to investigate whether scores differ from some neutral point on a Likert-type scale.
Calculation of ty : ty = Mean Difference
Standard Error (of the mean difference)Note: The symbol ty indicates
this is a t test for a single group mean.
8 of 65
9 of 65
10 of 65
Assumptions
The one-group t test requires the following statistical assumptions:
1. Random and Independent sampling.
2. Data are from normally distributed populations. Note: The one-group t test is generally considered robust against violation of
this assumption once N > 30.
11 of 65
Computing the one-group t test by hand
12 of 65
13 of 65
14 of 65
Critical Values: One-Group t testNote: Degrees of Freedom = N - 1
15 of 65
Computing the one-group t test using SPSS
16 of 65
Move DV to box labeled “Test variable(s):
Type in “3” as a proxy for the population mean.
17 of 65
SPSS Output
18 of 65
Reporting the Results: One Group t test
The results showed that the students’ rated level of agreement with the statement “I feel good about myself” (M=3.4) was not significantly different from the scale’s neutral point (M=3.0), t(4)=.784. However, it is important to note several important limitations with this result, including the use of self-report measures and the small sample size (five participants). Additional research is needed to confirm, or refute, this initial finding.
19 of 65
20 of 65
21 of 65
22 of 65
23 of 65
24 of 65
25 of 65
26 of 65
27 of 65
28 of 65
29 of 65
30 of 65
31 of 65
32 of 65
33 of 65
34 of 65
35 of 65
36 of 65
37 of 65
11. Select both Time 1 and Time 2, then move to the box labeled “Paired Variables.”
12. Next, “click”, “Paste”.
38 of 65
39 of 65
The Independent Groups t test: Between-subjects designs
Assumption: Participants contributing to the two means come from different groups; therefore, each person contributes only one score to the data.
Calculation of t: t = Mean Difference
Standard Error (of the mean difference)
40 of 65
Standard Error: How well does my sample represent the population?
When someone takes a sample from a population, they are taking one of many possible samples--each of which has its own mean (and s.d.).
We can plot the sample means as a frequency distribution or sampling distribution.
Sample Mean
Fre
quen
cy
5
4
3
2
1
0
6
10
Sampling Distribution
41 of 65
Standard Error: How well does my sample represent the population?
The Standard Error, or Standard Error of the Mean, is an estimate of the standard deviation of the sampling distribution of means, based on the data from one or more random samples.
• Large values tell us that sample means can be quite different, and therefore, a given sample may not be representative of the population.
• Small values tell us that the sample is likely to be a reasonably accurate reflection of the population.
• An approximation of the standard error can be calculated by dividing the sample standard deviation by the square root of the sample size
SE = N
42 of 65
Standard Error: Applied to Differences
We can extend the concept of standard error to situations in which we’re examining differences between means.
The standard error of the differences estimates the extent to which we’d expect sample means to differ by chance alone--it is a measure of the unsystematic variance, or variance not caused by the experiment.
An estimate of the standard error can be calculated by dividing the sample standard deviation by the square root of the sample size.
SE = N
43 of 65
44 of 65
Computing the independent-groups t test by hand
45 of 65
Sample ProblemA college administrator reads an article in USA Today suggesting that liberal arts professors tend to be more anxious than faculty members from other disciplines within the humanities and social sciences. To test whether this is true at her university, she carries out a study to determine whether professors teaching liberal arts courses are more anxious than professors teaching behavioral science courses.
Sample data are gathered on two variables: type of professor and level of anxiety.
Anxiety Scores
Liberal Arts
Behavioral Science
45 58
63 59
62 63
51 68
54 74
63 68
52 52
54 66
64 69
49 57
46 of 65
47 of 65
48 of 65
49 of 65
Cri
tica
l Val
ues
:
Ind
epen
den
t G
rou
ps
t te
st
Note: Degrees of Freedom = N1 + N2 - 2
50 of 65
51 of 65
On average, the mean level of anxiety among a sample of liberal arts professors (M = 55.7) was significantly lower than the mean level of anxiety among a sample of behavioral science professors (M = 63.4), t(18) = -2.54, p < .05, r2 = .26. The effect size estimate indicates that the difference in anxiety level between the two groups of professors represents a large effect.
Reporting the Results: Independent Groups t test
52 of 65
Computing the independent-groups t test using SPSS
53 of 65
Sample Problem
A researcher is interested in comparing the appetite suppression effects of two drugs, fenfluramine and amphetamine, in rat pups. Five-day-old rat pups are randomly assigned to be injected with one of the two drugs. After injection, pups are allowed to eat for two hours. Percent weight gain is then measured.
Compute the independent groups t-test using the data at right.
Is this a true experiment, quasi-experiment, or observational study?
Percent Weight Gain
Fenfluramine Amphetamine
2 8
3 10
3 4
4 7
4 9
5 3
6 7
6 12
6 6
7 8
54 of 65
55 of 65
56 of 65
57 of 65
SPSS Output: Independent-Groups t test
Group Statistics
10 4.60 1.647 .521
10 7.40 2.675 .846
DrugType1
2
WeightgainN Mean Std. Deviation
Std. ErrorMean
Independent Samples Test
1.110 .306 -2.819 18 .011 -2.800 .993 -4.887 -.713
-2.819 14.964 .013 -2.800 .993 -4.918 -.682
Equal variancesassumed
Equal variancesnot assumed
WeightgainF Sig.
Levene's Test forEquality of Variances
t df Sig. (2-tailed)Mean
DifferenceStd. ErrorDifference Lower Upper
95% ConfidenceInterval of the
Difference
t-test for Equality of Means
58 of 65
Calculating Effect Size: Independent Samples t test
r = t2
t2 + df
(-2.819)2
(-2.819)2 + 18
7.95
7.95 + 18
r =.5534
r2 = .306
Note: Degrees of freedom calculated by adding the two sample sizes and then subtracting the number of samples:
df = 10 + 10 – 2 = 18
59 of 65
On average, the percent weight gain of five-day-old rat pups receiving amphetamine (M = 7.4, SE = .85) was significantly higher than the percent weight gain of rat pups receiving fenfluramine (M = 4.6, SE = .52), t(18) = -2.82, p < .05, r2 = .31. The effect size estimate indicates that the difference in weight gain caused by the type of drug given represents a large, and therefore substantive, effect.
Reporting the Results: Independent Groups t test
60 of 65
61 of 65
62 of 65
63 of 65
64 of 65
65 of 65