Upload
annabella-day
View
213
Download
0
Tags:
Embed Size (px)
Citation preview
Inferential statistics
Hypothesis testing
Questions statistics can help us answer
• Is the mean score (or variance) for a given population different from the one predicted?
• Are two populations different on some characteristic?
• Are two (or more) variables related in the population under study?
• For all three above: Is the finding due to pure chance?– Requires probability sample or randomization
Statistical significance• Could my findings be due to chance? (luck)• Researcher sets acceptable level prior to data
collection– Less than 5 times in 100 [p < .05]– Less than 1 time in 100 [p < .01]
• If the researcher chooses p < .01, then the results are considered ‘statistically significant’ if fewer than 1 sample in 100 would generate a result “like the one found in the study” (depending on what you’re looking for)
• Statistical significance is not the same as theoretical or social significance– Very sensitive to sample size
Sample difference from ideal number
• A sample statistic may be compared to an ideal number (often 50% or the population percentage established by a census) to determine whether the group it represents has some characteristic.
• For example, is the population of Telecommunications students more than half male?
• Take a sample of Tel students and compare the % who are male to 50%– Depending on the percentage of males in the sample
and the size of the sample, you may be able to say that the finding that Tel students are more likely to be male is “statistically significant”
Differences between two samples• You may want to see if two samples are different
on some characteristic• Compare means of two samples on the outcome
measure• If the difference between the two groups is large in
comparison to the variance within groups, then there will be a statistically significant difference between groups
For example:
• Are Telecommunications students more likely to be male than are Geography students?
Comparisons among groups
• Commonly used in experimental studies• t-test
– preferred where two groups can be compared according to a hypothesis
• ANOVA– comparison among multiple groups– allows for factorial designs– good at dealing with interactions– F ratio
Main effect of commercial
0
5
10
15
20
25
30
35
Commercial A Commercial B
MaleFemale
Main effect of gender
0
5
10
15
20
25
30
35
Commercial A Commercial B
MaleFemale
Interaction
0
5
10
15
20
25
30
35
Commercial A Commercial B
MaleFemale
Relationships among variables
• Eyeballing a scatterplot
Statistics of association
• If we want a more precise test as to what extent two variables are related, we would look at a statistic of association
• Different statistics are used depending upon the kind of scale we are concerned with and the assumed population distribution of the variables– Parametric– Nonparametric
• Two nominal—Chi-square (non-parametric)• Two ordinal—Spearman’s rho• Two ratio—Pearson’s r
Covariance among variables
• Correlation• How much of the variance in one measure can be
accounted for by variance in another?• Essentially, when we know where someone stands on
one variable, how much does it help us predict where they stand on another?
• Stats: Pearson’s r; r2 coefficient of determination
• Regression
Linear regression
• Minimizes total squared distances between individual data points and regression line– y=ax+b
• allows for prediction of behavior of dependent variable
• preferred to simple correlation
Multiple variables
• We can statistically analyze the relations among multiple variables at the same time
• Attempt to determine unique contribution of a single independent variable– Identify, then remove the effect of variables not
currently being tested– Evaluate relationship between predictor
variable of interest and “corrected” scores on dependent variable
Common statistical methods
• Multiple correlation• Multiple regression
– Variables may be entered according to theoretical model or their predictive strength
So:
• Inferential statistics do two jobs:– They provide an estimate of how likely it is that
your findings are due to chance– They provide an estimate of the ‘size’ of the
relationship between two or more variables
• The appropriate statistics vary by the method you used to collect data, the assumed population distribution of your variables and the scale types you employed to measure them