21
2/10/11

2/10/11. Infer properties of the population from what is observed in the sample An inference is a generalization, as inferences go beyond the data

Embed Size (px)

Citation preview

Page 1: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

2/10/11

Page 2: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Infer properties of the population from what is observed in the sample

An inference is a generalization, as inferences go beyond the data

Assumption: the sample is representative of the population Random or probability sample

Page 3: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

A point estimate looks at one value and is an unbiased estimate of the population meanNot accurate!

An interval estimate or confidence interval can estimate with a specified degree of confidence that the population mean is within a certain range of values

Page 4: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Probability of obtaining any of the possible values in a statistic

Enable researcher to infer with a determined level of confidence, the population parameters from the sample statistics

Page 5: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Used with continuous data A statement that the shape of the sampling

distribution of the mean will approximate a normal curve if the sample is sufficiently large

If random samples of a fixed n are drawn from any population, as n increases the distribution of the sample mean approaches a normal distribution

Approximation of a normal distribution is n ≥ 30 Sample sizes less than 30 use t distributions to

identify curve

Page 6: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

A range of values or scores from sample data that probably include the true value (population parameter)

Necessary to tell the amount of error present or the accuracy of the estimate

Page 7: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

A statistical decision on whether the finding in a study reflect chance or real effects, at a given level of probabilityThe larger the sample size (n), the more

likely we are to reject H0

Page 8: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

1. State alternative hypothesis (HA) – claims results are real or significant (independent variable influenced dependent variable)

2. State null hypothesis (H0) –claims that any difference in data was due to chance (independent variable had no effect upon dependent variable)

Page 9: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

3. Set decision level (α)common level in medicine = 0.05 results are not significant if p > 0.05

4. Calculate probability of H0 being true

the smaller decision level α, less likely we are to reject H0

Page 10: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

5. Decision rule: p (H0 is true) ≤ α; reject H0 (Accept HA)

p (H0 is true) > α; retain H0

Important: Cannot PROVE H0, can ONLY reject or retain

Page 11: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

There is always a risk of making an error when inferences are made

To test a null hypothesis requires both:A test of significance, andA selected probability level that indicates

how much risk you are willing to take that the decision you make is wrong

Page 12: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Type I- we reject the H0 when it is really true and claim HA is supported when it is actually falseSmaller α, less chance of making a Type I

error Type II- we retain the H0 when it is really

false and conclude the alternate hypothesis was not supported (β error)Larger n, less chance of making Type II

error

Page 13: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Tests of significance are almost always two-tailed which allows for the possibility that a difference may occur in either direction

A one-tailed test assumes that if a difference occurs, it will favor one direction

More likely to reject H0 if use a one tail test (directional HA) than a two-tail test (non-directional HA)

Page 14: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Represents the number of scores which are free to vary when calculating the statistic

Dependent upon the number of participants and number of groupsExample for correlation coefficient (r),

df = N (number of participants) – 2 Each test of significance has its own

formula for determining degrees of freedom

Page 15: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

1. Determine if significance test will be one-tailed or two-tailed;

2. Select a probability level;3. Compute a test of significance4. Consult the appropriate tables to

determine significance of your results Determined by the intersection of

probability level and df

Page 16: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Parametric test are preferred because they are more powerful, but they require assumptions:

1.Normal population distribution2.Variables must be interval or ratio3.Randomized sample4.Variances of population comparison

groups are equal

Page 17: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Nonparametric tests are less powerful, but they make no assumptions about the shape of the distributionUsed for nominal or ordinal data setsTakes a larger sample to reach same level

of significance as a parametric test

Page 18: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Interval or Ratio Two groups:

t- test for independent groups t- test for dependent groups

Three or more groups:Analysis of variance (ANOVA) for

independent groups (F ratio)ANOVA for dependent groups (F)

Page 19: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

t-test is used to determine if 2 means are significantly different by comparing the actual mean observed with the difference expected by chance

t-value is compared to t-table values; if value is ≥ than table value, the null hypothesis is rejected

Page 20: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Computes an F-ratio Total variance is divided into 2 sources:

Variance between groups (numerator)Variance within groups (denominator)

Page 21: 2/10/11.  Infer properties of the population from what is observed in the sample  An inference is a generalization, as inferences go beyond the data

Nominal Chi-square (X2)-

df = k (# groups or categories) -1

measure of difference between observed frequency with expected frequency