19
Quantitative Methods Quantitative Methods Part 3 T- Statistics

Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Embed Size (px)

Citation preview

Page 1: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Quantitative MethodsQuantitative Methods

Part 3T- Statistics

Page 2: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Standard DeviationStandard Deviation

Measures the spread of scores within the data set◦Population standard deviation is

used when you are only interested in your own data

◦Sample standard deviation is used when you want to generalise for the rest of the population

Page 3: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Z - ScoresZ - ScoresA specific method for describing a specific location within a distribution

◦Used to determine precise location of an in individual score◦Used to compare relative positions of 2 or more scores

Page 4: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Normally Distributed (Bell Normally Distributed (Bell shaped) shaped)

Page 5: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Distribution of the MeansDistribution of the Means

X X X X

0 1 2 3 4 5 6 7 8 9

• Frequency Distribution of 4 scores (2, 4, 6,8)

• It is flat and not bell shaped• Mean of population is (2+4+6+8)/4 = 5

Page 6: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Distribution of the MeansDistribution of the Means

• Take all possible samples of pairs of scores (2,4,6,8)• Use random sampling and replace each individual sample into data set• Calculate average of all sample pairs

2+2 /2 = 22+4 /2 = 34+2 /2 = 3

4+4 /2 = 44+6 /2 = 56+4 /2 = 5

2+6 /2 = 46+2 /2 = 42+8 /2 = 58+2 /2 = 5

0 1 2 3 4 5 6 7 8 9

X XX

XXX

XXXX

X X XXX

X

Page 7: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

“For any population with a mean μ and standard deviation σ , the distribution of sample means for sample size n will have a mean of μ and standard deviation of σ/√n and will approach a normal distribution as n gets very large.”

How big should the sample size be? n=30X

X X X

X X X X X

X X X X X X X

0 1 2 3 4 5 6 7 8 9

Central Limit TheoremCentral Limit Theorem

Page 8: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

σ/√n is used to calculate the Standard Error of the sample mean

Sample data = X The mean of each sample = Then the standard error becomes It identifies how much the observed sample

mean differs from the un-measurable population mean μ.

So to be more confident that our sample mean is a good measure of the population mean, then the standard error should be small. One way we can ensure this is to take large samples.

Standard ErrorStandard Error

Page 9: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

The population of SATs scores is normal with μ= 500, σ =100. What is the chance that a sample of n=25 students has a mean score =540? Since the distribution is normal, we can use the z-score

First calculate Standard Error ◦ 100/5 = 20

Then Z-Score◦ 540-500/20 =2

z-value is 2, therefore around 98% of the sample means are below this and only 2% are above. So we conclude that the chance of getting a sample mean of 540 is 2%, so we are 98% confident that this sample mean, if recorded in an experiment is false.

Example Example Z = - Z = - μμ// σ σ

Page 10: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

T - StatisticsT - StatisticsSo far we’ve looked at mean and

sd of populations and our calculations have had parameters

But how do we deduce something about the population beyond our sample?

We can use T-Statistic

Page 11: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

T - StatisticsT - StatisticsRemember SD from last week?

Great for population of N but not for sample of n

Why n -1? Because we can only freely

choose n-1 (Degree of freedom = df)

Page 12: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

T - StatisticsT - StatisticsStandard ErrorZ-Score redone to show above =

To T-Statistic, we substitute σ (SD of population) with s (SD of sample)

But what about μ ?

Page 13: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Hypothesis TestingHypothesis TestingSample of computer game

players n =16Intervention = inclusion of rich

graphical elementsLevel has 2 rooms

◦Room A = lots of visuals◦Room B = very bland

Put them in level 60 minutesRecord how long they spend in B

Page 14: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

ResultsResults

Average time spent in B = 39 minutes

Observed “sum of squares” for the sample is SS = 540.

A B

Page 15: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Stage1: Formulation of Stage1: Formulation of HypothesisHypothesis

: “null hypothesis”, that the visuals have no effect on the behaviour.

: “alternate hypothesis”, that the visuals do have an effect on the players’ behaviour.

If visuals have no effect, how long on average should they be in room B?

Null hypothesis is crucial; here we can infer that μ = 30 and get rid of the population one

Page 16: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Stage 2: Locate the critical Stage 2: Locate the critical regionregion

We use the T-table to help us locate this, enabling us to reject or accept the null hypothesis. To get we need:◦Number of degree of freedom (df) 16

-1 =15◦Level of significance of confidence◦Locate in T-table (2tails)= critical

value of t=-2.131, t=2.131

Page 17: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Stage 3: Calculate Stage 3: Calculate statisticsstatisticsCalculate sample sd = 6

Sample Standard Error = 6 / 4 =1.5

T-Statistic = 6The μ 30 came from the null

hypothesis if visuals had no effect, then the player would spend 30 minutes in both rooms A and B.

Page 18: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

Stage 4: DecisionStage 4: DecisionCan we reject the , that the visuals

have no effect on the behaviour?◦T = 6 which is well beyond the value of

2.313 which indicates where chance kicks in.

So yes we can safely reject it and say it does affect behaviour

Which room do they prefer?◦They spent on average 39 minutes in

Room B which is bland

Page 19: Quantitative Methods Part 3 T- Statistics. Standard Deviation Measures the spread of scores within the data set ◦ Population standard deviation is used

WorkshopWorkshopWork on Workshop 6 activitiesYour journal (Homework)Your Literature Review

(Complete/update)

ReferencesReferences Dr C. Price’s notes 2010 Gravetter, F. and Wallnau, L. (2003) Statistics for the

Behavioral Sciences, New York: West Publishing Company