4
CONFIDENCE INTERVAL, GRAPHS & EXPLAINATON (DESCRIPTION) : AMMARA HAQUE (13883): The confidence interval (CI) of a mean tells you how precisely you have determined For example, you measure weight in a small sample (N!), and compute the mean. Tha is very unli"ely to e#ual the population mean. The si$e of the li"ely discrepancy si$e and varia%ility of the sample. If your sample is small and varia%le, the sample mean is li"ely to %e #uite far fr population mean. If your sample is large and has little scatter, the sample mean w very close to the population mean. &tatistical calculations com%ine sample si$e an (standard deviation) to generate a Confidence Interval (CI) for the population mea suggests, the Confidence Interval (CI) is a range of values. To interpret the confidence interval of the mean, you must assume that all the val were independently and randomly sampled from a population whose values are distri% according to a aussian distri%ution. If you accept those assumptions, there is a that the !* Confidence Interval (CI) contains the true population mean. In other generate many !* Confidence Interval (CI)s from many samples, you can expect the Confidence Interval (CI) to include the true population mean in !* of the cases, include the population mean value in the other !*. +oreover, any text%oo" that offers a correct explanation of C pro%a%ly includes a d figure , which shows !* Confidence Interval (CI)s around the sample means for a independent samples, each of si$e n - , ta"en from a Normal population. It is a familiar diagram, %ut the software provides controls that allow the user to sets of samples, and see the running total of intervals that include and also to co aspects of the display. /laying with the software can illustrate several %asic fact estimates, including0 . The level of confidence C is the percentage of intervals that, in the long run, include population mean, which is fixed %ut un"nown. -. If C !, sets of - samples on average contain one Confidence Interval (CI) that does not include 1ut often a set contains none, and occasionally a set contains three or eve hundreds of sets are ta"en, however, the overall percentage of intervals containing close to C . It is a %asic fact of randomness that in the short run results can appear together in a close range as opposed to %eing scattered. 3. It is worth examining the whole set of intervals, %ut also %earing in mind that practice ta"es only one sample, and so has to draw conclusions from 4ust that one interval could %e any single one of those displayed.

Confidence Interval Ammara

  • Upload
    ammara

  • View
    225

  • Download
    0

Embed Size (px)

DESCRIPTION

ahaha

Citation preview

CONFIDENCE INTERVAL, GRAPHS & EXPLAINATON (DESCRIPTION) : AMMARA HAQUE (13883):The confidence interval (CI) of a mean tells you how precisely you have determined the mean.For example, you measure weight in a small sample (N=5), and compute the mean. That mean is very unlikely to equal the population mean. The size of the likely discrepancy depends on the size and variability of the sample.If your sample is small and variable, the sample mean is likely to be quite far from the population mean. If your sample is large and has little scatter, the sample mean will probably be very close to the population mean. Statistical calculations combine sample size and variability (standard deviation) to generate a Confidence Interval (CI) for the population mean. As its name suggests, the Confidence Interval (CI) is a range of values. To interpret the confidence interval of the mean, you must assume that all the values wereindependently and randomly sampled from a population whose values are distributed according to aGaussiandistribution. If you accept those assumptions, there is a 95% chance that the 95% Confidence Interval (CI) contains the true population mean. In other words, if you generate many 95% Confidence Interval (CI)s from many samples, you can expect the 95% Confidence Interval (CI) to include the true population mean in 95% of the cases, and not to include the population mean value in the other 5%.Moreover, any textbook that offers a correct explanation of C probably includes a diagram like figure 1, which shows 95% Confidence Interval (CI)s around the sample means for a set of 20 independent samples, each of size n= 20, taken from a Normal population.

It is a familiar diagram, but the software provides controls that allow the user to take additional sets of samples, and see the running total of intervals that include and also to control many aspects of the display. Playing with the software can illustrate several basic facts about interval estimates, including:

1. The level of confidence C is the percentage of intervals that, in the long run, include the population mean, which is fixed but unknown.

2. If C = 95, sets of 20 samples on average contain one Confidence Interval (CI) that does not include But often a set contains none, and occasionally a set contains three or even more. If hundreds of sets are taken, however, the overall percentage of intervals containing will be very close to C. It is a basic fact of randomness that in the short run results can appear quite grouped together in a close range as opposed to being scattered.

3. It is worth examining the whole set of intervals, but also bearing in mind that a researcher in practice takes only one sample, and so has to draw conclusions from just that one interval. That interval could be any single one of those displayed.

4. If , the population SD, is assumed known, all intervals are the same width; if is not assumed known and s, the sample SD, is used to calculate each interval, intervals vary inwidth from sample to sample as in figure 1.

5. Such variation is large for small n , and is still noticeable even for n as large as 50.

6. Points 1 and 2 above the conclusions about capture of apply both for the constant widthintervals based on and z, and for the varying-width intervals based on s and t.

7. The unknown is more often captured near the centre of an interval than near the lower orupper limit, or end-point, of an interval.

8. If 95% Confidence Interval (CI) are displayed, those few intervals that miss usually only just miss , but very occasionally an interval misses by a considerable distance.

9. Standard Error ( SE )bars are intervals that extend one standard error above and one standarderror below the sample means M. Standard error (SE) bars are about half the width of 95% Confidence Interval (CI), if is known or n is roughlyat least 10.

10. Increasing C gives longer intervals, reducing C gives shorter intervals. Therefore a 99% CI islonger than the corresponding 95% Confidence Interval (CI), whereas the 90% CI is shorter. Choosing C = 68 gives intervals very similar in length to SE bars, if is known or nis roughly at least 10.

Fig. 1. The 95% Confidence Interval (CI)s for the population mean , for a set of 20 independent replications of a study. Each sample has size n = 20. The horizontal line is . The Confidence Interval (CI)s are based on sample estimates of the population SD and so vary in width from sample to sample. Open circles are the means whose bars do not include . In the long run, 95% of the Confidence Interval (CI) s are expected to include (18 do here). Note that the Confidence Interval (CI) varies from sample to sample, but is fixed and in practice always unknown.

Fig. 2. Some of the many ways to display interval estimates. Graphics ad show a 95% Confidence Interval (CI). In c, SE bars are also shown, and in d the 50% Confidence Interval (CI). Confidence Interval (CI) is also marked. In e, four Confidence Interval (CI)s are shown, with C marked above, and SE bars are also included; the margin of error is shown below, for assumed known, as the z value. The relative chance of capturing at various points is indicated in f by the values, in g by width, and in h approximately by shading.