Central Limit Theorem - Dartmouth College

Preview:

Citation preview

Central Limit Theorem

Β§ number of measurements needed to estimate the mean

Β§ convergence of the estimated mean Β§ a better approximation than the

Chebyshev Inequality (LLN).

Xingru Chenxingru.chen.gr@dartmouth.edu

XC 2020

Central Limit Theorem

𝑋! 𝑋"

𝑋# 𝑋$

𝑋% β‹―

𝑋& 𝑋'normal

density function

+

+

+

+

+

XC 2020

Central Limit Theorem

𝑋! 𝑋"

𝑋# 𝑋$

𝑋% β‹―

𝑋& 𝑋'normal

density function

++

+

+

+

If 𝑆' is the sum of 𝑛 mutually independent random variables, then the distribution function of 𝑆' is well-approximated by a normal density function.

XC 2020

CENTRAL LIMIT THEOREM FOR BERNOULLI TRIALS

binomial distribution

XC 2020

Bernoulli Trials

A Bernoulli trials process is a sequence of 𝑛 chance experiments such that

Β§ Each experiment has two possible outcomes, which we may call success and failure.

Β§ The probability 𝑝 of success on each experiment is the same for each experiment, and this probability is not affected by any knowledge of previous outcomes. The probability π‘ž of failure is given by π‘ž = 1 βˆ’ 𝑝.

50%Toss a Coinhead & tail

60%Weather Forecast

not rain & rain

12.5%Wheel of Fortune50 points & others

𝒑Bernoulli Trialsoutcome 1 & 2

XC 2020

Binomial Distribution

Β§ Let 𝑛 be a positive integer and let 𝑝 be a real number between 0 and 1.

Β§ Let 𝐡 be the random variable which counts the number of successes in a Bernoulli trials process with parameters 𝑛 and 𝑝.

Β§ Then the distribution 𝑏(𝑛, 𝑝, π‘˜) of 𝐡 is called the binomial distribution.

Binomial Distribution

𝑏 𝑛, 𝑝, π‘˜ =π‘›π‘˜π‘(π‘ž')(

1 2 3 4 5

βœ— βœ“ βœ— βœ— βœ“

XC 2020

Binomial Distribution

Β§ Consider a Bernoulli trials process with probability 𝑝 for success on each trial.

Β§ Let 𝑋* = 1 or 0 according as the 𝑖th outcome is a success or failure.

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋'. Then 𝑆' is the number of successes in 𝑛 trials.

Β§ We know that 𝑆' has as its distribution the binomial probabilities 𝑏(𝑛, 𝑝, π‘˜).

XC 2020

𝑛 increases

drifts off to the right

flatten out

XC 2020

The Standardized Sum of 𝑆4

Β§ Consider a Bernoulli trials process with probability 𝑝 for success on each trial.

Β§ Let 𝑋* = 1 or 0 according as the 𝑖th outcome is a success or failure.

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋'. Then 𝑆' is the number of successes in 𝑛 trials.

Β§ We know that 𝑆' has as its distribution the binomial probabilities 𝑏(𝑛, 𝑝, π‘˜).

Β§ The standardized sum of 𝑆' is given by

𝑆'βˆ— =,!)'-'-.

.

πœ‡ = β‹― 𝜎% = β‹―

XC 2020

The Standardized Sum of 𝑆4

Β§ Consider a Bernoulli trials process with probability 𝑝 for success on each trial.

Β§ Let 𝑋* = 1 or 0 according as the 𝑖th outcome is a success or failure.

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋'. Then 𝑆' is the number of successes in 𝑛 trials.

Β§ We know that 𝑆' has as its distribution the binomial probabilities 𝑏(𝑛, 𝑝, π‘˜).

Β§ The standardized sum of 𝑆' is given by

𝑆'βˆ— =,!)'-'-.

.

Β§ 𝑆'βˆ— always has expected value 0 and variance 1.πœ‡ = 0

𝜎% = 1

XC 2020

The Standardized Sum of 𝑆4

=𝑆'βˆ— =𝑆' βˆ’ π‘›π‘π‘›π‘π‘ž

𝑏(𝑛, 𝑝, 1)𝑏(𝑛, 𝑝, 2)

𝑏(𝑛, 𝑝, 3)𝑏(𝑛, 𝑝, 4)

𝑏(𝑛, 𝑝, 5)

𝑏(𝑛, 𝑝, 0)

⋯𝑏(𝑛, 𝑝, 𝑛)

𝑏(𝑛, 𝑝, π‘˜)

0 1 2 3 4 5 … 𝑛

0 βˆ’ π‘›π‘π‘›π‘π‘ž

1 βˆ’ π‘›π‘π‘›π‘π‘ž

2 βˆ’ π‘›π‘π‘›π‘π‘ž

3 βˆ’ π‘›π‘π‘›π‘π‘ž

4 βˆ’ π‘›π‘π‘›π‘π‘ž

5 βˆ’ π‘›π‘π‘›π‘π‘ž

… 𝑛 βˆ’ π‘›π‘π‘›π‘π‘ž

π‘˜

π‘₯XC 2020

𝑆'βˆ— =𝑆' βˆ’ π‘›π‘π‘›π‘π‘ž

πœ‡ = 0

𝜎% = 1

XC 2020

standardized sum

standard normal distribution

shapes: sameheights: different

why?

XC 2020

standardized sum standard normal distribution

shapes: sameheights: different why?

standardized sum: βˆ‘(/0' β„Ž* = 1

standard normal distribution: ∫)1

21𝑓 π‘₯ 𝑑π‘₯ = 1

XC 2020

standardized sum: βˆ‘(/0' β„Ž* = 1

standard normal distribution: ∫)1

21𝑓 π‘₯ 𝑑π‘₯ = 1 β‰ˆ βˆ‘(/0' 𝑓 π‘₯* 𝑑π‘₯

C(/0

'

β„Ž* = 1 C(/0

'

𝑓 π‘₯* 𝑑π‘₯ β‰ˆ 1

=β„Ž( = 𝑓 π‘₯( 𝑑π‘₯

=𝑓 π‘₯( =β„Ž(𝑑π‘₯

XC 2020

=β„Ž) = 𝑏(𝑛, 𝑝, π‘˜)=𝑆'βˆ— =𝑆' βˆ’ π‘›π‘π‘›π‘π‘ž =𝑑π‘₯ = π‘₯)*& βˆ’ π‘₯) =

1π‘›π‘π‘ž

𝑏(𝑛, 𝑝, 1)𝑏(𝑛, 𝑝, 2)

𝑏(𝑛, 𝑝, 3)𝑏(𝑛, 𝑝, 4)

𝑏(𝑛, 𝑝, 5)

𝑏(𝑛, 𝑝, 0)

⋯𝑏(𝑛, 𝑝, 𝑛)

0 1 2 3 4 5 … 𝑛

0 βˆ’ π‘›π‘π‘›π‘π‘ž

1 βˆ’ π‘›π‘π‘›π‘π‘ž

2 βˆ’ π‘›π‘π‘›π‘π‘ž

3 βˆ’ π‘›π‘π‘›π‘π‘ž

4 βˆ’ π‘›π‘π‘›π‘π‘ž

5 βˆ’ π‘›π‘π‘›π‘π‘ž

… 𝑛 βˆ’ π‘›π‘π‘›π‘π‘ž

=π‘˜ = π‘›π‘π‘žπ‘₯) + 𝑛𝑝 π‘Ž : the integer nearest to π‘Ž

XC 2020

β„Ž( = 𝑏(𝑛, 𝑝, π‘˜)

𝑑π‘₯ = π‘₯(2& βˆ’ π‘₯( =1π‘›π‘π‘ž

π‘˜ = π‘›π‘π‘žπ‘₯( + 𝑛𝑝

=𝑓 π‘₯( = 3"45= π‘›π‘π‘ž 𝑏(𝑛, 𝑝, π‘›π‘π‘žπ‘₯( + 𝑛𝑝 )

XC 2020

Central Limit Theorem for Binomial Distributions

Β§ For the binomial distribution 𝑏(𝑛, 𝑝, π‘˜) we have

lim'β†’1

π‘›π‘π‘žπ‘ 𝑛, 𝑝, 𝑛𝑝 + π‘₯ π‘›π‘π‘ž = πœ™(π‘₯),

where πœ™(π‘₯) is the standard normal density.

Β§ The proof of this theorem can be carried out using Stirling’s approximation.

Stirling’s FormulaThe sequence 𝑛! is asymptotically equal

to 𝑛'𝑒)' 2πœ‹π‘›.

XC 2020

Approximating Binomial Distributions

=lim'β†’1

π‘›π‘π‘žπ‘ 𝑛, 𝑝, 𝑛𝑝 + π‘₯ π‘›π‘π‘ž = πœ™(π‘₯)

?𝑏 𝑛, 𝑝, π‘˜ β‰ˆ β‹―

normalbinomial

binomial normal

XC 2020

Approximating Binomial Distributions

=lim'β†’,

π‘›π‘π‘žπ‘ 𝑛, 𝑝, 𝑛𝑝 + π‘₯ π‘›π‘π‘ž = πœ™(π‘₯) ?𝑏 𝑛, 𝑝, π‘˜ β‰ˆ β‹―

π‘˜ = 𝑛𝑝 + π‘₯ π‘›π‘π‘ž

π‘₯ =π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž

𝑏 𝑛, 𝑝, π‘˜ β‰ˆπœ™(π‘₯)π‘›π‘π‘ž

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž

πœ™(π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž

)

XC 2020

Example

Toss a coin

Find the probability of exactly 55 heads in 100 tosses of a coin.

=lim'β†’,

π‘›π‘π‘žπ‘ 𝑛, 𝑝, 𝑛𝑝 + π‘₯ π‘›π‘π‘ž = πœ™(π‘₯)

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž

πœ™(π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž

)

XC 2020

Example

Toss a coin

Find the probability of exactly 55 heads in 100 tosses of a coin.

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž

πœ™(π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž

)𝑛 = 100 𝑝 =12

𝑛𝑝 = 50 π‘›π‘π‘ž = 5

π‘˜ = 55

π‘₯ =π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž = 1 𝑏(100,

12, 55) β‰ˆ

15πœ™(1)

XC 2020

Example

Toss a coin

Find the probability of exactly 55 heads in 100 tosses of a coin.

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž πœ™(

π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž )

𝑏 100,12, 55 β‰ˆ

15πœ™ 1 =

15(12πœ‹

𝑒)&/%)

Standard normal distribution 𝒁

πœ‡ = 0 and 𝜎 = 1

𝑓8 π‘₯ = &%9:

𝑒) 5); #/%:# = &%9𝑒)5#/%.

XC 2020

𝑏(𝑛, 𝑝, 1)𝑏(𝑛, 𝑝, 2)

𝑏(𝑛, 𝑝, 3)𝑏(𝑛, 𝑝, 4)

𝑏(𝑛, 𝑝, 5)

𝑏(𝑛, 𝑝, 0)

⋯𝑏(𝑛, 𝑝, 𝑛)

0 1 2 3 4 5 … 𝑛

0 βˆ’ π‘›π‘π‘›π‘π‘ž

1 βˆ’ π‘›π‘π‘›π‘π‘ž

2 βˆ’ π‘›π‘π‘›π‘π‘ž

3 βˆ’ π‘›π‘π‘›π‘π‘ž

4 βˆ’ π‘›π‘π‘›π‘π‘ž

5 βˆ’ π‘›π‘π‘›π‘π‘ž

… 𝑛 βˆ’ π‘›π‘π‘›π‘π‘ž

A specific outcome

XC 2020

𝑏(𝑛, 𝑝, 1)𝑏(𝑛, 𝑝, 2)

𝑏(𝑛, 𝑝, 3)𝑏(𝑛, 𝑝, 4)

𝑏(𝑛, 𝑝, 5)

𝑏(𝑛, 𝑝, 0)

⋯𝑏(𝑛, 𝑝, 𝑛)

0 1 2 3 4 5 … 𝑛

0 βˆ’ π‘›π‘π‘›π‘π‘ž

1 βˆ’ π‘›π‘π‘›π‘π‘ž

2 βˆ’ π‘›π‘π‘›π‘π‘ž

3 βˆ’ π‘›π‘π‘›π‘π‘ž

4 βˆ’ π‘›π‘π‘›π‘π‘ž

5 βˆ’ π‘›π‘π‘›π‘π‘ž

… 𝑛 βˆ’ π‘›π‘π‘›π‘π‘ž

Outcomes in an Interval

XC 2020

Central Limit Theorem for Binomial Distributions

Β§ Let 𝑆' be the number of successes in n Bernoulli trials with probability 𝑝 for success, and let π‘Ž and 𝑏 be two fixed real numbers.

Β§ Then

lim'β†’21

𝑃 π‘Ž ≀ ,!)'-'-.

≀ 𝑏 = ∫<=πœ™ π‘₯ 𝑑π‘₯ = NA(π‘Ž, 𝑏).

Β§ We denote this area by NA(π‘Ž, 𝑏).

π‘Ž ≀𝑆' βˆ’ π‘›π‘π‘›π‘π‘ž

= 𝑆'βˆ— ≀ 𝑏 π‘Ž π‘›π‘π‘ž + 𝑛𝑝 ≀ 𝑆' ≀ 𝑏 π‘›π‘π‘ž + 𝑛𝑝

XC 2020

lim'β†’21

𝑃 π‘Ž ≀ ,!)'-'-.

≀ 𝑏 = NA(π‘Ž, 𝑏).

π‘Ž ≀𝑆' βˆ’ π‘›π‘π‘›π‘π‘ž

= 𝑆'βˆ— ≀ 𝑏 π‘Ž π‘›π‘π‘ž + 𝑛𝑝 ≀ 𝑆' ≀ 𝑏 π‘›π‘π‘ž + 𝑛𝑝

𝑖 ≀ 𝑆' ≀ 𝑗 π‘Ž =𝑖 βˆ’ π‘›π‘π‘›π‘π‘ž

≀ 𝑆'βˆ— ≀𝑗 βˆ’ π‘›π‘π‘›π‘π‘ž

= 𝑏

lim'β†’21

𝑃 𝑖 ≀ 𝑆' ≀ 𝑗 = lim'β†’21

𝑃 *)'-'-.

≀ 𝑆'βˆ— ≀>)'-'-.

= NA(*)'-'-.

, >)'-'-.

).

Approximating Binomial Distributions

XC 2020

𝑖 ≀ 𝑆' ≀ 𝑗 π‘Ž =𝑖 βˆ’ π‘›π‘π‘›π‘π‘ž

≀ 𝑆'βˆ— ≀𝑗 βˆ’ π‘›π‘π‘›π‘π‘ž

= 𝑏

lim'β†’21

𝑃 𝑖 ≀ 𝑆' ≀ 𝑗 = lim'β†’21

𝑃 *)'-'-.

≀ 𝑆'βˆ— ≀>)'-'-.

= NA(*)'-'-.

, >)'-'-.

).

Approximating Binomial Distributions (more accurate)

lim@β†’BC

𝑃 𝑖 ≀ 𝑆@ ≀ 𝑗 = NA(DE!"E@F

@FG,HB!"E@F

@FG).

𝑖 𝑗

XC 2020

Example

Toss a coin

A coin is tossed 100 times. Estimate the probability that the number of heads lies between 40 and 60 (including the end points).

=lim'β†’21

𝑃 𝑖 ≀ 𝑆' ≀ 𝑗 = NA(𝑖 βˆ’ 12 βˆ’ 𝑛𝑝

π‘›π‘π‘ž ,𝑗 + 12 βˆ’ 𝑛𝑝

π‘›π‘π‘ž )

XC 2020

Toss a coin

A coin is tossed 100 times. Estimate the probability that the number of heads lies between 40 and 60 (including the end points).

𝑛 = 100 𝑝 =12

𝑛𝑝 = 50 π‘›π‘π‘ž = 5

𝑖 = 40

𝑖 βˆ’ 12 βˆ’ π‘›π‘π‘›π‘π‘ž

= βˆ’2.1

𝑗 = 60

𝑗 + 12 βˆ’ π‘›π‘π‘›π‘π‘ž

= 2.1

the outcome will not deviate by more than two standard deviations from the expected value

XC 2020

Toss a coin

A coin is tossed 100 times. Estimate the probability that the number of heads lies between 40 and 60 (including the end points).

𝑖 = 40

𝑖 βˆ’ 12 βˆ’ π‘›π‘π‘›π‘π‘ž = βˆ’2.1

𝑗 = 60

𝑗 + 12 βˆ’ π‘›π‘π‘›π‘π‘ž = 2.1

=lim$β†’&'

𝑃 𝑖 ≀ 𝑆$ ≀ 𝑗 = NA(𝑖 βˆ’ 12 βˆ’ 𝑛𝑝

π‘›π‘π‘ž,𝑗 + 12 βˆ’ 𝑛𝑝

π‘›π‘π‘ž)

=lim$β†’&'

𝑃 40 ≀ 𝑆$ ≀ 60 = NA βˆ’2.1,2.1= 2NA 0, 2.1

2𝜎 2.1𝜎

XC 2020

Approximating Binomial Distribution

π‘Ίπ’βˆ— = 𝒙lim$β†’'

π‘›π‘π‘žπ‘ 𝑛, 𝑝, 𝑛𝑝 + π‘₯ π‘›π‘π‘ž = πœ™(π‘₯)

𝑺𝒏 = π’Œ

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž πœ™(

π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž )

𝒂 ≀ π‘Ίπ’βˆ— ≀ 𝒃

lim'β†’*,

𝑃 π‘Ž ≀ -!.'/'/0

≀ 𝑏 = NA(π‘Ž, 𝑏).

π’Š ≀ 𝑺𝒏 ≀ 𝒋

lim$β†’&'

𝑃 𝑖 ≀ 𝑆$ ≀ 𝑗 = NA(()!")$*

$*+,,&!")$*

$*+).

CLT

XC 2020

CENTRAL LIMIT THEOREM FOR DISCRETE INDEPENDENT TRIALS

any independent trials process such that the individual trials have finite variance

XC 2020

The Standardized Sum of 𝑆4

Β§ Consider an independent trials process with common distribution function π‘š π‘₯defined on the integers, with expected value πœ‡ and variance 𝜎%.

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋' be the sum of 𝑛 independent discrete random variables of the process.

Β§ The standardized sum of 𝑆' is given by

𝑆'βˆ— =,!)';':#

.

Β§ 𝑆'βˆ— always has expected value 0 and variance 1.

𝐸(𝑆') = β‹―

𝑉(𝑆') = β‹―

XC 2020

independent trials process

𝑃(𝑆' = π‘˜)

π‘˜ π‘₯)=π‘˜ βˆ’ π‘›πœ‡π‘›πœŽ%

=𝑆'βˆ— =𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

normal

π‘›πœŽ-𝑃(𝑆$ = π‘˜)

XC 2020

Approximation Theorem

Β§ Let 𝑋&, 𝑋%, β‹―, 𝑋' be an independent trials process and let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋'.

Β§ Assume that the greatest common divisor of the differences of all the values that the 𝑋( can take on is 1.

Β§ Let 𝐸 𝑋( = πœ‡ and 𝑉 𝑋( = 𝜎%.

Β§ Then for 𝑛 large,

𝑃(𝑆' = π‘˜) β‰ˆ &':#

πœ™(()';':#

).

Β§ Here πœ™(π‘₯) is the standard normal density.

XC 2020

Central Limit Theorem for a Discrete Independent Trials Process

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋' be the sum of n discrete independent random variables with common distribution having expected value πœ‡ and variance 𝜎%.

Β§ Then, for π‘Ž < 𝑏,

lim'β†’21

𝑃 π‘Ž ≀ ,!)';':#

≀ 𝑏 = &%9∫<

= 𝑒)5#/%𝑑π‘₯ = NA(π‘Ž, 𝑏).

XC 2020

Approximating a Discrete Independent Trials Process

𝑃(𝑆' = π‘˜) β‰ˆ1π‘›πœŽ%

πœ™(π‘˜ βˆ’ π‘›πœ‡π‘›πœŽ%

)

lim'β†’*,

𝑃 𝑐 ≀ 𝑆' ≀ 𝑑 =

NA(1.'2'3"

, 4.'2'3"

).

𝑺𝒏 = π’Œ

𝒄 ≀ 𝑺𝒏 ≀ 𝒅

CLT

XC 2020

Approximating a Discrete Independent Trials Process

lim'β†’*,

𝑃 𝑖 ≀ 𝑆' ≀ 𝑗 =

NA(5.#".'/

'/0,6*#".'/

'/0).

π’Š ≀ 𝑺𝒏 ≀ 𝒋

𝑃(𝑆' = π‘˜) β‰ˆ1π‘›πœŽ%

πœ™(π‘˜ βˆ’ π‘›πœ‡π‘›πœŽ%

)

𝑺𝒏 = π’Œ

𝑏(𝑛, 𝑝, π‘˜) β‰ˆ1π‘›π‘π‘ž

πœ™(π‘˜ βˆ’ π‘›π‘π‘›π‘π‘ž

)

𝑺𝒏 = π’Œ

lim'β†’*,

𝑃 𝑐 ≀ 𝑆' ≀ 𝑑 =

NA(1.'2'3"

, 4.'2'3"

).

𝒄 ≀ 𝑺𝒏 ≀ 𝒅

Bernoulli

XC 2020

ExampleA surveying instrument makes an error of -2, -1, 0, 1, or 2 feet with equal probabilities when measuring the height of a 200-foot tower.

XC 2020

Find the expected value and the variance for the height obtained using this instrument once.

Example

height = error + 200

error -2 -1 0 1 2

probability15

15

15

15

15

𝐸 error =15βˆ’2 βˆ’ 1 + 0 + 1 + 2 = 0

𝑉 error =15(βˆ’2)%+(βˆ’1)%+0% + 1% + 2% = 2

𝐸 height= 0 + 200 = 200

𝑉 height = 2

XC 2020

Estimate the probability that in 18 independent measurements of this tower, the average of the measurements is between 199 and 201, inclusive.

Example

πœ‡ = 200 𝜎% = 2

=lim'β†’21

𝑃 𝑐 ≀ 𝑆' ≀ 𝑑 = NA(𝑐 βˆ’ π‘›πœ‡π‘›πœŽ%

,𝑑 βˆ’ π‘›πœ‡π‘›πœŽ%

).

199 ≀𝑆'𝑛=𝑆'18

≀ 201𝑛 = 18

XC 2020

Example

πœ‡ = 200 𝜎% = 2

=lim'β†’*,

𝑃 𝑐 ≀ 𝑆' ≀ 𝑑 = NA(𝑐 βˆ’ π‘›πœ‡π‘›πœŽ%

,𝑑 βˆ’ π‘›πœ‡π‘›πœŽ%

).

199 ≀𝑆'𝑛=𝑆'18

≀ 201𝑛 = 18

𝑃 199 ≀𝑆'18

≀ 201 = 𝑃 3582 ≀ 𝑆' ≀ 3618

β‰ˆ NA3582 βˆ’ 3600

36,3618 βˆ’ 3600

36= NA(βˆ’3, 3).

XC 2020

03

01

02

Β§ independent

Discrete Random Variables

§ independent§ identical

Bernoulli Trials

§ independent§ identical

Discrete Trials

Central Limit Theorem

XC 2020

Central Limit Theorem

Β§ Let 𝑋&, 𝑋%, β‹―, 𝑋' be a sequence of independent discrete random variables. There exist a constant 𝐴, such that 𝑋* ≀ 𝐴 for all 𝑖.

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋' be the sum. Assume that 𝑆' β†’ ∞.

Β§ For each 𝑖, denote the expected value and variance of 𝑋* by πœ‡* and 𝜎*%, respectively.

Β§ Define the expected value and variance of 𝑆' to be π‘š' and 𝑠'%, respectively.

Β§ For π‘Ž < 𝑏,

lim'β†’21

𝑃 π‘Ž ≀ ,!)@!A!

≀ 𝑏 = &%9∫<

= 𝑒)5#/%𝑑π‘₯ = NA(π‘Ž, 𝑏).

XC 2020

CENTRAL LIMIT THEOREM FOR CONTINUOUS INDEPENDENT TRIALScontinuous random variables with a common density function

XC 2020

Central Limit Theorem

Β§ Let 𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋' be the sum of 𝑛 independent continuous random variables with common density function 𝑝 having expected value πœ‡ and variance 𝜎%.

Β§ Let

𝑆'βˆ— =,!)';':#

.

Β§ Then we have, for all π‘Ž < 𝑏,

lim'β†’21

𝑃 π‘Ž ≀ 𝑆'βˆ— ≀ 𝑏 = &%9∫<

= 𝑒)5#/%𝑑π‘₯ = NA(π‘Ž, 𝑏).

XC 2020

Example

Β§ Suppose a surveyor wants to measure a known distance, say of 1 mile, using a transit and some method of triangulation.

Β§ He knows that because of possible motion of the transit, atmospheric distortions, and human error, any one measurement is apt to slightly in error.

Β§ He plans to make several measurements and take an average.

Β§ He assumes that his measurements are independent random variables with common distribution of mean πœ‡ = 1 and standard deviation 𝜎 =0.0002.

XC 2020

Example

Β§ He can say that if 𝑛 is large, the average ,!

'has a density function that

is approximately normal, with mean 1mile, and standard deviation 0.000%

'miles.

Β§ How many measurements should he make to be reasonably sure that his average lies within 0.0001 of the true value?

XC 2020

Example

πœ‡ = 1

Expected Value

𝜎 = 0.0002

Standard Deviation𝑆' = 𝑋& + 𝑋% +β‹―+ 𝑋'

Sum of 𝒏 independent measurements

Β§ He can say that if 𝑛 is large, the average ,!' has a density function that is approximately normal, with mean 1 mile, and standard deviation 0.000%

'miles.

𝜎U = (0.0002)UVariance 𝐴' =

𝑋& + 𝑋% +β‹―+ 𝑋'𝑛

𝐸 𝐴' = πœ‡π· 𝐴' =

πœŽπ‘›

𝑉 𝐴' =𝜎%

𝑛

Average of 𝒏 independent measurements

XC 2020

Example

Chebyshev inequality

LLN

𝑆'βˆ— =𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

CLT

Β§ How many measurements should he make to be reasonably sure that his average lies within 0.0001 of the true value?

XC 2020

𝝈𝟐

𝜺𝟐

𝑷(|𝑿 βˆ’ 𝝁|β‰₯ 𝜺)

inequalityChebyshev𝑷(|𝑿 βˆ’ 𝝁| β‰₯ 𝜺) ≀

𝝈𝟐

𝜺𝟐

This is a sample text. Insert your desired text

here.

This is a sample text. Insert your desired text here.

Let 𝑋 be a discrete random variable with expected value πœ‡ = 𝐸(𝑋) and variance 𝜎% = 𝑉(𝑋), and let πœ€ > 0 be any positive number. Then

not necessarily positive

XC 2020

Example

Chebyshev inequality

LLN

Β§ How many measurements should he make to be reasonably sure that his average lies within 0.0001 of the true value?

πœ‡ = 1 𝜎% = (0.0002)%

=𝑃(|𝑋 βˆ’ πœ‡| β‰₯ πœ€) β‰€πœŽ%

πœ€%

𝑃 𝐴' βˆ’ πœ‡ β‰₯ 0.0001 β‰€πœŽ%

π‘›πœ€%=

0.0002 %

𝑛 0.0001 % =4𝑛

XC 2020

Example

Chebyshev inequality

LLN

Β§ How many measurements should he make to be reasonably sure that his average lies within 0.0001 of the true value?

=𝑃(|𝑋 βˆ’ πœ‡| β‰₯ πœ€) β‰€πœŽ%

πœ€%

𝑃 𝐴' βˆ’ πœ‡ β‰₯ 0.0001 β‰€πœŽ%

π‘›πœ€%=4𝑛= 0.05 𝑛 = 80

XC 2020

Example

𝑆'βˆ— =𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

CLT

=lim'β†’21

𝑃 π‘Ž ≀ 𝑆'βˆ— ≀ 𝑏 = NA(π‘Ž, 𝑏)

𝑃 𝐴' βˆ’ πœ‡ ≀ 0.0001 = 𝑃 𝑆' βˆ’ π‘›πœ‡ ≀ 0.5π‘›πœŽ

= 𝑃𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

≀ 0.5 𝑛 = 𝑃 βˆ’0.5 𝑛 ≀ 𝑆'βˆ— ≀ 0.5 𝑛

β‰ˆ NA βˆ’0.5 𝑛, 0.5 𝑛

πœ‡ = 1 𝜎% = (0.0002)%

XC 2020

Example

𝑆'βˆ— =𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

CLT

=lim'β†’21

𝑃 π‘Ž ≀ 𝑆'βˆ— ≀ 𝑏 = NA(π‘Ž, 𝑏)

𝑃 𝐴' βˆ’ πœ‡ ≀ 0.0001 β‰ˆ NA βˆ’0.5 𝑛, 0.5 𝑛 = 0.95

0.5 𝑛 = 2 𝑛 = 16

XC 2020

Example

Chebyshev inequality

LLN

𝑆'βˆ— =𝑆' βˆ’ π‘›πœ‡π‘›πœŽ%

CLT

Β§ How many measurements should he make to be reasonably sure that his average lies within 0.0001 of the true value?

𝑛 = 80 𝑛 = 16

XC 2020

Recommended