25
Randomized Algorithms CS648 Lecture 2 Randomized Algorithm for Approximate Median Elementary Probability theory 1

Randomized Algorithms Randomized Algorithms CS648 Lecture 2 Randomized Algorithm for Approximate Median Elementary Probability theory Lecture 2 Randomized

Embed Size (px)

Citation preview

Randomized AlgorithmsCS648

Lecture 2• Randomized Algorithm for Approximate Median• Elementary Probability theory

1

RANDOMIZED MONTE CARLO ALGORITHM FOR

APPROXIMATE MEDIAN

2

This lecture was delivered at slow pace and its flavor was that of a tutorial.

Reason: To show that designing and analyzing a randomized algorithm demands right insight and just elementary probability.

A simple probability exercise

There is a coin which gives HEADS with probability ¼ and TAILS with probability ¾. The coin is tossed times. What is the probability that we get at least HEADS ? [Stirling’s approximation for Factorial: ]

3

Probability of getting “at least HEADS in tosses”

Probability of getting at least heads:

4

Using Stirling’s approximation

Since , so …

Inverse exponential in .

Approximate median

Definition: Given an array A[] storing n numbers and ϵ > 0, compute an element whose rank is in the range [(1- ϵ)n/2, (1+ ϵ)n/2].

Best Deterministic Algorithm: • “Median of Medians” algorithm for finding exact median • Running time: O(n) • No faster algorithm possible for approximate median

5

Can you give a short proof ?

½ - Approximate medianA Randomized Algorithm

Rand-Approx-Median(A)1. Let k c log n;2. S ∅;3. For i=1 to k 4. x an element selected randomly uniformly from A;5. S S U {x};6. Sort S.7. Report the median of S.

Running time: O(log n loglog n)

6

Analyzing the error probability of Rand-approx-median

7

Elements of A arranged in Increasing order of values

n/4 3n/4

Right QuarterLeft Quarter

When does the algorithm err ?To answer this question, try to characterize what

will be a bad sample S ?

Analyzing the error probability of Rand-approx-median

Observation: Algorithm makes an error only if k/2 or more elements sampled from the Right Quarter (or Left Quarter).

8

n/4

Left Quarter Right Quarter

Elements of A arranged in Increasing order of values

3n/4 Median of S

Analyzing the error probability of Rand-approx-median

Pr[ An element selected randomly from A is from Right quarter] = ?? Pr[ Out of k elements sampled from A, at least k/2 are from Right quarter] = ?? for

9

Elements of A arranged in Increasing order of values

n/4 3n/4

Right QuarterLeft Quarter

Exactly the same as the coin tossing exercise we did !

¼

Main result we discussed

Theorem: The Rand-approx-median algorithm fails to report ½ -approximate median from array A[1.. ] with probability at most.

Homework: Design a randomized Monte Carlo algorithm for computing ϵ-approximate median of array A[1.. ] with running time O(log n loglog n) and error probability for any given constants ϵ and .

[Do this homework sincerely without any friend’s help.]

10

ELEMENTARY PROBABILITY THEORY

(IT IS SO SIMPLE THAT YOU UNDERESTIMATE ITS ELEGANCE AND POWER)

11

Elementary probability theory(Relevant for CS648)

• We shall mainly deal with discrete probability theory in this course.• We shall take the set theoretic approach to explain probability theory.

Consider any random experiment :o Tossing a coin 5 times.o Throwing a dice 2 times.o Selecting a number randomly uniformly from [1..n].

How to capture the following facts in the theory of probability ?1. Outcome will always be from a specified set.2. Likelihood of each possible outcome is non-negative.3. We may be interested in a collection of outcomes.

12

Probability SpaceDefinition: Probability space associated with a random experiment is an ordered pair (Ω,P), where• Ω is the set of all possible outcomes of the random experiment• P : Ω R such that

– P(ω) ≥ 0 for each ωϵ Ω

Elements of Ω are called elementary events or sample points.

13

Ω

Event in a Probability SpaceDefinition: An event A in a probability space (Ω,P) is a subset of Ω. The probability of event A is defined as

For sake of compact notation, we extend P for events as described above.

14

Exercises

A randomized algorithm can also be viewed as a random experiment.1. What is the sample space associated with Randomized Quick sort ?2. What is the sample space associated with Rand-approx-median

algorithm ?

15

An Important Advice

In the following slides, we shall state well known equations (highlighted in yellow boxes) from probability theory.• You should internalize them fully. • We shall use them crucially in this course.• Make sincere attempts to solve exercises that follow.

16

Union of two Events

Given two events A and B defined over a probability space (,P), what is P(AUB) ?

Try to prove it by showing the following: Each ω ϵ AUB contributes exactly P(ω) in the right hand side.

17

AB

P(AUB) = P(A) + P(B) P(A∩B)

Ω

Union of three Events

Given three events A₁, A₂, A₃, defined over a probability space (,P), what is P(A₁ U A₂ U A₃) ?

Try to prove this equation as well by showing the following: Each ω ϵ A₁ U A₂UA₃ contributes exactly P(ω) in the right hand side.

18

A B

C

P(A₁ U A₂UA₃) = P(A₁) + P(A₂) + P( A₃) P(A₁∩A₂) P(A₂∩A₃) P(A₁∩A₃) + P(A₁∩A₂∩A₃)

Ω

Exercises• For events ,…, defined over a probability space (,P), prove that P() = … )

• There are letters envelopes. For each letter, there is a unique envelope in which it should be placed. A careless postman places the letters randomly into envelopes (one letter in each envelope). What is the probability that no letter is placed correctly (into the envelope meant for it) ?

19

Conditional Probability

Happening of some event influences the likelihood of happening of other events. This notion is formally captured by conditional probability as follows.

Probability of event A conditioned on event B, compactly represented as P[A|B], means the following.Given that event B has happened, what is the probability that event A has also happened ?You might have seen and used the following equation for conditional probability.

Can you give suitable reason to justify the validity of the above equation ?In particular, give justification for ] in numerator and ] in denominator in this equation.

20

P[A|B] =

Exercises

• A man possesses five coins, two of which are double-headed, one is double-tailed, and two are normal. He shuts his eyes, picks a coin at random, and tosses it. What is the probability that the lower face of the coin is a head ? He opens his eyes and sees that the coin is showing heads; what it the probability that the lower face is a head ? He shuts his eyes again, and tosses the coin again. What is the probability that the lower face is a head ? He opens his eyes and sees that the coin is showing heads; what is the probability that the lower face is a head ? He discards this coin, picks another at random, and tosses it. What is the probability that it shows heads ?

21

Partition of sample space and an “important Equation”

A set of events ,…, defined over a probability space (,P) is said to induce a partition of if• = • =∅ for all

Given an event B, how can we express P(B) in terms of a given partition ?

22

P(B) = )

ΩB

Exercises

• There are sticks each of different heights. There are vacant slots arranged along a line and numbered from 1 to as we move from left to right. The sticks are placed into the slots according to a uniformly random permutation. A stick placed at th slot is said to be a dominating stick if its height is largest among all sticks placed in slots 1 to . Find the probability that th slot contains a dominating stick.

23

Independent Events

Two events A and B defined over a probability space (,P) are said to be independent if happening of one of them has no influence on the probability of the another event. Mathematically, it means that P(A|B)= P(A) and P(B|A)=P(B)

The following equation also compactly captures independence of two events.

Question: Can two independent events ever be disjoint ?

24

P(A ∩ B) = P(A) · P(B)

Exercises

1. Two fair dice are rolled. Show that the event that their sum is 7 is independent of the score shown by the first die.

2. Let (,P) be a probability space where = {1,2,…,p} for a given prime number p, and each elementary event has probability 1/p. Show that if two events A and B defined over (,P) are independent, then at least one of A and B is either ∅ or .

25