9
1 Probability Review E: set of equally likely outcomes A: an event E A outcomes likely equally of number happen to event for ways of number ) ( ) ( ) ( P A E n A n A Conditional Probability (Probability of A given B) Independent Events: ) ( P ) | ( P ) and ( P ) ( P ) and ( P ) ( ) and ( ) | ( P B B A B A B B A B n B A n B A ) ( P ) ( P ) and ( P ) ( P ) | ( P B A B A A B A Combined Probability Mutually Exclusive Events: ) and ( ) ( ) ( ) or ( P B A P B P A P B A ) ( ) ( ) or ( P 0 ) and ( B P A P B A B A P A B A and B E

Probability Review

  • Upload
    kaloni

  • View
    42

  • Download
    1

Embed Size (px)

DESCRIPTION

E. E: set of equally likely outcomes A : an event. A. Combined Probability Mutually Exclusive Events:. E. B. A and B. A. Conditional Probability (Probability of A given B) Independent Events:. Probability Review. Probability Example. Pets in a Pet Store. P(black pet) P(dog) - PowerPoint PPT Presentation

Citation preview

Page 1: Probability Review

1

Probability Review

E: set of equally likely outcomesA: an event

E

A

outcomeslikely equally ofnumber

happen toevent for waysofnumber

)(

)()(P

A

En

AnA

Conditional Probability (Probability of A given B)

Independent Events:

)(P)|(P) and (P )(P

) and (P

)(

) and ()|(P BBABA

B

BA

Bn

BAnBA

)(P)(P) and (P )(P)|(P BABAABA

Combined Probability

Mutually Exclusive Events:

) and ()()()or (P BAPBPAPBA

)()()or (P

0) and (

BPAPBA

BAP

AB A and B

E

Page 2: Probability Review

2

Probability Example

Pets in a Pet Store

black tawny

kitten 5 17

puppy 10 23

• P(black pet)• P(dog)• P(black dog)• P(dog|black pet)• P(black pet | dog)• P(dog or black)

• P(black pet) = 15/55 = 3/11• P(puppy) = 33/55 = 3/5• P(black puppy) = 10/55 = 2/11• P(puppy|black pet) = 10/15 = 2/3• P(black pet | puppy) = 10/33• P(puppy or black) = (5+10+23)/55 =38/55

black tawny total

kitten 5 17 22

puppy 10 23 33

total 15 40 55

Page 3: Probability Review

3

DNA Sequence

Bases (nucleotides)A: adenineG: guanineC: cytosineT: thymine

Purinesadenine and guanine

Pyrimidinescytosine and thymine

Page 4: Probability Review

4

Mutations to DNA Sequences

Base substitutionsS0: GCCATCTGAAS1: GCTATTTGGAS2: GCTATGTGAA

(a) transition:pur to pur or pyr to pyrEg A changed to G

(b) transversion:pur to pyr or pyr to purEg T changed to G

DeletionsS0: GCCATCTGAAS1: GCATCTGAA

InsertionsS0: GCCATCTGAAS1: GCCGATCTGAA

ReversalsS0: GCCATCTGAAS1: GCGTCTACAA

Page 5: Probability Review

5

Probability of a base at a site of a sequence

S0: GCCATCTGAAGTACTTGGACCATGCTGTTCAGAGGGTCGTX

Best estimate of the probability of each base at site X • P(A) • P(G) • P(C) • P(T)

Best estimate of the probability of each base at site X • P(A) = 8/40• P(G) = 12/40• P(C) = 9/40• P(T) = 11/40

S1: ACCACCTGAAGCACTAGGGCGATGCCGTTTAGAGAGTTGTX n(A)=10 n(G)=11 n(C)=9 n(T)=10

Estimate of the probability of each base at site X • P(A) • P(G) • P(C) • P(T)

Estimate of the probability of each base at site X (based on S1)• P(A) = 10/40• P(G) = 11/40• P(C) = 9/40• P(T) = 10/40

n(A)=8 n(G)=12 n(C)=9 n(T)=11 n(E)=40

Page 6: Probability Review

6

Probability of a base at a site in aligned sequences

S0: GCCATCTGAAGTACTTGGACCATGCTGTTCAGAGGGTCGTX S1: ACCACCTGAAGCACTAGGGCGATGCCGTTTAGAGAGTTGTX

Best estimate of the probability of a base in one sequence given a base in another•P(A1|A0) •P(A1|G0)

S0: GCCATCTGAAGTACTTGGACCATGCTGTTCAGAGGGTCGTX S1: ACCACCTGAAGCACTAGGGCGATGCCGTTTAGAGAGTTGTX

Best estimate of the probability of a base in one sequence given a base in another•P(A1|A0) = 7/8•P(A1|G0)

Best estimate of the probability of a base in one sequence given a base in another•P(A1|A0) = 7/8•P(A1|G0) = 2/12

S0: GCCATCTGAAGTACTTGGACCATGCTGTTCAGAGGGTCGTX S1: ACCACCTGAAGCACTAGGGCGATGCCGTTTAGAGAGTTGTX

Table comparing bases at site X in aligned sequences S0 and S1

S1 \ S0 A G C T

A 7 2

G

C

T

Total 8 12

S1 \ S0 A G C T Total

A 7 2 0 1 10

G 1 10 1 0 12

C 0 0 6 3 9

T 0 0 2 7 9

Total 8 12 9 11 40

Page 7: Probability Review

7

Conditional Probability

Table comparing bases at site X in aligned sequences S0 and S1

S1 \ S0 A G C T Total

A 7 2 0 1 10

G 1 10 1 0 12

C 0 0 6 3 9

T 0 0 2 7 9

Total 8 12 9 11 40

Estimate of the conditional probability of a base at a site in sequence S1 given a particular base at that site in sequence S0

S1 \ S0 A G C T

A 7/8 2/12 0 1/11

G 1/8 10/12 1/9 0

C 0 0 6/9 3/11

T 0 0 2/9 7/11

Total 1 1 1 1

Page 8: Probability Review

8

Conditional Probability

Table of conditional probability of a base at a site in sequence S1

given a particular base at that site in sequence S0

S1 \ S0 A G C T

A P(A1|A0) P(A1|G0) P(A1|C0) P(A1|T0)

G P(G1|A0) P(G1|G0) P(G1|C0) P(G1|T0)

C P(C1|A0) P(C1|G0) P(C1|C0) P(C1|T0)

T P(T1|A0) P(T1|G0) P(T1|C0) P(T1|T0)

Total 1 1 1 1

Objective: Create a model for the conditional probabilities in the above table and use the table to predict the probabilities of a particular base at a site in the future. For example, P(A1)

)()|()()|()()|()()|()(

) and (or ) and (or ) and (or ) and ()(

0010010010011

010101011

TPTAPCPCAPGPGAPAPAAPAP

TAPCAPGAPAAPAP

Page 9: Probability Review

9

Building a model to predict how sequences change

The probability of a base at a site in the future can be written as a matrix product. For example:

)(

)(

)(

)(

)|( )|( )|( )|()(

)()|()()|()()|()()|()(

0

0

0

0

010101011

0010010010011

TP

CP

GP

AP

TAPCAPGAPAAPAP

TPTAPCPCAPGPGAPAPAAPAP

001

01010101

01010101

01010101

01010101

0

0

0

0

0

and Then

)|( )|( )|( )|(

)|( )|( )|( )|(

)|( )|( )|( )|(

)|( )|( )|( )|(

and

)(

)(

)(

)(

Let

PMPPMP

TTPCTPGTPATP

TCPCCPGCPACP

TGPCGPGGPAGP

TAPCAPGAPAAP

M

TP

CP

GP

AP

P

tt

The matrix M is called a transition matrix. Entries are probabilities. Columns sum to one. This is an example of a Markov Model.