32
1 White Parts from: Technical overview for machin e-learning researcher – slide s from UAI 1999 tutorial Part II

1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

  • View
    215

  • Download
    1

Embed Size (px)

Citation preview

Page 1: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

1

White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorial

Part II

Page 2: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

2

Page 3: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

3

Page 4: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

4

= Ct,h

Example: for (ht + htthh), we get p(d|m) = 3!2!/6!

Page 5: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

5

Page 6: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

6

Numerical example for the network X1 X2

Imaginary sample sizes denoted N’ijk

Data: (true, true) and (true, false)

Page 7: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

7

Page 8: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

8

Used so far

Desired

Page 9: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

9

How do we assign structure and parameter priors ?

Structure priors: Uniform, partial order (allowed/prohibited edges), proportional to similarity to some a priori network.

Page 10: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

10

BDeK2

Page 11: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

11

Page 12: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

12

Page 13: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

13

Page 14: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

14

)|()1(

)1()|( h

yxyxxx

yyhxyxy mpmp

Page 15: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

15

Page 16: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

16

Page 17: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

17

Page 18: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

18

Example: Suppose the hyper distribution for (X1,X2) is Dir( a00, a01 ,a10, a11).

So how to generate parameter priors?

Page 19: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

19

Example: Suppose the hyper distribution for (X1,X2) is Dir( a00, a01 ,a10, a11)This determines a Dirichlet distribution for the parameters of both directed models.

Page 20: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

20

Page 21: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

21

Summary: Suppose the parameters for (X1,X2) are distributed Dir( a00, a01 ,a10, a11).Then, parameters for X1 are distributed Dir(a00+a01 ,a10+a11).Similarly, parameters for X2 are distributed Dir(a00+a10 ,a01+a11).

Page 22: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

22

BDe score:

Page 23: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

23

Page 24: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

24

Page 25: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

25

Page 26: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

26

Example: f(x+y) = f(x) f(y)Example: f(x+y) = f(x) f(y)Solution: (ln f )`(x+y) = (ln f )`(x) Solution: (ln f )`(x+y) = (ln f )`(x) and so: (ln f )`(x) = constantand so: (ln f )`(x) = constantHence: (ln f )(x) = linear functionHence: (ln f )(x) = linear functionhence: f(x) = c ehence: f(x) = c eaxax Assumptions: Positive everywhere, DifferentiableAssumptions: Positive everywhere, Differentiable

Functional Equations Example

Page 27: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

27

The bivariate discrete case

Page 28: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

28

The bivariate discrete case

Page 29: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

29

The bivariate discrete case

Page 30: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

30

The bivariate discrete case

Page 31: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

31

Page 32: 1 White Parts from: Technical overview for machine-learning researcher – slides from UAI 1999 tutorialTechnical overview for machine-learning researcher

32