14
Markov Indecision Markov Indecision Processes Processes Harry Q. Bovik, Harry Q. Bovik, Judy Q. Judy Q. Goldsmith Goldsmith , Q. Andrew , Q. Andrew Klapper, Michael Q. Klapper, Michael Q. Littman Littman Journal of Machine Learning Gossip

Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Embed Size (px)

Citation preview

Page 1: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Markov Indecision Markov Indecision ProcessesProcesses

Harry Q. Bovik, Harry Q. Bovik, Judy Q. Judy Q. GoldsmithGoldsmith, Q. Andrew , Q. Andrew

Klapper, Michael Q. LittmanKlapper, Michael Q. Littman

Journal of Machine Learning Gossip

Page 2: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

What’s it all about, Alpha?What’s it all about, Alpha?

Choose inactionsChoose inactions Optimize expected Optimize expected

outcomesoutcomes Minimize regretsMinimize regrets Mazimize confusion Mazimize confusion

(getting it all together(getting it all together Accentuate the Accentuate the

positvepositve

Page 3: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Definition

Let M = <I, I, I, I>, where

I is an indexed set of states, including a distinguished state iC , the state of confusion

I is the set of indecisionsI is the set of inactions

The next state is a function of the current state, current indecision, and current inaction. The process ignores all this and chooses a next state via state transgression.

Page 4: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Transgressions• The Markov indEciSion proceSs, or MESS,

ends in a state of disbelief. • Associated with each state and inaction is

something called the futility level, aka the frustration value.

• The agent tries to optimize net fultility over time, usually post-hock.

• This is called RETROOPTIMAL POLICY DETERMINATION

Page 5: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Relevant Clip ArtRelevant Clip Art

Page 6: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

APPLICATIONS

Page 7: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

•Deciding when to purchase airline tickets*

•The 2000 US Presidential Election

Page 8: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

What I really want to tell you What I really want to tell you about giving a talk…..about giving a talk…..

Page 9: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Organizing the technical material

• What is the problem being solved?

• What’s the solution?

• How is it new, what does it improve on?

• How does it fit into the bigger picture of related work?

Page 10: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Basics of technical presentations

• Don’t use it until you’ve defined it

• Tell us what you’re gonna tell us;

• Tell us;

• Tell us what you’ve told us.

• Take home message: “This is why you should read my paper”

Page 11: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Slides and posters

• Make it readable---eschew fancy backgrounds and small fonts!

• Use the visuals to support your verbal presentation; if the audience wanted to read long paragraphs, they’d go directly to the paper!

Page 12: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Verbal presentation

• Don’t mumble

• Do move around, use varying voice tone

• Sound like you think the work is interesting

• Respond positively to questions

Page 13: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Most important!

• Practice

• Practice out loud

• Time an out-loud practice talk

• Revise to fit the time and practice again

• Practice the first 2 minutes and last 2 minutes again

Page 14: Markov Indecision Processes Harry Q. Bovik, Judy Q. Goldsmith, Q. Andrew Klapper, Michael Q. Littman Journal of Machine Learning Gossip

Now you know….