107
Statistical Machine Learning Devika Subramanian Computer Science and Electrical Engineering Rice University

Statistical Machine Learning - Rice University

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Statistical Machine Learning - Rice University

Statistical Machine Learning

Devika SubramanianComputer Science and Electrical Engineering

Rice University

Page 2: Statistical Machine Learning - Rice University

h The context

Environment

Systemsensorsaction

sensorsactuators sensors

Goal: build systems that “learn from experience”

Page 3: Statistical Machine Learning - Rice University

The Mars BioplexNeed for models learned by the system it lf t d t i The Mars Bioplex itself to determine control actions.

Page 4: Statistical Machine Learning - Rice University

QuestionsHow can we design and build systems that adapt and learn from their that adapt and learn from their experience? Is there a general theory of Is there a general theory of embedded learning? Can we build machine learning tools Can we build machine learning tools that can be used "off the shelf"?

Page 5: Statistical Machine Learning - Rice University

How we build systems nowAnalyze problem

Interview human experts, gather requirements, psystem dynamics, understand how decisions are made.

Design a solutionDesign a solutionHandcraft system models and devise algorithms for decision makingfor decision making

ImplementTestTest

Page 6: Statistical Machine Learning - Rice University

When methodology breaks down (1)

There is no human expertise for the task.task.

Example 1: Energy allocation policies in Bioplex.pExample 2: Optimization sequences for a compiler.p

Page 7: Statistical Machine Learning - Rice University

When methodology breaks down (2)

There are human experts but they cannot articulate decision making cannot articulate decision making criteria.

Example 1: any perceptual task (e g face Example 1: any perceptual task (e.g., face recognition, speech and handwriting recognition).gExample 2: most perceptual-motor tasks (e.g., playing video games).

Page 8: Statistical Machine Learning - Rice University

When methodology breaks down (3)

Task solutions need to be customized for each individual.for each individual.

Example 1: Spam filtering/Speech recognition.gExample 2: Tracking human learning.Example 3: Program-specific compiler Example 3 Program specific compiler optimization sequences.

Page 9: Statistical Machine Learning - Rice University

When methodology breaks down (4)

The system dynamics or the environment changes rapidly. environment changes rapidly.

Example 1: Outdoor robot navigation.Example 2: Mars Bioplex energy Example 2: Mars Bioplex energy allocation decisions.Example 3: Predicting conflict levels in Example 3 Predicting conflict levels in regions of the world.

Page 10: Statistical Machine Learning - Rice University

Learning in context

Machine learning

Artificial Intelligence

Algorithms Systems/Software E. Data Mining

Control Th. OR (MDPs) Statistics Appl. Math.

Page 11: Statistical Machine Learning - Rice University

Statistical machine learning

Environment )Pr(: SASM →×

Model of environment

Feedback

Systemsensorsaction

⎤⎡⎟⎞

⎜⎛

AS:

*

πpolicy

⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛= ∑

ttt ssrE ))(,(maxarg* ππ

πOptimalpolicy

Page 12: Statistical Machine Learning - Rice University

Machine learning (1)There is no human expertise for the task.

Example 1: Energy allocation policies in Bioplex.p y p pExample 2: Optimization sequences for a compiler.

M hi l l Machine learners learn Value functions Q:SxA Re by interaction

Actively choosing actions a using QActively choosing actions at using Q.Observing system transitions (st,at,st+1,rt)Updating Q on the basis of observations

{ }πγγπ ,,|...),( 22

1 aassrrrEasQ ttttt ==+++= ++

Page 13: Statistical Machine Learning - Rice University

Machine learning (2)There are human experts but they cannot articulate decision making criteria.

E l 1 l k ( f Example 1: any perceptual task (e.g., face recognition).Example 2: most perceptual-motor tasks (e.g., p p p ( g ,playing video games).

Machine learners learnP li i i S* A i S* P (A) b b i Policies: pi:S* A or pi:S* Pr(A) by observing (st,at,st+1) system transitions over time, where trajectories are chosen by human experts.

Page 14: Statistical Machine Learning - Rice University

Machine learning (3)Task solutions need to be customized for each individual.

E l 1 fil i / h i iExample 1: Spam filtering/Speech recognition.Example 2: Tracking human learning.Example 3: Program-specific compiler Example 3: Program specific compiler optimization sequences.

Machine learners learnPolicies: pi:S* A or pi:S* Pr(A) by observing system trajectories (st,at,st+1) over time, where trajectory is chosen by individual.j y y

Page 15: Statistical Machine Learning - Rice University

Machine learning (4)The system dynamics or the environment changes rapidly.

E l 1 O d b i iExample 1: Outdoor robot navigation.Example 2: Mars Bioplex energy allocation decisions.Example 3: Predicting conflict levels in regions of the world.

Machine learners learnMachine learners learnTo detect changes (discontinuities and trends).Reacquire policies and value functions as soon as Reacquire policies and value functions as soon as change is detected.

Page 16: Statistical Machine Learning - Rice University

Statistical Machine LearningReplaces guesswork/intuition with analysis of real data.Instead of guessing a control policy/prediction function/

h l d b f Machine learning provides a basis for deriving them from observed data.

Result: more robust more adaptive Result: more robust, more adaptive systems!

Page 17: Statistical Machine Learning - Rice University

OutlineMotivationStatistical machine learningStatistical machine learning

Scenario 1: Mars BioplexScenario 2: VirgilgScenario 3: Adaptive compilersScenario 4: Analyzing conflicty gScenario 5: Tracking human learning

Conclusions

Page 18: Statistical Machine Learning - Rice University

Scenario 1Controlling the Mars Bioplex

Approach 1: assemble a team of control ppengineers to write software to make resource allocation decisions for the system.Approach 2: use available Approach 2: use available sensor/actuator/reward data to learn dynamic models and value functions for the Bioplex and

l l l ll l calculate optimal resource allocation policies.

Thanks to NASA-NRA and D. Kortenkamp and P. Bonasso of NASA JSC.

Page 19: Statistical Machine Learning - Rice University

Statistical machine learning

Bioplex )Pr(: SASM →×

Model of environment

Feedback

controllerBioplexsensors

Bioplexactuators

Bioplex treatedas a MarkovDecision Processsensorsactuators

⎤⎡⎟⎞

⎜⎛

AS:

*

πpolicyDecision Process

⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛= ∑

ttt ssrE ))(,(maxarg* ππ

πOptimalpolicy

Page 20: Statistical Machine Learning - Rice University

The Bioplex: take 1

Dirty H2O

PotableH2O

WaterRS

OGS CRSPlants

Crew

O2 H2 CO2

Air and water recycling systems interact! One major sourceAir and water recycling systems interact! One major sourceof complexity in the overall system.

Page 21: Statistical Machine Learning - Rice University

Modeling Bioplex as an MDPAction space: turning on/off the CRS, OGS, WRS.

l d Sensor space: O2, H2, CO2, potable H20 and dirty H2O values.Local reward functionLocal reward function

Rewards for keeping crew alive (in proportion to length of time they are kept alive) y pPunishment for starving crew of water and/or air and/or food.Punishment for killing plantsPunishment for killing plants.Punishment for overflowing tanks.

Page 22: Statistical Machine Learning - Rice University

Result from MDP TheoryThere exist optimal value functions

d di i l li i

),(max),(* asQasQ π

π=

and corresponding optimal policies

** ),(maxarg)( ** asQsa

Page 23: Statistical Machine Learning - Rice University

So what?

Page 24: Statistical Machine Learning - Rice University

The Reinforcement Learning gAlgorithm

Q(s,a) = 0 for all s in S and a in A.Repeat till end of trial

s current states = current statePick action a which maximizes Q(s,a) [policy selection] (can be ε-greedy)Do a and observe new state s’ and local reward r Do a and observe new state s and local reward r. Update

Q(s,a) = (1-α)Q(s,a) + α(r + max{a’ in A} Q(s’,a’)). [Temporal difference update][Temporal difference update]

Set s to s’.Repeat step above till Q converges.p p g

Page 25: Statistical Machine Learning - Rice University

Speeding up convergence by Speed ng up convergence by nearest neighbor generalization

When Q(s,a) is updated, updates are applied to N nearest states within a set radius, with the update decayed exponentially according to the distance from s. Allows for faster convergence because we no Allows for faster convergence, because we no longer have to visit every single state to update it. The trajectory of a simulation run cuts a wider

h h h h swath through the state space.

Page 26: Statistical Machine Learning - Rice University

Generalized RL Results

Klein, Subramanian, Kortenkamp, Bell ICES 2004

Page 27: Statistical Machine Learning - Rice University

Simulation Behavior

Page 28: Statistical Machine Learning - Rice University

Back of the envelope…Given the biomass store of 300 kg, and food stocks of 2000 kg, the maximum mission lifetime for this configuration is approximately 10220g pp yhours: it is the time it takes for the crew to consume all available food. RL achieves 10000 hours.Since the simulator uses 10 square meters of wheat, it doesn’t grow enough food for mission to be extended beyond 10220 hours. be extended beyond 10220 hours. So given perfect water and oxygen recycling the limiting resource in this simulation configuration is food!food!

Page 29: Statistical Machine Learning - Rice University

Scenario 1 summaryReinforcement learning is an effective tool for exploring the state space of a complex coupled dynamical system.It can find interesting policies rapidly. It

k f h d can work fast enough to respond to abrupt changes to the system and redesign control policies (because convergence can occur as policies (because convergence can occur as quickly as 10-20 trials).

Page 30: Statistical Machine Learning - Rice University

OutlineMotivationStatistical machine learningStatistical machine learning

Scenario 1: Mars BioplexScenario 2: VirgilgScenario 3: Adaptive compilersScenario 4: Predicting conflictgScenario 5: Tracking human learning

Conclusions

Page 31: Statistical Machine Learning - Rice University

Scenario 2Outdoor robot navigation with GPS and odometry.and odometry.

Approach 1: get a human expert to construct sensor error models for GPS.Approach 2: get robot to gather correlated GPS and odometry data and ylearn error models.

Page 32: Statistical Machine Learning - Rice University

Virgil: The Rice Tour Guide

RWIATRV Jrequipped with cheap

On-boardodometry

with cheapdifferentialGPS ($2K)Sonars and

bumpers forbumpers forcollision avoidance

Supported by a grant from Rice’s Engineering School

Page 33: Statistical Machine Learning - Rice University

The task and its challengesTask: To give tours of the Rice University Campus.Challenges

Environment cannot be modified to assist robot.Low budget project --- no expensive sensors!Robot needs to interact with people.L li i f h 40 Localization errors of more than 40 cm cannot be tolerated. Cost of failure high!

Page 34: Statistical Machine Learning - Rice University

The tour

We can localizeVirgil to 1mresolution w/GPS of 10-40mGPS of 10 40mresolution.

Page 35: Statistical Machine Learning - Rice University

The Darpa Grand Challenge

CNN video of Darpa Grand Challenge

Page 36: Statistical Machine Learning - Rice University

Statistical machine learning

Virgil )Pr(: SASM →×

Odometry model

Via pointSensor model

navigatorsensorsaction

P(GPS reading|s)

AGS →×:πProportionalControllerController

Real problem is state estimation!

Page 37: Statistical Machine Learning - Rice University

Bayesian localizationEstimate robot’s coordinates s=(x,y,θ) from sensor data (GPS) by using Bayesian filtering.

1111 )(),|()|()( ∫= tttttttt dssbasspsopsb η

predictcorrect

1111 )(),|()|()( −−−−∫ tttttttt dssbasspsopsb η

odometryGPS model odometryGPS model

Page 38: Statistical Machine Learning - Rice University

Bayesian filter at work

sp(s)

Page 39: Statistical Machine Learning - Rice University

GPS sensor characteristics

Near buildings In free space

Page 40: Statistical Machine Learning - Rice University

Characteristics of GPS dataAbrupt shifts in GPS data quality when satellites drop out of view (e.g., when satellites drop out of view (e.g., when robot is near concrete buildings or travels under trees).or travels under trees).Gradual drifts in GPS data quality caused by atmospheric effectscaused by atmospheric effects.

Page 41: Statistical Machine Learning - Rice University

Sensor modelsA single Gaussian for the GPS sensor model P(o|s) requires a large variance to

mm d t th i ti n in d t lit accommodate the variation in data quality with location.

This causes slow convergence in localization This causes slow convergence in localization estimates even when data quality is high!Shifts in GPS data quality are not handled well; e g when robot emerges into a clear area with e.g., when robot emerges into a clear area with many visible satellites, it takes a while before localization accuracy reflects the quality of the GPS dataGPS data.

Page 42: Statistical Machine Learning - Rice University

Unsupervised learningModel is a conditional probability density function p(o|s) for GPS y preading o in location s.

Parametric (e.g., Gaussian)Non-parametric (histogram)Semi-parametric (mixture)

mean

Training data are samples (o1,s1), …, (on,sn).

Page 43: Statistical Machine Learning - Rice University

Learning GPS sensor modelsNeed location-specific P(o|s).Which locations? How to learn them?

Joystick Virgil through course where we allow it to gather correlated GPS and odometry data.Learn mean and variances of GPS readings Learn mean and variances of GPS readings throughout the route; merge distributions that are “close enough”.We discovered a very strong correlation We discovered a very strong correlation between the number of visible satellites and GPS mean and variance.

Page 44: Statistical Machine Learning - Rice University

Adaptive localizationFor abrupt shifts in GPS data quality: Learn p(o|s) indexed by number of visible

lli U “ i h d li ” d satellites. Use “gain scheduling” and swap in appropriate sensor model for current number of visible satellites at each time tnumber of visible satellites at each time t.For gradual shifts in GPS data quality: use “exponential forgetting”, i.e., use a p f g g , ,window of N (=100) GPS observations and update state estimate based on it.

Page 45: Statistical Machine Learning - Rice University

Virgil video

Page 46: Statistical Machine Learning - Rice University

Toast localization

Page 47: Statistical Machine Learning - Rice University

OutlineMotivationStatistical machine learningStatistical machine learning

Scenario 1: Mars BioplexScenario 2: VirgilgScenario 3: Adaptive compilersScenario 4: Analyzing conflicty gScenario 5: Tracking human learning

Conclusions

Page 48: Statistical Machine Learning - Rice University

Scenario 3Find good optimization sequences for compiling programs with respect to given objectives.

Approach 1: get human experts to devise good c mpil ti n sequences f r h le r n e f compilation sequences for a whole range of programs.Approach 2: learn a probabilistic model of what pp p fconstitutes a good compilation sequence and use it to perform biased random sampling of the combinatorial compilation spacecombinatorial compilation space.

Page 49: Statistical Machine Learning - Rice University

Statistical machine learningCompiler

+program ℜ→× ASM :

Opt. cost model

program

rewardNot available,will have tobe learned!

Sequencedesigner Sensors

(measurements on program action( f

be learned!

(measurements on program and compiler state)

(sequence ofoptimizations)

Find optimal sequence of optimizations for a specific program.

Page 50: Statistical Machine Learning - Rice University

The problemThe problem

Combinatorial space of sequences (actions).Discrete, unknown, non-linear model M:S x A Re.

Page 51: Statistical Machine Learning - Rice University

The fmin experimentTook one program, fmin

150 lines of Fortran, 44 basic blocksbasic blocksExhibited complex behavior in other experimentsexperiments

Generated all sequences of length 10 from 5 opt

9765625 combinations

FMIN

Page 52: Statistical Machine Learning - Rice University

Cost distributionCost distribution

CooperSubramanianTorczonet al

un-optimized

et. al.2002, 2003,2004

Page 53: Statistical Machine Learning - Rice University

The landscape

Page 54: Statistical Machine Learning - Rice University

Steps to local minima

Page 55: Statistical Machine Learning - Rice University

Structure of sequence spaceMany local minima!90% local minima are within 10% of 90% local minima are within 10% of best solution. 10% are 20-30% worse.Short hill climbing runsShort hill-climbing runs.Suggests that hill climbing with

d i d i i randomized restarts is an appropriate algorithm.

Page 56: Statistical Machine Learning - Rice University

Hill climbing with randomized H ll cl mb ng w th random zed restarts (HCRR)

for m restarts do1. Start with s = a randomly generated y gsequence.2. (local improvement) for k random

i hb f neighbors of sGenerate Hamming 1 neighbor s’ of sIf s’ better than s make s = s’ and go back to If s better than s, make s = s and go back to 2.

3. If no local improvement, restart at 1.

Page 57: Statistical Machine Learning - Rice University

Role of learningfor m restarts do

1. Start with s = a randomly generated y gsequence.2. (local improvement) for k random

i hb f neighbors of sGenerate neighbor s’ of sIf s’ better than s make s = s’ and go back to

Biasneighbor If s better than s, make s = s and go back to

2.3. If no local improvement, restart at 1.

ggenerationby N-gramN-grammodels over subsequences.

Page 58: Statistical Machine Learning - Rice University

Neighbor generation

p p s nlppolpp,l,o,s,n

40 neighbors (each with the same probability ofb i t d)being generated)

• HCRR runs yield sequences and their costs.• Sort out good sequences.• Learn n-gram models of optimizations on good seq

Page 59: Statistical Machine Learning - Rice University

Summary of Scenario 3Approach:

Learn n-gram models pi(oj…oi+n) of the impact p j pof optimizations on specific programs.Use these models to perform a biased random HCRR search of the combinatorial spaceHCRR search of the combinatorial space.

ResultsOn Spec and Media benchmarks approach has On Spec and Media benchmarks, approach has yielded improvements of 15-40% over human-designed universal optimization sequences.

Page 60: Statistical Machine Learning - Rice University

OutlineMotivationStatistical machine learningStatistical machine learning

Scenario 1: Mars BioplexScenario 2: VirgilgScenario 3: Adaptive compilersScenario 4: Analyzing conflicty gScenario 5: Tracking human learning

Conclusions

Page 61: Statistical Machine Learning - Rice University

Scenario 4Is it possible to monitor news media from regions all over the world over extended periods of time, extracting low-level event information from them, and piece them t th t t m ti ll t k d together to automatically track and predict conflict in all the regions of the world? world?

Page 62: Statistical Machine Learning - Rice University

Analyzing and predicting flconflict

Rice Terascale Cluster

Thanks to NSF ITR- 0219673 (with Ric Stoll)

Page 63: Statistical Machine Learning - Rice University

Example of event extractionExample of event extraction

NY Times

Headline 03/31/2003: Iraq moving more Headline 03/31/2003: Iraq moving more troops to guard Baghdad from South.Text: ………

Coding: 03/31/2003 IRQ US/Brit –9.5

Who did what to whom and when?

Not in headline, needs to be inferred!

Page 64: Statistical Machine Learning - Rice University

Extracting eventsInformation extraction from news stories

Approach 1: assemble a team of human experts pp pto write rules for extracting actors, targets and actions from sentences.Approach 2: assemble a training corpus of Approach 2: assemble a training corpus of sentences with extracted components, and learn rules for identifying different

components in news stories.

Page 65: Statistical Machine Learning - Rice University

Supervised learningSupervised learning

Page 66: Statistical Machine Learning - Rice University

Identifying relevant storiesFiltering stories not pertaining to conflict.

Only about 10% of the stories contain events that are t b t t dto be extracted.The rest are interpretations of conflictual events (e.g., op-eds), or are events not about conflict (e.g.,

t h i t t t i t )sports, human-interest stories, etc.).We have trained a Naïve Bayes classifier (with specificity and sensitivity of over 90%) for this p y y )phase. Further improvements using semantic information about words (WordNet) are being implementedimplemented.

Page 67: Statistical Machine Learning - Rice University

Analyzing conflict (Mideast)y g f (M )

Page 68: Statistical Machine Learning - Rice University

Change point detectionbiweek Date

rangeevent

17-35 11/79 to 8/80 Start of Iran/Iraq war

105-111 4/83 to 7/83 Beirut suicide attack

244 1/91 to 2/91 Desert Storm

413-425 1/95 to 7/95 Rabin assassination/start of Intifada

483-518 10/97 to 2/99 US/Iraq confrontation via Richard Butler/arms i tinspectors

522-539 4/99 to 11/99 Camp David summit hosted by Clinton

Page 69: Statistical Machine Learning - Rice University

Analyzing conflict (Cold war)Analyzing conflict (Cold war)

Page 70: Statistical Machine Learning - Rice University

Change point detectionWeek(s) Date(s) CommentsWeek(s) Date(s) Comments

56 – 83 67/1 – 67/8 [Red] Six Day War

132 68/7 [Red] Czech Crisis (138; 68/8), but NPT signed

187-265 69/8 – 71/1 [Red] Cienfuegos (Cuban sub base)

271-289 71/3 – 71/7 [Blue] Soviet ships leave Cienfuegos.

295-312 71/8 – 71/12 [Red] Soviets criticize Nixon trip to China Moscow 295 312 71/8 71/12 [Red] Soviets criticize Nixon trip to China, Moscow summit

316-334 72/1 – 72/5 [Blue] Moscow summit, SALT, ABM Treaties

341-363 72/7 – 72/12 [Red] US “Christmas bombing” of North Vietnam

364-390 72/12 – 73/6 [Blue] US-Soviet Summit (Brezhnev to US)

391-585 73/6 – 77/3 [Red] October War Crisis; Angola; Carter criticizes Soviets on human rights

620 649 77/11 78/6 [Red] Soviet UN employees arrested Carter criticizes 620-649 77/11 – 78/6 [Red] Soviet UN employees arrested. Carter criticizes dissident trials. Soviet stage large maneuvers in Central Europe. Ginzburg & Shchransky sentenced.

Page 71: Statistical Machine Learning - Rice University

OutlineMotivationStatistical machine learningStatistical machine learning

Scenario 1: Mars BioplexScenario 2: VirgilgScenario 3: Adaptive compilersScenario 4: Analyzing conflicty gScenario 5: Tracking human learning

Conclusions

Page 72: Statistical Machine Learning - Rice University

Scenario 5Tracking human learning on the NRL Navigation taskNavigation task

Approach 1: get cognitive scientists to write models of how humans learn.Approach 2: gather visual and motor data from subjects and learn models jdirectly from them.

Page 73: Statistical Machine Learning - Rice University

Statistical machine learning

)Pr(: SASM →×

Game dynamics

Feedback

Systemsensors (instrument panel)action (joystick)

)Pr(: AS →πpolicy

Track the evolution of the policy used by human and Track the evolution of the policy used by human, and alter training protocol to improve the speed and efficacy of learning.

Page 74: Statistical Machine Learning - Rice University

The NRL Navigation Task

Page 75: Statistical Machine Learning - Rice University

Th NRL N i ti T kThe NRL Navigation Task

Page 76: Statistical Machine Learning - Rice University

Learning curves (success)Learning curves (success)100

70

8090

S3

40

5060

70

cces

s %

S3S4S5S1

1020

30

40

Suc S1

S2

0

10

1 51 101

151

201

251

301

351

401

451

501

551

601

651

701

751

Episode

Page 77: Statistical Machine Learning - Rice University

Mathematical characteristics Mathemat cal character st cs of the NRL task

A partially observable Markov decision process which can be made fully observable b t ti f t t ith i by augmentation of state with previous action.State space of size 1014 at each step a State space of size 1014, at each step a choice of 153 actions (17 turns and 9 speeds)speeds).Challenging for both humans and machines.

Page 78: Statistical Machine Learning - Rice University

Challenges for a human learner

A task with a significant strategic and a visual-motor component.visual motor component.Need for rapid decision making with incomplete information.pThe sheer number (1014) of sensor panel configurations and action choices (153).g ( )Binary feedback at end of episode (200 steps).

Page 79: Statistical Machine Learning - Rice University

Building Representative Models

Behavioral equivalence (similarity in learning curves)learning curves)

Task

a(t) Plcess

%

( )

p(t)

Player

time

Succ

Modela’(t)p(t)t m

Page 80: Statistical Machine Learning - Rice University

Models of policy

Consider stateless stochastic models of the form pi:sensors → Pr(actions)the form pi:sensors → Pr(actions).

Associate with every observed sensor panel configuration, the distribution of actions taken gby the player at that configuration.

Advantage: no need to abstract sensor space.Model construction can be done in real time!

Page 81: Statistical Machine Learning - Rice University

Surely, this can’t work!

There are 1014 sensor configurations possible in the NRL Navigation task.possible in the NRL Navigation task.However, there are between 103 to 104 of those configurations actually observed by g y yhumans in a training run of 600 episodes.Exploit sparsity in sensor configuration p p y gspace to build a direct model of subject’s policy.

Page 82: Statistical Machine Learning - Rice University

M d l t tiModel construction

Segmentation of episodic data

episodespStart oftraining

End oftraining

Fit models of the form sensors Pr(actions) on the stationary segments.Pr(actions) on the stationary segments.

Page 83: Statistical Machine Learning - Rice University

Model Derivatived /dt dm/dt =

wiiswiswiKLdiv ))()2(( +Π−+−+Πw

wiiswiswiKLdiv )),(),2,(( +Π++Π

empirical optimum: w = 20, s = 5Computed by Monte Carlo sampling (st bili s ft 5% f t i s (stabilizes after 5% of entries are sampled)

ww

Overlap = s

Page 84: Statistical Machine Learning - Rice University

Model derivative for Cea

Page 85: Statistical Machine Learning - Rice University

Before shift: Cea (episode 300)

Page 86: Statistical Machine Learning - Rice University

After shift: Cea (episode 320)

Page 87: Statistical Machine Learning - Rice University

Model derivative for HeiModel derivative for Hei

Page 88: Statistical Machine Learning - Rice University

Nearest-neighbor action computationcomputation

To compute action a associated with current sensor configuration sg

take 100 neighbors of s in policy table.perform locally weighted regression p f y g g(LWR) on these (s,a) pairs.

Page 89: Statistical Machine Learning - Rice University

Evaluation protocolEvaluation protocol

Same mine configurations as subject.Model switched on segment boundaries.Model switched on segment boundaries.Cross-validation method on each segment:g

Train on 9/10ths of dataTest on left-out chunk

Page 90: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 1

Subject Model

Page 91: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 2

Subject Model

Page 92: Statistical Machine Learning - Rice University

Subject Cea: day 5: 3

Subject Model

Page 93: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 4

Subject Model

Page 94: Statistical Machine Learning - Rice University

b Subject Cea: Day 5: 5

Subject Model

Page 95: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 6

Subject Model

Page 96: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 7

Subject Model

Page 97: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 8

Subject Model

Page 98: Statistical Machine Learning - Rice University

Subject Cea: Day 5: 9

Subject Model

Page 99: Statistical Machine Learning - Rice University

Comparison with global methodsComparison with global methods

Page 100: Statistical Machine Learning - Rice University

Tracking human learningg m g

Page 101: Statistical Machine Learning - Rice University

Summary of Scenario 5We can model subjects on the NRL task in real-time, achieving excellent fits to their learning curves using the available visual-motor data curves, using the available visual motor data stream.One of the first in cognitive science to directly use objective visual motor performance data to use objective visual-motor performance data to derive high-level strategy models on a complex task.N l ith f d t ti h i t d New algorithms for detecting change-points and building predictive stochastic models for massive, noisy, non-stationary, vector time series data.

Page 102: Statistical Machine Learning - Rice University

Overall SummaryStandard software system design methods fail in many applications. methods fail in many applications. Statistical machine learning methods actively gather performance data to actively gather performance data to make robust, adaptive systems.

Page 103: Statistical Machine Learning - Rice University

Statistical machine learningMachine learning is already at the heart of speech recognition and handwriting recognition.Statistical machine learning is transforming Statistical machine learning is transforming natural language, bioinformatics, data mining and mobile robotics.St ti ti l hi l i i ti Statistical machine learning is creating opportunities in compilers, networking, and computer security.Statistical machine learning is opening up interactions with political science, biology and cognitive science.g

Page 104: Statistical Machine Learning - Rice University

Role of SML in S+EData is a new source of power for scientists and engineers.Statistical machine learning should be taught to our students. By combining engineered frameworks with models learned from data, we can develop hi h f b t d ti high-performance, robust, adaptive, systems of the future.

Page 105: Statistical Machine Learning - Rice University

AcknowledgementsNASA NRA (with D. Kortenkamp and P. Bonasso, NASA JSC)

fl l ( h Conflict analysis: NSF ITR 0219673 (with Ric Stoll)Adaptive compilers: NSF ITR 0205303Adaptive compilers: NSF ITR 0205303(with Keith Cooper and Linda Torczon)Human learning: ONR N00014-96-1-0538Human learning: ONR N00014 96 1 0538Virgil: School of Engineering, Rice University

Page 106: Statistical Machine Learning - Rice University

The futureMr. Gates scoffed at the notion, advanced by some, that the computer industry was a mature business of waning opportunity. In one question-and-answer session, a student

k d if th ld b th t h l asked if there could ever be another technology company as successful as Microsoft.

"If you invent a breakthrough in y gartificial intelligence, so machines can learn, that is worth 10 Microsofts."

--- B. Gates,NY Times March 1, 2004,

Page 107: Statistical Machine Learning - Rice University

Scientific drivers