JohnLangford vw/ · Baseline: example for cb loss estimates Reward/lossestimationiskey(e.g....

Vowpal Wabbit 2017 Update

John Langford

http://hunch.net/~vw/

git clonegit://github.com/JohnLangford/vowpal_wabbit.git

What is Vowpal Wabbit

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)

5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models

6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)

7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)

8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)

9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)

10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)

11. JSON/Decision Service (Markus)

(*) Old stuff

1. Large Scale linear regression (*)2. Online Learning (*)3. Active Learning (*)4. Learning Reduction (*)5. Sparse Models6. Baseline (Alberto)7. Optimized Exploration algorithms (Alberto)8. Cost Sensitive Active Learning (Akshay)9. Active Learning to Search (Hal)10. Java Interface (Jon Morra)11. JSON/Decision Service (Markus)

(*) Old stuff

Community

1. BSD license.

2. Mailing list >500, Github >1K forks, >1K,>1K issues, >100 contributors

Community

1. BSD license.2. Mailing list >500, Github >1K forks, >1K,

>1K issues, >100 contributors

Community

1. BSD license.2. Mailing list >500, Github >1K forks, >1K,

>1K issues, >100 contributors3.

Sparse Models

Scenario: You want to train a model with manypotential parameters but use little RAM at test time.

Step 1: vw -b 26 --l1 1e-7 <training set>(memory footprint is 1GB)Step 2: vw -t --sparse_weights <test set>(memory footprint is 100MB)

Sparse Models

Scenario: You want to train a model with manypotential parameters but use little RAM at test time.Step 1: vw -b 26 --l1 1e-7 <training set>(memory footprint is 1GB)

Step 2: vw -t --sparse_weights <test set>(memory footprint is 100MB)

Sparse Models

Scenario: You want to train a model with manypotential parameters but use little RAM at test time.Step 1: vw -b 26 --l1 1e-7 <training set>(memory footprint is 1GB)Step 2: vw -t --sparse_weights <test set>(memory footprint is 100MB)

Baseline and Contextual Bandits

Alberto Bietti (Inria)alberto@bietti.me

Baseline

Setting: online regressionI Problem: range of targets (e.g. offset) isunknown

I Bias term (weight for “constant” features) canbe slow to learn

I Hurts performance of learning / explorationalgorithms

Goal: adapt quickly and automatically to the rangeof targets

Baseline

Setting: online regressionI Problem: range of targets (e.g. offset) isunknown

I Bias term (weight for “constant” features) canbe slow to learn

I Hurts performance of learning / explorationalgorithms

Goal: adapt quickly and automatically to the rangeof targets

Baseline

Solution:I Learn baseline regressor separately from rest

I From constant features on example --baselineI Or separate global constant example --baseline

--global_only

I Residual regression on the other featuresNote: learning rate multiplied by max label toconverge faster than other normalized updates

Baseline

--global_only

I Residual regression on the other features

Note: learning rate multiplied by max label toconverge faster than other normalized updates

Baseline

--global_only

I Residual regression on the other featuresNote: learning rate multiplied by max label toconverge faster than other normalized updates

Contextual Bandits

Baseline: example for cb loss estimates

Reward/loss estimation is key (e.g. doubly robust)

> vw ds.txt --cbify 10 --cb_explore_adf --cb_type dr --epsilon 0.050.682315> vw ... --loss0 9 --loss1 100.787594> vw ... --loss0 9 --loss1 10 --baseline0.710636> vw ... --loss0 9 --loss1 10 --baseline --global_only0.636140

Baseline: example for cb loss estimates

Reward/loss estimation is key (e.g. doubly robust)

> vw ds.txt --cbify 10 --cb_explore_adf --cb_type dr --epsilon 0.050.682315> vw ... --loss0 9 --loss1 100.787594> vw ... --loss0 9 --loss1 10 --baseline0.710636> vw ... --loss0 9 --loss1 10 --baseline --global_only0.636140

Contextual bandits: bagging

Bagging: “bootstrapped Thompson sampling”I Each update is performed Poisson(1) timesI For only one policy, greedy performs better(always update once)

I --bag n --greedify treats first policy like greedyI Often works better, especially for small n

Contextual bandits: bagging

Bagging: “bootstrapped Thompson sampling”I Each update is performed Poisson(1) timesI For only one policy, greedy performs better(always update once)

I --bag n --greedify treats first policy like greedyI Often works better, especially for small n

Contextual bandits: cover

Cover: maintains set of diverse policies good forexplore/exploit

I New parameterization: --cover n [--psi 0.01][--nounif]

I ψ controls diversity cost for training policies(ψ = 0 → all ERM policies)

I εt = 1/√Kt always

I --nounif disables exploration on ε actions (notchosen by any policy)

Contextual bandits: miscellaneous

I Most changes are only in the ADF codeI --cbify K --cb_explore_adf for using ADF codein cbify

I --loss0 [0] –loss1 [1] to specify different lossencodings

I Cover + MTR uses MTR for the first (ERM)policy, DR for the rest

I Upcoming: reduce uniform ε exploration inε-greedy using disagreement test (from activelearning)

JohnLangford vw/ · Baseline: example for cb loss estimates Reward/lossestimationiskey(e.g....

Documents

05 Epsilon

Epsilon Powerpoint

Kappa Epsilon

Epsilon Overview

VW - DGHSystem - Anhängerkupplungen | Towbars · diagnostics processes or software-supported test mechanisms ... wyposazenia kolorow, ... VW VW VW VW 90270310 MANUAL 90500507

ALPHA EPSILON CHI

Epsilon Psi

Chi Epsilon – Purdue University Welcome to Chi Epsilon Congratulations on your Accomplishments!

Epsilon Convergence

VW Golf V 2003 / VW Golf Plus 2005 / VW Golf V Variant ... · VW Golf VI Variant 2009 / VW Golf VI Plus 2009 / VW Jetta 2005 / VW Jetta 2011 VW Tiguan 2007-2009 / VW Tiguan 2010-2011

AUDI VOLKSWAGEN SEAT ŠKODA 2016 - wielengids.nl · VW up! VW Polo VW Golf VW Beetle VW Passat VW Touran VW Tiguan VW Sharan VW Touareg VW Caddy VW Transporter VW Crafter VW …

Epsilon Book

VW Touran 03/03 – 06/15 VW Golf Plus 02/05 >> VW Tiguan ......VW Touran 03/03 – 06/15 VW Golf Plus 02/05 >> VW Tiguan 11/07 >> VW Sharan / SEAT Alhambra 10/10 >> VW Jetta 4 01/11

Micro Epsilon

Epsilon Presentation

Epsilon No1

Delta Kappa Epsilon International Chapter Strategic Planning Delta Kappa Epsilon Fraternity

Epsilon 105 Catalog

Epsilon Series

EPSILON CHAPTER AT IOWA STATE The Epsilon Experience