45
A Look into the Future by Learning from the Past Roger S. Barga Cloud Machine Learning, Cloud and Enterprise Microsoft Corporation

Barga DIDC'14 Invited Talk

Embed Size (px)

Citation preview

Page 1: Barga DIDC'14 Invited Talk

A Look into the Future by Learning from the PastRoger S. Barga

Cloud Machine Learning, Cloud and Enterprise

Microsoft Corporation

Page 2: Barga DIDC'14 Invited Talk

This isn’t an academic talk…

Page 3: Barga DIDC'14 Invited Talk

This isn’t an applied research talk…

Page 4: Barga DIDC'14 Invited Talk
Page 5: Barga DIDC'14 Invited Talk
Page 6: Barga DIDC'14 Invited Talk

1 1 5 4 3

7 5 3 5 3

5 5 9 0 6

3 5 2 0 0

Page 7: Barga DIDC'14 Invited Talk
Page 8: Barga DIDC'14 Invited Talk

1. Learn it when you can’t code it

2. Learn it when you can’t scale it

3. Learn it when you have to adapt/personalize

4. Learn it when you can’t track it

Page 9: Barga DIDC'14 Invited Talk

• Distributed

computing and

storage

• Deep Neural

Networks

• Learning =

Scalable,

Adaptive

Computation for

Various Big

Data

2011 (“Big

Data, DNN”)

• Wide

application in

products

• Statistical

Modeling of

Data

• Learning =

Parameter

Estimation or

Inference

2005

(“Graphical

Models”)

• Statistical

Learning Theory

• Scoring Systems

• Learning =

Optimization of

Convex

Functions

2000

(“Kernel

Machines”)

• Expert Systems

• Decision-Tree

Learning (C4.5)

• Learning =

Methods to

automatically

build Expert

Systems

1990

(“Symbolic”)

• Neural

Networks

• Artificial

Intelligence

• Learning =

Adaptation of

Neurons based

on External

Stimuli

1980

(“Neuro”)

Page 10: Barga DIDC'14 Invited Talk

• Distributed

computing and

storage

• Deep Neural

Networks

• Learning =

Scalable,

Adaptive

Computation for

Various Big

Data

2011 (“Big

Data, DNN”)

• Wide

application in

products

• Statistical

Modeling of

Data

• Learning =

Parameter

Estimation or

Inference

2005

(“Graphical

Models”)

• Statistical

Learning Theory

• Scoring Systems

• Learning =

Optimization of

Convex

Functions

2000

(“Kernel

Machines”)

• Expert Systems

• Decision-Tree

Learning (C4.5)

• Learning =

Methods to

automatically

build Expert

Systems

1990

(“Symbolic”)

• Neural

Networks

• Artificial

Intelligence

• Learning =

Adaptation of

Neurons based

on External

Stimuli

1980

(“Neuro”)

Page 11: Barga DIDC'14 Invited Talk

• Distributed

computing and

storage

• Deep Neural

Networks

• Learning =

Scalable,

Adaptive

Computation for

Various Big

Data

2011 (“Big

Data, DNN”)

• Wide

application in

products

• Statistical

Modeling of

Data

• Learning =

Parameter

Estimation or

Inference

2005

(“Graphical

Models”)

• Statistical

Learning Theory

• Scoring Systems

• Learning =

Optimization of

Convex

Functions

2000

(“Kernel

Machines”)

• Expert Systems

• Decision-Tree

Learning (C4.5)

• Learning =

Methods to

automatically

build Expert

Systems

1990

(“Symbolic”)

• Neural

Networks

• Artificial

Intelligence

• Learning =

Adaptation of

Neurons based

on External

Stimuli

1980

(“Neuro”)

Page 12: Barga DIDC'14 Invited Talk

• Distributed

computing and

storage

• Deep Neural

Networks

• Learning =

Scalable,

Adaptive

Computation for

Various Big

Data

2011 (“Big

Data, DNN”)

• Wide

application in

products

• Statistical

Modeling of

Data

• Learning =

Parameter

Estimation or

Inference

2005

(“Graphical

Models”)

• Statistical

Learning Theory

• Scoring Systems

• Learning =

Optimization of

Convex

Functions

2000

(“Kernel

Machines”)

• Expert Systems

• Decision-Tree

Learning (C4.5)

• Learning =

Methods to

automatically

build Expert

Systems

1990

(“Symbolic”)

• Neural

Networks

• Artificial

Intelligence

• Learning =

Adaptation of

Neurons based

on External

Stimuli

1980

(“Neuro”)

Page 13: Barga DIDC'14 Invited Talk

• Distributed

computing and

storage

• Deep Neural

Networks

• Learning =

Scalable,

Adaptive

Computation for

Various Big

Data

2011 (“Big

Data, DNN”)

• Wide

application in

products

• Statistical

Modeling of

Data

• Learning =

Parameter

Estimation or

Inference

2005

(“Graphical

Models”)

• Statistical

Learning Theory

• Scoring Systems

• Learning =

Optimization of

Convex

Functions

2000

(“Kernel

Machines”)

• Expert Systems

• Decision-Tree

Learning (C4.5)

• Learning =

Methods to

automatically

build Expert

Systems

1990

(“Symbolic”)

• Neural

Networks

• Artificial

Intelligence

• Learning =

Adaptation of

Neurons based

on External

Stimuli

1980

(“Neuro”)

Page 14: Barga DIDC'14 Invited Talk

The future will belong to those who can turn

their historical data into predictive models…

Page 15: Barga DIDC'14 Invited Talk

Vision Analytics

Recommenda-

tion engines

Advertising

analysis

Weather

forecasting for

business planning

Social network

analysis

Legal

discovery and

document

archiving

Pricing analysis

Fraud

detection

Churn

analysis

Equipment

monitoring

Location-based

tracking and

services

Personalized

Insurance

Machine learning and predictive models are core new capabilities that will touch everything in the new enterprise

Page 16: Barga DIDC'14 Invited Talk
Page 17: Barga DIDC'14 Invited Talk
Page 18: Barga DIDC'14 Invited Talk

training data (expensive) synthetic training data (cheaper)

Page 19: Barga DIDC'14 Invited Talk

solve hard problems

value from Big Data

data analytics

Page 20: Barga DIDC'14 Invited Talk

Machine learning enables nearly every

value proposition of web search.

Page 21: Barga DIDC'14 Invited Talk

Hundreds of thousands of machines…

Hundreds of metrics and signals per machine…

Which signals correlate with the real cause of a problem?

How can we extract effective repair actions?

Page 22: Barga DIDC'14 Invited Talk

solve hard problems

value from Big Data

data analytics

human intelligence

Page 23: Barga DIDC'14 Invited Talk
Page 24: Barga DIDC'14 Invited Talk
Page 25: Barga DIDC'14 Invited Talk
Page 26: Barga DIDC'14 Invited Talk

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

1993

1994

1995

1996

1997

1998

1999

2000

2001

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

2012

WER %

Page 27: Barga DIDC'14 Invited Talk

Training

English training data

English words,

with some errors

English

speech input

More

with fewer errors

How much data? – About the same as a human needs…

Runtime

Page 28: Barga DIDC'14 Invited Talk

Training

English training data

English words,

with some errors

English

speech input

with fewer errors

Runtime

Can we learn the internal representation of human speech?

French training data

or French words

or French

Chinese training data

or Chinese words

or Chinese

Page 29: Barga DIDC'14 Invited Talk
Page 30: Barga DIDC'14 Invited Talk
Page 31: Barga DIDC'14 Invited Talk

Shetland Sheepdog (0.72) Shoe Store (0.56) Attack Aircraft Carrier (0.81)

Steel Arch Bridge (0.74) Ballplayer, Baseball Player (0.86) Catamaran (0.51)

Wood Rabbit, Cottontail, Cottontail Rabbit (0.18)

Page 32: Barga DIDC'14 Invited Talk

The first image returned is Rajiv Gandhi (her husband) in the Answer.

An image of Lindsay Lohan appears in the Images Answer

not really

X

X

Page 33: Barga DIDC'14 Invited Talk
Page 34: Barga DIDC'14 Invited Talk
Page 35: Barga DIDC'14 Invited Talk

solve hard problems

value from Big Data

data analytics

human intelligence

engineering practices

Page 36: Barga DIDC'14 Invited Talk

intelligence will become ambient

intelligence from machine learning

Page 37: Barga DIDC'14 Invited Talk

55

57

59

61

63

65

67

69

71

Overall NDCG

Bing NDCG Google NDCG

Page 38: Barga DIDC'14 Invited Talk

The razor-toothed piranhas of the genera

Serrasalmus and Pygocentrus are the most

ferocious freshwater fish in the world. In

reality they seldom attack a human.

Template

matching

Page 39: Barga DIDC'14 Invited Talk

The razor-toothed piranhas of the genera

Serrasalmus and Pygocentrus are the most

ferocious freshwater fish in the world. In

reality they seldom attack a human.

pypygygogoc

Pygocentrus

Page 40: Barga DIDC'14 Invited Talk

The razor-toothed piranhas of the genera

Serrasalmus and Pygocentrus are the most

ferocious freshwater fish in the world. In

reality they seldom attack a human.

Sentence-level

decoding

The razor-toothed piranhas of the genera

Serrasalmus and Pygocentrus are the most

ferocious freshwater fish in the world. In

reality they seldom attack a human.

Page 41: Barga DIDC'14 Invited Talk
Page 42: Barga DIDC'14 Invited Talk
Page 43: Barga DIDC'14 Invited Talk
Page 44: Barga DIDC'14 Invited Talk

Massive

The Intelligent Cloud

Machine

Learning &

Analytics

Crowd

Sourcing

Massive &

Diverse Data

The Cloud - Where Everything Comes Together

Page 45: Barga DIDC'14 Invited Talk