61
ÉCOLE NORMALE SUPÉRIEURE RESEARCH UNIVERSITY PARIS

Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

  • Upload
    duongtu

  • View
    223

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

1/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Reinforcement learningfor intensional data management

Pierre Senellart

ÉCOLE NORMALE

S U P É R I E U R E

RESEARCH UNIVERSITY PARIS

22 Feb. 2018, Télécom ParisTech, Data Science Seminar

Page 2: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

2/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Uncertain data is everywhere

Numerous sources of uncertain data:

� Measurement errors

� Data integration from contradicting sources

� Imprecise mappings between heterogeneous schemas

� Imprecise automatic processes (information extraction,

natural language processing, etc.)

� Imperfect human judgment

� Lies, opinions, rumors

Page 3: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

2/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Uncertain data is everywhere

Numerous sources of uncertain data:

� Measurement errors

� Data integration from contradicting sources

� Imprecise mappings between heterogeneous schemas

� Imprecise automatic processes (information extraction,

natural language processing, etc.)

� Imperfect human judgment

� Lies, opinions, rumors

Page 4: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

3/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Structured data is everywhere

Data is structured, not �at:

� Variety of representation formats of data in the wild:

� relational tables

� trees, semi-structured documents

� graphs, e.g., social networks or semantic graphs

� data streams

� complex views aggregating individual information

� Heterogeneous schemas

� Additional structural constraints: keys, inclusion

dependencies

Page 5: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

4/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Intensional data is everywhereLots of data sources can be seen as intensional: accessing all the

data in the source (in extension) is impossible or very costly,

but it is possible to access the data through views, with some

access constraints, associated with some access cost.

� Indexes over regular data sources

� Deep Web sources: Web forms, Web services

� The Web or social networks as partial graphs that can be

expanded by crawling

� Outcome of complex automated processes: information

extraction, natural language analysis, machine learning,

ontology matching

� Crowd data: (very) partial views of the world

� Logical consequences of facts, costly to compute

Page 6: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

5/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Interactions betweenuncertainty, structure, intensionality

� If the data has complex structure, uncertain models should

represent possible worlds over these structures (e.g.,

probability distributions over graph completions of a

known subgraph in Web crawling).

� If the data is intensional, we can use uncertainty to

represent prior distributions about what may happen if we

access the data. Sometimes good enough to reach a

decision without having to make the access!

� If the data is a RDF graph accessed by semantic Web

services, each intensional data access will not give a single

data point, but a complex subgraph.

Page 7: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

6/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Intensional Data Management

� Jointly deal with Uncertainty, Structure, and the fact that

access to data is limited and has a cost, to solve a user's

knowledge need

� Lazy evaluation whenever possible

� Evolving probabilistic, structured view of the current

knowledge of the world

� Solve at each step the problem: What is the best access to

do next given my current knowledge of the world and the

knowledge need

� Knowledge acquisition plan (recursive, dynamic, adaptive)

that minimizes access cost, and provides probabilistic

guarantees

Page 8: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 9: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 10: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 11: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 12: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 13: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 14: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 15: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

formulation

answer & explanation

optimization

modeling

priors

intensional access

knowledge update

Knowledge

need

Current

knowledge

of the world

Knowledge

acquisition plan

Uncertain

access result

Structured

source pro�les

Page 16: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

8/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

What this talk is about

� A personal perspective on how to approach intensional

data management

� Various applications of intensional data management and

how we solved them (own research and my students')

� Main tool used: reinforcement learning (bandits, Markov

decision processes)

� Focus on one such application: database tuning

Page 17: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

9/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Conclusion

Page 18: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

10/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Reinforcement learning

� Deals with agents learning to interact with a partially

unknown environment

� Agents can perform actions, resulting in rewards (or

penalties) and in a possible state change

� Agents learn about the world by interacting with it

� Goal: �nd the right sequence of actions that maximizes

overall reward, or minimizes overall penalty

� Classic tradeo�: exploration vs exploitation

Page 19: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

11/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Multi-armed bandits

� Stateless model

� k > 1 di�erent actions, with

unknown rewards

� often assumed that the

rewards are from a

parametrized probabilistic

distribution (Bernoulli,

Gaussian, Exponential, etc.)

Page 20: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

12/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Markov decision process (MDP)

� Finite set of states

� In each state, actions lead:

� to a state change

� to a reward

� Depending on cases:

� state changes may be

deterministic or

probabilistic

� state changes may be known

or unknown (to be learned)

� rewards may be known or

unknown (to be learned)

� current state may even be

unknown! (poMDP)

Page 21: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

13/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Conclusion

Page 22: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

14/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Adaptive focused crawling (stateless)[Gouriten et al., 2014]

with

3

5 0

4

0

0

3

5

3

4

2

2

3

3

5 0

4

0

0

3

5

3

4

2

2

3

2 3

0

00

10

0

1

0

1

3

00

0

0

� Problem: E�ciently crawl nodes

in a graph such that total score is

high

� Challenge: The score of a node is

unknown till it is crawled

� Methodology: Use various

predictors of node scores, and

adaptively select the best one so

far with multi-armed bandits

Page 23: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

14/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Adaptive focused crawling (stateless)[Gouriten et al., 2014]

with

3

5 0

4

0

0

3

5

3

4

2

2

3

3

5 0

4

0

0

3

5

3

4

2

2

3

2 3

0

00

10

0

1

0

1

3

00

0

0

� Problem: E�ciently crawl nodes

in a graph such that total score is

high

� Challenge: The score of a node is

unknown till it is crawled

� Methodology: Use various

predictors of node scores, and

adaptively select the best one so

far with multi-armed bandits

Page 24: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

14/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Adaptive focused crawling (stateless)[Gouriten et al., 2014]

with

3

5 0

4

0

0

3

5

3

4

2

2

3

3

5 0

4

0

0

3

5

3

4

2

2

3

2 3

0

00

10

0

1

0

1

3

00

0

0

� Problem: E�ciently crawl nodes

in a graph such that total score is

high

� Challenge: The score of a node is

unknown till it is crawled

� Methodology: Use various

predictors of node scores, and

adaptively select the best one so

far with multi-armed bandits

Page 25: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

15/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Adaptive focused crawling (stateful)

with

� Problem: E�ciently crawl nodes

in a graph such that total score is

high, taking into account

currently crawled graph

� Challenge: Huge state space

� Methodology: MDP, clustering of

state space, together with linear

approximation to value functions

Page 26: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

16/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Optimizing crowd queries for data mining[Amsterdamer et al., 2013a,b]

with

� Problem: To �nd patterns in the

crowd, what is the best question

to ask the crowd next?

� Challenge: No a priori

information on crowd data

� Methodology: Model all possible

questions as actions in a

multi-armed bandit setting, and

�nd a trade-o� between

exploration and exploitation

Page 27: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

17/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Online in�uence maximization[Lei et al., 2015]

with

Action Log

User | Action | Date

1 1 2007-10-102 1 2007-10-123 1 2007-10-142 2 2007-11-104 3 2007-11-12

. . .

1

42

3

0.5

0.1 0.9

0.50.2

. . .

Real World Simulator

1

42

3

. . .

Social Graph

1

42

3

. . .

Weighted Uncertain Graph

One Trial

Sampling

1

2

Activation SequencesUpdate

3

Solution Framework

� Problem: Run in�uence

campaigns in social networks,

optimizing the amount of

in�uenced nodes

� Challenge: In�uence probabilities

are unknown

� Methodology: Build a model of

in�uence probabilities and focus

on in�uent nodes, with an

exploration/exploitation trade-o�

Page 28: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

18/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Routing of Autonomous Taxis[Han et al., 2016]

with

� Problem: Route a taxi to

maximize its pro�t

� Challenge: Real-world data, no a

priori model of the world

� Methodology: MDP, with

standard Q-learning and

customized

exploration/exploitation strategy

Page 29: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

19/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Cost-Model�Free Database Tuning[Basu et al., 2015, 2016]

with

� Problem: Automatically �nd

which indexes to create in a

database for optimal performance

� Challenge: The workload and

cost model are unknown

� Methodology: Model database

tuning as a Markov decision

process and use reinforcement

learning techniques to iteratively

learn a cost model and workload

characteristics

Page 30: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

20/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 31: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

21/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 32: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

22/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Motivation

� Current query optimizers depend on pre-determined cost

models

� But cost models can be highly erroneous

the cardinality model. In my experience, the cost modelmay introduce errors of at most 30% for a givencardinality, but the cardinality model can quite easilyintroduce errors of many orders of magnitude! I’llgive a real-world example in a moment. With sucherrors, the wonder isn’t “Why did the optimizer pick abad plan?” Rather, the wonder is “Why would theoptimizer ever pick a decent plan?”

Guy Lohman, IBM Research, ACM SIGMOD Blog 2014

Page 33: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

23/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Proposed Solution

� We propose and validate a tuning strategy to do without

such a pre-de�ned model

� The process of database tuning is modeled as a Markov

decision process (MDP)

� A reinforcement learning based algorithm is developed to

learn the cost function

� COREIL replaces the need of pre-de�ned knowledge of cost

in index tuning

Page 34: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

24/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 35: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

25/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Problem

Database Schema: R

WarehouseW

Customer 2Customer 1 Customer 3

Queries

1) New order

2) Delivery

3) Stock

Tables

1) History

2) Stock

3) New orders

4) Stocks

s

Set of all Database Configurations: S = {s}

… …..

t = 201 Customer 1, New order

t = 202 Stock

t = 203 Customer 2, Delivery

… …..

Schedule of queries and updates: Q

Page 36: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

26/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Transition

1ts ts

tqQuery

DB configurationUpdated

DB configuration

1,t ts s

cost ,t ts q

Query execution

1 1( , , ) , cost ,t t t t t t tC s s q s s s q Per-stage cost

Configuration update

Page 37: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

27/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Mapping to MDP

1ts ts

tq

1,t ts s

cost ,t ts q

Query execution

1 1( , , ) , cost ,t t t t t t tC s s q s s s q Per-stage cost

Configuration update

States

Action

Penalty function

Page 38: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

28/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

MDP Formulation

� State: Database con�gurations s 2 S

� Action: Con�guration changes st�1 ! st along with query

qt execution

� Penalty function: Per-stage cost of the action C(st�1; st; q̂t)

� Transition function: Transition from one state to another

on an action are deterministic

� Policy: A sequence of con�guration changes depending on

the incoming queries

Page 39: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

29/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Problem Statement

� For a policy � and discount factor 0 < < 1 the

cumulative penalty function or the cost-to-go function can

be de�ned as,

V �(s) , E

"1Xt=1

t�1C(st�1; st; q̂t)

#satisfying

8>><>>:s0 = s

st = �(st�1; q̂t);

t � 1

� Goal: Find out an optimal policy �� that minimizes the

cumulative penalty or the cost-to-go function

Page 40: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

30/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Features of The Model

� The schedule is sequential

� The issue of concurrency control is orthogonal

� Query qt is a random variable generated from an unknown

stochastic process

� It is always cheaper to do a direct con�guration change

� There is no free con�guration change

Page 41: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

31/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 42: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

32/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Policy Iteration

A dynamic programming approach to solve MDP

� Begin with an initial policy �0 and initial con�guration s0

� Find an estimate V�0(s0) of the cost-to-go function

� Incrementally improve the policy using the current

estimate of the cost-to-go function. Mathematically,

V�t(s) = min

s02S

��(s; s0) + E

�cost(s0; q)

�+ V

�t�1(s0)�

� Carry on the improvement till there is no (or �) change in

policy

Page 43: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

33/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Problems with Policy Iteration

� Problem 1: The curse of dimensionality makes direct

computation of V hard

� Problem 2: There may be no proper model available

beforehand for the cost function cost(s; q)

� Problem 3: The probability distribution of queries being

unknown, it is impossible to compute the expected cost of

query execution

Page 44: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

34/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Solution: Reducing the Search Space

Proposition

Let s be any con�guration and q̂ be any observed query. Let

�� be an optimal policy. If ��(s; q̂) = s0, then

cost(s; q̂)� cost(s0; q̂) � 0: Furthermore, if �(s; s0) > 0, i.e., if

the con�gurations certainly change, then

cost(s; q̂)� cost(s0; q̂) > 0:

Thus, the reduced subspace of interest

Ss;q̂ = fs0 2 S j cost(s; q̂) > cost(s0; q̂)g

Page 45: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

35/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Solution: Learning the Cost Model

� Changing the con�guration from s to s0 can be considered

as executing a special query q(s; s0)

� Then the cost model can be approximated as

�(s; s0) = cost(s; q(s; s0)) � ���T���(s; q(s; s0))

� This approximation can be improved recursively using

Recursive Least Square Estimation (RLSE) algorithm

� Similar linear projection ���(s) can be used to approximate

the cost-to-go function V�t(s)

Page 46: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

36/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 47: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

37/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

What is COREIL?

COREIL is an index tuner, that

� instantiates our reinforcement learning framework

� tunes the con�gurations di�ering in their secondary indexes

� handles the con�guration changes corresponding to the

creation and deletion of indexes

� inherently learns the cost model and solves an MDP for

optimal index tuning

Page 48: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

38/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

COREIL: Reducing the State Space

� I be the set of all possible indexes

� Each con�guration s 2 S is an element of the power set 2jIj

� r(q̂) be the set of recommended indexes for a query q̂

� d(q̂) be the set of indexes being modi�ed (update, insertion

or deletion) by q̂

� The reduced search space is

Ss;q̂ = fs0 2 S j (s� d(q̂)) � s0 � (s [ r(q̂))g

� For B+ trees, pre�x closure hr(q̂)i replaces r(q̂) for better

approximation

Page 49: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

39/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

COREIL: Feature Mapping Cost-to-go Function

� We can de�ne

�s0(s) ,

8<:1; if s0 � s

�1; otherwise:8s; s0 2 S

Theorem

There exists a unique ��� = (�s0)s02S which approximates the

value function as

V (s) =Xs02S

�s0�s0(s) = ���T���(s)

Page 50: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

40/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

COREIL: Feature Mapping Per-stage Cost

� ���(s; q̂) captures the di�erence between the index set

recommended by the database system and that of the

current con�guration

� ���(s; q̂) takes values either 1 or 0 whether a query modi�es

any index in the current con�guration

� We de�ne the feature mapping

��� = (���T ; ���T )T

to approximate the functions � and cost

Page 51: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

41/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Motivation

Problem Formulation

Adaptive Tuning Algorithm

COREIL: Index Tuner

Performance Evaluation

Conclusion

Page 52: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

42/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Dataset and Workload

� The dataset and workload conform to the TPC-C

speci�cation

� They are generated by the OLTP-Bench tool

� Each of the 5 transactions are associated with 3 � 5 SQL

statements (query/update)

� Response time of processing corresponding SQL statement

is measured using IBM DB2

� The scale factor (SF) used here is 2

Page 53: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

43/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

E�ciency

0 500 1;000 1;500 2;000 2;500 3;0000

1;000

2;000

3;000

4;000

5;000

Query #

Tim

e(m

s)

COREILWFIT

Page 54: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

44/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Box-plot Analysis

COREIL WFIT

200

400

600

800Tim

e(m

s)

Page 55: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

45/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Overhead Cost Analysis

0 500 1;000 1;500 2;000 2;500 3;0000

500

1;000

1;500

Query #

Tim

e(m

s)

COREILWFIT

Page 56: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

46/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

E�ectiveness

0 500 1;000 1;500 2;000 2;500 3;000

101

102

103

Query #

Tim

e(m

s)

COREILWFIT

Page 57: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

47/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Outline

Intensional data management

Reinforcement learning

Applications

Focus: Database Tuning

Conclusion

Page 58: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

48/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

In brief

� Intensional data management arises in a large variety of

settings, whenever there is a cost to accessing data

� Reinforcement learning (bandits, MDPs) is a key tool in

dealing with such data

� Various complications in data management settings: huge

state space, no a priori model for rewards/penatlies,

delayed rewards. . .

� Rich �eld of applications for RL research!

Page 59: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

49/49

Intensional data management Reinforcement learning Applications Focus: Database Tuning Conclusion

Merci.

Page 60: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

Bibliography I

Yael Amsterdamer, Yael Grossman, Tova Milo, and Pierre

Senellart. Crowd mining. In Proc. SIGMOD, pages 241�252,

New York, USA, June 2013a.

Yael Amsterdamer, Yael Grossman, Tova Milo, and Pierre

Senellart. Crowd miner: Mining association rules from the

crowd. In Proc. VLDB, pages 241�252, Riva del Garda,

Italy, August 2013b. Demonstration.

Debabrota Basu, Qian Lin, Weidong Chen, Hoang Tam Vo,

Zihong Yuan, Pierre Senellart, and Stéphane Bressan.

Cost-model oblivious database tuning with reinforcement

learning. In Proc. DEXA, pages 253�268, Valencia, Spain,

September 2015.

Page 61: Reinforcement learning for intensional data managementpierre.senellart.com/talks/telecom-20180222.pdf · Structured data is everywhere ... known subgraph in Web crawling). If the

Bibliography IIDebabrota Basu, Qian Lin, Weidong Chen, Hoang Tam Vo,

Zihong Yuan, Pierre Senellart, and Stéphane Bressan.

Regularized cost-model oblivious database tuning with

reinforcement learning. Transactions on Large-Scale Data

and Knowledge-Centered Systems, 28:96�132, 2016.

Georges Gouriten, Silviu Maniu, and Pierre Senellart. Scalable,

generic, and adaptive systems for focused crawling. In Proc.

Hypertext, pages 35�45, Santiago, Chile, September 2014.

Douglas Engelbart Best Paper Award.

Miyoung Han, Pierre Senellart, Stéphane Bressan, and Huayu

Wu. Routing an autonomous taxi with reinforcement

learning. In Proc. CIKM, Indianapolis, USA, October 2016.

Industry track, short paper.

Siyu Lei, Silviu Maniu, Luyi Mo, Reynold Cheng, and Pierre

Senellart. Online in�uence maximization. In Proc. KDD,

pages 645�654, Sydney, Australia, August 2015.