1 Quality of Service Guarantees for Multimedia Digital Libraries and Beyond Gerhard Weikum...

Quality of Service Guarantees for Multimedia Digital Libraries

and Beyond

Gerhard Weikum

weikum@cs.uni-sb.de

http://www-dbs.cs.uni-sb.de

Vannevar Bush’s Memex (1945)Collect all human knowledge into computer storage

Size of today‘s and tomorrow‘s applications:

Everything you see or hear: 1 MB/s * 50 years 2 PB

Library of Congress: 20 TB books + 200 TB maps + 500 TB video + 2 PB audio

Challenges: size of data• performance & QoS• intelligent search

Multimedia Data Management

Discrete Data Index Data Continuous Data

ParallelDiskSystem

ServerMemoryBuffer

Clients

High-speed Networkwith QoS Guarantees

QoSGuarantees

byData Server

Internal Server Error.Our system administrator has been notified. Please try later again.

Check Availability(Look-Up Will Take 8-25 Seconds)

The Need for Performance and QoS Guarantees

• Service performance is best-effort only• Response time is unacceptable during peak load because of queueing delays• Performance is mostly unpredictable !

Observations:

From Best Effort To Performance & QoSGuarantees

”Our ability to analyze and predict the performance of the enormously complex software systems ...are painfully inadequate”

(Report of the US President’s Technology Advisory Committee)

• Very slow servers are like unavailable servers• Tuning for peak load requires predictability of workload config performance function• Self-tuning requires mathematical models• Stochastic guarantees for huge #clients

Outline

The Need for Performance Guarantees

Towards a Science of QoS Guarantees

QoS for Continuous-Data Streams

Caching and Prefetching for Discrete Data

Self-tuning Servers using Stochastic Predictions

Performance and Service Qualityof Continuous-Data Streams

Quality of service (QoS): (almost) no "glitches"

High throughput (= concurrently active streams)

admission control

Data Placement and SchedulingPartitioning of C-data Objects with VBR (Variable Bit Rate)into CTL Fragments (of Constant Time Length)Coarse-grained Striping with Round-robin AllocationPeriodic, Variable-order SchedulingOrganized in Rounds of Duration T (= Fragment Time Length)

0 T 3T2T

Admission control:

No way!Now go ahead!

Yes, go ahead!

Admission Control

Stochastic QoS: Admit at most N streams such thatP [ total service time > T ]

- tolerable by most multimedia applications- appropriate with many workload and system parameters being random variables- allows much better resource utilization compared to worst-case modeling

threshold

Worst-case QoS: Admit at most N streams such thatN * Tmax T

with Stochastic QoS Guarantees

Mathematical Tools

X, Y, ...: continuous random variables with non-negative, real values

(cumulative) distribution function of X :][)( xXPxFX

probability density function of X :)(')( xFxf XX

:][)()(*0 sX

X eEdxxfesf Laplace-Stieltjes transform(LST) of X

YXYX dxxzFxfzF0

)()()(Convolution

)(*)(*)(* sfsfsf YXYX

0|)(*inf][ X

t fetXP Chernoff bound

Total Service Time Per Round(With N Streams Per Disk)

T T T Tserv seek rot i trans ii

11f f f fserv seek rot

Ntrans

N* * * *

T N tseekZ

NSEEKseek

( ) :1

1 f s eseek

s SEEK* ( )

s ROTrot

* ( ) 1

ROTrot ( ) 1

f sC ROT

C ROT strans* ( )/

Ctrans size( )

f x x esizex( ) ( ) / ( ) 1

P T t e fservt

serv[ ] inf * ( )

Total Service Time Per Round(With N Streams)

T T Tserv seek rot ii

T N tseekZ

NSEEKseek

( ) :1

f tROTrot ( )

F t Ft

ROTCtrans size( )

f x x esizex( ) ( ) / ( ) 1

P T t e fservt

serv[ ] inf * ( )

f f f fserv seek rotN

transN* * * *

f s eseeks SEEK* ( )

s ROTrot

* ( ) 1

f sC ROT

C ROT strans* ( )/

Ttrans,i

Stochastic versus Worst-Case QoS Guarantees

12 14 16 18 20 22 24 26 28 30 32 34 36 38 40

analyticreal

p late

Stochastic versus Worst-Case QoS Guarantees

12 14 16 18 20 22 24 26 28 30

analyticreal

p late

analytic realworst-case

Generalization to Mixed-Workload Servers

0 T 3T2T

arrivals ofdiscrete-datarequests

departures of completed discrete-datarequests

response time

Additional performance guarantee for discrete-data requests: ][ ttimeresponseP (e.g., with t = 2s and = 0.95)

Needs clever scheduling and sophisticated stochastic model,to provide both continuous-data and discrete-data guarantees

QoS & Performance Guarantees for Mixed Workload Servers

P [ glitch frequency of a stream > tolerance ] threshold

P [ admission/startup delay of a stream > tolerance ] threshold

forContinuousData

forDiscreteData

P [ response time > tolerance t ] threshold (e.g., t = 2 seconds, = 5 percent)

Detailed analytic model can derive minimum-cost server configurationfor specified QoS & performance requirementsincl. differentiated QoS for multiple user/request classes

Auto-Configuration of Data Server:

Outline

The Need for Caching in Storage Hierarchies

Searchengine

Internet

Clients

DL server

Ontologies,XML etc.

Very high access latency ! CachingPrefetching

100 TB

Basic Caching PoliciesLRU: Drop page that has been least recently used

Example:

C DX Y X Y

1 2 3 4 5 10 15 20 24 now

LRU-k: Drop page with the oldest k-th last reference

estimates heat (p) =

optimal for IRM)( ptnow

LRU-k OptimalityIRM: pages 1 ... n with ref. probabilities 1 ... n (i i+1)

and backward distances b1 ... bn

timenow

3 2 13221323

]|..[ dbprobrefhasxP xi

hasxPhasxdbP

1][]|[

]|..[ dbxofprobrefE x

hxhh dbhasxP

Theorem: ...][......][... yExEbb yx

LRU-k as Maximum Likelihood Estimator

for observation b1, ..., bn with bi < b i+1

maximize

111 )1(

k for k << bi

IRM: pages 1 ... n with ref. probabilities 1 ... n (i i+1)and backward distances b1 ... bn

timenow

3 2 13221323

Cache Size Configuration

$50064

sMBKB 1min21.0

Keep page in cache if diskcache CC

Cost / throughput consideration:

$5032 19 y

Keep page in cache if waitcache CC

Cost / response-time consideration:

Minimum cache size M such that

goalpercentile RTMgfratiohitfRT ...)),((...),(

Response-time guarantee:

LRU-k Cache Hit Rate (for Cache Size M)][:)( WwindowintimeskleastatreferencedpagesdistinctEWP

)(:~ 1 MPW

][: cacheinresidesipagePpi jWi

~~ ~)1(

][:)( hitcacheisreferencePMH

80 LRU

cache size M

hit rate H(M) [%]

Stochastic Response Time Guaranteewith cache size M, block size S, and multi-zone disk with known seek-time function, Z tracks of capacity Cmin Ci Cmax, rotation time T

iRdiskiiRcacheiiR tfptfptf

1)()1()()(

iRdiskiiR sfpsf

** )()1()(

* )()(t

X dttfesfwith LST

servdiskdiskservRdisk

iiidisk p

][ servdisk tE

)()()()( **** sfsfsfsf transrotseekserv

M/G/1 queue:

)( inf ][

*RfetRP tChernoff bound:

Extended LRU-k-based Policies

Generalization to variable-size documents:

Generalization to non-uniform / hierarchical storage:

temperature (d) =

benefit (d) =

drop documents with lowest

psizepheat

)(cos)( dtdetemperatur fetch

Generalization to cooperative cachingin computer cluster

singletisdifdtreplicaisdifdt

fetch diskcacheremotedt )(cos

)(cos)(cos

Speculative PrefetchingArchive

Mask high access latency

Speculative prefetching

Keep long-term beneficial data in cache

Throttling of prefetching

Prefetch x iff benefit(x,T) > {benefit(y,T) | y victims}

with benefit (x,T) =

))()(()(

][#xRTxRT

xsizeTtimeinxtoaccessesE

cachearchive

with time horizon T = „max“ (RTarchive)

Context-aware Prefetching and Caching

Session 1 Session 2 ...

accessdoc. i

accessdoc. k

doc. f doc. g doc. h

... ...

Pif=0.80.1

0.30.1 ...

P[time in i t]Hi=E[...]=10s

Hk=10s

Hf=30s

Sessionarrival rate

newsessions

HN+1=c/

with continuous state-residence timesModel session behavior as Markov chain

Superimpose CTMCs of all active sessions

Incorporate arrivals of new sessions

CTMC-based Access PredictionGiven: states di (i=1, ..., N+c) with transition probabilites pij and mean residence times Hi (departure rates i=1/Hi)

Uniformization:

ijforpij

/1with = max{i}

N(x,T) = E[#accesses to x in T] = s j

jxjsstate p

ij pnt

mij ppp

)1()( jiifjiif

Transient analysis for time horizon T:

MCMin Prefetching and Caching Algorithm

access tracking and online bookkeeping for statistics

periodic evaluation of N(state(s),T) for active sessionsbased on approximative CTMC transient analysis

prefetching candidates

Prefetch x iff benefit(x,T) > {benefit(y,T) | y victims}

with benefit (x,T) =

))()()(()(),(

xxRTxRTxsizeTxN

penaltycachearchive

)()()( xtxtx servservpenalty with

+ appropriate device scheduling at server

Overhead: • size of bookkeeping data < 0.02%• compute time per access 1 ms• both dynamically adjustable

Performance ExperimentsSimulations based on WWW-server access patterns

101520253035404550

0,20% 1% 2%

LazyTempMCMin

Mean response time [s]

Cache size / archive size

Applicability of LRU-k and MCMin Familyfor Internet and intranet proxies and clients

for data hoarding in mobile clients

for (stochastically) guaranteed response time

for caching of (partial) search results

w.r.t. heterogeneous data servers,as opposed to best-effort caching

when client goes on low (or zero) connectivity,prefetch near-future relevant data and programs

with careful management of access statistics

in data warehouses, digital libraries, etc.

for adaptive broadcast of data feedsin networks with asymmetric bandwidth

Interesting Research Problems

Response-time guarantee for MCMin

Optimal (online) decisions about amoung of bookkeeping

Caching and prefetching fordifferentiated QoS (multiple user/request classes)

Caching of (partial) search results andprefetching for (speculative) query evaluationin ranked (XML) retrieval

Outline

Advancing the State of the Art on QoS

+ Substantially Better Cost/Performance

+ Major Building Blocks for Configuration Tool for Specified QoS Guarantees and Self-tuning, Zero-admin Operation

Benefit of stochastic models and derived algorithms/systemsover commercial state-of-the-art systems(e.g., Oracle Media Server, MS NetShow Theater Server, etc.):

+ Predictable Performance

QoS in (Web) Query Processing

Credibility

Timeliness

Responsiveness & Cost-effectivity (Performance)

AccuracyExample: Select ... ).( AmountOSum

ComprehensivenessExample: association rules of the kind

Software Engineering & Y2K Astrology

Combined with IR & MultimediaExamples: Where ... P About {„Mining“, „19th Century“} ...

Where ... P.Category =„CDs“ And P Sounds Like

The End

„low-hanging fruit“ engineering: 90% solution with 10% intellectual effort

self-tuningservers withguaranteedperformance

„Web engineering“ for end-to-end QoSwill rediscover stochastic modeling or will fail

need libraries of composable building blocks withpredictable behavior and (customizable) QoS guarantees

Conceivable killer argument:Infinite RAM & network bandwidth and zero latency (for free)

But:• An engineer is someone who can do for a dime what any fool can do for a dollar.• Predictions are very difficult, especially about the future.

1 Quality of Service Guarantees for Multimedia Digital Libraries and Beyond Gerhard Weikum...

Documents

Universit¨at des Saarlandes - math.uni-sb.de

1 The Web in the Year 2010: Challenges and Opportunities for Database Research Gerhard Weikum weikum@cs.uni-sb.de

Improving Performance of OpenCL on CPUs€¦ · Improving Performance of OpenCL on CPUs Ralf Karrenberg karrenberg@cs.uni-saarland.de Sebastian Hack hack@cs.uni-saarland.de European

Weikum@mpi-inf.mpg.de weikum/ Gerhard Weikum DB & IR: Both Sides Now in collaboration with Georgiana Ifrim, Gjergji Kasneci,

Bitwise Operations - Max Planck Societyrg1-teaching.mpi-inf.mpg.de/advancedc-ws08/script/... · 2009-01-06 · Bitwise Operations Sebastian Hack hack@cs.uni-sb.de Christoph Weidenbach

Gerhard Weikum Max Planck Institute for Informatics & Saarland University weikum

op-doTwn Syntax Analysis - rw.cdl.uni-saarland.de Syntax Analysis op-doTwn Syntax Analysis Wilhelm/Maurer: Compiler Design, Chapter 8 Reinhard Wilhelm Universität des Saarlandes wilhelm@cs.uni-sb.de

Advanced C Programming - Max Planck Societyresources.mpi-inf.mpg.de/departments/rg1/teaching/advancedc-ws08/... · Advanced C Programming Sebastian Hack hack@cs.uni-sb.de Christoph

Gerhard Weikum - Max Planck Institute for Informatics

Gerhard Weikum Max Planck Institute for Informatics mpi-inf.mpg.de/~weikum

Phonetic features in ASR: a linguistic solution to acoustic variation? Jacques Koremankoreman@coli.uni-sb.de Bistra Andreevaandreeva @coli.uni-sb.de Attilio

1 Self-tuning DB Technology & Info Services: from Wishful Thinking to Viable Engineering Gerhard Weikum weikum@cs.uni-sb.de Acknowledgements to collaborators:

Weikum@mpi-inf.mpg.de weikum/ Gerhard Weikum Harvesting, Searching, and Ranking Knowledge from the Web joint work with Shady

1/28 Efficient Top-k Queries for XML Information Retrieval Gerhard Weikum weikum@mpi-sb.mpg.de weikum/ Joint work with Ralf Schenkel

What Computers Should Knowpeople.mpi-inf.mpg.de/~weikum/weikum-CCKS-Beijing-sep...(Liu Cixin, computer engineer) translatedBy (Liu Cixin, Liu Ken) hasFavoriteBooks (Liu Cixin, { Arthur

Generic Software Pipelining at the Assembly Level Markus Pister pister@cs.uni-sb.de

Big Text: from Language to Knowledge Gerhard Weikum Max Planck Institute for Informatics & Saarland University Saarbrücken, Germany weikum

Exam registration - react.uni-saarland.de · Exam registration through HISPOS open now. If you cannot register through HISPOS →send email to finkbeiner@cs.uni-sb.de Deadline: December

Sreyasi Nag Chowdhury, Niket Tandon, Gerhard Weikum · Sreyasi Nag Chowdhury, Niket Tandon, Gerhard Weikum ... “Wow! Double-decker buses ... Existing CSK knowledge bases: WordNet,

Gerhard Weikum Max Planck Institute for Informatics weikum/ From Information to Knowledge Harvesting Entities and Relationships