Hierarchical Caching and Prefetching for Continuous Media Servers with Smart Disks

Hierarchical Caching and Prefetching for Continuous Media Servers with

Smart Disks

By:Amandeep Singh Parth Kushwaha

Index:• Introduction• Media Servers• Proposed Algorithms

-Sweep and Prefetch(S&P)-Gradual Prefetching-Grouped Periodic Multiground Prefetching(GPMP)

• Performance Evaluation• The Experiment• Results • Conclusion• Future Use• References

Introduction

• Due to increase in CPU performance,I/O systems have become the performance bottleneck.

• Reason for this performance bottleneck is mechanical movement of disk head.

• Several algorithms are introduced that exploit emerging smart disk technologies and increase data throughput on media servers.

Media Server

• media server is a device that simply stores and shares audio,video media files

• Media server offers client access to media types such as video,audio and so on.

• To avoid glitches media server must retrieve data from secondary memory at specific rate.

Working of Media server

• To avoid glitches media server require double buffer in main memory to ensure a data streams continuous playback.

• Data read from the disk fills half of buffer while other half is use to play video

• When data in first buffer is consumed,a switch occurs and media server use the second buffer to play video and uses the empty buffer for storage.

Contd..

• Media server serve multiple requests concurrently

• Therefore media server serves streams in rounds, which is of specific length

• During a round system reads one block for each stream.

Contd..

• Each request which is to served in a round is added to service list

• A resolution mechanism maps each requested video block to disk block

• I/O controller routes this requested disk block list to disk drive

• Then the low level schedular schedules the corresponding disk block for each drive,reducing disk head positioning overhead

contd..

• The disk head then transfers the requested data from the disk surface to the disk buffer cache.

• With the help of I/O bus this buffer cache data transferred to server RAM

Contd..

• Several algorithms are introduced to achieve maximum throughput, which is maximum number of streams served by disk.

• Two important factors that limits maximum throughput are:

1.Time required to transfer data from the disk to main memory

2.Cost of cache memory

Proposed Algorithms

• In these proposed algorithm we increase maximum throughput while keeping round duration constant,with relatively low memory requirement.

• To improve the drives maximum stream throughput,our algorithm use caching and prefetching techniques.

• It actually increase throughput because prefetched requests are served from the disk’s cache,without head –positioning –overhead.

SWEEP AND PREFETCH ALGORITHM

• Data blocks that are retrieved from the disk are called randomly retrieved blocks.This causes more head-positioning overhead

• In this algorithm disk head prefetches blocks when it reads adjacent blocks and stores them in a cache.This cause no head-positioning overhead

CONTD..

• In this algorithm maximum reandomly retrieved blocks are 25 in one round

• The ratio of randomly retrieved block that can be exchanged with prefetched blocks is 5/8.It means for retrieving 8 prefetched blocks required as much time to retrieve 5 random blocks

Contd..

• During round 1 and round2 of fig(a) all the blocks are randomly retrieved.

• During round 1of fig(b) last five blocks are served

through high level cache buffer

• As the ratio of randomly retrieved blocks exchanged with prefetched blocks is 5/8 so we can exchange last 5 blocks by prefetching 8 blocks from the disk.

Contd..

• During round2 of the fig(2) first 8 blocks are already prefetched so we don’t have to retrived these blocks from disk.

• In place of these 8 preftched blocks we can add 3 randomly accessed blocks and 8 prefetched blocks

Issues With Sweep & Prefetch:

• Until maximum throughput is achieved S&P services without prefetch.

• Requires 3 cache buffers for each stream that is cached at the higher level.

• This requires 3 blocks from each stream.– 2 for the double buffer.– 1 for the multi-disk controller.

• Thus each streams block will be skipped in some round.

• Creating extra startup latency.

Gradual Prefetching:

• Force the server to work under S&P all the time. Don’t consider the number of concurrent streams.

• At any time disk head prefetches half of all supported streams.

• For every 2 newly admitted streams, 1 will have its next block prefetched in the first round

• It works with maximum number of supported streams so always time to prefetch an additional block for half of all new streams.

• Hence, no stream will have extra start-up latency and triple buffering is no longer required.

Random retrievals (v) = 19

Prefetched blocks (p) = 9

v ~19+5.6

Strings Serviced = 19

Round 1

Gradual prefetching

v = 17p = 12 v ~17+7.5~25Strings = 26

Round 2

Gradual Prefetching

v = 16p = 14 v ~16+8.7~25Strings = 28

Round 3

Gradual Prefetching

v = 16p = 14 v ~16+8~25Strings = 30

Round 4

Gradual Prefetching

Grouped Periodic Multi-ground Prefetching:

• This algorithm temporarily stores prefetched blocks in the host’s cache.

• epoch: epoch or virtual round is the total duration of fixed number of actual rounds.

• At the system level, GPMP offers finer grained disk requests per stream, more flexible configuration and higher service quality, because of lower start-up latencies.

• During GPMP round, media server serves all streams.

• Delivers all blocks sustaining playback of the supported streams to host.

• These blocks don’t have to be retrieved from disk in each round.

• During a round, group containing fraction of the supported stream is randomly retrieved.

• Time remains in round for u prefetches for each stream.(u=num. of blocks prefetched for one stream).

• In next round, blocks sustaining playback of next group are read from disk with u prefetched blocks for each stream.

• All streams are served in each round as N-v rounds that were not retrieved from disk are in cache having been prefetched in previous rounds.

• The u prefetched blocks sustain for u rounds. After which same streams read from disk in given round are read again.

• Epoch length is k = u+1• Total number of supported streams are

N=v(u+1)• “Played” blocks are immediately discarded.

Performance evaluation

• To analytically evaluate the performance of the algorithms the continuous media server is simulated for a pregenerated workload similar to that of a typical server use.

• DiskSim is used. It does not address onboard cache issues like ability to enable disable caching , the ability to switch prefetching on and off and ability to change disk cache segmentation.

• DiskSim code is modified to add these capabilities.

The Experiment

• Use of video library that holds videos that are 90 to 120 min. long which are ordered according to popularity and follow Zipfian distribution.

• Newly videos are with a Poisson distribution. Disk partitioning is assumed.

• S&P algorithms are implemented using trace generator to pass hints and evaluate algorithms.

Results

• Testing Sweep and S&P for throughput for: • Disk Cache sizes (d): 2, 4, 8, 12 Mbytes• Rounds length (r): 0.5, 1, 1.5, 2, 3, 4 sec.• 20 to 70 % improvements in throughput for r=

0.5 to 1.5 sec and d=2 to 12 Mbytes.• Longer r explode memory requirements.• GPMP outperforms Sweep getting higher

throughput but poor start-up latency.

• In this figure the evaluation of a single disk media server is shown. r= 0.25 to 1 sec.

• Low request arrival rate=> low value of k=> lower start-up latencies.

• As request arrival rate increases, queuing delays occur and GPMP configurations become more beneficial for higher throughput.

Conclusion

• The techniques presented introduce higher(60 to 70%) throughput compared to Sweep strategies for continuous blocks.

• Sweep does not exploit on board buffers.• There are no prefetching techniques by disk

manufacturers for media retrievals that account for concept of rounds.

• Parallel transfer of I/O requests and other disk to buffer transfers.

Expected future use:

• The current technology trends suggest that these techniques will show even better results for future disk products because transfer rates will improve and more powerful controllers on bigger embedded caches are certain to follow.

References:

• Paper on “Hierarchical Caching and Prefetching for Continuous Media Servers with Smart disks”-Stavros Harizopoulos(Carnegie Mellon University)Costas Harizakis and Peter Triantafillou(Technical University of Crete)

• http://en.wikipedia.org• http://www.howstuffworks.com

Hierarchical Caching and Prefetching for Continuous Media Servers with Smart Disks

Documents

1 Storage-Aware Caching: Revisiting Caching for Heterogeneous Systems Brian Forney Andrea Arpaci-Dusseau Remzi Arpaci-Dusseau Wisconsin Network Disks University

ECE 7995 Caching And Prefetching Techniques In Computer System Professor: Dr. Song Jiang

Part 8: Disks and Caching - uni-halle.deusers.informatik.uni-halle.de/~brass/dd03/c8_disks.pdf8. Disks and the Buﬀer Cache 8-6 Disks (3) • The data is written on each surface in

Scalable Parallel Out-of-core Terrain Renderingmaverick.inria.fr/~Prashant.Goswami/Research/Papers/...caching and prefetching of this data is handled by an asyn-chronous load queue

Hierarchical Caching and Prefetching for Improving the …nms.csail.mit.edu/~stavros/pubs/tucthesis.pdf · 2005-10-20 · 9 Chapter 1 Introduction 1.1 Applications The technological

Edge AI a ssists Partial Content Caching with Smart …networking.khu.ac.kr/layouts/net/publications/data...Edge AI a ssists Partial Content Caching with Smart Content Prefetching

INTEGRATING INTELLIGENT PREDICTIVE CACHING AND STATIC PREFETCHING

Azure Documentation · Web viewLoad Balancer Protocol Virtual Disks The Virtual Machine is using the following disks Data Hard Disks Name Size (GB) Source Image Name Host Caching OS

An Automatic Prefetching and Caching Systemxqin/pubs/ipccc10-prefetch.pdf · paper we propose an automatic prefetching and caching system ... through three unique techniques, namely:

Scalable Visual Hierarchy Exploration - Worcester …davis.wpi.edu/~xmdv/docs/stroe_msthesis.pdf · · 2003-04-211.1 Caching and Prefetching for Large-Scale Visualization ... without

Informed Mobile Prefetching

Energy Efficient Prefetching and Caching

Energy Efficient Prefetching and Caching Athanasios E. Papathanasiou and Michael L. Scott. University of Rochester Proceedings of 2004 USENIX Annual Technical

Informed Prefetching and Caching

THE DESIGN AND EVALUATION OF WEB PREFETCHING AND …brian/pubs/2002/thesis/thesis.pdf · The Design and Evaluation of Web Prefetching and Caching Techniques by Brian Douglas Davison

Informed Prefetching and Caching R. Hugo Patterson, Garth A. Gibson, Eka Ginting, Daniel Stodolsky, Jim Zelenka

A Data-Mining-Based Prefetching Approach to Caching For Network Storage Systems

ECE7995 Caching and Prefetching Techniques in Computer Systems

Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE -7995 Fall 2007

THE DESIGN AND EVALUATION OF WEB PREFETCHING AND CACHING TECHNIQUES