13

Click here to load reader

Continuous sampling from distributed database

Embed Size (px)

Citation preview

Page 1: Continuous sampling from distributed database

Continuous Sampling from Distributed Database

Marzia Ahmed Jui (090232)

Nazmul Hossain Bilash (090236)

11th November, 2012

Page 2: Continuous sampling from distributed database

Distributed DatabaseSamplingContinuous Distributed StreamingSliding WindowProblems DefinitionBinary Bernoulli SamplingThreshold ProtocolSteps for sampling replacementConclusion

OverviewOverview

2

Page 3: Continuous sampling from distributed database

Logically interrelated database Distributed over a computer network

Distributed DatabaseDistributed Database

3

Page 4: Continuous sampling from distributed database

Statistical method Obtain representative data or observations from a group.

SamplingSampling

4

Page 5: Continuous sampling from distributed database

Random Sampling Distinct Sampling Concise Sampling Counting Sampling Priority Sampling

Continues…Continues…

5

Page 6: Continuous sampling from distributed database

Continuous track some functions at a design coordinators.

Continuous Distributed StreamingContinuous Distributed Streaming

6

Page 7: Continuous sampling from distributed database

Sequence-Based windows. Time Based windows. Time based windows are more difficult to handle.

Sliding WindowSliding Window

7

Page 8: Continuous sampling from distributed database

Primary Concern : Total communication cost between the coordinator & sites.

No direct communication allowed between them Reservoir Sampling Problem Current value of total number of element is unknown

Problems DefinitionProblems Definition

8

Page 9: Continuous sampling from distributed database

Implementing random sampling Makes the sampling protocols more convenient Do not need to materialize each bit string

Binary Bernoulli SamplingBinary Bernoulli Sampling

9

Page 10: Continuous sampling from distributed database

Initiated by a message to begin. Identifies the moment when new elements arrived across all

sites.

Threshold ProtocolThreshold Protocol

10

Page 11: Continuous sampling from distributed database

Level Sampling at sites Collection of sample element Production of a sample

Steps for sampling replacementSteps for sampling replacement

11

Page 12: Continuous sampling from distributed database

Works for arrival distribution Stochastic Streaming

ConclusionConclusion

12

Page 13: Continuous sampling from distributed database

Thank You

Any Questions?