Example: Rumor Performance Evaluation

Andy WangCIS 5930-03

Computer SystemsPerformance Analysis

Motivation• Optimistic peer replication is popular

– Intermittent connectivity– Availability of replicas for concurrent

updates– Convergence and correctness for updates

• Example: Rumor, Coda, Ficus, Lotus Notes, Outlook Calendar, CVS

Background• Replication provides high availability• Optimistic replication allows immediate

access to any replicated item, at the risk of permitting concurrent updates

• Reconciliation process makes replicas consistent (i.e., two replicas for peer-to-peer)

Background Continued• Conflicts occur when different replicas

of the same file are updated subsequent to the previous reconciliation

Optimistic Replication Example

Log on Desktop10:00 Update10:25 Update

Log on Portable10:00 Update10:25 Update

connected

Log on Desktop10:00 Update10:25 Update10:40 Update

Log on Portable10:00 Update10:25 Update10:51 Update

disconnected

Example Continued

Log on Desktop10:00 Update10:25 Update10:40 Update

Log on Portable10:00 Update10:25 Update10:51 Update

disconnected

Log on Desktop10:00 Update10:25 Update10:40 Update10:51 Update

Log on Portable10:00 Update10:25 Update10:40 Update10:51 Update

connected

• Run reconciliation• Detect a conflict• Propagate updates

Goal• Understand the cost characteristics of

the reconciliation process for Rumor

Services• Reconciliation

– Exchange file system states– Detect new and conflicting versions

• If possible, automatically resolve conflicts• Else, prompt user to resolve conflicts

– Propagate updates

Outcomes• Two reconciled replicas become

consistent for all files and directories• Some files remain inconsistent and

require user to resolve conflicts

Metrics• Time

– Elapsed time • From the beginning to the completion of a

reconciliation request– User time (time spent using CPU)– System time (time spent in the kernel)

• Failure rate– Number of incomplete reconciliations and

infinite loops (none observed)

Metrics not Measured• Disk access time

– Require complex instrumentations • E.g., buffering, logging, etc.

• Network and memory resources– Not heavily used

• Correctness– Difficult to evaluate

Monitor Implementation

Spool-to-dump Spool-to-dumpRecon

Scanner Rfindstored Rrecon Server

Perl library

Reconciliation Process

• Top-level Perl time command

Parameters• System parameters

– CPU (speed of local and remote servers)– Disk (bandwidth, fragmentation level)– Network (type, bandwidth, reliability)– Memory (size, caching effects, speed)– Operating system (type, version, VM

management, etc.)

Parameters (Continued)• Workload parameters

– Number of replicas– Number of files and directories– Number of conflicts and updates– Size of volumes (file size)

Workloads• Update characteristics extracted from

Geoff Kuenning’s traces

File accessRead-only

access

Read-write access

Nonshared access Shared access

Read access

Write access

2-way sharing 3+way sharing

Read access

Write access

Read access

Write access

Experimental Settings• Machine model: Dell Latitude XP• CPU: x486 100 MHz• RAM: 36MB• Ethernet: 10Mb• Operating system: Linux 2.0.x• File system: ext3

Experimental Settings• Should have documented the following

as well– CPU: L1 and L2 cache sizes– RAM: Brand and type– Disk: brand, model, capacity, RPM, and

the size of on-disk cache– File system version

Experimental Design• 255 full factorial design • Linear regression or multivariate linear

regression to model major factors• Target: 95% confidence interval

255 Full Factorial Design

• Number of replicas: 2 and 6• Number of files: 10 and 1,000• File size: 100 and 22,000 bytes• Number of directories: 10 and 100• Number of updates: 10 and 450

– Capped at 10 updates for 10 files• Number of conflicts: 0 /* typical */

255 Full Factorial Analysis

• Experiment errors < 3%

0 5 10 15 20 25 30 350

20406080

100120140160

elapsed time

measured timepredicted time

experiment number

time (sec-onds)

0 5 10 15 20 25 30 3505

10152025303540

user time

experiment number

time (sec-onds)

0 5 10 15 20 25 30 350123456

system time

experiment number

time (sec-onds)

Variation of Effects• All major effects

significant at 95% confidence interval

#files#dirs

file size * #files

file size

#updates0

100top 5 effects for elapsed time

% variation

#files

#updates

#files * #updates

file size

file size * #files

020406080

100top 5 effects for system time

% variation

#files

#replicas

#replicas *

#files

#files * #updates

020406080

top 5 effects for user time

% variantion

Residuals vs. Predicted Time

• Clusters caused by dominating effects of files

0 20 40 60 80 100 120 140

-20-15-10

101520

elapsed time

predicted time

residuals

0 5 10 15 20 25 30 35 40

user time

predicted time

residuals0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

-0.5-0.4-0.3-0.2-0.1

00.10.20.30.40.5

system time

predicted time

residuals

Residuals vs. Experiment Numbers

• Residuals show homoscedasticity, almost

0 20 40 60 80 100 120 140 160 180

user time

experiment number

residuals0 20 40 60 80 100 120 140 160 180

-0.5-0.4-0.3-0.2-0.1

00.10.20.30.40.5

system time

experiment number

residuals

0 20 40 60 80 100 120 140 160 180

-20-15-10

101520

elapsed time

experiment number

residuals

Quantile-Quantile Plot• Residuals are

normally distributed, almost

-3 -2 -1 0 1 2 3 4

-20-15-10

101520

f(x) = 5.61253143490396 x + 4.93495530436048E-16R² = 0.97570585239607

elapsed time

normal quantiles

residual quantiles

-3 -2 -1 0 1 2 3 4

f(x) = 0.124183670176851 x − 3.226948188583E-16R² = 0.952366702694788

user time

normal quantiles

residual quantiles-3 -2 -1 0 1 2 3 4

-0.5-0.4-0.3-0.2-0.1

00.10.20.30.40.5

f(x) = 0.112484959649303 x − 5.06606047559798E-18R² = 0.986338838838569

system time

normal quantiles

residual quantiles

Multivariate Regression• Number of replicas: 2• Number of files: 4 levels, 10-600• File size: 22,000 bytes• Number of directories: 4 levels, 10-60• Number of updates: 0• Number of conflicts: 0 /* typical */• Number of repetitions: 5 per data point

Multivariate Regression• Experiment errors <

7%• All coefficients are

significant

0 10 20 30 40 50 60 70 80 9005

10152025303540

user time

experiment number

time (seconds)

0 10 20 30 40 50 60 70 80 900

20406080

100120140

elapsed time

experiment number

time (seconds)

0 10 20 304050 6070 80900

system time

experiment number

time (sec-onds)

• Elapsed time shows a bi-model trend

• User time shows an exponential trend

5 10 15 20 25 30 35

-1-0.8-0.6-0.4-0.2

00.20.40.60.8

user time

predicted time

residuals1 1.2 1.4 1.6 1.8 2 2.2 2.4 2.6 2.8

-0.5-0.4-0.3-0.2-0.1

00.10.20.3

system time

predicted time

residuals

30 40 50 60 70 80 90 100 110 120

elapsed time

predicted time

residuals

• Not so good for elapsed time and user time

0 10 20 30 40 50 60 70 80 90

elapsed time

experiment number

residuals

0 10 20 30 40 50 60 70 80 90

-1-0.8-0.6-0.4-0.2

00.20.40.60.8

user time

experiment number

residuals0 10 20 30 40 50 60 70 80 90

-0.5-0.4-0.3-0.2-0.1

00.10.20.3

system time

experiment number

residuals

Quantile-Quantile Plot• Residuals are not

normally distributed for elapsed time and user time

-3 -2 -1 0 1 2 3

15f(x) = 5.6774814834728 x − 3.74753980933428E-14R² = 0.84068455127645

elapsed time

normal quantiles

residual quantiles

-3 -2 -1 0 1 2 3

-1-0.8-0.6-0.4-0.2

00.20.40.60.8

1f(x) = 0.481071580575666 x − 1.8682654604378E-15R² = 0.924255360680913

user time

normal quantiles

residual quantiles

-3 -2 -1 0 1 2 3

-0.5-0.4-0.3-0.2-0.1

00.10.20.3

f(x) = 0.132069999118134 x − 2.51384352224851E-15R² = 0.978920253463901

system time

normal quantiles

residual quantiles

Log Transform (User Time)

• ANOVA tests failed miserably

0.9 1 1.1 1.2 1.3 1.4 1.5 1.6

-0.06-0.05-0.04-0.03-0.02-0.01

00.010.020.030.04

user time

predicted time

residuals

0 10 20 30 40 50 60 70 80 90

-0.06-0.05-0.04-0.03-0.02-0.01

00.010.020.030.04

user time

experiment number

residuals -3 -2 -1 0 1 2 3

-0.06-0.05-0.04-0.03-0.02-0.01

00.010.020.030.04

f(x) = 0.0222199973685429 x − 1.28549373927752E-15R² = 0.870897001030419

user time

normal quantiles

residual quantiles

Residual Analyses (User Time)

• No indications that transforms can help…

5 10 15 20 25 30 35 400

mean user time

standard deviation of

residuals

5 10 15 20 25 30 35 400

mean user time

variance of residuals

0 200 400 600 800 1000 12000

mean user time squared

standard deviation of

residuals

Possible Explanations• i-node related factors

– Number of files per directory block– Crossing block boundary may cause

anomalies• Caching effects

– Reboot needed across experiments

Linear Regression• Number of files: 100, 150, 200, 250,

252, 253, 300, 350, 400, 450 – Test for the boundary-crossing condition as

the number of files exceeds one block– Note that Rumor has hidden files

• Number of repetitions: 5 per data point• Flush cache (reboot) before each run

Linear Regression• R2 > 80%• All coefficients are

significant

200300

400500

elapsed time

measured timepredicted time95% confidence interval

number of files

time (seconds)

200300

400500

system time

number of files

time (seconds)

0 1002003004005000

user time

number of files

time (seconds)

• Elapsed time shows a bi-model trend

• User time shows an exponential trend

35 40 45 50 55 60 65 70 75 80 85

elapsed time

predicted time

residuals

1.2 1.4 1.6 1.8 2 2.2 2.4

-0.2-0.15

-0.1-0.05

0.10.15

0.20.25

system time

predicted time

residuals

8 10 12 14 16 18 20 22 24 26

-0.4-0.3-0.2-0.1

00.10.20.30.40.50.6

user time

predicted time

residuals

• Elapsed time shows a rising bi-modal trend– Randomization of

experiments may help

0 10 20 30 40 50 60

elapsed time

experiment number

residuals

0 10 20 30 40 50 60

-0.2-0.15

-0.1-0.05

0.10.15

0.20.25

system time

experiment number

residuals

0 10 20 30 40 50 60

-0.4-0.3-0.2-0.1

00.10.20.30.40.50.6

user time

experiment number

residuals

Quantile-Quantile Plot• Error residuals for

elapsed time is not normal – Perhaps piece-wise

normal

-3 -2 -1 0 1 2 3

15f(x) = 5.82178334927256 x + 2.58046606262658E-15R² = 0.87800554257113

elapsed time

normal quantiles

residual quantilas

-3 -2 -1 0 1 2 3

-0.2-0.15

-0.1-0.05

0.10.15

0.20.25

f(x) = 0.0976338391551245 x − 4.46690697919164E-16R² = 0.969293820421059

system time

normal quantiles

residual quantilas

-3 -2 -1 0 1 2 3

-0.4-0.3-0.2-0.1

00.10.20.30.40.50.6

f(x) = 0.213446556701086 x + 1.49533417053058E-15R² = 0.970879846787612

user time

normal quantiles

residual quantilas

Possible Explanations• i-node related factors: No• Caching effects: No• Hidden factors: Maybe• Bugs: Maybe

Conclusion• Identified the number of files as the

dominating factor for Rumor running time

• Observed the existence of an unknown factor in the Rumor performance model

White Slide

Example: Rumor Performance Evaluation

Documents

Buy on Rumor, Sell on News

Player Campaign Missions Rumor Potentially Clarified

El Rumor Del Patio. Doc[1]

A rumor of war

Black Rain Rumor 20130627

REVISTA RUMOR - NOTAS DE PRENSA

Past evaluation example

Rumor Routing Algorithm For sensor Networks

2008 human rna rumor virus review.pdf

Islam & the Rumor of Sects

Example of powerpoint evaluation 2011

Datasheet Rumor LG:Datasheet Rumor LG

Past evaluation example 2

GM's Cookbook - The Rumor Mill #01

15 El Rumor

The rumor in the media communication

Example: Rumor Performance Evaluation Andy Wang CIS 5930 Computer Systems Performance Analysis

Information Acquisition in Rumor‐Based Bank Runsfaculty.chicagobooth.edu/zhiguo.he/research/rumorbankrun.pdf · Information Acquisition in Rumor-Based ... three anonymous referees,

Rumor has it office politics

Myth, Rumor, and History