StarFish: highly-available block storage Eran Gabber Jeff Fellin Michael Flaster Fengrui Gu Bruce...

StarFish: highly-available block storageEran Gabber

Jeff Fellin

Michael Flaster

Fengrui Gu

Bruce Hillyer

Wee Teck Ng

Banu O¨ zden

Elizabeth Shriver

2003 USENIX Annual Technical Conference

Presenter: D00922019 林敬棋

IntroductionImportant data need to be

protected.◦Making replicas.

Replication on remote sites◦Reduce the amount of data lost in

failure.◦Decrease the time required to

recover from catastrophic site failure.

StarFishA highly-available geographically-

dispersed block storage system.◦Does not require expensive

dedicated communication lines to all replicas to achieve highly-available .

◦Achieves good performance even during recovery from a replica failure.

◦Single-owner access semantics.

ArchitectureStarFish consists of

◦One Host Element(HE) Provides storage virtualization and read

cache.

◦N Storage Element(SE) Q: write quorum size. Synchronous updates to a quorum of Q

SEs, and asynchronous updates to the rest.

Recommended Setup

N = 3, Q = 2

MAN : Metropolitan Area NetworkWAN :Wide Area Network

Another Deployment

SE RecoveryWrite log

◦HE keeps a circular buffer of recent writes.

◦Each SE maintains a circular buffer of recent writes on a log disk.

Three types of recovery◦Quick recovery◦Replay recovery◦Full recovery

Availability and ReliabilityAssume that the failure and

recovery processes of the network links and SEs are i.i.d Poisson processes with combined mean failure and recovery rates of λ and μ per second.

Similarly, the HE has Poisson-distributed λhe and μhe .

AvailabilityThe steady-state probability that

at least Q SEs are available.

Derived from the standard machine repairman mode.

)1(),( 0

Machine Repairman Model

Availability(cont.)

X ★ 9 : the number of 9s in an availability measure.

Achieve a much higher availability when N = 2Q + 1.

For fixed N, availability decrease with larger quorum size.◦Increasing quorum size trades off

availability for reliability.

ReliabilityThe probability of no data loss.The reliability increases with

larger Q.Two approaches

◦Make Q > floor(N/2) and at least Q SEs are available. Reduce availability and performance.

◦Read-only consistency

Read-only ConsistencyAvailable in read-only mode

during failure.◦Read-only mode obviates the need

for Q SEs to be available to handle updates.

◦Increase availability

iadOnly

NQA)1)(1(

)1)(1(

headOnly

QANANQA

),1(),(Re

Availability with Read-only Consistency

ObservationsIf ρhe = 0, availability is

independent of Q.◦Can always recover from HE.

If ρhe increase, availability increase with Q.

Largest increase occurs from Q = 1 to Q = 2, and bounded by 3/16 when ρ = 1.◦Diminishing gain after Q = 2.◦Suggest Q = 2 in practical system.

Implementation

Performance MeasurementsCompares with a direct-attached

RAID unit.

SettingsDifferent network delays

◦1, 2, 4, 8, 23, 36, 65 msDifferent bandwidth limitations

◦31, 51, 62, 93, 124 Mb/s.Benchmark:

◦Micro-benchmark Read hit Read miss Write

◦PostMark

Effects of network delays and HE cache size

Near SE delay: 4ms; Far SE delay: 8msNo cache miss if HE cache size = 400

ObservationLarge HE cache improves

performance.◦HE can respond to more read

requests without communicating with SE. Does not change write requests.

◦Especially beneficial when local SE has significant delays.

Q = 2 and 400MB cache size is not influenced by the delay to local SE.◦Depend on near SE.

Normal Operation and placement of the far SE

1-8: 1, 2, 4, 8 ms; 4-12: 4, 8, 12 ms 23-65: 23, 36, 65 ms; 31-124:

31,51,62,93,124 Mbps Local SE delay: 0ms

Normal Operation and placement of the far SE(Cont.)

N = 3 8 threads

Normal Operation and placement of the far SE(Cont.)

ObservationPerformance is influenced mostly

by two parameters◦Write quorum size◦Delay to the SE.

StarFish can provide adequate performance when one of the SEs is placed in a remote location.◦At least 85% of the performance of a

direct-attached RAID.

Recovery

Performance degrades more during full recovery.

ConclusionThe StarFish system reveals

significant benefits from a third copy of the data at an intermediate distance.

A StarFish system with 3 replicas, a write quorum size of 2, and read-only consistency yields better than 99.9999% availability assuming individual Storage Element availability of 99%.

StarFish: highly-available block storage Eran Gabber Jeff Fellin Michael Flaster Fengrui Gu Bruce...

Documents

CLIL Using Songsand Chants as a Pronunciation ToolSongs+&+Chants.pdf... · La canzone del sole -Lucio Battisti: WhenI fellin love

, Puppetry.5’9.5”Aerialist, Harness Flying, Partner Acro ...€¦ · Rudy Hogenmiller D. Rudy Hogenmiller D. Stacey Flaster D. Kevin Bellie D. Michael Rashid D. . Jared Stepp

Order To Revoke Renato Fellin · 2011. 6. 21. · Renato Fellin had until February 26, 2010, after service of the Notice, to request a hearing before the Financial Services Tribunal

Adaptively Weighted Multi-task Deep Network for Person ...homepage.fudan.edu.cn/fengrui/files/2020/03/... · The adaptively weighted multi-task deep convolutional neural net-work

Running head: FREEDOM, GOODNESS, POWER, AND …roar.uel.ac.uk/4723/1/Ugazio Negri Fellin JCP 2015.pdf · Ugazio, as Procter (1981, 1996, 2005), shifted attention onto conversational

CLIL Using Songsand Chants as a Pronunciation ToolSongs+&+Chants.pdf · La canzone del sole -Lucio Battisti: WhenI fellin love

McClintock Girls Soccer Dulce Segura 4 Individual … Kulak 27 9-Megan Wood 24 10-Shep Wilson 22 11-Adreanna Sanchez 21 12-Sarah ... 5-Natalie Peterson 17 Colleen Fellin 17 7-Melissa

Ordinances and Resolutionsc.ymcdn.com/sites/mocities.site-ym.com/resource/resmgr/...Presented June 12, 2014 by Pam Fellin Missouri Municipal League Elected Officials Conference Overview

CHILDREN’S EXPERIENCES OF DOMESTIC VIOLENCE – REALITIES AND FANTASIES OF MIGRATION AND MOVEMENT Jane Callaghan, Joanne Alexander, Lisa Fellin, Judith Sixsmith,

Towards Output-Based Regulation A Regulatory Perspective ... · Towards Output-Based Regulation A Regulatory Perspective based ... Turri e Fellin dell’Università di Padova,

Choosing The Right Legal Entity - Flaster/Greenberg...10% of your pre-tax income, is no small feat. In fact, that’s the best argument in favor of using an LLC. But you knew it couldn’t

exelon-patch-10-transdermal-flaster-acfd kisa ürün bilgisi · varsa (6rn., artan eritem, 6dem, papiiller, vezikiiller) ... EXELON PATCH transdermal flaster igin alerjik kontakt

Society For Economic Botany Newsletter PLANTS PEOPLE Society For Economic Botany Newsletter. ... They are all part of our paths. Regards, Trish Flaster, Editor. Society for Economic

Menu Gigi - Amazon S3Gigi.pdf · domenico modugno tomato sauce, mozzarella, chicken breast, red and olives 20.soo federico fellin tomato sauce mozzarella, artichokes ram, wurstel,

Photophoretic trapping of multiple particles in …photomech.ustc.edu.cn/File/2014 oe lfr.pdfPhotophoretic trapping of multiple particles in tapered-ring optical field Fengrui Liu,

Collection of Theses and Research Testing, Psychometrics ...nectar.northampton.ac.uk/5474/1/Ugazio20085474.pdf · (Ugazio, Fellin, Colciago, Pennacchio, & Negri, 2007, 2008) for which

Tom and Tim Fellin Jesse White travels the ... - Life Goes On · featuring James’ kidney transplant story. Secretary White presents a countertop display featuring Francine Dzialo’s

St. Peter the Apostle University and Community Parish ... · 28/10/2018 · Patricia Deri, Alessia DePasquale, Brian Donoghue, Christine Fellin, Jose Lopez, Sara Mette, Joseph Gerity,

Yao, Fengrui; Liu, Can; Chen, Cheng; Zhang, Shuchen; Zhao ... · Fengrui Yao1, Can Liu1, Cheng Chen1, Shuchen Zhang2, Qiuchen Zhao2, Fajun Xiao3, Muhong Wu1, Jiaming Li1, Peng Gao

Regulation A+ Primer - Flaster/Greenberg · Regulation A has labored in obscurity for more than 50 ... In the Regulation A+ Primer, I hope to provide practical guidance on the new