Repairable Fountain Codes

Megasthenis Asteris, Alexandros G. Dimakis

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 32, NO. 5, MAY 2014

2014/5/22

Outline

• Introduction• Problem Description• Prior Work• Repairable Fountain Codes• A Graph Perspective• Simulations• Conclusion

2014/5/22

Introduction

• Fountain codes [2], [3], [4] form a new family of linear erasure codes with several attractive properties.

• For a given set of k input symbols, produces a potentially limitless stream of output symbols.

• Randomly selected subset of (1+ ɛ)k encoded symbols, a decoder should be able to recover the original k input symbols with high probability (w.h.p.) for some small overhead ɛ.

2014/5/22

[2] J. W. Byers, M. Luby, M. Mitzenmacher, and A. Rege, “A digital fountain approach to reliable distribution of bulk data,” [3] M. Luby, “LT codes,” [4] A. Shokrollahi, “Raptor codes,”

Introduction

• This paper design a new family of Fountain codes that combine multiple properties appealing to distributed storage.– systematic form– efficient repair– locality

2014/5/22

Introduction

• A key observation is that in a systematic linear code, locality is strongly connected to the sparsity of parity symbols [11]– i.e., the maximum number of input symbols

combined in a parity symbol.• A parity symbol along with the systematic

symbols covered by it form a local group.– The smaller the size of the local group, the lower

the locality of the symbols in it.

2014/5/22[11] P. Gopalan, C. Huang, H. Simitci, and S. Yekhanin, “On the locality of codeword symbols,”

Introduction

• The availability of a systematic symbol naturally extends the notion of locality, measuring the number of disjoint local groups the symbol belongs to.– define the availability of a systematic symbol as

the number of disjoint sets of encoded symbols which can be used to reconstruct that particular symbol.

2014/5/22

Outline

2014/5/22

Problem Description

• Given k input symbols, elements of a finite field , we want to encode them into n symbols using a linear code.

• An input vector u ∈ produces a codeword v = uG ∈.– by a k × n generator matrix G.

2014/5/22

Problem Description

• G to have the following properties:– Systematic form– Rateless property– MDS property– Low locality– High Availability

2014/5/22

Problem Description

• For any code, any sufficiently large subset of encoded symbols should allow recovery of the original data.– In the case of MDS codes, an information

theoretically minimum subset of k encoded symbols suffices to decode.

– When equipped with systematic form, the generator matrix of an MDS code affords no zero coefficient in the parity generating columns.

2014/5/22

Problem Description

• This paper require that for ɛ > 0, a set of k’= (1+ ɛ)k randomly selected encoded symbols suffice to decode with high probability.– This property called as near-MDS.

• It is impossible to recover the original message with high probability if the parities are linear combinations of fewer than Ω(log k) input symbols.

2014/5/22

Outline

2014/5/22

Prior Work

• In LT codes, the average degree of the output symbols, i.e., the number of input symbols combined into an output symbol, is O (log k).

• However, that sparsity in this case does not imply good locality, since LT codes lack systematic form.

2014/5/22

Prior Work

• Raptor codes, a different class of Fountain codes.– The core idea is to precode the input symbols

prior to the application of an appropriate LT code.– The k input symbols can be retrieved in linear

time by a set of (1 + ɛ)k encoded symbols• However, the original Raptor design does not

feature the highly desirable systematic form.– does not imply good locality.

2014/5/22

Prior Work

• [4], Shokrollahi provided a construction that yields a systematic flavor of Raptor codes.– Encoding is not applied directly on the input

symbols.– Complexity O().

• The source symbols appear in the encoded stream– But the parity symbols are no longer sparse in the

original input symbols,

2014/5/22

[4] A. Shokrollahi, “Raptor codes,”

Prior Work

• [16], Gummadi proposes systematic variants of LT and Raptor codes that feature low (even constant) expected repair complexity.– However, the overhead ɛ required for decoding is

suboptimal: it cannot be made arbitrarily small.

2014/5/22

[16] R. Gummadi, “Coding and scheduling in networks for erasures and broadcast,”

Outline

2014/5/22

• A new family of Fountain codes that are systematic and also have sparse parities.– Each parity symbol is a random linear combination

of up to d = Ω(logk) randomly chosen input symbols.

– Require that a set of k’= (1+ ɛ)k randomly selected encoded symbols

– with probability of failure vanishing like 1/poly(k).

2014/5/22

• Given a vector u of k input symbols in • The code is a linear mapping of u to a vector v

of higher dimension n through a k × n matrix G.

• A single parity symbol is constructed in a two step process.

2014/5/22

• The parity is the linear combination of the symbols selected in the first step, weighted with the coefficients drawn in the second step.– First, d input symbols are successively selected

uniformly at random, independently, with replacement.

– Then, a coefficient is uniformly drawn from

2014/5/22

U is the set of k input symbols

V is the set of nencoded symbols.

• Returning to the matrix representation• Generator matrices G = [ | P ].• Every encoded symbol corresponds to a

column of G.• P, the part responsible for the construction of

the parity symbols– a random matrix whose columns are sparse, each

bearing at most d(k) nonzero entries.

2014/5/22

• In summary, decreasing d(k) improves the locality of the code– How small d(k) can be to ensure that randomly

selected set of k’= (1+ɛ)k symbols is decodable.

2014/5/22

• From the two theorems, it follows that our codes achieve optimal locality with a logarithmic degree for every parity symbol.– Original data is reconstructed in O() using

Maximum Likelihood (ML) decoding– Wiedemann algorithm can reduce complexity to

O(polylog(k)) on average.

2014/5/22

• Theorem 3. Let rk be the total number of parities generated, with each parity symbol constructed as a linear combination of d(k) = c log(k) independently selected symbols uniformly at random with replacement.

• The expected number of parities covering a systematic symbol u is

2014/5/22

Availability

Outline

2014/5/22

A Graph Perspective

• First, consider a balanced random bipartite graph where |U| = |V | = k and each vertex of V is randomly connected to d(k) = c log k nodes in U.

• A classical result by Erd˝os and Renyi [14] shows that these graphs will have perfect matchings with high probability.

2014/5/22

[14] P. Erd˝os and A. R´enyi, “On random matrices,”

A Graph Perspective• However, the graphs are unbalanced– |U| = k and |V | = k’ = (1+ ɛ)k vertices, for ɛ ≥ 0

2014/5/22

A Graph Perspective

2014/5/22

Outline

2014/5/22

Simulations

• This paper experimentally evaluate the probability that decoding fails when a randomly selected subset of encoded symbols is available at the decoder.– set the rate equal to ½.– the generator matrix comprises the k columns of

the identity matrix and k parity generating columns.

– d(k) = c log(k).

2014/5/22

Simulations

2014/5/22

d(k) = c log(k), with c = 6.

Expected decodingoverhead ɛ = 1− 2Pe.

k = 100,300,500.

Simulations

2014/5/22

d(k) = c log(k), with c = 4.

k = 100,300,500.

Outline

2014/5/22

Conclusion

• This paper introduce a new family of Fountain codes that are systematic and also have parity symbols with logarithmic sparsity.

2014/5/22

Conclusion

• Main technical contribution is a novel random matrix result: a family of random matrices with non independent entries have full rank with high probability.– The analysis builds on the connections of matrix

determinants to flows on random bipartite graphs, using techniques from [14], [15].

2014/5/22

[14] P. Erd˝os and A. R´enyi, “On random matrices,”[15] A. G. Dimakis, V. Prabhakaran, and K. Ramchandran, “Decentralized erasure codes for distributed networked storage,”

Conclusion

• One disadvantage of this construction is higher decoding complexity O().– Wiedemann’s algorithm [13] can be used to

decode in O(polylogk) time.– O(k log k) for LT and O(k) for Raptor but offer no

locality.

2014/5/22

[13] D. Wiedemann, “Solving sparse linear equations over finite fields,”

Repairable Fountain Codes

Documents

Do you know probe lens is repairable ?

Scalable Video Multicast Using Expanding Window Fountain Codes

Raptor Codes - en:group [Algo LMA]€¦ · 2. Fountain Codes 2.1. Deﬁnition 2.2. Some type of fountain codes 2.3. LT-Codes 2.4. Raptor Codes 2.5. Systematic codes 3. Analysis of

PAPER ASurveyonNetworkCodes forDistributedStorageusers.ece.utexas.edu/~dimakis/rc_Ieeeproc.pdf · distributed storage applications (e.g., [3] and [5]). Fountain codes [8] and low-density

Impact of Fountain Codes on GPRS channels

Fountain Codes and implementation

Modern Erasure Codes for Distributed Storage Systems · r Erasure Codes Timeline r Classical Codes - (n, k) code r Modern Codes r Locally Repairable Codes (Codes on Codes) r Regenerating

Inactivation Decoding of LT Codes · Page 0/2 F. Lázaro ⋅Fountain Codes ⋅ January 18, 2015 Digital Fountain • The encoder acts like a fountain L Each water drop is a packet

Repairable area tread gauge 0808 164 9151 - etyresthis handy REPAIRABLE AREA TREAD GAUGE, you can quickly find out if a puncture in your tyre is likely to be repairable. Follow the

On Decoding Fountain Codes with Erroneous Received Symbols · Fountain codes were originally proposed for erasure chan-nels, where an ES is either lost or received without errors

•LDPC (Expander) codes • Tornado codes • Fountain codes ...15853-f19/slides/ecc7.pdf · Recap: Low Density Parity Check (LDPC) Codes n n-k ú ú ú ú ú ú ú ú û ù ê ê

Threshold Phenomena and Fountain Codes Amin Shokrollahi EPFL Parts are joint work with M. Luby, R. Karp, O. Etesami

Reliability Prediction of Complex Repairable Systems: an ...eprints.qut.edu.au/16273/1/Yong_Sun_Thesis.pdf · Reliability Prediction of Complex Repairable ... A MECHANICAL SYSTEM

RDB - Repairable Database Systems

Digital Fountain with Tornado Codes and LT Codes K. C. Yang

Fountain Codes Amin Shokrollahi EPFL and Digital Fountain, Inc

Condition Assessment of Depot Level Repairable (DLR

Binary Linear Locally Repairable Codescodes, cyclic difference-set codes, and 4-cycle free regular low-density parity-check (LDPC) codes. We also show the construction of a long LRC

Threshold Phenomena and Fountain Codes Amin Shokrollahi EPFL Joint work with M. Luby, R. Karp, O. Etesami

On Fountain Codes Under ML decoding · On Fountain Codes Under ML decoding Francisco Lazaro´ ... Page 19/55 F. Lazaro´ Fountain Codes ML decoding of Fountain Codes June 23, 2015