Data Persistence in Sensor Networks: Towards Optimal Encoding for Data Recovery in Partial Network...

Data Persistence in Sensor Networks: Towards Optimal

Encoding for Data Recovery in Partial Network Failures

Abhinav Kamra, Jon Feldman, Vishal Misra and Dan RubensteinDNA Research Group, Columbia University

Motivation and Model

Typical Scenario of Sensor Networks Large number of nodes deployed to

``sense'' environmentData collected periodically

pulled/pushed through a sink/gateway node

Nodes prone to failure (disaster, battery life, targeted attack)

Want data to survive individual node failures``Data Persistence''

OverviewErasure codes

LT-CodesSoliton distribution

Coding for failure-prone sensor networksMajor resultsA brief sketch of proofsA case study of failure-prone sensor

networks

Erasure Codes

Message

Encoding

Received

Message

Encoding Algorithm

Decoding Algorithm

Transmission

Luby Transform Codes

Simple Linear CodesImprovement over “Tornado codes”Rateless Codes

Erasure Codes: LT-Codes

b1 b2 b3 b4 b5F=

n=5 input blocks

LT-Codes: Encoding

b1 b2 b3 b4 b5

1. Pick degree d1 from a pre-specified distribution. (d1=2)

2. Select d1 input blocks uniformly at random. (Pick b1 and b4 )

3. Compute their sum (XOR).

4. Output sum, block IDs

LT-Codes: Encoding

b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7

LT-Codes: Decoding

b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7

F= b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7

F= b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7

F= b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7

335445555ccbccbccb←−←−←−

F= b1 b2 b3 b4 b5

c1 c2 c3 c4 c5 c6 c7E(F)=

b5 b5 b5

b1 b2 b3 b4 b5

c1 c2 c3 c5 c6 c7

b5c4b5 b5

b1 b2 b3 b4 b5

c1 c2 c3 c5 c6 c7

b5c4b5 b5

b1 b2 b3 b4 b5

c1 c2 c3 c5 c6 c7

b5c4b5 b5

b1 b2 b3 b4 b5

c1 c2 c3 c5 c6 c7

b5c4b5 b5b2b2

b1 b2 b3 b4 b5

c1 c2 c3 c5 c6 c7

b5c4b5 b5b2b2

Degree Distribution for LT-Codes Soliton Distribution:

Avg degree H(N) ~ ln(N) In expectation: Exactly one degree 1 symbol in

each round of decoding Distribution very fragile in practice

i <<−

1 for )1(

Failure-prone Sensor NetworksAll earlier works:

How many encoded symbols needed to recover all original symbols (all or nothing decoding)

Failure-prone networks:How many original symbols can be

recovered from given surviving encoded symbols

Iterative Decoder

Received Symbols

Recovered Symbols

• 5 original symbols x1 … x5

• 4 encoded symbols received

• Each encoded symbol is XOR of component original symbols

Sensor Network Model

Encoded Symbols remaining: kWant to maximize “r”, the recovered

original data symbolsNo idea apriori what k will be

Coding is bad, for small kN original symbolsk encoded symbols receivedIf k ≤ 0.75N, no coding required

N = 128

Proof SketchTheorem: To recover first N/2 symbols, it is best to not do any

encodingProof:

1. Let C(i, j) = Expected symbols recovered from i degree 1 and j symbols of degree 2 or more.

2. C(i, j) ≤ C(i+1, j-1) if C(i, j) ≤ N/2a. Sort given symbols in decoding orderb. All degree 1 symbols will be decoded before other symbolsc. Last symbol in decoded order will be of degree > 1 (see b.)d. Replace this symbol by a random degree 1 symbole. New degree 1 symbol more likely to be useful

3. Hence, more degree 1 symbols => Better output4. No coding is best to recover any first N/2 symbols5. All degree 1 => Coupon Collector’s => ≈ 3N/4 symbols to

recover N/2 distinct symbols

Ideal Degree Distribution

Theorem: To recover r data units such that

r < jN/(j+1), the optimal degree distribution has symbols of degree j or less only.

Lower degree are better for small kIf k ≤ kj, use symbols of up to degree j

So, use kj – kj-1 degree j symbols in close to optimal distribution

N = 128

Case Study: Single-sink Sensor Network

Storage

Sensor nodenodes exchange symbols

nodes 2 and 3 transfer new symbols to the sink

Case Study: Single-sink Sensor NetworkNetwork prone to failureNodes store unencoded symbols at first

and higher degrees with timeSink receives low degree symbols first and

higher degree as time goes on

Distributed SimulationClique TopologyN = 128 nodes in a clique topologySink receives one symbol per unit time

Distributed SimulationChain Topology

N = 128 nodes in a chain topology

1 2 3 … N

Related WorkBulk Data Distribution: Coding is

usefulTornado (Efficient Erasure Correcting Codes by M. Luby et. al.,

IEEE Transactions on Information Theory, vo. 47, no. 2, 2001)

LT-Codes (LT Codes by M. Luby, FOCS 2002)

Reliable Storage in Sensor NetworksDecentralized erasure code (Ubiquitous Access to

Distributed Data in Large-Scale Sensor Networks through Decentralized Erasure Codes by A. Dimakis et. al., IPSN 2005)

Random Linear Coding (“How Good is Random Linear Coding Based Distributed Networked Storage?” by M. Medard et. al., NetCod 2005)

Data Persistence in Sensor Networks: Towards Optimal Encoding for Data Recovery in Partial Network...

Documents

Abhinav presettation

KAMRA INLAY PROFESSIONAL USE INFORMATION · 2015. 4. 29. · KAMRA® INLAY . PROFESSIONAL USE INFORMATION . The KAMRA® inlay is indicated for intrastromal corneal implantation to

Abhinav (2)

Abhinav Kamra Computer Science, Columbia University 3.1 Operating System Concepts Silberschatz, Galvin and Gagne 2002 Chapter 3: Operating-System Structures

Abhinav GSK Report

Abhinav Sharma

Ishan Misra Abhinav Shrivastava Abhinav Gupta Martial ... · Ishan Misra Abhinav Shrivastava Abhinav Gupta Martial Hebert The Robotics Institute, Carnegie Mellon University Abstract

Pgp30297 abhinav anand_section_b

Dr. S. K. Kamra

Ishan Misra Abhinav Shrivastava Abhinav Gupta Martial

Abhinav Kamra Computer Science, Columbia University 1.1 Operating System Concepts Silberschatz, Galvin and Gagne 2002 COMS 4118 Operating Systems Spring

Directorate of Estate Projects Directorate of ... - Kamra€¦ · FAZAIA HOUSING SCHEME AVN CITY KAMRA FAZAIA HOUSING SCHEME AVN CITY KAMRA FAZAIA HOUSING SCHEME AVN CITY KAMRA Directorate

KAMRA inlayfiles.ctctcdn.com/fafba80f101/597ac587-0f2c-4103-b2ac-2a... · 2015. 6. 8. · KaMRa® SMall aPERTURE TECHnOlOGy P3 a G E ~ KaMRa® inlay OD Clinical Pearls Quick Reference

Cross-Stitch Networks for Multi-Task Learning · Cross-stitch Networks for Multi-task Learning Ishan Misra∗ Abhinav Shrivastava∗ Abhinav Gupta Martial Hebert The Robotics Institute,

Abhinav Pol

From Red Wine to Red Tomato: Composition with Contextimisra/data/composing_cvpr17.pdf · From Red Wine to Red Tomato: Composition with Context Ishan Misra Abhinav Gupta Martial Hebert

abhinav steels

Growth Codes: Maximizing Sensor Network Data Persistence Abhinav Kamra, Vishal Misra, Dan Rubenstein Department of Computer Science, Columbia University

Enjoying life with the KAMRA inlay. · Enjoying life with the KAMRA inlay. Your aftercare guide to clearer reading vision. AFTERCARE guide Fonts used: KAMRA: Berkeley Oldstyle Medium

KAMRA INLAYodpcli.com/media/file/KAMRA_Clinical_Pearls_sm.pdf · 2016. 11. 15. · KAMRA® SMALL APERTURE TECHNOLOGY P A G E ~ KAMRA 3 ® Inlay OD Clinical Pearls Quick Reference