35
Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science Department University of Tennessee CERN May 30, 2002

Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

  • View
    219

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Scalable Sharing of Network Storage

Micah Beck, Research Assoc. Professor

Director, Logistical Computing & Internetworking (LoCI) Lab

Computer Science DepartmentUniversity of TennesseeCERN May 30, 2002

Page 2: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

A Broad View of Networking

• A network is a facility for sharing a pool of resources within a community.

• Sharing is enabled by– Common goals– Standard protocols– Low allocation overhead– Mutual protection

Page 3: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

IP Routing as Scalable Sharing of Bandwidth

• The Internet is a collection of links connected by routers

• Internet Protocol (IP) datagrams flow from sender to reciever along some path of links

• The choice of paths is made by routers according to topological considerations (tempered by policy)

Page 4: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Wide Area Network Services Are Like the Network Itself

• Intermittently inaccessible

• Vulnerable to partition

• Prone to corruption in transit

• Unpredictable latencies/jitter

• End-to-End: Never require a network service to be bigger, better or more complex than wide area access allows

Page 5: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Why Is It Called “End-to-End”?

• If the end-points requires a service that is stronger than the network can provide, they must implement it on top of the network.

• The fundamental tools are – Aggregation (retransmission, fragmentation)– Timers– Protocol state maintained at the end-points

Page 6: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

How Can Storage Be Shared Scalably?

• Don’t require it to be– Unbounded in size– Unbounded in duration– Perfectly available– Perfectly reliable

• Best effort network storage

• Implement these properties end-to-end

Page 7: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Sharing Storage Enables Advanced Network Apps

• Caching

• Staging (Replication)

• Localization (Rendezvous)

• Distribution (Multicast)

• Spooling (E-Mail)

• Temporary State Management (Checkpointing)

Page 8: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

“Logistical” Networking

• Analogy to military or industrial logistics

• Distributing goods requires not only highways but also warehouses (depots)

• Data transmission is the highway

• Storage servers are the depots

• This is not Storage Networking

• Enabling new users and applications

Page 9: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Dimensions in Communication

Routing (spatial) Storage (temporal)

Logistical Networking

Page 10: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Logistical Networking

• Principle Investigators– Micah Beck

– James S. Plank

• Graduate Students– Erica Fuentes

– Xiang Li

• Funding– DoE SciDAC

– NSF Next Gen SW

– NSF Internet Tech.

• Research Staff– Scott Atchley

– Alex Bassi

– Ying Ding

– Hunter Hagewood

– Sharmila Kancherla

– Jeremy Millar

– Terry Moore

– Stephen Soltesz

– Yong Zheng

Page 11: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Internet Backplane Protocol (IBP)

• Servers that make allocation of primitive “byte arrays” available to clients

• Byte arrays are not blocks (more abstract)– Network capabilities

(primitive security)

– Variable extents

• Byte arrays are not files (weaker semantics)– Size & duration are limited

– “Volatile” allocations

– Best effort reliability and availability

– No directory structure, accounting

– No caching, replication

• An end-to-end approach to network storage

Page 12: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

IBP Software Structure

• IBP Depots (servers) are daemons that serve local storage to IBP clients.

• IBP clients only link to a protocol library.

• Clients talk to depots using TCP/IP.– Alternatives are being investigated

• Design is for high-performance/scalability.

Page 13: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Allocation Attributes

• Permanent vs. Time-Limited

• Volatile vs. Stable

• Read/Write semantics:– Byte Array– Pipe– Circular Queue– Completely Destructive Write

Page 14: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Impressions of Networking for High Energy Physics (?)

• Reliable throughput is the limiting factor

• Modes of sharing can be “modified”

• The use of specialized data movement middleware is acceptable

• The data transfer network has very few nodes and small user communities

• Little patience for fancy architectural ideas

Page 15: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

IBP Enables Heterogeneity

• The tyranny of “TCP-friendly” IP– Datagrams and usage policy must be

homogeneous end-to-end– Non-standard IP policies in the middle can leak

out onto public networks

• Good fences make good neighbors

• IBP can terminate a flow and regenerate it in different network, with localized control

Page 16: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Example:Tsunami

• Uses very aggressive UDP transfer with TCP back-link for control

• Runs using Abilene less-than-best-effort “Scavenger Service”

• Tsunami must not be routed onto a link where Scavenger Service is not supported!

• Solution: do not route the packets, require explicit buffering and retransmission

Page 17: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Scheduling Control Over Sharing Without IP QoS

• Consider a high speed link that operates at 10Gb/s, connected to two types of links: – 100Mb/s and 1Gb/s

• Jobs are 1TB transfers– ~20 hours at 100Mb/s, ~2 hours at 1Gb/s

• Suspending a slow job to allow a fast job through delays it by only 10%

Page 18: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Scheduled Sharing of Network Links Is In Your Future!

• Ignoring buffer control won’t make the need for scheduled sharing go away

• When was more control over time & space ever a bad thing?

• Are we revisiting CERNET/UUCP/BITNET/DECnet?A Short History of Internet Protocols at CERN Ben Segal / CERN PDP-NS April, 1995

• IBP is designed according to end-to-end principles (see SIGCOMM ’02 paper)

Page 19: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

The Alternative to Interoperability In Buffering

• High performance data transfer is an isolated service, with “job scheduling” and “data staging” like at computation centers

• As more high performance links are built, concatenation is based on “gateways” rather than a scalable routing infrastructure

• Should networking solutions be built without the benefit of network architecture?

Page 20: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Application Areas

• Source routing• Bandwidth adaption• Reducing (BWdelay)• Reliable multicast• Content Distribution • Remote access to

structured data

• Managing computation state (NetSolve caching)

• Temporary storage• Very large data sets• Collaborative

computing & visualization

Page 21: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Building on IBP

• Many applications assume file semantics– Unbounded size & duration– High reliability & availability– Caching & replication

• In a layered architecture, these are implemented through aggregation and additional intelligence at the next level

Page 22: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

The Network Storage Stack

Physical

Access (OS)

IBP (InternetBackplane Protocol)

LogisticalRuntime System

Logistical Tools

Applications

L-Bone exNode

Page 23: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

ExNode vs inode

exNode

inode

IBP Allocations

the network

local system

disk blocks

kernel

capabilities

block addresses

user

Page 24: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

ExNode Mobility

XML Serialization

The exNode serialization is a portable soft link

Page 25: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Sameple exNodes Services: Fragmentation, Replication

1 2 3

Page 26: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Building a Logistical Operating Environment

• Resource Discovery– Logistical Backbone registry (LBone: Plank)– Can also use DNS, could use MDS

• Programming Tools– exNode library, primitive commands– policy modules (replication, fragmentation)– cool applications (SC demo: IBPster MP3)

Page 27: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 28: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 29: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 30: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 31: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 32: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science
Page 33: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Related Work

• Peer-to-Peer Storage– Napster, Gnutella, etc– OceanStore (UC

Berkeley)

• Storage Area Networking– Fiber Channel,

Ultraband, iSCSI– Network Attached

Storage (CMU)

• Overlay Networking– Caching

– Multicast

– Data Grids

• Multimedia Protocols– Content Delivery

– Streaming

– Video on Demand

Page 34: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Logistical Networking: E2E Storage and Bandwidth

• Complex paths and buffers are exposed

• Assumptions are appropriately weak

• Resource management is passive

• Spatial and temporal control

• Applic. autonomy, network transparency

• Resources in the hands of end users

• Futures: more control, greater scope

Page 35: Scalable Sharing of Network Storage Micah Beck, Research Assoc. Professor Director, Logistical Computing & Internetworking (LoCI) Lab Computer Science

Logistical Computing and Internetworking Laboratoryhttp://loci.cs.utk.edu

• Documents and software downloads