27
Achieving Dependable Bulk Throughput in a Hybrid Network Guy Almes <[email protected]> Aaron Brown <[email protected]> Martin Swany <[email protected]> Joint Techs Meeting Univ Wisconsin -- 17 July 2006

Achieving Dependable Bulk Throughput in a Hybrid Network

  • Upload
    varsha

  • View
    26

  • Download
    0

Embed Size (px)

DESCRIPTION

Achieving Dependable Bulk Throughput in a Hybrid Network. Guy Almes Aaron Brown Martin Swany Joint Techs Meeting Univ Wisconsin -- 17 July 2006. Outline. Observations: on user needs and technical opportunities on TCP dynamics - PowerPoint PPT Presentation

Citation preview

Page 1: Achieving Dependable Bulk Throughput in a Hybrid Network

Achieving Dependable Bulk Throughput in a Hybrid Network

Guy Almes <[email protected]>

Aaron Brown <[email protected]>

Martin Swany <[email protected]>

Joint Techs Meeting

Univ Wisconsin -- 17 July 2006

Page 2: Achieving Dependable Bulk Throughput in a Hybrid Network

OutlineObservations: on user needs and technical opportunities on TCP dynamics

Notion of a Session Layer the obvious application a stronger application

Phoebus as HOPI experiment deployment early performance results

Phoebus as an exemplar hybrid network

Page 3: Achieving Dependable Bulk Throughput in a Hybrid Network

On User NeedsIn a variety of cyberinfrastructure-intensive applications, dependable high-speed wide-area bulk data flows are of critical valueExamples: Terabyte data sets in HPC applications Data-intensive TeraGrid applications Access to Sloan Digital Sky Survey and similar very

large data collections

Also, we stress ‘dependable’ rather than ‘guaranteed’ performanceAs science becomes more data-intensive, these needs will be prevalent in many science disciplines

Page 4: Achieving Dependable Bulk Throughput in a Hybrid Network

On Technology DriversNetwork capacity increases, but user throughput increases more slowly

Source: DOE

The cause of this gap relates to TCP dynamics

Page 5: Achieving Dependable Bulk Throughput in a Hybrid Network

On TCP DynamicsConsider the Mathis Equation for Reno

Focus on bulk data flows over wide areasHow can we attack it? Reduce non-congestive packet loss (a lot!) Raise the MTU (but only helps if end-to-end!) Improve TCP algorithms (e.g., FAST, Bic)

RTT is still a factor

Use end-to-end circuits Decrease RTT??

Speed ≤MTU

RTT * loss

Page 6: Achieving Dependable Bulk Throughput in a Hybrid Network

Situation for running example

Page 7: Achieving Dependable Bulk Throughput in a Hybrid Network

The Transport-Layer GatewayA session is the end-to-end chain of segment-specific transport connections In our early work, each of these transport connections

is a conventional TCP connection Each transport-level gateway (depot) receives data

from one connection and pipes it to the next connection in the chain

Physical

Data Link

Network

TransportSession

Physical

Data Link

Network

TransportSession

Physical

Data Link

Network

Transport

User Space

Page 8: Achieving Dependable Bulk Throughput in a Hybrid Network

The Logistical Session Layer

Page 9: Achieving Dependable Bulk Throughput in a Hybrid Network

Obvious ApplicationPlace a depot half-way between hosts A and B, thus cut the RTT roughly in half

Bad news: only a small factor

Good news: it actually does more€

Speed ≤MTU

max(RTT1,RTT2) * loss

Speed ≤ min(MTU1

RTT1 * loss1

,MTU2

RTT2 * loss2)

Page 10: Achieving Dependable Bulk Throughput in a Hybrid Network

Obvious Application: With one depot to reduce RTT

Page 11: Achieving Dependable Bulk Throughput in a Hybrid Network

Stronger ApplicationPlace one depot at HOPI node near the source, and another near the destination

Observe: Abilene Measurement Infrastructure:

2nd percentile: 950 Mb/s median: 980 Mb/s MTU = 9000 bytes; loss is very low

Local infrastructure: MTU and loss are good, but not always very good but the RTT is very small

But with HOPI we can do even better

Page 12: Achieving Dependable Bulk Throughput in a Hybrid Network

The HOPI ProjectThe Hybrid Optical and Packet Infrastructure Project (hopi.internet2.edu)

Leverage both the 10-Gb/s Abilene backbone and a 10-Gb/s lambda of NLR

Explore combining packet infrastructure with dynamically-provisioned lambdas

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 13: Achieving Dependable Bulk Throughput in a Hybrid Network

Stronger application: depots near each host

Backbone:large RTT9000-byte MTUvery low non-congestive loss

GigaPoP / Campus:very small RTTsome 1500-byte MTUsome non-congestive loss

Page 14: Achieving Dependable Bulk Throughput in a Hybrid Network

Two ConjecturesSmall RTT does effectively mask moderate imperfections in MTU and loss

End-to-end session throughput is (only a little less than) the minimum of component connection throughputs

Speed ≤ min(MTU1

RTT1 * loss1

,MTU2

RTT2 * loss2,

MTU3

RTT3 * loss3)

Page 15: Achieving Dependable Bulk Throughput in a Hybrid Network

PhoebusPhoebus aims to narrow the performance gap by bringing revolutionary networks like HOPI to users Phoebus is another name for the mythical

Apollo in his role as the “sun god”

Phoebus stresses the ‘session’ concept to enable multiple network/transport infrastructures to be catenated

Phoebus builds on an earlier project called the Logistical Session Layer (LSL)

Page 16: Achieving Dependable Bulk Throughput in a Hybrid Network

Experimental Phoebus Deployment

Place Phoebus depots at each HOPI node

Ingress/egress spans via ordinary Internet2/ Abilene IP infrastructureBackbone span can use either/both of: 10-Gb/s path through Abilene dynamic 10-Gb/s lambda

Initial test user sites: SDSC host with gigE connectivity Columbia Univ host with gigE connectivity

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 17: Achieving Dependable Bulk Throughput in a Hybrid Network
Page 18: Achieving Dependable Bulk Throughput in a Hybrid Network

Initial Performance Results

In very early tests: SDSC to losa: about 900 Mb/s losa to nycm: about 5.1 Gb/s nycm to Columbia: about 900 Mb/s direct: 380 ± 88 Mb/s Phoebus: 762 ± 36 Mb/s

In later tests with a variety of file sizes, SDSC to losa performance became worse

Page 19: Achieving Dependable Bulk Throughput in a Hybrid Network

Initial Performance ResultsBandwidth Comparison

0

100

200

300

400

500

600

32 64 128 256 512 1024 2048 4096

Transfer Size in Megabytes

Megabits/second

Direct Phoebus

Page 20: Achieving Dependable Bulk Throughput in a Hybrid Network

Initial Test Results

What about the three components? SDSC to losa depot: 429-491 Mb/s losa depot to nycm depot: 5.13-5.15 Gb/s nycm depot to Columbia: 908-930 Mb/s

Whatever caused that weakness in the SDSC-to-losa path did slow things down

Page 21: Achieving Dependable Bulk Throughput in a Hybrid Network

Plans for Summer 2006

‘Experimental production’ Phoebus, reaching out to interested users

Improve access control and instrumentation: Maintain a log of achieved performance

Test use of dynamic HOPI lambdas

Evaluate Phoebus as a service within newnet

Test use of Phoebus internationally

Page 22: Achieving Dependable Bulk Throughput in a Hybrid Network

Comments on Backbone SpanBackbone could ensure flow performance between pairs of backbone depots

Backbone could provide a Phoebus Service in addition to its “IP” service

Relatively easy to use dynamic lambdas within the backbone portion of the Phoebus infrastructure

Alternatively, the backbone portion could use IP, but a non-TCP transport protocol!

Page 23: Achieving Dependable Bulk Throughput in a Hybrid Network

Comments on the Local (Ingress and Egress) Spans

Near ends, we have good, but not perfect, local/metro-area infrastructure

Relatively hard to deploy dynamic lambdas

Small RTTs allow high-speed TCP flows to be extended to many local sites in a scalable way

Page 24: Achieving Dependable Bulk Throughput in a Hybrid Network

Thus, Phoebus leverages both: innovative wide-area infrastructure and conventional local-area infrastructure

Phoebus can thus extend the value of multi-lambda wide-area infrastructure to many science users on high-quality conventional campus networks

Page 25: Achieving Dependable Bulk Throughput in a Hybrid Network

Ongoing Work

Phoebus deployment on HOPI We’re seeking project participants! Please email for information

ESP-NP ESP = Extensible Session Protocol Implementation on an IXP Network Processor

from Intel The IXP2800 can forward at 10 Gb/s

Page 26: Achieving Dependable Bulk Throughput in a Hybrid Network

AcknowledgementsUD Students: Aaron Brown, Matt Rein

Internet2: Eric Boyd, Rick Summerhill, Matt Zekauskas, ...

HOPI Testbed Support Center (TSC) Team MCNC, Indiana Univ NOC, Univ Maryland

San Diego Supercomputer Center: Patricia Kovatch, Tony Vu

Columbia University: Alan Crosswell, Megan Pengelly, the Unix group

Dept of Energy Office of Science: MICS Early Career Principal Investigator program

Page 27: Achieving Dependable Bulk Throughput in a Hybrid Network

End

Thank you for your attention

Questions?