14
Arun Viswanathan Subi Krishnamurthy Vishal Zinjuvadia Rajeev Manur --Force10 Networks TraceFlow

TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Embed Size (px)

Citation preview

Page 1: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Arun ViswanathanSubi KrishnamurthyVishal ZinjuvadiaRajeev Manur

--Force10 Networks

TraceFlow

Page 2: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Agenda…

MotivationProblem StatementUse-case scenarioTraceFlow Protocol OperationLegacy (non-compliant devices) supportEncapsulationOpen IssuesQ & A

Page 3: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Motivation…

Today’s networks have evolved into more complex heterogeneous entities

LAG & ECMP are commonly used for redundancy and load-balancingE2E paths are typically an intermix of L2/L3/MPLS hops

OAM tools have not kept pace with the OAM requirements of evolving networks.

Destination address based Ping/Traceroute do not provide extra level of information needed in networks that use LAG/ECMP/flow based forwarding function like ACLs, Policy Based RoutingNeed a mechanism to collect specific data along the flow path tohelp diagnose the network problems better.

Many of our customers (SPs, Datacenters) would like to have a solution to this problem.

Page 4: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Problem Statement…Need to extend ping/traceroute to have,

Ability to trace 3/5 tuple user defined flow to exercise the same ECMP path or component link in a LAG as that taken by corresponding data packets. (An example on the following slide)Ability to selectively collect relevant diagnostic data along the flow path.

Increased security built into the protocol for tracing intra-area and inter-area E2E pathsExtend this ping/trace functionality to L2/MPLS hops along the way

Page 5: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Use-case scenario…

LAG-1

X10.10.10.1/30 10.10.10.2/30

10.11.10.2/30

10.11.10.1/30

ECMP

10.21.10.1/30

10.21.10.2/3010.20.10.1/30

10.20.10.2/30

RT1

RT4

RT3RT2

Data-flow

Current Ping/Traceroute

Current Ping/Traceroute

LAG-2

Soft Failure

Page 6: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

TraceFlow ProtocolIt is a protocol to address the requirements we have discussed so far

Design is extendable to future needsWe have presented this in the OPSAWG Working Group at IETF

Internet Draft URL: http://tools.ietf.org/html/draft-zinjuvadia-traceflow-00 (work in progress)

Page 7: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Protocol Operation -- 1

LAG-1 LAG-2

10.10.10.1/30

10.10.10.2/30

10.11.10.2/30

10.11.10.1/30

ECMP 10.21.10.1/30

10.21.10.2/3010.20.10.1/30

10.20.10.2/30

RT1

RT4

RT3RT2

Data-flow

Current Ping/Traceroute

TFRQ TFRQ

TFRSP

TFRSP

Page 8: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Protocol Operation (Fan-out) – 2

LAG-1 LAG-2

10.10.10.1/3010.10.10.2/30

10.11.10.2/30

10.11.10.1/30

ECMP

10.21.10.1/30

10.21.10.2/3010.20.10.1/30

10.20.10.2/30

RT1

RT4

RT3RT2

Data-flow

Current Ping/Traceroute

TFRQ

TFRSP

TFRSP

TFRQ

TFRSP

Page 9: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Protocol Operation (Messages) -- 3

Request Message – Composed of the following Type-Length-Value tuples

Flow Descriptor (first 256 bytes of flow packet)Flow Information (MTU, fragmentation, etc.)Originator addressTerminationRequested informationAuthentication

Response Message – Composed of the following Type-Length-Value tuples

Flow Descriptor (first 256 bytes of flow packet)Encapsulated Packet MaskRecord-RouteInterface Info TLV(incoming/outgoing)Result TLV

Page 10: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Legacy (non-compliant devices) support…

Non-compliant nodes likely to forward TraceFlow messages along a different path compared to data packets for the same traffic flowPotential solution

Use Probe packets that look like data packets with TTL Scoping (like traceroute) to determine the path and the boundary of the legacy segmentThen use regular TraceFlow & hop over the legacy segment along the direction of the actual traffic flow

IssuesProbe packets are indistinguishable from dataWe run the risk of leaking spoofed probe packets into end-node receiver applicationsProtocol complexity increases

Page 11: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Encapsulation…

We are currently proposing UDPHop-by-Hop termination may be an issue

Other optionsNew IP Protocol?New ICMP message types?

Not sure if all hardware can look into ICMP subtypes for re-directing the packet to the CPU?

Page 12: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Open Issues…EncapsulationLegacy support – Do we need it?

Page 13: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Questions/Feedback?

Page 14: TraceFlow - North American Network Operators' Group Protocol Operation Legacy (non-compliant devices) support Encapsulation Open Issues Q & A Motivation…

Thank You