21
On a New Internet On a New Internet Traffic Matrix Traffic Matrix (Completion) Problem (Completion) Problem Walter Willinger Walter Willinger AT&T Labs–Research AT&T Labs–Research

On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Embed Size (px)

Citation preview

Page 1: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

On a New Internet Traffic On a New Internet Traffic Matrix (Completion) Matrix (Completion)

ProblemProblem

Walter WillingerWalter Willinger

AT&T Labs–ResearchAT&T Labs–Research

Page 2: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

2

Local Traffic MatricesLocal Traffic Matrices

• At an individual router– Gives traffic volumes

(number of bytes per time unit: 5 min, 1 hour, 1 day) between every input port and output port on a router

– Typical routers have a small number of ports, from 16 to at most 256

• Available measurements– Netflow-enabled routers

provide direct measurements

– Routing data– No need for inference!

Page 3: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Abilene Router (Washington, D.C.)Abilene Router (Washington, D.C.)3

Page 4: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Local TM (Washington, D.C., 9/1/06)Local TM (Washington, D.C., 9/1/06)4

Page 5: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Top 6 Local TM Elements (Wash. Top 6 Local TM Elements (Wash. PoP)PoP)

5

Page 6: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

6

Intra-Domain Traffic MatricesIntra-Domain Traffic Matrices

• For an individual network– Gives traffic volumes

(number of bytes per time unit: 5 min, 1 hour, 1 day) between every ingress router/PoP and egress router/PoP in a network

– Some of the larger networks can have 1000’s of routers or 100’s of PoPs

• Available measurements– SNMP data provide indirect

measurements (per link)– Routing data

Page 7: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Intra-Domain TM Inference ProblemIntra-Domain TM Inference Problem

• Network-wide availability of SNMP data (link loads)

• Relying only on SNMP data, solve

AX=Y A: routing matrix; Y: link measurements• In real networks, this is a massively

underconstrained problem• Active area of research in 2000-2010

– Zhang, Roughan, Duffield, and Greenberg (2003)

– Zhang, Roughan, Lund, and Donoho (2003, 2005)

7

Page 8: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Intra-Domain TM Inference ProblemIntra-Domain TM Inference Problem

• Applications– Network engineering (capacity planning)– Traffic engineering (what-if scenarios)– Anomaly detection– Enormously useful for daily network operations– Textbook example of theory impacting practice

• Things changed around 2010 …– Netflow-enabled routers are now deployed

network-wide and provide direct measurements– Can measure the intra-domain TM directly!– Inference approach is no longer needed!

8

Page 9: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Example: Abilene NetworkExample: Abilene Network

• High speed Education Network

• 28 links• 10 Gbps

Capacity on each link

• 11 Points of Presence (POPs) with NetFlow measurement capabilities

Page 10: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Abilene Traffic Matrix (9/1/06)Abilene Traffic Matrix (9/1/06)10

Page 11: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Top 12 Abilene TM Elements (1 week)Top 12 Abilene TM Elements (1 week)11

Page 12: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Intra-Domain TM: Open ProblemsIntra-Domain TM: Open Problems

• Synthesis of realistic TMs– Can’t be agnostic about the underlying

network!– What information about the underlying

network is needed?• Network-related root causes for

observed properties of measured TMs– Low-rank, deviations from low-rank– Sparsity

• Which measurements are more critical than others for my network?

12

Page 13: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

What can Intra-Domain TMs tell us?What can Intra-Domain TMs tell us?

• How much of the traffic that enters my network in NYC is destined for ATL (per hour, per day)?

• How much of the daily traffic on my network is coming from (which) CDNs?

• How much of the hourly traffic that enters my network in NYC and is destined to ATL is coming from Netflix?

• How much traffic does my network carry (per hour, per day)?

13

Page 14: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

A Different Set of QuestionsA Different Set of Questions

• How much traffic do Sprint and Verizon exchange with one another (per hour, day)?

• How much traffic does Verizon get from Netflix (per day, month)?

• What are the networks that exchange the most traffic with Google?

• How much does Facebook’s traffic increase on a monthly basis?

• How much traffic does the Internet carry per day?

14

Page 15: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

15

New Problem: Inter-Domain TM New Problem: Inter-Domain TM

• The Internet is a “network of networks”– Individual networks are also called Autonomous

Systems (ASes) – Today’s Internet consists of about ~30K-40K

actively routed ASes– We are getting a clearer picture of the AS-level

topology (i.e., which networks exchange routing information with one another and hence presumably also IP traffic)

• Inter-domain (or AS-level) traffic matrix– Gives traffic volumes between ASes– Completely unknown …

Page 16: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Inter-Domain TM: Highly StructuredInter-Domain TM: Highly Structured

• Some numbers …– In 2010 the Internet carried some 20 EB/month– In late 2009, AT&T carried some 20PB/day in

2009– There are some 20 AT&T-like large transit

providers in today’s Internet

• Some caveats …– Large transit providers use multiple networks to

run their business (e.g., Verizon has some 230 ASes)

– Need to know how to map ASes to companies

16

Page 17: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

On Inter-Domain TM CompletionOn Inter-Domain TM Completion

• Today’s formulation– About 1% of the inter-domain TM elements

are responsible for a majority of all the traffic

– Inter-domain TM has low rank (does it?)– (Non)standard TM completion problem

• Towards tomorrow’s formulation– How to insist on strong validation criteria?– What sort of new measurements are feasible

and can be used to check the validity of a solution to today’s formulation of the inter-domain TM completion problem?

17

Page 18: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Internet eXchange Points (IXPs)Internet eXchange Points (IXPs)

AS3

AS4AS5

AS1

AS2 Content

Provider 2

layer-2 switch

Content

Provider 1

Page 19: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Inter-Domain TM and IXPsInter-Domain TM and IXPs

• Some numbers …– There are some 300 IXPs worldwide that

see some 10-20% of all Internet traffic– They involve some 4K ASes– Most IXPs publish their hourly/daily total

traffic volume– We are getting more and more accurate

peering matrices for these 300 IXPs

• New Twist …– How to infer the local TM at each IXP?– How to measure the local TM at each IXP?

19

Page 20: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

Back to Inter-Domain TM CompletionBack to Inter-Domain TM Completion

• Tomorrow’s formulation– Start with today’s formulation

• Accounts for large transit providers

– Incorporate IXP-specific information • Accounts for large content providers

– New (non)standard TM completion problem

• … and repeat– What other sources of new measurements?– Promising candidates: CDNs (Akamai & co.)– What types of measurements are more

critical than others?

20

Page 21: On a New Internet Traffic Matrix (Completion) Problem Walter Willinger AT&T Labs–Research

SummarySummary

• Intra-domain TM research– Beautiful example of innovative research with

enormous practical benefits for network operators– The intra-domain TM of an AS is a basic ingredient

for a first-principles approach to understanding the AS’s router-level topology (forget “Network Science” …)

– Reminder that “change changes things”

• Inter-domain TM research– Enormous practical value – Adds new twist to generic matrix completion problem– The inter-domain TM as critical ingredient for a first-

principles approach to understanding the Internet’s AS-level topology (TBD)

21