34
Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 Acknowledgements and particular thanks to also to: Elisa Jasinska, AMS-IX Chris Malayter, Switch and Data Mo Shivji, LINX Wolfgang Hennerbichler,VIX Robert Wozny, PLIX Sebastian Spies, DECIX Thursday, 21 January 2010

Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Embed Size (px)

Citation preview

Page 1: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Route Server Bake-offAndy Davidson, LONAP

UKNOF15, Rochdale, January 21st 2010

Acknowledgements and particular thanks to also to:Elisa Jasinska, AMS-IX

Chris Malayter, Switch and DataMo Shivji, LINX

Wolfgang Hennerbichler, VIXRobert Wozny, PLIX

Sebastian Spies, DECIX

Thursday, 21 January 2010

Page 2: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Agenda

• What Route Servers Do

• Situation in November 2009

• Functional Tests

• Performance/Scalability Tests

Thursday, 21 January 2010

Page 3: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Agenda

• What Route Servers Do

• Situation in November 2009

• Functional Tests

• Performance/Scalability Tests

Thursday, 21 January 2010

Page 4: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Why Operators Care

• At IXP but not peering - and want to

• At IXP and peering with route-server, concerned about stability

• Operator, looking at internal r-s for scaling

Thursday, 21 January 2010

Page 5: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgp

Thursday, 21 January 2010

Page 6: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BE

MAGICtraffic

Thursday, 21 January 2010

Page 7: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BE

MAGICtraffic

Thursday, 21 January 2010

Page 8: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BE

MAGICtraffic

Thursday, 21 January 2010

Page 9: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BE

MAGICtraffic

filter

Thursday, 21 January 2010

Page 10: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BE

MAGICtraffic

filter

Thursday, 21 January 2010

Page 11: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

What route-servers do

bgpHERE BEDOOM

traffic

filter

Thursday, 21 January 2010

Page 12: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Agenda

• What Route Servers Do

• Situation in November 2009

• Functional Tests

• Performance/Scalability Tests

Thursday, 21 January 2010

Page 13: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Situation

• IXP Community complaints turned into a significant effort to fix situation.

• But we did not know the new limits of this software, and had not discovered any new bugs.

Thursday, 21 January 2010

Page 14: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

So ....

• December 2009, tests of capacity .......and an awful lot of config/tip swapping.

Thursday, 21 January 2010

Page 15: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

So ....

• December 2009, tests of capacity and bugsand an awful lot of config/tip swapping.

Thursday, 21 January 2010

Page 16: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Agenda

• What Route Servers Do

• Situation in November 2009

• Functional Tests

• Performance/Scalability Tests

Thursday, 21 January 2010

Page 17: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Test bugs marked as fixed

• Several bugs raised in Q3/4 2009 relating to brand new features that IXPs asked for.

• Behaviour on New RIB.

• Erronous IPv6 withdraws

Thursday, 21 January 2010

Page 18: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Filtering

• V4 and V6 inbound Prefix filtering (fierce debate about whether route-servers should actually do this!)

• Max prefix limit

• V4 and V6 outbound Prefix filtering (required if multiple RIBs in use. Vyatta Bug.)

• . . . . all worked

Thursday, 21 January 2010

Page 19: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Transparency TestsCisco Dynamips (7200)91.198.97.13 (AS888)

Announcing 10.2.0.0/24

16bit AS speaker only

OBGP MLPAS6002

BIRD MLPAS6006

Juniper - o291.198.97.16 (AS999999)Announcing 10.0.1.0/24

ASN32 speaker with 32 bit ASN

Juniper - o191.198.97.16 (AS999)

Announcing 10.0.0.0/24with community 999:999 and med 999

ASN32 speaker with 16 bit ASN

Olive/Dynamips allow us to trap packet from wire

with tcpdump.

Thursday, 21 January 2010

Page 20: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

ASN32

Cisco Dynamips (7200)91.198.97.13 (AS888)

Announcing 10.2.0.0/24

16bit AS speaker only

OBGP MLPAS6002

BIRD MLPAS6006

Juniper - o291.198.97.16 (AS999999)Announcing 10.0.1.0/24

ASN32 speaker with 32 bit ASN

Juniper - o191.198.97.16 (AS999)

Announcing 10.0.0.0/24with community 999:999 and med 999

ASN32 speaker with 16 bit ASN

Correct Compatibility ModeAS23456 in AS_PATH. NEW_AS_PATH attribute added.

Correct 32 bit ASN properties(AS4_PATH, adjacent to AS999999)

Thursday, 21 January 2010

Page 21: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

MEDs & General attribute transparency

Cisco Dynamips (7200)91.198.97.13 (AS888)

Announcing 10.2.0.0/24

16bit AS speaker only

OBGP MLPAS6002

BIRD MLPAS6006

Juniper - o291.198.97.16 (AS999999)Announcing 10.0.1.0/24

ASN32 speaker with 32 bit ASN

Juniper - o191.198.97.16 (AS999)

Announcing 10.0.0.0/24with community 999:999 and med 999

ASN32 speaker with 16 bit ASN

Common handling of attributes across RS

families to allow IXPs to run multiple RS & send

equivalent BGP messages.

Communities and MEDs unchanged

Fixed in OpenBGPd Q4 2009

Thursday, 21 January 2010

Page 22: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

IPv6

• Run a current version of all route servers for best v6 support

Thursday, 21 January 2010

Page 23: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

MD5

• Problem with sessions continuing to expect tcp-md5 once configuration is removed

• Not yet explained - possibly OS fault

• Recommend that participants do not use md5 on route-server sessions

Thursday, 21 January 2010

Page 24: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Agenda

• What Route Servers Do

• Situation in November 2009

• Functional Tests

• Performance/Scalability Tests

Thursday, 21 January 2010

Page 25: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Ixia XM12

Sessions originate from1.6.1.2 (AS10002).

The IP address and ASnumber increase by one

lab2

lab3

lab4

lab5

lab6

lab7

1.0.0.1/8

Sessions originate from3.1.1.0 (AS9000).

The IP address and ASnumber increase by one

Force10 E300

3.1.0.1/16Elisa's perl bgp generator

3.2.0.1/16OpenBGPd AS65002

3.6.0.1/16BIRD AS65006

3.7.0.1/16Hoofprints

3.4.0.1/16Quagga AS65004

Vlan 20 - 10.0.0.0/8 Mgmt networkVlan 21 - Public IP addressing

lab1

Thursday, 21 January 2010

Page 26: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Performance

• Compressing a week of tests into general themes, that best demonstrate:

• General behaviour under load

• Capacity (sessions, prefixes...)

Thursday, 21 January 2010

Page 27: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

• Quagga. Strange memory profile and high CPU load with flapping sessions.

• Unstable, unusable with flapping participants.

100 Sessions, 500 Pfx, Flapping at random

Thursday, 21 January 2010

Page 28: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

100 Sessions, 500 Pfx, Flapping at random

• BIRD.

Thursday, 21 January 2010

Page 29: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

100 Sessions, 500 Pfx, Flapping at random

• OpenBGPd. Note - two results because two threads.

Thursday, 21 January 2010

Page 30: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Prefixes

• Limited really by RAM

• More prefixes, more convergence time

• Memory limit is prefixes * RIBs

• OpenBSD 1GB limit per thread (32 Bit)

Thursday, 21 January 2010

Page 31: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Sessions

• BIRD struggles by 1,000 sessions

• OpenBGP starts 1,000 sessions fine, but adding 1,000 RIBs/Prefixes = RAM starved

• Not just describing OS limits, e.g. file desc

• Not a problem for all but the largest IXPs

Thursday, 21 January 2010

Page 32: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Conclusions

• Where’s the doom ?

• Where’s the gloom ?

• Where are the BUGS ?!!1

Thursday, 21 January 2010

Page 33: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Conclusions

• Where’s the doom ?

• Where’s the gloom ?

• Where are the BUGS ?!!

• There are new, more scalable route-server daemons available today.

• The ‘vendors’ did fix our bugs

• ISPs tempted to connect should do so

Thursday, 21 January 2010

Page 34: Route Server Bake-off - uknof.org.uk · Route Server Bake-off Andy Davidson, LONAP UKNOF15, Rochdale, January 21st 2010 ... lab7 1.0.0.1/8 Sessions originate from 3.1.1.0 (AS9000)

Any Questions ?Any Answers ?

Andy Davidson <[email protected]>

Thursday, 21 January 2010