Memory-Efficient and Scalable Virtual Routers Using FPGA

Author:Hoang Le, Thilan Ganegedara and Viktor K. PrasannaPublisher:ACM/SIGDA FPGA '11Presenter:Zi-Yang OuDate:2011/10/12

About FPGA’11

Introduction Network virtualization is a technique to consolidate

multiple networking devices onto a single hardware platform.

To make efficient use of the networking resources.

An abstraction of the network functionality away from the underlying physical network.

Separated and Merged3

Contributions A simple merging algorithm results in the total amount of

required memory to be less sensitive to the number of routing tables, but to the total number of virtual prefixes.

A tree-based architecture for IP lookup in virtual router that achieves high throughput and supports quick update.

Use of external SRAMs to support large virtual routing table of up to 16M virtual prefixes.

A scalable design with linear storage complexity and

resource requirements.

Set-Bounded Leaf-Pushing Algorithm In order to use tree search algorithms, the given set of

prefixes needs to be processed to eliminate the overlap

between prefixes. This elimination process results in a set

of disjoint prefixes. 1. Build a trie from the given routing table. 2. Grows the trie to a full tree. 3. Pushes all the prefixes to the leaf nodes.

The prefix tables expand about 1.6 times after leaf pushing. About 90% of the prefixes are leaf prefixes.

Set-Bounded Leaf-Pushing Algorithm There are 3 steps involved in the algorithm:

1. Move the leaves of the trie into Set 1

2. Trim the leaf-removed trie

3. Leaf-push the resulting trie and move the leaves into Set 2

NS1 = lN

NS2 = kN(1 − l)

N‘ = NS1 + NS2 = kN + lN(1 − k) = N(k + l − kl) l = 0.9 k = 1.6 N‘ = 1.06N

2-3 Tree

Merging Algorithm

IP Lookup Algorithm for Virtual Router

Memory Requirement

Memory Requirement M1 = |S1|(L + LP + LV ID + LNHI + 2LPtr1)

M2 = |S2|(L + LP + LV ID + LNHI + 2LPtr2)

M = M1 +M2 = (|S1| + |S2|)(L + LP + LV ID + LNHI)

+ 2|S1|LPtr1 + 2|S2|LPtr2 M = (|S1| + |S2|)(L + LP + logm + LNHI)

+ |S1|LPtr1 + |S2|LPtr2

|S1| + |S2| = 1.06N , O(N × logm).

Overall Architecture

Memory Management

Virtual Routing Table Update

Scalability The per-level required memory size grows exponentially

as we go from one level to the next of the tree. Therefore, we can move the last few stages onto external SRAMs.

IMPLEMENTATION M = (|S1| + |S2|)(L + LP + LVID + LNHI)

+ |S1|LPtr1 + |S2|LPtr2 M = NS(L + LP + LVID + LNHI + LPtr) MIPv4 = NS(32 + 5 + 5 + 6 + 20) = 68NS MIPv6 = NS(128 + 7 + 5 + 6 + 20) = 164NS

A state-of-the-art FPGA device with 36 Mb of on-chip memory (e.g. Xilinx Virtex6) can support up to 530K prefixes (for IPv4), or up to 220K prefixes (for IPv6),

without using external SRAM.

IMPLEMENTATION In our design, external SRAMs can be used to handle even

larger routing tables, by moving the last stages of the pipelines onto external SRAMs.

Thus, the architecture can support up to 16M prefixes,

or 880K prefixes for IPv4 and IPv6, respectively.

Experimental Setup

Performance Comparison Candidates : trie-overlapping (A1), trie braiding (A2) Time complexity:

Our algorithm:O(N) A1 : O(NlogN) A2 : O(N^2) Memory efficiency:

Our algorithm: 15 MB for 1.3M prefixes

A1 : 9MB for 290K prefixes

A2 : 4.5MB for 290K prefixes Quick-update capability:

A1 and A2 : reconstruct the entire lookup data structure

Memory-Efficient and Scalable Virtual Routers Using FPGA

Documents

Cisco and PLDT: Designing a Scalable Network to Unleash ......Designing a scalable network to unleash the true potential of 5G 5 Traditionally, service providers connect routers using

FPGA Implementation of a Scalable and Highly Parallel

FPGA Security FPGA bitstream FPGA Authentication FPGA ... · PDF fileFPGA Security, FPGA Configuration, FPGA Bitstream, FPGA Authentication Business Considerations for Systems with

1 How scalable is the capacity of (electronic) IP routers? Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University

Scalable and Modularized RTL Compilation of Convolutional ... · Scalable and Modularized RTL Compilation of Convolutional Neural Networks onto FPGA ... Design Entry RTL C-language

A Scalable Framework for Monte Carlo Simulation Using …nicola/thesis/phil_masc_2011.pdf · A Scalable Framework for Monte Carlo Simulation Using FPGA-based Hardware Accelerators

SIPHER: Scalable Implementation of Primitives for Homomorphic EncRyption FPGA implementation using Simulink Dave Cousins, Kurt Rohloff, Rick Schantz: BBN

High Performance Scalable FPGA Accelerator for …Naveen Mellempudi, Bharat Daga, Martin Langhammer, Gregg Baeckler, Bharat Kaul Intel Corporation {sudarshan.srinivasan}@intel.com

ParaSplit: A Scalable Architecture on FPGA for Terabit ... A Scalable ... FW1_10K Difference in memory ... 0 0.1 0.2 0.3 0.4 0.5 0.6 v Heuristic 3 Heuristic 2 Heuristic 1. NSLab, RIIT,

GAMT: A Fast and Scalable IP Lookup Engine for GPU-based Software Routers

An E cient FPGA Implementation of Scalable Matrix ...cseweb.ucsd.edu/~airturk/core/matrix_QRD.pdfAn E cient FPGA Implementation of Scalable Matrix Inversion Core using QR Decomposition

Boosting XML filtering through a scalable FPGA-based architecture A. Mitra, M. Vieira, P. Bakalov, V. Tsotras, W. Najjar

Scalable FPGA-accelerated Cycle-Accurate Hardware

Fido: Fast Inter-Virtual-Machine Communication for ...2 •High performance •Scalable and highly-available access Network attached storage, routers, etc. Example Appliance 3 •Monolithic

Towards Scalable and E cient FPGA Stencil Acceleratorsimpact.gforge.inria.fr/impact2016/papers/impact2016-deest-slides.pdf · Towards Scalable and E cient FPGA Stencil Accelerators

T Series Core Routers - Juniper Networks · Juniper Networks T Series Core Routers provide a unique solution that is flexible and scalable to tackle these new challenges. The T Series

Distributed and scalable routing table manager for the next generation of ip routers

A FPGA-based Parallel Architecture for Scalable High-Speed Packet Classification

Flexible and scalable FPGA-oriented design of multipliers

1 Efficient Trie Braiding in Scalable Virtual Routers Author: Haoyu Song, Murali Kodialam, Fang Hao, T.V. Lakshman Publisher: IEEE/ACM TRANSACTIONS ON