Packet Classification Using Multidimensional Cutting Sumeet Singh (UCSD) Florin Baboescu (UCSD)...

Packet Classification Using Multidimensional Cutting

Sumeet Singh (UCSD)Florin Baboescu (UCSD)George Varghese (UCSD)

Jia Wang (AT&T Labs-Research)

Reviewed byMichela Becchi

Discussion LeaderHaoyu Song

Michela Becchi - 05/06/23

Outline

Introduction

Related works» HiCuts

HyperCuts

Evaluation

Conclusions

Packet Classification Rule-based packets’ handling

» Destination address» Source address» Protocol type» Destination and source port» TCP flags

Rules Destination Source Dest. Port Action

Rule1 * 128.13.34.42 25 Block

Rule2 128.12.120.1 * * Redirect

Applications Security QoS Network address translation Traffic shaping Monitoring …

Challenge Classify packets at packets’ processing speed

Increasing link speed » 14% links between core routers OC-768 (40 Gbps)» 21% links between edge routers OC-192 (10 Gbps)

Memory-time tradeoff

Terminology Classifier: N rules R1,R2,…,RN

Rule Rj: array of k values (fields, dimensions ) Rj[i]: value of the i-th header field of a packet

» Exact match: source address equal to 128.252.169.1 » Prefix match: destination address matches 128.252.*» Range match: destination port in range 0 to 255

actionj: action associated to Rj

E.g. R=(128.252.*,*,TCP,23,*), action=block» Pkt1=(128.252.169.16,128.111.41.101,TCP,23,1025)» Pkt2=(128.252.169.16,128.111.41.101,TCP,79,1025)

Memory-time tradeoff Time-memory tradeoff:

» O((log N)^(k-1)) time and linear space» Log N time and O(N^k) space

SRAM vs. DRAM

Hardware solutions: Ternary CAMs

Algorithmic solutions:» Linear search» EGT-PC» HiCuts

Note: Update complexity not considered for core routers

TCAMs Uses parallelism in hardware

Pros:» Low latency and high throughput» Simple on-chip management scheme

Cons:» Power scaling (parallel comparisons) » Density scaling (more board area)» Time scaling (highest match arbitration)» Rule Multiplication for ranges (prefix format)

=> Suitable for small classifiers

EGT-PCExtended Grid-Of-Tries with Path Compression

Idea: Regardless of database size, any packet matches only a few rules. This is true even when the rules are projected to only source or destination fields

Extend efficient two-field classification algorithm with linear search

Worst case search time ~ HiCuts optmized for speed

Memory requirement ~ HiCuts optmized for space

HiCutsHierarchical Intelligent Cutting

Decision-tree based algorithm

Linear search on leaves

Storage ~ depth of tree

Local optimization decisions at each node to test next dimension to cut» Limit amount of linear search» Limit amount of storage increase

Range checks => cut=hyperplane

HiCuts: an example

Field2

Field4 Field3R9R10R11

R8R9R10R11

R7R10R11

R3R7R10R11

R2R7R10R11

R4R7R10R11

R7R10R11

R0R5R6R10

R7R10R11

Field5R1R7R10R11

R0R5R6R7R10R11

R2R3R4R7R10R11

R0R1R5R6R7R10R11

0..3 4..7 8..11 12..15

Bucket size = 4

(0010,1101,00,01,TCP)

12..15

From HiCuts to Hypercuts

Multiple cuts per node possible» Reduce depth of the tree (memory)» Through array indexing one memory access per node

Hypercube instead of hyperspace

Hypercube

* Slide taken from S. Singh’s presentation

Building Decision Tree (1)Step1: Select dimensions to cut

Goal: Pick dimensions leading to the most uniform distribution of rules

Alternatives:» Largest number of unique elements» # unique elements > mean of unique elements» # unique elements / size of region

Idea: dimensions with highest entropia

Building Decision Tree (2)Step2: Select number of cuts

Goal: Create search tree with minimal memory requirement

Alternative 1:» Minimum number of rules in each child node» Maximum number of children limited by space

factor * sqrt(# rules in current node)

Alternative 2 (Greedy approach):» Determine local optimum nc(i) for each dimension» Determine iteratively best combination

Refinements (1) Node Merging:

nodes with same rules

Rule Overlap: overlapping rules and different priorities

Refinements (2) Region Compaction:

shrink the region of a node depending on its rules

Pushing Common Rule Subset Upwards: » rules to non-leaf nodes.» Bitmap in header to

avoid extra memory accesses

Search Algorithm

* Slide taken from S.Singh’s presentation

Search Algorithm

Evaluation Memory: up to an order of magnitude less than

HiCuts optimized for memory and EGT-PC

Time: 3 to 10 times faster than HiCuts

On ERs: HyperCuts ~ HiCuts (only IP source and destination specified => 2 dimensions)

On FWs: wildcard-rules on IP addresses make HyperCuts ouperform HiCuts

Synthetic databases: memory requirement grows linearly with number of rules (except for FWs – wildcards)

Conclusions Idea of cutting in more than one direction

» Improvement in memory requirement» Still one access per node

Refinements to reduce memory wasting

Evaluation on industrial firewall databases and synthetic databases

Limited depth of the tree: possible hardware implementation using pipelining and on-chip SRAM

Questions?

Evaluation Data (1)

Evaluation Data (2)

Packet Classification Using Multidimensional Cutting Sumeet Singh (UCSD) Florin Baboescu (UCSD)...

Documents

Roy varghese profile2424

The Multi-Armed Bandit Problem - CAE Usershomepages.cae.wisc.edu/~sumeet/files/banditsslides.pdfMotivation Model Algorithms The Multi-Armed Bandit Problem Sumeet Katariya Electrical

Automatic Test Packet Generation · 2013. 4. 3. · Automatic Test Packet Generation James Hongyi Zeng with Peyman Kazemian, George Varghese, Nick McKeown Stanford University, UCSD,

H EADER S PACE A NALYSIS : S TATIC C HECKING F OR N ETWORKS Peyman Kazemian (Stanford University) George Varghese (UCSD, Yahoo Labs) Nick McKeown (Stanford

PREPARED BY: TINTU VARGHESE ICU DEPARTMENT PREPARED BY: TINTU VARGHESE ICU DEPARTMENT

Reconciling Differences: towards a theory of cloud complexity George Varghese UCSD, visiting at Yahoo! Labs 1

Sumeet Rohatgi Elsevier Innovation Live

Algorithms for Network Security George Varghese, UCSD

electron diffraction sumeet

Sumeet vij enterprise_knowledge_graph

scanning electron microscope sumeet

HBM nCode DesignOpt+RobustDesignWebinar Sumeet 1

Network Algorithms, Lecture 1: Intro and Principles · 2011. 3. 28. · Intro and Principles George Varghese, UCSD . Course Goal • Network Algorithms: Algorithms for key bottlenecks

Automated Worm Fingerprinting - USENIX · Automated Worm Fingerprinting Sumeet Singh, Cristian Estan, George Varghese and Stefan Savage Department of Computer Science and Engineering

Automated Worm Fingerprinting - UCF Computer Scienceczou/CAP6133/AutomatedWormFingerprinting.pdf · Automated Worm Fingerprinting Paper By: Sumeet Singh, Cristian Estan, George Varghese

CS 443 Advanced OS Fabián E. Bustamante, Spring 2005 Automated Worm Fingerprinting Sumeet Singh, Cristian Estan, George Varghese and Stefan Savage Presenter:

Presented By: Dr Sumeet Kaur

Blog writing by sumeet narang

Sumeet Kapur - Frugal Innovation

Click to add Text Automated Worm Fingerprinting Sumeet Singh, Cristian Estan, George Varghese and Stefan Savage Department of Computer Science and Engineering