25
Multilevel Generalized Force-directed Multilevel Generalized Force-directed Method for Circuit Placement Method for Circuit Placement Tony Chan Tony Chan 1 , Jason Cong , Jason Cong 2 , , Kenton Sze Kenton Sze 1 1 UCLA Mathematics Department UCLA Mathematics Department 2 UCLA Computer Science Department UCLA Computer Science Department This work is partially supported by SRC, NSF, and ONR.

Multilevel Generalized Force-directed Method for Circuit Placement Tony Chan 1, Jason Cong 2, Kenton Sze 1 1 UCLA Mathematics Department 2 UCLA Computer

Embed Size (px)

Citation preview

Multilevel Generalized Force-directed Multilevel Generalized Force-directed Method for Circuit PlacementMethod for Circuit Placement

Tony ChanTony Chan11, Jason Cong, Jason Cong22,, Kenton Sze Kenton Sze11

11UCLA Mathematics DepartmentUCLA Mathematics Department22UCLA Computer Science DepartmentUCLA Computer Science Department

This work is partially supported by SRC, NSF, and ONR.

UCLA VLSICAD LAB 2

OutlineOutline A Brief History of mPL

Recent Progress in Analytical PlacementRecent Progress in Analytical Placement

Our new contributions and enhancements [mPL5]Our new contributions and enhancements [mPL5] Generalization of force-directed method (GFD)Generalization of force-directed method (GFD)

More accurate approximation of half-perimeter wirelengthMore accurate approximation of half-perimeter wirelength

More accurate computation of cell spreading forcesMore accurate computation of cell spreading forces

Systematic scaling of the cell spreading forcesSystematic scaling of the cell spreading forces

Multilevel implementation of GFDMultilevel implementation of GFD Overview of mPL multilevel frameworkOverview of mPL multilevel framework

mPL5 frameworkmPL5 framework

ConclusionsConclusions

UCLA VLSICAD LAB 3

Relative Wirelength

mPL 1.0 [ICCAD00]• Recursive ESC clustering• NLP at coarsest level• Goto discrete relaxation• Slot Assignment legalization• Domino detailed placement

year2000 2001 2002 2003 2004

A Brief History of mPL

mPL 1.1• FC-Clustering• added partitioning to legalization

mPL 2.0 • RDFL relaxation• primal-dual netlist pruning

mPL 3.0 [ICCAD 03]• QRS relaxation• AMG interpolation• multiple V-cycles• cell-area fragmentation

UNIFORM CELL SIZE

NON-UNIFORM CELL SIZE

mPL 4.0• improved DP• better coarsening • backtracking V-cycle

mPL 5.0• Multilevel Force-Directed

UCLA VLSICAD LAB 4

Recent Progress on Analytical PlacementRecent Progress on Analytical Placement Force-directed method [Eisenmann and Johannes 98]Force-directed method [Eisenmann and Johannes 98]

Efficient spreading force computation using a fast Poisson solverEfficient spreading force computation using a fast Poisson solver

Interleave with quadratic placementInterleave with quadratic placement

Limitations:Limitations:• Inaccurate objective functionInaccurate objective function• Require ad hoc tuning of forces for good convergence Require ad hoc tuning of forces for good convergence

Aplace [Kahng and Wang 04] Aplace [Kahng and Wang 04] More accurate approximation to half-perimeter wirelengthMore accurate approximation to half-perimeter wirelength

• Log-sum-exp [Naylor. et al 01]Log-sum-exp [Naylor. et al 01] Solving the non-linear optimization problem in a multilevel frameworkSolving the non-linear optimization problem in a multilevel framework

Limitations:Limitations:• Local smoothing of density functionsLocal smoothing of density functions• Penalty formulation lumps all constraints togetherPenalty formulation lumps all constraints together

UCLA VLSICAD LAB 5

Basic Formulation of Our ApproachBasic Formulation of Our Approach

Minimize the half-perimeter wirelength subject to even Minimize the half-perimeter wirelength subject to even

density constraint:density constraint:

)( min xW

,)( .. cxdts area. coreby divded area cells total where c

UCLA VLSICAD LAB 6

Choices of Wirelength Objective FunctionsChoices of Wirelength Objective Functions

HPWL

Log-Sum-Exp

Quadratic

Lp-norm

UCLA VLSICAD LAB 7

Bin based Density FormulationBin based Density Formulation

Average bin densityAverage bin density

Equality constraintEquality constraint

Average bin density = Average bin density = utilization ratioutilization ratio

However, density function is However, density function is

highly non-smoothhighly non-smooth

1

1

3

2

432

m

n

v6

v5

v4

v3

v2

v1v7

= a13(v7) = fractional area of cell v7 in bin B13

7

1

area)bin /()(k

kijij vaD

UCLA VLSICAD LAB 8

Smoothing Density FunctionSmoothing Density FunctionSmoothing operator:Smoothing operator:

Larger epsilonLarger epsilon

More local smoothingMore local smoothing

Slow convergenceSlow convergence

Smaller epsilonSmaller epsilon

More global smoothingMore global smoothing

Faster convergenceFaster convergence

dI )(

UCLA VLSICAD LAB 9

Smoothed Constrained WL Minimization ProblemSmoothed Constrained WL Minimization Problem

MMinimize smooth objective wirelength subject to smooth inimize smooth objective wirelength subject to smooth

density function:density function:

,)( .. xts)( min xW

.),()( where11cxdx

UCLA VLSICAD LAB 10

Solving Density Constrained WL MinimizationSolving Density Constrained WL Minimization

Using the Uzawa algorithm, we iteratively solveUsing the Uzawa algorithm, we iteratively solve

can be viewed as “generalized force”can be viewed as “generalized force”

Advantages:Advantages: Individual scaling factor at each binIndividual scaling factor at each bin

Systematic updates of these scaling factorsSystematic updates of these scaling factors

No Hessian inversion is requiredNo Hessian inversion is required

)()(1 kkk

W xλx

)()()(1

ijijkijkλλ

UCLA VLSICAD LAB 11

Summary of Generalized Force-directed (GFD) AlgorithmSummary of Generalized Force-directed (GFD) Algorithm

If initial solution not given:If initial solution not given: Use unconstrained quadratic minimizerUse unconstrained quadratic minimizer

Set stopping criterionSet stopping criterion

Iteratively solve:Iteratively solve: Poisson equation to get forcesPoisson equation to get forces

Updating the scaling factor (Lagrange multiplier) for forces based Updating the scaling factor (Lagrange multiplier) for forces based on the smoothed densityon the smoothed density

The nonlinear equation by stabilized fixed point iterationThe nonlinear equation by stabilized fixed point iteration

UCLA VLSICAD LAB 12

Important Ingredients of GFDImportant Ingredients of GFD

Use of accurate objective functionsUse of accurate objective functions

Optimization-based bin-density constraint formulationOptimization-based bin-density constraint formulation

Global smoothing of density functionGlobal smoothing of density function

Use of Uzawa algorithm enables:Use of Uzawa algorithm enables: Systematic bin-level adjustment of force-scaling factors Systematic bin-level adjustment of force-scaling factors

Convergence to a well defined solution via fixed-point iterationConvergence to a well defined solution via fixed-point iteration

Applying multilevel optimization can lead to better runtime Applying multilevel optimization can lead to better runtime

and wirelengthand wirelength

UCLA VLSICAD LAB 13

Overview of mPL multilevel frameworkOverview of mPL multilevel framework

Coarsening:Coarsening: build a hierarchy of problem approximations by build a hierarchy of problem approximations by

First ChoiceFirst Choice clustering clustering

Relaxation:Relaxation: improve the placement at each level by iterative improve the placement at each level by iterative

optimizationoptimization

Interpolation:Interpolation: transfer coarse-level solution to adjacent, finer transfer coarse-level solution to adjacent, finer

level (AMG declustering)level (AMG declustering)

Multilevel Flow:Multilevel Flow: multiple traversals over multiple hierarchies multiple traversals over multiple hierarchies

(V-cycle variations)(V-cycle variations)

UCLA VLSICAD LAB 14

mPL5 FrameworkmPL5 Framework

Level at which GFD is applied

Level 3

Level 2

Level 1

C

C

I

I

C+I

C+I

I

I

C Coasening

I Interpolation

Keep coarsening until # cells less than 500

UCLA VLSICAD LAB 15

Improvement by Our Multilevel FrameworkImprovement by Our Multilevel Framework

Improvement by multilevel GFD over flat GFDImprovement by multilevel GFD over flat GFD

CircuitCircuit % WL improved% WL improved % runtime reduced% runtime reduced

Ibm01Ibm01 13%13% 42%42%

Ibm05Ibm05 37%37% 43%43%

Ibm10Ibm10 23%23% 59%59%

Ibm15Ibm15 20%20% 67%67%

Ibm18Ibm18 31%31% 66%66%

AverageAverage 24.8%24.8% 55.4%55.4%

Experiments carried out on ISPD2004 FastPlace IBM benchmarks.

UCLA VLSICAD LAB 16

Comparison on Standard Cell DesignsComparison on Standard Cell Designs

Capo9.01.09, 2.29

Dragon3.011.03, 12.38

FastPlace1.01.08, 0.18

Fengshui5.01.06, 2.03mPL5

1,1mPL5-fast1.07, 0.30

123456789

10111213

0.98 1 1.02 1.04 1.06 1.08 1.1

Scaled w irelength

Sca

led

ru

nti

me

Experiments carried out on ISPD2004 FastPlace IBM benchmarks.

UCLA VLSICAD LAB 17

Scalability ComparisonScalability Comparison

y = 0.0001x1.2409

(mPL5-fast)

y = 5E-06x1.4995

(FastPlace1.0)

0

200

400

600

800

0 50000 100000 150000 200000

#Cells

Run

tim

e

FastPlace1.0 mPL5-fast

mPL5-fast is slightly more scalable than FastPlace1.0

UCLA VLSICAD LAB 18

Comparison on Mixed-Size Placement BenchmarksComparison on Mixed-Size Placement Benchmarks

mPL51, 1

Capo9.01.23, 0.97

Fengshui5.01.10, 0.62

0

0.2

0.4

0.6

0.8

1

1.2

0.95 1 1.05 1.1 1.15 1.2 1.25

Scaled Wirelength

Sca

led

Run

tim

e

mPL5 has 18% shorter wirelength than Capo 9.0

mPL5 has 9 % shorter wirelength than Fengshui 5.0

Experiments carried out on ICCAD2004 Mixed-size benchmarks.

UCLA VLSICAD LAB 19

Placement Plot of Placers on ICCAD2004 Mixed-size Placement Plot of Placers on ICCAD2004 Mixed-size IBM02IBM02

mPL5

Rel. WL = 1.00

Fengshui 5.0

Rel. WL = 1.11

Capo 9.0

Rel. WL = 1.17

UCLA VLSICAD LAB 20

Placement Plot of Placers on ICCAD2004 Mixed-size Placement Plot of Placers on ICCAD2004 Mixed-size IBM10IBM10

mPL5

Rel. WL = 1.00

Fengshui 5.0

Rel. WL = 1.15

Capo 9.0

Rel. WL = 1.28

UCLA VLSICAD LAB 21

Results on PEKO BenchmarksResults on PEKO Benchmarks

1.001.201.401.601.802.002.202.402.602.80

12506 27220 45639 68685 83709 182980

#Cells

Qualit

y r

atio

Capo9.0 Dragon3.01 Fengshui5.0

FastPlace1.0 mPL5 mPL5-fast

UCLA VLSICAD LAB 22

mPL5 placement on ICCAD2004 Mixed-size IBM02mPL5 placement on ICCAD2004 Mixed-size IBM02

UCLA VLSICAD LAB 23

Conclusions Conclusions

mPL5 is a highly scalable multilevel placer based on bin-mPL5 is a highly scalable multilevel placer based on bin-

density constrained optimization formulationdensity constrained optimization formulation

Provides a mathematically sound foundation for force-Provides a mathematically sound foundation for force-

directed methodsdirected methods

mPL5 produces the best wirelength with competitive mPL5 produces the best wirelength with competitive

runtime on both standard cell and mixed-size designs.runtime on both standard cell and mixed-size designs. 3% to 9% shorter WL on standard cell designs3% to 9% shorter WL on standard cell designs

9% to 18% shorter WL on mixed size designs9% to 18% shorter WL on mixed size designs

compared the best-known academic placerscompared the best-known academic placers

UCLA VLSICAD LAB 24

AcknowledgementAcknowledgement

Financially supported by SRC, NSF, and ONR.Financially supported by SRC, NSF, and ONR.

Thank Min Xie for implementation of detailed placementThank Min Xie for implementation of detailed placement

Thank Joseph Shinnerl and Min Xie for valuable Thank Joseph Shinnerl and Min Xie for valuable

discussionsdiscussions

Thank Chris Chu and Natarajan Viswanathan for providing Thank Chris Chu and Natarajan Viswanathan for providing

ISPD04 FastPlace IBM benchmarks.ISPD04 FastPlace IBM benchmarks.

UCLA VLSICAD LAB 25

End of the PresentationEnd of the Presentation

Thank you!Thank you!