18
Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion Pietro Babighian, Luca Be nini, Alberto Macii, Enric o Macii ISLPED’04

Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

  • Upload
    marty

  • View
    42

  • Download
    7

Embed Size (px)

DESCRIPTION

Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion. Pietro Babighian , Luca Benini , Alberto Macii , Enrico Macii ISLPED’04. Outline. Introduction Previous Work Algorithm Experimental Results Conclusion. Low Vt logic module. Virtual ground. sleep. - PowerPoint PPT Presentation

Citation preview

Page 1: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Post-Layout Leakage Power MinimizationBased on Distributed Sleep Transistor

Insertion

Pietro Babighian, Luca Benini, Alberto Macii, Enrico Macii

ISLPED’04

Page 2: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Outline

Introduction Previous Work Algorithm Experimental Results Conclusion

Page 3: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Introduction

Bellow .13 process leakage dominates power consumption– Leakage power = exp(-q*Vt / K*T)

Leakage reduction methods– Dual Vt partition

– MTCMOS– State assignment

Low Vt logic module

sleep Virtual ground

high Vt

Page 4: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Outline

IntroductionPrevious Work Algorithm Experimental Results Conclusion

Page 5: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

sleep

Previous Works

MTCMOS– Take a non-negligible amount of time to wake up

and re-activate sleep transistor. (long re-activation time)

Virtual ground

Low Vt logic module

Vdd

ONOFF

VDD-Vth 0DischargeRe-activation time

Stand by mode

Active mode

Page 6: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Previous Works

Distributed sleep transistor– Multiple sleep transistors are initiated.– A faster re-activation time

Most techniques presented at the logic and circuit level, and do not take placement information into account.

Cause severe wiring congestion

Page 7: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Outline

Introduction Previous WorkAlgorithm Experimental Results Conclusion

Page 8: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Sleep Transistor Insertion in row-based layout

Low Vt logic module

sleep Virtual ground

high Vt

Vdd

local wiring

Page 9: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Row Compaction & Area Penalty

Row Compaction

Area PenaltyAdd sleep transistor

Page 10: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Gate Clustering

Get Timing & floorplan Information from Layout

Select a sleep transistor

Check all rows?

Yes

No

Row Compaction

Select a cell

Update maximum current available at sleep transistor

Add cell to cluster

Timing violation?No

Yes

sleep Virtual ground

Gate1

Gate2

Gaten

available current at sleep transistor

According to available space

A gate by gate exploration of each row

Page 11: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

How to Select Cell?

sleepVirtual ground

Gate1

Gate2

Gaten

ON Re-activation time

If Arrival time > Re-activation time, zero re-activation delay overhead are paid.

From primary output to primary input

OFF

Vdd

Discharge

Check whether the cell can be power-gated?

2.Current? 3.Timing? RT>RT_OH?1.Leakage Power?

Page 12: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Sleep Transistor Sizing

sleep

Virtual ground

Gate1

Gate2

GateN

CL

Page 13: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Outline

Introduction Previous Work AlgorithmExperimental Results Conclusion

Page 14: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Experimental Results(1/2)

Delay overhead constraint is set to 5% Area overhead constraint is set to 5%

Benchmark

Orig Opt ∆

PL[m

W]

Pdyn[m

W]

Ptot[m

W]

PL[mW]

Pdyn[mW]

Ptot[mW]

PL [%]

Pdyn[%]

Ptot[%]

Block1 0.11 0.29 0.4 0.02 0.32 0.34 78.9 -9.0 15.0

Block2 0.19 0.220.4

10.04 0.24 0.28 80.0 -10.1 31.0

Block3 0.16 0.310.4

70.04 0.33 0.37 74.6 -8.8 21.2

Block4 0.26 0.60.8

60.05 0.63 0.68 82.7 -5.0 18.6

Block5 0.12 0.290.4

10.03 0.32 0.35 78.9 -9.7 12.5

Block6 0.46 0.881.3

40.09 0.98 1.07 83.5 -12.4 20.1

Avg.   79.7 -9.6 18.9

Page 15: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Experimental Results(2/2)

Area Penalty

Benchmark Gates SleepArea_Orig [µm2]

Area_Opt

[µm2]∆[%]

Block1 1852 14 64912 66794 2.9

Block2 1916 14 65210 66710 2.3

Block3 2215 22 65053 66550 2.3

Block4 2267 13 65412 66524 1.7

Block5 2302 26 65918 68159 3.4

Block6 2612 20 70298 71703 2.0

Page 16: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Experimental Results(3/3)

Delay penalty

CellNo Leak Control Leak Control

∆Power[%]

∆Delay[%]PLk[mW] Delay[ps] PLk[mW] Delay[ps]

G1 7.761 132.3 2.717 137.0 65.0 -3.5

G2 7.881 132.2 2.371 135.0 69.0 -2.1

G3 11.278 120.8 1.466 126.1 87.0 -4.3

G4 1.951 161.3 0.547 166.3 71.9 -3.1

G5 1.988 158.3 0.467 165.3 76.5 -4.4

G6 2.161 161.0 0.737 168.3 65.8 -4.5

G7 4.967 130.3 0.745 135.1 85.0 -3.6

G8 5.704 185.0 1.254 189.0 78.0 -2.1

G9 1.136 146.7 0.353 152.5 68.9 -3.9

G10 1.968 240.0 0.446 248.0 77.3 -3.3

G11 4.967 182.6 0.745 188.2 85.1 -3.0

G12 3.054 369.6 0.916 385.7 70.0 -4.3

Page 17: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Outline

Introduction Previous Work Algorithm Experimental ResultsConclusion

Page 18: Post-Layout Leakage Power Minimization Based on Distributed Sleep Transistor Insertion

Conclusion

Sleep Transistor Insertion :– Driven by a layout-aware cost function– Done with tunable performance and area penalty