26
Hassan Mostafa & M. Anis & M. Elmasry University of Waterloo, Ontario, Canada Comparative Analysis of Power Yield Improvement Under Process Variations of Sub-Threshold Flip-Flops

Hassan Mostafa & M. Anis & M. Elmasry University of Waterloo, Ontario, Canada

Embed Size (px)

DESCRIPTION

Comparative Analysis of Power Yield Improvement Under Process Variations of Sub-Threshold Flip-Flops. Hassan Mostafa & M. Anis & M. Elmasry University of Waterloo, Ontario, Canada. Outline. Introduction and Background Motivation and Objectives Simulation Procedure - PowerPoint PPT Presentation

Citation preview

Page 1: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

Hassan Mostafa & M. Anis & M. Elmasry

University of Waterloo, Ontario, Canada

Comparative Analysis of Power Yield Improvement Under Process Variations

of Sub-Threshold Flip-Flops

Page 2: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 2

Outline

Introduction and Background

Motivation and Objectives

Simulation Procedure

Results and Discussions

Conclusion

Introduction and Background

Page 3: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 3

Outline

Introduction and BackgroundVariability

Classification and Design MethodologiesSourcesVariability and YieldVariability and Sub-threshold logic

Motivation and Objectives

Simulation Procedure

Results and Discussions

Conclusion

Introduction and Background

Page 4: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 4

Variability Classification

Die-to-Die (D2D) Affects all devices on the chip in the same way

e.g., all devices on a chip have the same Vt

Within-Die (WID)Variations within a single chip

Affecting devices on the same chip differently

e.g., devices on the same chip have different Vt

Introduction and Background

Die index

D2D

WID

Page 5: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 5

Design Methodologies

Introduction and Background

Nominal

Corner-Based

Statistical

Design Methodology Performance Distribution Cost (e.g. power)

FAIL

FAIL

FAIL

50% Timing Yield

100% Timing Yield

>99.9% Timing Yield

www.nowpublishers.com

Frequency

(Overhead)

Page 6: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 6

T. C. Chen et al., ISSCC06

Process Variations SourcesRandom Dopant Fluctuations (RDF)

As CMOS devices are scaled, number of dopant atoms decreases

The number of dopant atoms has variations around its nominal value resulting in Vt variations

σVt α (WL)-0.5

As transistor area decreases with scaling, σVt increases

Channel Length VariationDifficulty to control the critical dimensions at sub-wavelength lithography

Large variations in the channel length (L)

Vt α exp(-L) due to short channel effects

A small variations in L results in large variations in Vt

Introduction and Background

Variations increase with technology scaling

1980 1990 2000 2020

100nm

1m

10nm

2010

193nm248nm

365nmLithographyWavelength

Generation

Gap

32nm

`

13nm EUV

Sub-wavelength lithography

65nm

90nm

130nm

45nm

180nm

[1]

S. Borkar et al., DAC04

Page 7: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 7

Variability and Yield

Delay

Delay distribution

Target delay

PASS FAIL

Process variations causes the device parameters to have fluctuations around their nominal values

The system parameters such as delay and power have a spread around their nominal values

This result in a percentage of the systems does not meet the required function or the required parameter constraint

Yield loss

Variability results in Yield loss

Introduction and Background

Page 8: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 8

Variability and Sub-threshold logic

Sub-threshold logic is adopted for low power consumption when performance is of secondary importance

The minimum energy occurs at the sub-threshold operating region

Variability increases in sub-threshold circuits due to the exponential relationship between the current and Vt

Variability increases in sub-threshold circuits

Introduction and Background

Super-threshold

Sub-threshold

VDDVt

Minimum Energy

Page 9: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 9

Outline

Introduction and BackgroundMotivation and Objectives

Simulation Procedure

Results and Discussions

Conclusion

Motivation and Objectives

Page 10: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 10

MotivationWith technology scaling:

Variability is getting worse resulting in yield loss (higher cost), especially, in sub-threshold circuits.

Flip-Flops

The continued demand for low power consumption leads to sub-threshold circuit design

There are several Flip-Flops topologies which have different power and performance trade-offs

Motivation and Objectives

Flip-Flops designers need some design guidelines on selecting the best Flip-Flop topology suitable for their application constraints considering power, performance, and yield

Page 11: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 11

Objectives

The power yield improvement of the sub-threshold flip-flops is of paramount importance, especially, in applications with strict power constraints. This power yield improvement results in excess delay and energy overheads.

The questions are:

Which flip-flop topology exhibits the largest overheads ?

Which flip-flop topology exhibits the lowest overheads?

Motivation and Objectives

Page 12: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 12

OutlineIntroduction and BackgroundMotivation and ObjectiveSimulation Procedure

Flip-Flops Selection and Design

Power Yield Improvement Using Gate Sizing

Results and Discussions

Conclusion

Simulation Procedure

Page 13: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 13

1. Master-Slave TopologyGood hold time & Relatively high latency delay

Selected Flip-Flops:Transmission-Gate Master-Slave Flip-Flop (TG-MSFF) : implemented in IBM PowerPC 603 processor and used in standard libraries (AMS) [1]

Modified-Clocked CMOS-Master-Slave Flip-Flop (MC2MOS-MSFF) : used in standard libraries (AMS) [2]

Flip-Flops Selection (3 topologies)

TG-MSFF M-C2MOS-MSFF[1] G. Gerosa et al., JSSC94; [2] V. Stojanovic et al., JSSC99

Simulation Procedure

Page 14: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 14

Flip-Flops Selection (3 topologies)2. Pulsed Flip-Flops

Samples the input data at a short transparency period around the clock edge

High performance

High power consumption & poor hold time

Selected Flip-Flop:Semi-Dynamic Flip-Flop (SD-FF) [1] :

One of the fastest flip-flops structures

The most convenient structure for high performance applications

SD-FF

[1] F. Klass, VLSI Dig. 98

Simulation Procedure

Page 15: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 15

Flip-Flops Selection (3 topologies)

3. Sense-amplifier based flip-flop topology (SA-FF)High performance at moderate power consumption

Represents a compromise between the MS-FF and the Pulsed-FF topologies

Uses differential pair architecture

Selected Flip-Flop:Strong Arm 110 Flip-Flop (SA-FF) [1]:

implemented in the high

performance WD21264 Alpha

processor

A comparative analysis among the selected flip-flops will aid the flip-flops designers to: Choose the best flip-flop topology based on their applications

power, performance, and yield constrains

[1]U. Ko et al.,ISLEPD96

Simulation Procedure

Page 16: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 16

Flip-Flops Design

Optimum Power Delay Product (PDP) DesignAll flip-flops sizes are optimized to achieve minimum EDP

The optimization process is performed by using the CFSQP (C-version Feasible Sequential Quadratic Programming) optimization package

Functional Yield Improvement Using Setup Time MarginMonte Carlo Analysis including D2D and WID variations is conducted using industrial STMicroelectronics 65-nm Technology node.

Due to variability, the setup time of some of the flip-flops samples is violated and these samples malfunction reducing the functional yield

A setup time margin is added to improve the functional yield

> 99.9%

Simulation Procedure

Page 17: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 17

Power Yield Improvement Using Gate SizingMain idea

Find the transistors sizes that shift the power distribution to achieve a power yield ≥ a given yield (Yo)

The power exhibits a log-normal distribution

Statistical gate sizing to shift the power distribution mean

Simulation Procedure

Page 18: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 18

Power Yield Improvement Using Gate SizingAlgorithm

Find the mean and standard deviation of the power distribution of the initially sized sub-threshold flip-flops (PO and σ )

Using PO and σ , find the equivalent normal distribution mean and standard deviation ( μln and σln )

Using μln and σln , find the geometric mean and standard deviation of the log-normal distribution (μg and σg )

Using μg and σg , find the new target power distribution mean (PO ‘) n = 3 for YO = 99.87%

Simulation Procedure

Page 19: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 19

Power Yield Improvement Using Gate SizingAlgorithm Flowchart

Power Constraint (PO) Power Yield Constraint (YO)

Initial gate sizing (obtained for optimum EDP)

Calculate the power mean and variability

(PO and σ)Calculate PO’

Calculate the gate sizing that match Po’ and minimize

the delay and energy overheads

Repeat if the power yield Constraint is not met

Simulation Procedure

Page 20: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 20

OutlineIntroduction and BackgroundMotivation and ObjectiveSimulation ProcedureResults and Discussions

Conclusion

Results and Discussions

Page 21: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 21

The delay, energy, and EDP Overheads Required for Power Yield Improvement

SA-FF exhibits the lowest overheads . Thus, the SA-FF is recommended for sub-threshold operation.

MC2MOS-FF exhibits the highest overheads. Therefore, the MC2MOS-FF is not recommended at all for sub-threshold operation.

TG-MSFF shows lower overheads than that of the SD-FF for power yield improvement.

Page 22: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 22

The delay, energy, and EDP Overheads Required for Power Yield Improvement

Scattered plots:

TG-MSFF M-C2MOS-MSFF

Page 23: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 23

The delay, energy, and EDP Overheads Required for Power Yield Improvement

Scattered plots:

SD-FF SA-FF

Page 24: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 24

Outline

Introduction and BackgroundMotivation and ObjectiveSimulation ProcedureResults and Discussions

Conclusion

Conclusions

Page 25: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 25

Conclusion

A comparative analysis among different sub-threshold flip-flops topologies is performed considering the required delay and energy overheads for power yield improvement .

This work demonstrates that the SA-FF is the recommended candidate for sub-threshold operation in the nanometer regime whereas the MC2MOS-MSFF is not recommended due to its large delay and energy overheads.

Conclusions

Page 26: Hassan Mostafa  & M. Anis  & M. Elmasry University of Waterloo, Ontario, Canada

ISCAS 2010 Slide 26

THANK YOU