20
System Co-Design and Co-Analysis Approach to Implementing the XDR™ Memory System of the Cell Broadband Engine™ Processor Realizing 3.2 Gbps Data Rate per Memory Lane in Low Cost, High Volume Production Wai-Yeung Yip, Scott Best, Wendem Beyene, Ralf Schmitt Rambus Inc Los Altos, CA, USA

System Co-Design and Co-Analysis Approach to … · System Co-Design and Co-Analysis Approach to Implementing the XDR™ Memory System of the Cell Broadband Engine™ Processor Realizing

  • Upload
    lydieu

  • View
    228

  • Download
    0

Embed Size (px)

Citation preview

System Co-Design and Co-Analysis Approach to Implementing the XDR™ Memory System of the Cell Broadband Engine™ Processor

Realizing 3.2 Gbps Data Rate per Memory Lane in Low Cost, High Volume Production

Wai-Yeung Yip, Scott Best, Wendem Beyene, Ralf SchmittRambus IncLos Altos, CA, USA

ASP-DAC 2007 Page 2

Agenda• Motivation• PLAYSTATION®3 (PS3™) Memory System• XIO (XDR™ I/O) Clock Generation• Power Distribution System Design & Analysis• Designing for Low Cost Manufacturing

Tolerances• Summary

* PLAYSTATION and PS3 are trademarks of Sony Computer Entertainment Inc.

ASP-DAC 2007 Page 3

Motivation• The Cell Broadband Engine™ (Cell B.E.),

designed for PS3, must be manufacturable at low cost and in high volume

• High speed interfaces have little margin to give• Bit time of the Cell B.E. XDR memory system is only

312.5 ps• Traditional system design and analysis approach may

not have worked in implementing such a system cost-effectively

* Cell Broadband Engine is a trademark of Sony Computer Entertainment Inc.

ASP-DAC 2007 Page 4

Traditional Design/Analysis Approach• Components may be designed and optimized

independent of each other• Components may be designed by teams across

multiple companies, at different design stages• Silicon may be designed independent of package &

PCB • DRAM may be designed independent of memory

controller physical layer (PHY)

ASP-DAC 2007 Page 5

System Co-Design/Co-Analysis Approach• Different components of the XDR memory

system were designed and analyzed simultaneously• Including XIO, DRAM device, clock chip, packages,

and PCB• System level analysis and optimization performed

taking into consideration interactions between components before the design of any component was finalized

ASP-DAC 2007 Page 6

Merits of Co-Design/Co-Analysis Approach• Allows best trade-off between system cost &

performance• Uncovers potential power integrity and signal

integrity issues at system level early in the design cycle• Allows resolution of such issues with a combination of

architectural, circuit, package, & PCB improvements• Results in solutions with the lowest system cost

ASP-DAC 2007 Page 7

PS3 Memory System Block Diagram

XDR XDR DRAMDRAM

XDR Clock XDR Clock GeneratorGenerator

XDR XDR DRAM DRAM

Cell B.E.Cell B.E.

System Clock

Term

inat

ion

Term

inat

ion

XIO

1XI

O1

Mem

ory

Mem

ory

cont

rolle

rco

ntro

ller

XDR XDR DRAMDRAM

DQ (Data) @3.2GbpsDQ (Data) @3.2Gbps

CFMCFMRQ (Command/Address)RQ (Command/Address)

Serial InterfaceSerial Interface

XDR XDR DRAMDRAM

XDR XDR DRAMDRAMCFMCFM

Serial InterfaceSerial Interface

XIO

XIO

00

Term

inat

ion

Term

inat

ion

XDR XDR DRAM DRAM

XDR XDR DRAM DRAM

XDR XDR DRAM DRAM

Note: XIO = XDR I/O CellTotal Memory BW = 25.6 GB/s

DQ (Data) @3.2GbpsDQ (Data) @3.2Gbps

CTM

RQ (Command/Address)RQ (Command/Address)

ASP-DAC 2007 Page 8

XIO Clock Generation Circuit

MUX

Mixer

Interp

DCC

MUX

Mixer

Interp

DCC

PLL

FF

MUX

Mixer

00h

CTM

receiveclocks <8:0>

Cycle-Count <1:0>

8

Interp

DCC

Div-by-4

MUX

Mixer

Interp

DCC

MUX

Mixer

Interp

DCC

MUX

Mixer

Interp

DCC

MUX

Mixer

Interp

DCC

transmitclocks <8:0>

FF

TX phasecontrol

RX phasecontrol

Per-Byte Block Diagram

ASP-DAC 2007 Page 9

XIO Clock Generation Circuit Features• Both TX (write) and RX (read) mixers for

precision phase control were implemented in the XIO• No phase mixers in the DRAM to keep its cost low

• Each DQ (data) bit has its own independent controls• Data-to-data PCB trace length matching not required

resulting in simpler PCB layout in smaller area

ASP-DAC 2007 Page 10

Power Distribution System Design/Analysis• Power Integrity considered from start of design

process• Power Integrity analyzed at system level

• Every component of the power distribution system included in the model

• Explored different trade-offs to arrive at most cost-effective power distribution system design

ASP-DAC 2007 Page 11

Optimally Accurate Analysis Approach

• Power distribution system consists of multiple components: VRM, PCB, package, chip

• Relative significance of a component in power integrity depends on the noise frequency

• Performed separate power integrity analyses, one for each frequency range

• In each frequency range, simplified models of components whose contribution to power supply noise is relatively small

ASP-DAC 2007 Page 12

Power Integrity Analyses• On-chip IR drop analysis

• Voltage drop due to resistance of on-chip wires• Package & PCB models removed due to relatively low resistance

• High-frequency noise analysis• Voltage noise due to high frequency switching of core & I/O• Package & PCB models simplified to an inductor

• Medium-frequency noise analysis• Voltage noise due to medium frequency switching of core & I/O• On-chip power distribution and decoupling capacitor model

simplified to a lumped RC network

ASP-DAC 2007 Page 13

Power Distribution System Components

Board Package On-Chip

⇐ ZPDN

VRM PCB

LPFVREF

+-

Note:PDN = Power distribution networkVRM = Voltage Regulator Module

ASP-DAC 2007 Page 14

Power Integrity Trade-off Analysis

4-Layer Board

Layer Board

4-Layer PCB

6-Layer PCB

1x On-chip caps2x On-chip caps

Z PD

N[Ω

]10

-210

-1

105 106 107 108 109 1010

Frequency [Hz]

• Resonance peak in medium frequency can be lowered in two different ways• Convert from 4 to 6-layer PCB, or• Double amount of on-chip capacitance

• Choose solution with a lower system cost

ASP-DAC 2007 Page 15

Low Cost Manufacturing Tolerances• Design parameters allowed to vary from their nominal

values within tolerances to minimize manufacturing costs• PCB impedance allowed to vary by ±15%• Package impedance allowed to vary by ±20%

• Such parameter variations result in variations in voltage and timing margins

• Signal integrity analysis to quantify such variations can be computationally expensive

ASP-DAC 2007 Page 16

Statistical Analysis Methods• Statistical signal integrity analysis methods

explored to find an accurate approach with low computational cost

• Taguchi-based method• Uses an orthogonal array to minimize amount of

simulations needed to explore design space• Generates accurate voltage & timing margin

distributions• Provides insight into parameter sensitivities

ASP-DAC 2007 Page 17

Orthogonal Arrays• An orthogonal array consists of columns

which are mutually orthogonal• For any pair of columns, all combinations of

factor levels occur an equal number of times

H

L

H

L

Factor 2

H

H

L

L

Factor 1

LSimulation 4HSimulation 3HSimulation 2LSimulation 1

Factor 3

ASP-DAC 2007 Page 18

Comparison of Statistical Methods

1210Simulation Time (hours)275000# of Simulations

0.4340.456Maximum0.2370.244Minimum

-0.0430.187Skewness-0.583-0.473Kurtosis0.0360.038Std. Deviation0.3340.341Mean

Taguchi BasedMonte CarloEye Height Statistics

• Taguchi-based method yields similar distribution as Monte Carlo at a much lower computational cost

ASP-DAC 2007 Page 19

Factor Effects Plots• Taguchi-based method yields factor effects plots

• Reveal sensitivity of system performance to individual design parameters

• Facilitate design trade-off to optimize performance at low overall system cost

100

110

120

130

140

150

160

L N H L N H L N H L N H

PCB impedance PKG impedance Termination Current drive

Volta

ge m

argi

n (m

V)

ASP-DAC 2007 Page 20

Summary• System Co-Design & Co-Analysis approach applied to

the implementation of the XDR memory system of the Cell B.E. processor• Allowed best trade-off between system cost and performance

• Precision phase control circuits implemented in the XIO to lower cost of DRAM and PCB design

• Optimally accurate approach applied to system-level power integrity analysis at low computational cost

• Taguchi-based statistical approach quantified signal integrity variations resulting from manufacturing tolerances at low computational cost