44
Chip-Package-Board Co-Design / Co-Verification Technology for DDR3 1.6G in Consumer Products Ji Zheng Director, Chip Package System Apache Design Solutions 2011 ASP-DAC Designer’s Forum January 27, 2011 Yokohama, Japan

Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Chip-Package-Board Co-Design / Co-Verification

Technology for DDR3 1.6G in Consumer Products

Ji Zheng

Director, Chip Package System

Apache Design Solutions

2011 ASP-DAC Designer’s Forum

January 27, 2011 Yokohama, Japan

Page 2: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Agenda

Challenges in DDR3 (consumer product)

Key aspects of DDR3 C-P-B Co-verification

View points of EDA tool provider

1/17/2011 ©2010 Apache Design Solutions, Confidential

Page 3: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

DDR3 1.6G Challenges

Increased clock frequency and reduced

voltage supply

Tighter timing and noise budget

Cost sensitivity (consumer products)

Shortened time window for sign-off

Debugging failure is more difficult

1/17/2011 ©2010 Apache Design Solutions, Confidential

Page 4: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Chip-Package-Board View of DDR System

Page 5: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Ecology of DDR3 Co-Design / Co-Verification

1/17/2011 ©2010 Apache Design Solutions, Confidential

DDR

Subsystem

Sign-off

DDR

Circuit

Design

Package

Design

RDL

DesignIO Ring

Place &

Route

PCB &

decaps

SoC Core

Physical

Design

Page 6: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Key Aspects of Co-Verification

Handling of multiple banks of DDR interface

for realistic debugging

Early stage co-design / co-verification

IO ring parasitics and decaps

Impact of core switching noise

Accurate channel models from DC to 15GHz

1/17/2011 ©2010 Apache Design Solutions, Confidential

Page 7: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

1/17/2011, 7 (c)2008 Apache Design Solutions

On-Die PG Effect on Jitter

Red – Jitter simulation without on-die PG grid: 202ps

Blue – Jitter simulation with on-die PG grid: 150ps

Page 8: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

C4 bumps

PCB + Package

CHIP

Chip Power Model

Chip on-die Power Grid RLC

Transistor current /Cap/Esr

Open SPICE netlist format

Chip Power Model for C-P-B Analysis

1/17/2011, 8 ©2010 Apache Design Solutions - Company Confidential

Page 9: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Expanding CPM for DDR - Chip Signal Model

Correlation with full transistor model

Models non-linear IO behavior & PG parasitics

Speed-up for DDR system simulation

Enables C-P-B co-verification of DDR

1/17/2011, 9 (c)2009 Apache Design Solutions

CSM

vsso

vddc

vddo

CSM

vddc vddo

vsso

Page 10: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Need Accurate Channel Models

1/17/2011 ©2010 Apache Design Solutions, Confidential

Page 11: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Conclusion

1/17/2011 ©2010 Apache Design Solutions, Confidential

DDR3 C-P-B co-verification needs a platform

that integrates and models all the

components in the DDR system

Early co-analysis is the key to the success

through model exchange among chip,

package and board design teams

New design methodologies suitable for co-

verification are needed to overcome

challenges from DDR3 1.6G

Page 12: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

Global Unichip Corp.ConfidentialSecurity C

Global Unichip Corp.ConfidentialSecurity B

DDR3 1600mbps

Global Unichip CorporationJen Tai HsuSenior Director, IP Division

ASP DAC 2011 Yokohama, Japan

Page 13: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 1

OutlinesDDR memory and trendDDR3 new feature introductionDDR3 high speed PHY designPI/SI analysisWire bond solution1866/2133mbps higher speed of DDR3Conclusion

Page 14: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 2

DDR GenerationDDR3 : transition happening but not complete yetDDR4: 2011 specification finalized

Page 15: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 3

DDR memory comparisonDDR feature comparison

* not finalize yet

Memory DDR2 DDR3 DDR4

Data rate (mbps) 400~1066 800~2133 2133~4266

System assumption 2 DIMM per channel 2 DIMM per channel single DIMM per channel

Vdd/Vddq 1.8v±0.1v 1.5v±0.075v 1.2v/1.1v *

Interface SSTL18 SSTL15

Package TSOP/BGA BGA BGA/TSV (Through-silicon Via)

System buses parallel parallel point to point

Page 16: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 4

DDR4 system

Source: PCWatch 後藤弘茂

Page 17: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 5

TSV in DDR4

Source: Denali MEMCON10

Page 18: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 6

DDR3 new feature (1)Write/ Read leveling

Source: Micron

Page 19: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 7

DDR3 new feature (2) Dynamic ODT (On Die Terminator)

ODT advantageDynamically turn on and off parallel terminatorSave power Extra components cause leakage

Save cost No extra components

ODT can change value on the fly without idle time in DDR3

It is applied in 2 DIMM module system For the non-active device during write, with low-impedance

terminator value For the active device during write, with high-impedance

terminator value

Page 20: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 8

DDR3 new feature (3) Master Reset

Improve system stabilityReduce controller burden to ensure no illegal command

Source: Micron

Page 21: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 9

DDR3 new feature (4) Self calibration

Through a ZQ pin to calibrate the accurate ODT (on die terminator) and OCD (off chip driver)

In DDR2, most of DDR chip use trimming to adjust OCD value

Page 22: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 10

DDR3 high speed PHY design (1)Block diagram in system

I/O CellPads

PhysicalInterface

(PHY)

PHY IP Macro

DDR2/DDR3

SDRAM

MemoryController(Soft IP)

Configurable

DFI

MemoryController IP

IP A

IP A

IP A

AHB/AXI

Page 23: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 11

DDR3 high speed PHY design (2)DDRPHY

IO (so called SSTL-15)PLL/DLL( source synchronous DQS/DQ)Logic (Ser/De-Ser, DFT…)PVT (calibration)

Page 24: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 12

DDR3 high speed PHY design (3)Timing budget

System designer need to break timing budget into individual parts in DDRPHY

tREF clock 1250

unit: ps jitter/skew setup time (ps) hold time (ps)

Transmitter Components

DFF skew

BUFFER skew

MUX skew

local clock tree skew

local clock tree jitter

dll jitter

DQ IO duty skew

DQ IO C2C jitter

SSO push out jitter

Total transmitter uncertainty

Inter-connection

PCB cross talk

ISI

Package and substrate skew

PCB skew

Receiver components (DRAM)

tDS/tDH (1600mpbs) base AC150

tDS/tDH (1600mpbs) delta

tDS/tDH (1600mpbs) total

Total Worst Case Jitter + Skews

Data UI

Margin Under Absolute WC

Margin Under typical case

Margin Under guard banded

Page 25: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 13

DDR3 high speed PHY design (4)To accommodate low cost and provide high performance DDR IP, per-bit de-skewing technology need to be used to eliminate the following skew and limit the total skew to 30ps

Internal Clock SkewInternal Circuit Routing SkewPackage SkewPCB Skew

Page 26: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 14

PI/SI (1)Power integrity and Signal integrity analysis is the must be items for DDR system designA DDRPHY claims it could run up to 1.6Gbps does not mean anythingPackage and PCB design impact the timing budget seriously

Package modelChip model

Board model

Page 27: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 15

PI/SI (2)PI (power integrity)

Decoupling capacitor adding could lower the impedance in PI simulation

Decoupling capacitor need to be put as close to circuit as possibleOn package decap Capacitance size larger but with substrate inductance from

dieOn die decap Capacitance size smaller but with very small inductance

How to optimize the usage of two kinds of decapWith flexible decap added on DDRPHY IP could optimize the package design flow in adding decap

Page 28: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 16

PI/SI (3)SI simulation Eye diagram

The final sign-off of the DDRPHY system will be eye-diagram, which contains the following information

SSOCross-talkISIIO dutyreflection

Page 29: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 17

PI/SI (4)SI simulation Eye diagram

Model correctness will decide the accuracy of simulation resultPCBPackageIO

Iteration on the correlation of simulation result and measurement takes a long cycle and timing consuming

Page 30: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 18

Wire bond solution (1)Wire bond package needs more careful package substrate designIncrease power/ground to IO ratio could lower the inductance of substrateGeneral speaking, power/ground to IO ratio = 1:2 is a good startingHowever, high power/ground to IO ratio will increase the DDRPHY area

Page 31: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 19

Wire bond solution (2)Proper arrangement on DDRPHY IO layout can increase power/ground to IO ratio without increase the size of PHYTri-tier bond pad arrangement could decrease 40% of wire inductance

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_I

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_I

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_I

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_I

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_I

PAD_O PAD_O

PAD_M

PAD_I

PAD_M

PAD_O

Page 32: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 20

Higher speed in DDR3 (1)DDR3 maximum speed is 2133mbps

UI = 468.8ps compared with UI =625ps when operating in 1600mbps. Difference = 156.2ps

There is no design margin in such high speed operationHow to “squeeze” the source synchronous system for the extra 156ps margin, that is a big challenge

Page 33: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 21

Higher speed in DDR3 (2)DDR3 2133mbps

From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take UI(625ps) -418ps-25ps (dqs skew + jitter) = 182ps

SSOCross-talkISIIO dutyreflection

DDRPHY designer need to squeeze 100ps from 182ps and another 50ps from the other skew and jitter inside the PHY

Page 34: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 22

Higher speed in DDR3 (3)DDR3 2133mbps

Every components in the system need to be designed very carefully

PCBPackageIOImpedance matchingClock skewPLL/DLL jitter

Page 35: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

GUC Confidential & Proprietary Copyright © 2010 All rights reserved 23

ConclusionHigh speed DDR performance will strongly depends on

DDRPHY design and need DDR controller to provide logic function control of PHYPackage design

PI/SI simulation is to make sure the quality of package substrate

PCB designPCB layer and layout will impact the performance of DDRPHY a lot

Page 36: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

1

C-P-B Co-design/Co-verification for DDR3 1.6G in consumer products

CT Chiu, ASEJanuary 27, 2011

Page 37: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

2

Packaging Trend for Consumer Product

0.18 µm/FSG 0.13 µm/FSG 90 nm/Low-K 65 nm/Low-K 45 nm/Low-K

1995 2000 1990 2005 2010

P-DIP

QFP family

SOP family

BCC QFN family

Surface Mount Lead-frame PKG

Ball Array PKG

Stacked / MCM / SiP PKG

Hybrid PKG

Ultra thin PKG

BGA family

FBGA (mini BGA)

SiP BGA

PoP

PiP

WLCSP� High Density and Supply Chain Integration� Emerging Advanced Packages Solutions� High Electrical & Thermal Performance� Fast Time-to-market & Quality� Small, Light & Soft� Low Cost� Green

Stacked-die BGA

Page 38: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

3

Design Span Expanding for Total Performance

Ref.: GSA, SIA, ITRS (2007) & ASE Data

IC Design

IC Fab

IC Assy & Test

DFM

Featu

re Si

ze(1/

2 pitc

h)

20202015

Analog

Consumer Logic

DRAM

• CPU • GPU • NAND• DSP/BB

16 nm

32 nm

45 nm

65 nm

201020052000

Slowing Adoption of New IC Technology

IP/Library/DFM Design C

hain

Page 39: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

4

Package Performance Index for High Speed Signal

� Signal Integrity (SI)- Timing Skew, Impedance, X-talk, Jitter

� Power Integrity (PI)- PSN impedance, SSN, SSO

� Eletromagnetic Interference (EMI)- Near-field radiation, Shielding

Page 40: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

5

Package Modeling Transition

� Bonding wire

� Transmission line

� Via transient

� Plating bar

� PWR/GND effect

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

Freq(GHz)

-18

-16

-14

-12

-10

-8

-6

-4

-2

0

S21

(dB

)

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

Freq(GHz)

-70

-60

-50

-40

-30

-20

-10

0

S11

(dB

)1st order RLC modelS-parameter model

1st order RLC modelS-parameter model

Page 41: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

6

IC onlyIC+PKGIC+PKG+PCB

IC-Package-Board Co-Verification

Page 42: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

7

ASIC

DDR2

DDR2

DDR2 - 800Mbps : Wire Bond BGA Package

Resistor

PBGA 23mmx23mm32 BitDQS wire ~ 2.83mmDQ wire ~ 3.56mm4 Layer PCB

Page 43: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

8

Chip-Package-Board Co-Simulation

PCB GND shape voided (Original) PCB GND shape filled (Modified)

Layer2 Layer2

Page 44: Chip-Package-Board Co-Design / Co-Verification Technology ... · From the data eye simulation of 1600mbps, the setup time + hold time = 418ps, which means the following items take

9

� Package play as a carrier to offer highly density in 2D/3D integration which full fill consumer market demand in cost effective. Package design cope with system concept is essential in SiP.

� Package performance play significant part at overall product success. Chip-Packge-Board Co-Design/Co-Verification is crucial to products which operate at Gbps (or GHz).

� Package design optimization and modeling methodology transition ensure the quality signal integrity and power integrity performance to DDR2 which also been a basis for DDR3.

Summary