39
Phase Change Memory An Architecture and Systems Perspective Benjamin Lee Electrical Engineering Stanford University Stanford EE382 2 December 2009 Benjamin Lee 1 :: PCM :: 2 Dec 09

Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Phase Change MemoryAn Architecture and Systems Perspective

Benjamin LeeElectrical Engineering

Stanford University

Stanford EE3822 December 2009

Benjamin Lee 1 :: PCM :: 2 Dec 09

Page 2: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Memory Scaling

◦ ⇑ density, capacity; ⇓ cost-capability ratio

◦ Emerging challenges for prevalent technologies [ITRS07]

Benjamin Lee 2 :: PCM :: 2 Dec 09

Page 3: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Technology Alternatives

• Memory Scaling◦ ⇑ density, capacity; ⇓ cost◦ Challenges for prevalent technologies

• Charge Memory◦ Write data by capturing charge Q◦ Read data by detecting voltage V◦ Examples: Flash, DRAM

• Resistive Memory◦ Write data by pulsing current dQ/dt◦ Read data by detecting resistance R◦ Examples: PCM, STT-MRAM, memristor

Benjamin Lee 3 :: PCM :: 2 Dec 09

Page 4: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Limits of Charge Memory

◦ Difficult charge placement and control

◦ Flash: floating gate charge

◦ DRAM: capacitor charge, transistor leakage

Benjamin Lee 4 :: PCM :: 2 Dec 09

Page 5: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Towards Resistive Memory

• Scalable◦ Program cell with scalable mechanisms◦ Map resistance to logical state

• Non-Volatile◦ Set atomic structure of cell◦ Incur activation cost to alter properties

• Competitive◦ Achieve viable latency, power, endurance◦ Scale to improve performance metrics

Benjamin Lee 5 :: PCM :: 2 Dec 09

Page 6: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

TechnologyBenjamin Lee, Engin Ipek, Onur Mutlu, Doug Burger. “Architecting phase change

memory as a scalable DRAM alternative.” ISCA 2009.

Benjamin Lee 6 :: PCM :: 2 Dec 09

Page 7: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Phase Change Memory

◦ Store data within phase change material

◦ Set phase via current pulse

◦ Detect phase via resistance (amorphous/crystalline)

Benjamin Lee 7 :: PCM :: 2 Dec 09

Page 8: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

PCM Scalability

◦ Program with current pulses, which scale linearly

◦ PCM roadmap to 30nm [Raoux+08]

◦ Flash/DRAM roadmap to 40nm [ITRS07]

Benjamin Lee 8 :: PCM :: 2 Dec 09

Page 9: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

PCM Non-Volatility

• Atomic Structure◦ Program with current pulses◦ Melt material at 650 ◦C◦ Cool material to desired phase

• Activation Cost◦ Crystallize with high activation energy◦ Isolate thermal effects to target cell◦ Retain data for >10 years at 85 ◦C

Benjamin Lee 9 :: PCM :: 2 Dec 09

Page 10: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Technology Parameters

◦ Survey prototypes from 2003-2008 [ISSCC][VLSI][IEDM][ITRS]

◦ Derive parameters for F=90nm

Size◦ 9 - 12F2 using BJT

◦ 1.5× DRAM

Endurance◦ 1E+08 writes

◦ 1E-08× DRAM

Latency◦ 50ns Rd, 150ns Wr

◦ 4×, 12× DRAM

Energy◦ 40µA Rd, 150µA Wr

◦ 2×, 43× DRAM

Benjamin Lee 10 :: PCM :: 2 Dec 09

Page 11: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Technology Parameters

◦ Survey prototypes from 2003-2008 [ISSCC][VLSI][IEDM][ITRS]

◦ Derive parameters for F=90nm

Size◦ 9 - 12F2 using BJT

◦ 1.5× DRAM

Endurance◦ 1E+08 writes

◦ 1E-08× DRAM

Latency◦ 50ns Rd, 150ns Wr

◦ 4×, 12× DRAM

Energy◦ 40µA Rd, 150µA Wr

◦ 2×, 43× DRAM

Benjamin Lee 10 :: PCM :: 2 Dec 09

Page 12: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Technology Parameters

◦ Survey prototypes from 2003-2008 [ISSCC][VLSI][IEDM][ITRS]

◦ Derive parameters for F=90nm

Size◦ 9 - 12F2 using BJT

◦ 1.5× DRAM

Endurance◦ 1E+08 writes

◦ 1E-08× DRAM

Latency◦ 50ns Rd, 150ns Wr

◦ 4×, 12× DRAM

Energy◦ 40µA Rd, 150µA Wr

◦ 2×, 43× DRAM

Benjamin Lee 10 :: PCM :: 2 Dec 09

Page 13: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Technology Parameters

◦ Survey prototypes from 2003-2008 [ISSCC][VLSI][IEDM][ITRS]

◦ Derive parameters for F=90nm

Size◦ 9 - 12F2 using BJT

◦ 1.5× DRAM

Endurance◦ 1E+08 writes

◦ 1E-08× DRAM

Latency◦ 50ns Rd, 150ns Wr

◦ 4×, 12× DRAM

Energy◦ 40µA Rd, 150µA Wr

◦ 2×, 43× DRAM

Benjamin Lee 10 :: PCM :: 2 Dec 09

Page 14: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

PCM Deployment

◦ Deploy PCM on memory bus

◦ Begin by co-locating PCM, DRAM

Benjamin Lee 11 :: PCM :: 2 Dec 09

Page 15: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Price of Scalability

◦ 1.6× delay, 2.2× energy, 500-hour lifetime

◦ Replace DRAM with PCM in present architectures

Benjamin Lee 12 :: PCM :: 2 Dec 09

Page 16: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Architecture and ScalabilityBenjamin Lee, Engin Ipek, Onur Mutlu, Doug Burger. “Architecting phase change

memory as a scalable DRAM alternative.” ISCA 2009.

Benjamin Lee 13 :: PCM :: 2 Dec 09

Page 17: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Architecture Objectives

• DRAM-Competitive◦ Reorganize row buffer to mitigate delay, energy◦ Implement partial writes to mitigate wear mechanism

• Area-Efficient◦ Minimize disruption to density trends◦ Impacts row buffer organization

• Complexity-Effective◦ Encourage adoption with modest mechanisms◦ Impacts partial writes

Benjamin Lee 14 :: PCM :: 2 Dec 09

Page 18: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Buffer Organization

• On-Chip Buffers◦ Use DRAM-like buffer and interface◦ Evict modified rows into array

• Narrow Rows◦ Reduce write energy ∝ buffer width◦ Reduce peripheral circuitry, associated area

• Multiple Rows◦ Reduce eviction frequency◦ Improve locality, write coalescing

Benjamin Lee 15 :: PCM :: 2 Dec 09

Page 19: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Buffer Area Strategy

◦ Narrow rows :: fewer expensive S/A’s (44T)

◦ Multiple rows :: additional inexpensive latches (8T)

Benjamin Lee 16 :: PCM :: 2 Dec 09

Page 20: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Buffer Design Space

◦ Derive DRAM, PCM area model

◦ Explore space of area-neutral buffer designs

Benjamin Lee 17 :: PCM :: 2 Dec 09

Page 21: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Wear Reduction

• Wear Mechanism◦ Writes induce phase change at 650 ◦C◦ Contacts degrade from thermal expansion/contraction◦ Current injection is less reliable after 1E+08 writes

• Partial Writes◦ Reduce writes to PCM array◦ Write only stored lines (64B), words (4B)◦ Add cache line state with 0.2%, 3.1% overhead

Benjamin Lee 18 :: PCM :: 2 Dec 09

Page 22: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Partial Writes

◦ Derive PCM lifetime model

◦ Quantify eliminated writes during buffer eviction

Benjamin Lee 19 :: PCM :: 2 Dec 09

Page 23: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Scalable Performance

◦ 1.2× delay, 1.0× energy, 5.6-year lifetime

◦ Scaling improves energy, endurance

Benjamin Lee 20 :: PCM :: 2 Dec 09

Page 24: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Systems and Non-VolatilityJeremy Condit, Edmund Nightingale, Christopher Frost, Engin Ipek, Benjamin Lee,

Doug Burger, Derrick Coetzee. “Better I/O through byte-addressable, persistent

memory.” SOSP 2009.

Benjamin Lee 21 :: PCM :: 2 Dec 09

Page 25: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Storage Systems

◦ Persistent data in slow, non-volatile memory

◦ Buffered data in fast, volatile memory

Benjamin Lee 22 :: PCM :: 2 Dec 09

Page 26: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Storage System Trade-offs

• Design Objectives◦ Safety :: secure against crashes◦ Consistency :: correctness in non-volatile memory◦ Performance :: buffering in volatile memory

• Byte-addressable Persistence (BPRAM)◦ Narrows gap between volatile/non-volatile memory◦ Addressable like DRAM◦ Persistent like disk, flash

Benjamin Lee 23 :: PCM :: 2 Dec 09

Page 27: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Byte-addressable Persistent File System (BPFS)

• Safety◦ Use PCM as DRAM alternative◦ Reflect writes to PCM in O(ms), not O(s)

• Consistency◦ Enforce atomicity, ordering in hardware◦ Support shadow paging, copy-on-write

• Performance◦ Use short-circuit shadow paging◦ Exploit addressability for small, in-place writes

Benjamin Lee 24 :: PCM :: 2 Dec 09

Page 28: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Tree-Based File System

Benjamin Lee 25 :: PCM :: 2 Dec 09

Page 29: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Journaling

◦ Write to journal before write to file system

◦ Requires twice the writes

Benjamin Lee 26 :: PCM :: 2 Dec 09

Page 30: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Journaling

◦ Write to journal before write to file system

◦ Requires twice the writes

Benjamin Lee 26 :: PCM :: 2 Dec 09

Page 31: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Journaling

◦ Write to journal before write to file system

◦ Requires twice the writes

Benjamin Lee 26 :: PCM :: 2 Dec 09

Page 32: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Shadow Paging

◦ Copy-on-write up to file system root

◦ Requires recursion up file system tree

Benjamin Lee 27 :: PCM :: 2 Dec 09

Page 33: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Shadow Paging

◦ Copy-on-write up to file system root

◦ Requires recursion up file system tree

Benjamin Lee 27 :: PCM :: 2 Dec 09

Page 34: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Disks & Shadow Paging

◦ Copy-on-write up to file system root

◦ Requires recursion up file system tree

Benjamin Lee 27 :: PCM :: 2 Dec 09

Page 35: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

PCM & Short-Circuit Shadow Paging

◦ Update in-place for small writes

◦ Lowers copying overhead

Benjamin Lee 28 :: PCM :: 2 Dec 09

Page 36: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Hardware Support

• Atomicity◦ BPFS assumes atomic 64-bit pointer updates◦ PCM writes atomically into memory array◦ Capacitors guard against power failures

• Ordering◦ Caches, memory controller reorder writes◦ Epochs define barrier-delimited BPFS writes◦ Enforce ordering across epochs◦ Allow reordering within epochs

Benjamin Lee 29 :: PCM :: 2 Dec 09

Page 37: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

BPFS Evaluation

◦ Improved safety, consistency

◦ Improved performance

Benjamin Lee 30 :: PCM :: 2 Dec 09

Page 38: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Conclusions

• Scaling Challenges◦ Fundamental limits in charge memory◦ Transition towards resistive memory

• Architecture and Scalability◦ Scalable, non-volatile, DRAM-competitive◦ Efficient buffers mitigate latency, energy◦ Partial writes reduce wear

• Systems and Non-Volatility◦ BPFS changes storage system trade-offs◦ Short-circuit shadow paging, hardware support◦ Improves durability, performance

Benjamin Lee 31 :: PCM :: 2 Dec 09

Page 39: Phase Change Memory: An Architecture and Systems …Architecture and Scalability Scalable, non-volatile, DRAM-competitive Efficient buffers mitigate latency, energy Partial writes

Phase Change MemoryAn Architecture and Systems Perspective

Benjamin LeeElectrical Engineering

Stanford University

Stanford EE3822 December 2009

Benjamin Lee 32 :: PCM :: 2 Dec 09