35
Glenn K. Lockwood, Ph.D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond -1- June 28, 2018

Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Glenn K. Lockwood, Ph.D.+ 28 others

Planning for the Future of Storage for HPC: 2020, 2025, and Beyond

- 1 -

June 28, 2018

Page 2: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC: the mission HPC facilityfor the U.S. Department of Energy Office of Science

- 2 -

7,000 users800 projects700 applications~2000 publications per year

Page 3: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Cori – NERSC’s Cray XC-30 (2015)

Compute• 9,688 Intel KNL nodes• 2,388 Intel Haswell nodesStorage• 30 PB, 700 GB/s scratch– Lustre (Cray ClusterStor)– 248 OSSes⨉ 41 HDDs ⨉ 4 TB– 8+2 RAID6 declustered parity• 1.8 PB, 1.5 TB/s burst buffer– Cray DataWarp– 288 BBNs ⨉ 4 SSDs ⨉ 1.6 TB– RAID0

- 3 -

Page 4: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Wang Hall – NERSC’s new data center (2015)

• 12.5 MW in 2015• 20 MW planned for 2020• 1,850 m2 floor space• PUE < 1.1• Free cooling – no chillers!– 4x 3.4 MW cooling towers– < 25°C water to racks– 4x ~0.5 MW air handlers– 30% - 70% humidity

- 4 -

Page 5: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC’s storage infrastructure (2015)

- 5 -

Performance

Capacity

Burst Buffer

Scratch

Campaign

Archive

Page 6: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Burst Buffer

Campaign

Archive

Scratch

More capabilities, more problems

- 6 -

Performance

Capacity

Page 7: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Burst Buffer

Scratch

Campaign

Archive

More capabilities, more problems

- 7 -

Performance

Capacity

Tactical challenges• Partly inverted pyramid• Wang Hall could not

host tape archives• Tape technology was

discontinued by vendor

Page 8: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Burst Buffer

Scratch

Campaign

Archive

More capabilities, more problems

- 8 -

Performance

Capacity

Strategic challenges• Exascale problems are

massive data problems• New instruments have huge

data analysis needs• Storage tiers are hard to

use and manage• Hardware landscape is

rapidly changing

Page 9: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Storage 2020: NERSC’s strategic plan for storage

- 9 -

Storage 2020 goals:• Define roadmap for NERSC storage

infrastructure for the next ten years• Define architectures for specific

NERSC milestones:– 2020: NERSC-9 deployment– 2025: NERSC-10 deployment

• Define blueprint for HPC storage planning

Page 10: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC’s approach to strategic planning

- 10 -

WorkloadAnalysis

UserRequirements

TechnologyTrends

NERSCStrategy

Page 11: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

User requirements: Workflows

- 11 -

APEX w orkflow s w hite paper - https://w w w .nersc.gov/assets/apex-w orkflow s-v2 .pdf

Survey findings:• Data re-use is uncommon

• Significant % of working set must be retained forever

Insight:

• Read-caching burst buffers require prefetching

• Need large archive

• Need to efficiently move data from working space to archive

Page 12: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

User requirements: Exascale

Large working sets: “Storage requirements are likely to be large; they are already at the level of 10 PB of disk storage, and they are likely to easily exceed 100 PB by 2025.” (HEP)

High ingest rates: “Next generation detectors will double or quadruple these rates in the near term, and rates of 100 GB/sec will be routine in the next decade.” (BES)

- 12 -DOE Exascale Requirements Reviews - https://www.exascaleage.org/

Page 13: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Workload analysis: Read/write ratios

- 13 -

Burst Buffer: 4:6Scratch: 7:5Archive: 4:6• Checkpoint/restart

is not the whole picture

• Read performance is very important!

Lockw ood et al., TO KIO for C lusterStor. Proceedings of 2018 Cray U ser G roup.Stockholm , SE. M ay 2018.

Page 14: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Workload analysis: File interactions

- 14 -

Metadata ops issued in a year to scratchFile size distribution on project

Page 15: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Technology trends: tape

• Industry is consolidating• Revenue is shrinking• Tape advancements are

driven by profits, not tech!– Re-use innovations in HDD– Trail HDD bit density by 10 yr

• Refresh cadence will slow• $/GB will no longer keep

up with data growth

- 15 -

LTO market trends; Fontana & Decad, MSST 2016

Page 16: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Technology trends: magnetic disk

- 16 -

• Bit density increases slowly (10%/yr)

• !"# ∼ ⁄&'(# ')*

• HDDs for capacity, not performance

Page 17: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Technology trends: flash

• NAND $/GB dropping fast

• Performance limited by PCIe and power

• O($0.15/GB) by 2020

- 17 -

Actuals from Fontana & Decad, Adv. Phys. 2018

Page 18: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Technology trends: flash

- 18 -

Perfo

rman

ce

Cap

acity

, End

uran

ce

SCM

Fast3D NAND

MLC & TLC3D NAND

Five-level cell 3D NAND

3DXP, CrossbarO(10 GB/s), O(100 DWPD)

Trash flashO(100 MB/s), O(100 DW)

Z-NANDGB/s, O(10 DWPD)

Plain flashGB/s, O(1 DWPD)

Page 19: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Technology trends: software

- 19 -

• Workloads demand perf at small I/O sizes

• Don’t pay for POSIX consistency

• Object stores are a flexible foundation

Page 20: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Design goals derived from requirements

• Tiering and data movement– Fix the storage pyramid– Collapse tiers for simplicity

• POSIX and file systems– Maintain POSIX API support– ...but look beyond POSIX semantics

• Hardware– Capitalize on falling costs of flash– Tape will not be business as usual

- 20 -

Page 21: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2015, 2020, and 2025

- 21 -

Page 22: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – Tiering and data movement

- 22 -

• Target 2020– Collapse burst buffer

and scratch into a single logical tier

Page 23: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – Tiering and data movement

- 23 -

• Target 2020– Collapse burst buffer

and scratch into a single logical tier

• Target 2025– Collapse longer-term

disk and tape into a single logical tier

Page 24: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – POSIX and file systems

- 24 -

• Target 2020– Provide object

interfaces to file-based storage systems

Page 25: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – POSIX and file systems

- 25 -

• Target 2020– Provide object

interfaces to file-based storage systems

• Target 2025– Provide file interfaces

into object-based storage systems

Page 26: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – Hardware

- 26 -

• Target 2020– Collapse burst buffer and

scratch into all-flash scratch– Invest in large disk tier for

capacity– Tape isn’t dead yet, so invest in

it to minimize long-term costs• Target 2025

– Use single namespace to manage tiers of SCM and flash for scratch

– Use single namespace to manage tiers of disk and tape for long-term repository

Page 27: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – Hardware

- 27 -

• Target 2020– Collapse burst buffer and

scratch into all-flash scratch– Invest in large disk tier for

capacity– Tape isn’t dead yet, so invest in

it to minimize long-term costs • Target 2025

– Use single namespace to manage tiers of SCM and flash for scratch

– Use single namespace to manage tiers of disk and tape for long-term repository

Page 28: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2020, 2025, and beyond

- 28 -

All-flash parallel file system feasible for

NERSC-9

Page 29: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2020, 2025, and beyond

- 29 -

All-flash parallel file system feasible for

NERSC-9

> 100 PB disk-based file system

Page 30: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2020, 2025, and beyond

- 30 -

All-flash parallel file system feasible for

NERSC-9

> 100 PB disk-based file system

> 350 PB HPSS archive w/

IBM TS4500+Integrated Cooling

Page 31: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2020, 2025, and beyond

- 31 -

All-flash parallel file system feasible for

NERSC-9

> 100 PB disk-based file system

> 350 PB HPSS archive w/

IBM TS4500+Integrated Cooling

Performance object store w/ SCM+NAND on NERSC-10 (DAOS? Mero? Others?)

Page 32: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

NERSC roadmap – 2020, 2025, and beyond

- 32 -

All-flash parallel file system feasible for

NERSC-9

> 100 PB disk-based file system

> 350 PB HPSS archive w/

IBM TS4500+Integrated Cooling

Performance object store w/ SCM+NAND on NERSC-10 (DAOS? Mero? Others?)

Archival object store w/ HDD+tape(GHI+HPSS? Versity? Others?)

Page 33: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Unresolved issues requiring investment

• Automatic data movement– Need better metadata capabilities to inform policy– Need better software integration between tiers

• Moving beyond POSIX– Only implement the parts of POSIX people use– Limit unnecessary performance penalties (locking, etc)

• Everything must be tunable to workload of the center

- 33 -

Page 34: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Following the Storage 2020 blueprint

• 36-page Storage 2020 report online: https://escholarship.org/uc/item/744479dp

• Be inclusive and get all staff involved!• Keep up with industry & research

– HPC-IODC– Massive Storage Systems and Technology conference– Flash Memory Summit– Large Tape User Group– PDSW-DISCS

- 34 -

Page 35: Planning for the Future of Storage for HPC: 2020, 2025 ... · Glenn K. Lockwood, Ph. D. + 28 others Planning for the Future of Storage for HPC: 2020, 2025, and Beyond-1-June 28, 2018

Thank you!

- 35 -