15
IBM General Parallel File System (GPFS™) 3.5 and GSS Introduction IBM Confidential Karl Hansen, Nordic HPC/Technical Computing Sales Manager IBM Systems & Technology Group A New Era in Technical Computing: Powerful. Comprehensive. Intuitive.

IBM general parallel file system - introduction

Embed Size (px)

DESCRIPTION

Presentation from the HPC event at IBM Denmark - September 2013, Copenhagen

Citation preview

Page 1: IBM general parallel file system - introduction

IBM General Parallel File System (GPFS™) 3.5and GSS Introduction

IBM Confidential

Karl Hansen, Nordic HPC/Technical Computing Sales Manager

IBM Systems & Technology GroupA New Era in Technical Computing: Powerful. Comprehensive. Intuitive.

Page 2: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

Extreme Scalability Proven Reliability Manageability

The IBM General Parallel File SystemTM (GPFSTM)Shipping since 1998

© 2012 IBM Corporation

File system

� 263 files per file system

� Maximum file system

size: 299 bytes

� Production 19PB file

system

Number of nodes

� 1 to 8192

Extreme Scalability

� No special nodes

� Add/remove nodes

and storage on the fly

� Rolling upgrades

� Administer from any

node

� Data replication

� Snapshots

� File system journaling

Proven Reliability

� Integrated tiered storage

� Storage pools

� Quotas

� Policy-Driven automation

� Clustered NFS

� SNMP monitoring

� TSM / HPSS (DMAPI)

Manageability

Page 3: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

IBM General Parallel File System (GPFS)

IBM General Parallel File System (GPFS) is a scalable high-performance file

A highly available cluster architecture

Concurrent shared disk access to

© 2012 IBM Corporation3

performance file management

infrastructure for AIX®, Linux® and

Windows™ systems.

Concurrent shared disk access to a global namespace

Capabilities for high performanceparallel workloads

Page 4: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

File data infrastructure optimization

GPFS enables:

� A global namespace across platforms

� High performance

common storage

ConnectionsSAN TCP/IPInfiniBand

Management

Databases

File servers

© 2012 IBM Corporation4

common storage

� Eliminating copies

of data

� Improved storage utilization

� Simplified file

management

AvailabilityData Migration

ReplicationBackup

ManagementCentralizedMonitoringAutomated File Mgmt

Application servers

Backup and

archive

Page 5: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

How is GPFS different?

GPFS

Centrally deployed, managed,

Massive namespace support

© 2012 IBM Corporation5

All features are included. All software features: snapshots, replication and multi-site connectivity are included in the GPFS license. With no license keys except for client and server to add on, you get all of the features up front.

SAN

Centrally deployed, managed,

backed up and grown.

Seamless capacity and

performance scaling

Page 6: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

Network-based block input output

LAN

NSD clients

NSD GPFS

Application data access on network

attached nodes is exactly the same as

a SAN attached node. GPFS

transparently sends the block level IO

request over a TCP/IP network.

© 2012 IBM Corporation6

Why?� Enable virtually seamless multi-site operations

� Reduce costs for data administration

� Provide flexibility of file system access

� Establish highly scalable and reliable data storage

� Future protection by supporting mixed technologies

SAN

NSD servers

SAN

GPFS

Page 7: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

IBM General Parallel File System (GPFS™) – History & Evolution

HPC

GPFS General File

Serving

Virtual Tape Server (VTS)

Linux® Clusters (Multiple architectures)

GPFS 2.1-2.3

HPC

Research

Visualization

Digital media

Seismic

Weather

Life sciences

32 bit /64 bit

GPFS 3.1-3.2

First called GPFS

GPFS 3.4

Enhanced

Windows cluster

support

- Homogenous

Windows server

Performance and

scaling

improvements

GPFS 3.3

Restricted

admin functions

Improved

installation

New license

model

Improved

Ease of administration

Multiple-

Information lifecycle management (ILM)

� Storage pools

� File sets

� Policy engine

GPFS 3.5

Caching via

Active File

Management

(AFM)

GSS - GPFS

Storage Server

GPFS File

Placement

© 2012 IBM Corporation

2006200520021998

Serving� Standards� Portable

operating system interface (POSIX) semantics-Large block

� Directory and small file perf

� Data management

architectures)

IBM AIX® Loose Clusters

Inter-op (IBM AIX

& Linux)

GPFS Multicluster

GPFS over wide

area networks

(WAN)

Large scale

clusters

thousands of

nodes

2009

Enhanced

migration and

diagnostics

support

2010

snapshot and

backup

Improved ILM

policy engine

2012

Multiple-networks/ RDMA

Distributed token management

Windows 2008

Multiple NSD servers

NFS v4 support

Small file performance

Placement

Optimizer (FPO)

Page 8: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

� New Storage Solution fulfilled exclusively through IBM Intelligent Cluster

� High capacity, High performance, High Value offering

Product importance

� Single, integrated, fully supported IBM solution

A Disruptive HPC Play - GPFS Storage Server (GSS) At a GlanceThe New High Capacity, High PerformanceStorage Solution

© 2012 IBM Corporation8

� Single, integrated, fully supported IBM solution

� Built to leverage a strong GPFS software market

� High capacity, scalable building-block approach - performance and capacity increases as you add multiple building blocks

� Cost competitive

� Extreme data integrity and reduced latency with faster rebuild times

Page 9: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

GPFS Storage Server - Product DescriptionGSS is a new storage solution fulfilled exclusively through the IBM Intelligent Cluster

� Two x3650 servers combined with either four or six JBODs

� Two models: GSS 24 and GSS 26

– GSS 24 (Entry): 4 JBODs – starts at nearly 500TB of storage space

– GSS 26 (Main): 6 JBODs – starts at over 700TB of storage

space

Data striped across

all disks

© 2012 IBM Corporation

� 2 and 3 TB options

� 10GbE or FDR Infiniband interconnects, or both!

� Scalable Building Block approach to HPC Storage - performance and capacity increase as you add multiple building blocks

� Complete Storage Solution with no Storage controllers

� De-Clustered RAID Techniques

� Built on GPFS software

� Industry standard components – Leverages standard components including x3650s, NetApp

JBODs, LSI SAS cards and lots of HDDs, and Intelligent

Cluster fulfillment as a single, integrated, fully supported IBM

GSS 24 Model

Page 10: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

x3650 M4 Server

Storage solution includes Data Servers,

Disk (2TB or 3TB NL-SAS, SSD), Software,

InfiniBand / Ethernet with no Storage Controllers

IBM System x GPFS Storage Server provides a comprehensive storage solution with a scalable building block approach

© 2012 IBM Corporation

JBODDisk Enclosure

GSS 24: Light and Fast2 3650 servers + 4 JBOD 20U rack

10 GB/Sec

GSS 26: HPC Workhorse2 3650 servers +

6 JBOD Enclosures, 28U

12 GB/sec

High-Density HPC Option6 3650 servers + 18 JBOD2 - 42U Standard Racks

36 GB/sec

Page 11: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

Why you should care…

Cost competitive� Fewer parts means lower cost

� Leverages System x servers and Commercial JBODs

Affordably Scalable Building Block

approach to HPC Storage

� Performance and capacity increases as you add multiple buidling blocks

� Start Small and Scale via incremental additions

� Add capacity AND bandwidth

• Fast rebuild times and industry-leading performance

© 2012 IBM Corporation11

Extreme data integrity and reduced latency

• Fast rebuild times and industry-leading performance• Better sustained performance• Industry-leading throughput using efficient De-

Clustered RAID Techniques

Built on GPFS• The Infrastructure for Global Technical Computing

Data Management

Fully integrated, fully supported

�Single, integrated, fully supported IBM solution

�Complete Storage Solution with no Storage controllers

�Easy to order through Intelligent Cluster

Page 12: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

A data management portfolio for Technical Computing

Focus: Ease of UseReliability

IBM Data Management Leadership

Focus:Managed Building

Block

Focus:Raw Raw Performance

I/O Bandwidth

Government

High EndResearch

Petroleum

Financial Media/Ent.

GPFS Storage Server

© 2012 IBM Corporation

Financial Services

Smaller Installations

Higher End University

Media/Ent.

CAE

Bio/Life Science

IBM Tape, Tivoli Storage Manager, and HPSS

Direct Attached(DS3500 + V3700)

SONAS

DCS3700DCS3700+

Page 13: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

NSD File Server 1

Clients

FDR IB

10 GbE

NSD File Server 1x3650

Clients

What makes this different

File/Data Servers

© 2012 IBM Corporation

JBOD Disk Enclosures

NSD File Server 2

Servers!

Migrate RAID

and Disk

Management to

Standard File

Servers!

Custom Dedicated

Disk Controllers

JBOD Disk Enclosures

x3650

NSD File Server 2GPFS Native RAID

GPFS Native RAID

File/Data Servers

Page 14: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

System x

Smarter Systems for a Smarter Planet

© 2012 IBM Corporation14

Page 15: IBM general parallel file system - introduction

Technical Computing: Powerful. Comprehensive. Intuitive

For more information ibm.com/systems/software/gpfs

© 2012 IBM Corporation

Email [email protected] or contact your IBM Representative