Download pdf - IBM Platform Computing Elastic Storage

© 2014 IBM Corporation

Platform Computing

1

IBM Platform Computing

Elastic Storage

Gord SissonsPlatform Symphony Product Marketing

Scott CampbellPlatform Symphony Product Manager

Rohit ValiaDirector, Product Marketing


Platform Computing

2

Filer 1 Filer 2

Traditional Storage


Platform Computing

3

Filer 5 Filer 6 Filer 7 Filer 8Filer 1 Filer 2 Filer 3 Filer 4

Traditional Storage


Platform Computing

4

Filer 1 Filer 2 Filer 3 Filer 4

Solution: Global workload sharing, resource balanced storage

with automatedstorage tiering

Global Namespace


Platform Computing

5

Elastic Storage provides extraordinarily parallel, scale-out storage

Global Namespace

with automatedstorage tiering


Platform Computing

6

Maximum file system size:

1 Million Yottabytes

2^64 files per file system

Maximum file size equals file

system size

Customers with 18 PB file

systems

IPv6

Futureproof

Commodity hardware

Extreme Scalability

Snapshots, replication

Built-in heartbeat, automatic

failover/failback

Add/remove on the fly

Rolling upgrades

Administer from any node

Commodity hardware

Proven Reliability

Parallel file access

Distributed, scalable, high

performance metadata

Flash acceleration

Automatic tiering

Over 400 GB/s

Commodity hardware

High Performance

Elastic Storage – Key features


Platform Computing

7

Supported storage hardware

In addition to IBM Storage, IBM General

Parallel File System (GPFS™) supports

storage hardware from these vendors:

GPFS supports many storage systems, and the IBM support team can help

customers using storage hardware solutions not on this list of tested devices.

EMC

Hitachi

Hewlett Packard

DDN


Platform Computing

8

Supported server hardware

GPFS for x86 Architecture is

supported on multiple x86 and AMD

compatible systems:

IBM Intelligent Cluster

IBM iDataPlex®

IBM System x®

rack-optimized servers

IBM BladeCenter® servers

Non-IBM x86 and AMD

compatible servers

System p™

BladeCenter servers

IBM Blue Gene®

IBM System p®

General Parallel File System (GPFS™) for IBM

POWER Systems™ is supported on both IBM

AIX® and Linux®.

General Parallel File System (GPFS™) for x86

Architecture™ is supported on both Linux® and

Windows Server 2008.

GPFS for Power is supported on

multiple IBM POWER platforms:


Platform Computing

9

GPFS introduced

concurrent file

system access from

multiple nodes.

Sharing Data Across an Organization

Multi-cluster expands the global

namespace by connecting

multiple sites

AFM takes global namespace

truly global by automatically

managing asynchronous

replication of data

GPFS

GPFS

GPFS

GPFS

GPFS

GPFS

1993 2005 2011


Platform Computing

10

Global Namespace

Clients access:

/global/data1

/global/data2

/global/data3

/global/data4

/global/data5

/global/data6

Clients access:

/global/data1

/global/data2

/global/data3

/global/data4

/global/data5

/global/data6

Clients access:

/global/data1

/global/data2

/global/data3

/global/data4

/global/data5

/global/data6

Cache Filesets:

/data1

/data2

Local Filesets:

/data3

/data4

Cache Filesets:

/data5

/data6

File System: store1

Local Filesets:

/data1

/data2

Cache Filesets:

/data3

/data4

Cache Filesets:

/data5

/data6

File System: store2

Cache Filesets:

/data1

/data2

Cache Filesets:

/data3

/data4

Local Filesets:

/data5

/data6

File System: store3

See all data from any Cluster

Cache as much data as

required or fetch data on

demand


Platform Computing

11

Elastic Storage

SSD

CIFSSingle Name Space

Use Elastic Storage file set and ILM policies to

control data placement, deletion and movement

across storage tiers (pools)

Elastic Storage Data Life Cycle Management

SATASAS

File-System

TSM

LTFS

HPSS


Platform Computing

12

A Typical Hadoop HDFS Environment

MapReduce Cluster

NFS

Filers

M

a

p

R

e

d

u

c

e

UsersJobs

H

D

F

S

Uses disk local to each server

Aggregates the local disk space into a single, redundant shared file system

The open source standard file systems used in partnership with Hadoop MapReduce


Platform Computing

13

Hadoop MapReduce Environment Using Elastic Storage FPO

MapReduce Cluster

NFS

Filers

M

a

p

R

e

d

u

c

e

UsersJobs

Elastic

Storage

FPO

Uses disk local to each server

Aggregates the local disk space into a single redundant shared file system

Designed for MapReduce workloads

Unlike HDFS, GPFS-FPO is POSIX compliant – so data maintenance is easy

Intended as a drop in replacement for open source HDFS (IBM BigInsights product may be required)


Platform Computing

14

Cloud Tier (ICStore)

• IBM Public Cloud

• Amazon S3

• MS Azure

• Private Cloud

The Vision

Analytics MediaFile Storage Data Ingest

Solid State Spinning

Disk

TapeESS

World-Wide

Data Distribution

POSIX

NFS

MAP Reduce

Object

Elastic Storage

• Single name space no matter where data resides

• Data in best location, on the best tier (performance & cost), at the right time

• Multi-tenancy

• All in software


Platform Computing

15


Platform Computing

16

Architecture


Platform Computing

17

Elastic Storage Cluster Models

Storage

Storage Storage

TCP/IP or Infinband RDMA Network

Storage Network

TCP/IP or Infiniband Network

TCP/IP or Infinband Network

NSD Servers

ApplicationNodes

ApplicationNodes


Platform Computing

18

Features


Platform Computing

19

Elastic Storage Key Features more details

Distributed journaled file

system, scalable, high

performance metadata

AIX, Linux and Windows

Single name space

File parallel access

Built-in heartbeat, automatic

failover/failback, quorum

Administer from any node

Add/remove on the fly servers

or disks

Rolling upgrades

Basics Standard

SNMP (running on a Linux

node)

Snapshots, backup, replication

Filesets, quotas

Active/active dual site with

synchronous replication

Multi-cluster

Server internal disks (FPO)

Flash acceleration (LROC

Linux)

File clones

Basics

Automatic tiering (ILM), even to

tapes with HSM software

Geographic asynchronous

caching (AFM)

Clustered NFS servers (cNFS

Linux) to give access beyond

the elastic storage cluster

Advanced

Native encryption

Secure deletion


Platform Computing

20

Application servers

Elastic Storage Server

Or commodity hardware

Tape Library

Autotiering and

Migration

• Policy-driven automation and tiered storage management

• Match the cost of storage to the value of data

• Storage pools create tiers of storage ‒ High performance SSD

‒ High speed SAS drives

‒ High capacity NL SAS drives

• Integrated with IBM Tivoli Storage Manager (TSM)

and IBM LTFS Enterprise Edition (EE)‒ Elastic Storage handles all metadata processing then hands

the data to TSM and LTFS EE for storage on tape

‒ Data is retrieved from the external storage pool on demand, as a result of an application opening a file for example

‒ Policies move data from one pool to another without changing the file’s location in the directory structure

• Tape Migration Bottom Line:

‒ Cuts storage costs up to 90%

Elastic Storage Manages the Full Data Lifecycle Cost Effectively

• Right Data

• Right Place

• Right Time

• Right Performance

• Right Cost


Platform Computing

21

Flash Local Read Only Cache (LROC)

Clients

Elastic Storage

Flash LROC SSDs• Inexpensive SSDs placed directly in Client nodes

• Accelerates I/O performance up to 6x by reducing the amount of time

CPUs wait for data

• Also decreases the overall load on the network, benefitting

performance across the board

• Improves application performance while maintaining all the

manageability benefits of shared storage

• Cache consistency ensured by standard tokens

• Data is protected by checksum and verified on read

• Elastic Storage handles the flash cache automatically so data is

transparently available to your application with very low latency and

no code changes


Platform Computing

22

Elastic Storage : Tiering to tape with LTFS/EE

• Automatic migration to tape

• File user does not see where file is

stored

• Scales by adding tape drives or nodes

• Load is balanced on nodes and drives

• Tapes can be exported/imported.

• Redbook : IBM Linear Tape File System

Enterprise Edition V1.1 Installation and

Configuration Guide sg248143

GPFS Node 1

TSxxxx Tape Library

LTFS EE

GPFS Node 2

LTFS EE

Global name space

GPFS file systems (user data and metadata)

Users and applications

User data


Platform Computing

23

File Placement Optimizer

(GPFS-FPO)


Platform Computing

24

Use disk local to each server

All Nodes are NSD servers and NSD Clients

Designed for map reduce workloads

Elastic Storage - FPO

GPFS


Platform Computing

25

Elastic Storage advanced storage for Hadoop

Hadoop HDFS IBM GPFS-FPO Advantages

HDFS NameNode is a single point of failure

Large block-sizes – poor support for small files

Non-POSIX file system – obscure commands

Difficulty to ingest data – special tools required

Single-purpose, Hadoop MapReduce only

Not recommended for critical data

No single point of failure, distributed

metadata

Variable block sizes – suited to multiple types

of data and data access patterns

POSIX file system – easy to use and manage

Policy based data ingest

Versatile, Multi-purpose

Enterprise Class advanced storage features


Platform Computing

26

OpenStack


Platform Computing

27

OpenStack Mission: To produce the ubiquitous open source cloud

computing platform that will meet the needs of public and private

cloud providers regardless of size, by being simple to implement and

massively scalable

OpenStack Delivers a Massively Scalable Cloud Operating

System


Platform Computing

28

Horizon

Nova

Cinder

Swift

Neutron

Keystone

Glance

OpenStack Key Components


Platform Computing

29

• OpenStack Havana release includes a GPFS Cinder driver

– Giving architects access to the features and capabilities of the industry’s leading

enterprise scale-out software defined storage

• With OpenStack on GPFS, all nodes see all data

– Copying data between services, like Glance to Cinder is minimized

or eliminated

– Speeding instance creation and conserving storage space

• Rich set of data management and information lifecycle features

- Volume Placement: On GPFS storage pools or FPO based placement

- Resilience: Per-volume replication level, DIO volumes

- Storage migration: Transparent or user-directed migration of volumes between GPFS

storage pools, GPFS nodes or to other Cinder back ends

- Glance Integration: Convert a volume to an image or an image to a volume through

COW mechanism – fast mechanism for instance provisioning and capture

OpenStack GPFS Cinder Driver


Platform Computing

30

Competition


Platform Computing

3131

Business Problem IBM GPFS Lustre EMC Isilion IBRIX

Fusion

HDFS MAPR

POSIX Interface Yes Yes Yes Yes No Yes

Multi-OS Support Yes Linux only N/A No No No

Hadoop FS API or location aware

connectorYes No Yes Yes Yes

Lifecycle Management, Tape

archivalYes No No No No

Global name space Yes No Yes No

Distributed meta data Yes No Yes Yes

Expand capability on-line Yes No

WAN caching / replication Yes No No No

File system snapshots Yes

Quotas Yes

Open source No Yes No Yes No

Commercial support Yes YesOracle, Cray,

Bull, SGI &

others

Yes YesHP

YesCloudera,

IBM &

others

Yes

IBM GPFS vs. Competitors


Platform Computing

32

Elastic Storage – Editions


Platform Computing

33

Elastic Storage - New Pricing Structure

Server and Client for Each

Socket Based Licensing

• Simpler, no more PVUs

Express Edition

• gpfs.base (no ilm, afm, cnfs)

• gpfs.docs

• gpfs.gpl

• gpfs.msg

• gpfs.gskit

Standard Edition

• Add gpfs.ext

Advanced Edition

• Add – gpfs.crypto

Platforms

• zLinux

• Ubuntu

Features Express Edition Standard Edition Advanced Edition

Basic GPFS

functionality

ILM: Storage pools,

Policy, mmbackup

Active File

Management (AFM)

Clustered NFS

(cNFS)

Encryption


Platform Computing

34

Elastic Storage Cluster Models

Storage

Storage Storage

TCP/IP or Infinband RDMA Network

Storage Network

TCP/IP or Infiniband Network

TCP/IP or Infinband Network

NSD Servers

ApplicationNodes

ApplicationNodes

FPO license

Client license

Server license

Server license


Platform Computing

35



Platform Computing

36

36

Replaces Specialized hardware controller with software

Delivers Extreme Data Integrity– 2- and 3-fault-tolerant erasure codes

– End-to-end checksum

– Protection against lost writes

– Fastest rebuild times using Declustered RAID

Breakthrough Performance– Declustered RAID reduces app load during rebuilds

– Up to 3x lower overhead to applications

– Built-in SSDs and NVRAM for write performance

– Faster than alternatives today – and tomorrow!

Lowers TCO– 3 Years Maintenance and Support

– General Purpose Servers

– Off-the-shelf SBODs

– Standardized in-band SES management

– Standard Linux

– Modular Upgrades



Platform Computing

37

37

Model GL4

Analytics and Cloud4 Enclosures, 20U

232 NL-SAS, 2 SSD

10+ GB/Sec

Model GL6

PetaScale Storage6 Enclosures, 28U

348 NL-SAS, 2 SSD

12+ GB/sec

Model GL2

Analytics Focused2 Enclosures, 12U

116 NL-SAS, 2 SSD

5+ GB/Sec

5146 Machine

Type

•Power S822L Servers

•20 Cores Each

•1818-80e Expansion Chassis

•Red Hat 7

•Graphical User Interface

•Management Server and HMC

•Elastic Storage Software

•Elastic Storage Native RAID

•xCat or Platform Cluster Mgr. Opt.

•10 Gb, 40 Gb Enet, FDR Inifiniband

•From 116 to 348 Spinning Disk

•3 Years Maitenance

•Building Block approach to Growth

•High Capacity Storage for Analytics and Cloud Serving

•Uses 4U, 60 Drive Storage Enclosures

•2TB or 4 TB Drives

•A Client-Ready Petabyte in Single Rack!

Elastic Storage Server GL Models


Platform Computing

38

Elastic Storage Server GS Models

Model GS124 SSD

6 GB/Sec

Model GS246 SAS + 2 SSD or

48 SSD Drives

2 GB/Sec SAS

12 GB/Sec SSD

Model GS494 SAS + 2 SSD or

96 SSD Drives

5 GB/Sec SAS

16 GB/Sec SSD

Model GS6142 SAS + 2 SSD

7 GB/Sec

•Smaller Configurations for High Velocity Ingest or Lower Cost Entry Point

•Uses 2U, 24 Drive Storage Enclosures

•400, 800 GB SSD Drives or 1.2 TB SAS Drives

•Highest “Performance per U” Delivered to Clients

•Deployable alone or as part of an ESS Configuration as “Platinum” tier

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

5887

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

5887

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

FC

58

87

5146 Machine

Type •Power S822L - 20 Cores Each

•Power Expansion Chassis

•Red Hat 7

•Graphical User Interface

•Management Server and HMC

•Elastic Storage Software

•Elastic Storage Native RAID

•xCat or Platform Cluster Mgr. Opt.

•10 Gb, 40 Gb Enet, FDR Inifiniband

•3 Years Maintenance

•Building Block approach to Growth


Platform Computing

39

Elastic Storage Ensures End-to-end Data Availability,

Reliability, and Integrity

• GPFS Elastic Storage Native RAID (De-clustered RAID)

‒ Data and parity stripes are uniformly partitioned and distributed across array

‒ Rebuilds that take days on other systems, take minutes on Elastic Storage

• 2-fault and 3-fault tolerance

‒ Reed-Solomon parity encoding; 2-fault or 3-fault tolerant

‒ 3 or 4-way mirroring

• End-to-end checksum & dropped write detection

‒ From disk surface to Elastic Storage user / client

‒ Detects and corrects off-track and lost / dropped disk writes

• Asynchronous error diagnosis while affected I/Os continue

‒ If media error: verify and restore if possible

‒ If path problem: attempt alternate paths

• Supports live replacement of disks

‒ I/O operations continue for tracks whose disks are removed during service