Upload
doanthien
View
224
Download
1
Embed Size (px)
Citation preview
Alfred Freudenberger
North America Power Systems
Sales Executive
512-659-8059
IBM Power Systems for SAP Environments
Blog: SAPonPower.wordpress.com
© 2014 IBM Corporation2
Production Node 3
512GB
Production Node 1
512GB
shared f ilesystem - GPFS
HDD f lash
data01 log01
DB partition 1
- index server- statistic server- SAP HANA® studio
- SAP HANA® DB
worker node
data01(1)
DB partition 2
HDD f lash
DB partition 3
f lash
- index server- statistic server
- SAP HANA® DB
worker node
data03 log03
Hot Standby512GB
HDD f lash
- index server- statistic server
- SAP HANA® DB
standby node
HDD replica
primary data
SAP HANA® DB
data01(2) data01(3)data02(1) data02(2)data02(3)
data03(1)data03(2) data03(3)
log01(1)
Log replica
log01(2) log01(3)
log02(1) log02(2)log02(3)log03(1)log03(2) log03(3)
HDD replica
data01(2) data01(3)data02(1) data02(2)data02(3)
data03(1)data03(2)
Log replica
log01(2) log01(3)
log02(1) log02(2)log02(3)log03(1)log03(2)
Production Node 2
512GB
HDD flash
- index server- statistic server
- SAP HANA® DB
worker node
data02 log02
data01(1)
data03(3)
log01(1)
log03(3)
DB partition 2
HANAHANA
Node1
GPFS
Node2
GPFS
Node3
GPFS
Node4
GPFS
NodeN
GPFS
IMDB IMDB IMDB IMDB IMDB
App1G1
App10G1
App10G2App1G2
4 5
6
8
7
9
0 1
4 5
0 1
4 5
0 1
4 5
0 1
4 5
0 1
BI
mgmtBI
2 3
IMM IMM IMM IMM IMM
2 3 2 3 2 3 2 3
ISL: 2x 10GigE
mgmt
ISL: 2x 10GigE
SAP HANA
© 2014 IBM Corporation3
SAP HANA Deployment Options for Different Use Cases
e.g. SAP ERP: CO-PA e.g. SAP BW -7.3 SP5-on HANA, Business Suite
e.g. Demographics & Google Maps
e.g. SAP ERP: Operational reportingHANA Live
© 2014 IBM Corporation4
SAP HANA on IBM POWER
� SAP HANA on Power is targeting enterprise customers requiring an SAP HANA-based
solution on IBM Power Systems servers
� IBM intention is not to offer it as an appliance, but in a flexible form combining the HANA
license from SAP and IBM Power Systems servers, middleware and services.
+
© 2014 IBM Corporation5
SAP HANA on IBM Power – Expected Customer Value
• Intended for mission critical 7X24 Enterprise customer operations
– Not an Appliance, running on traditional Power 8 and Power 7+ servers
– Best Reliable, Available, Serviceable (RAS) in the market
– On-Demand Capacity
– Can be integrated into and tailored to a customer’s environment
• Throughput per core
– Significant Power SMT throughput advantages versus Intel x86
– Value: fewer cores, fewer footprints, and lower operating costs
• Virtualization out of the box
– Lower virtualization layer overhead on multi-threaded HANA workloads
– Value: fewer cores, fewer footprints, and lower operating costs
• Price performance
© 2014 IBM Corporation6
Power Enterprise Systems
6
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
Core
L2
L3 Cache & Chip Interconnect
8M L3
Region
Mem. Ctrl.Mem. Ctrl.
SM
P L
inks
Accelerato
rsS
MP
Lin
ksP
CIe
✓22nm SOI - Single Chip Module✓8, 10 or 12 cores per socket✓4 – 4.35 GHz✓Up to 8 threads per core✓Integrated eDRAM L3, L4 Cache✓Improved SMP topology✓Dual memory controllers✓Dual on-chip PCIe Gen3 I/O controllers
Open Innovation to put data to work
© 2014 IBM Corporation7
POWER8 Memory Cards
POWER8 Memory Cards
10 chips per rank for double chipkill
� POWER8 2U and 4U systems � Capacity:
• 16GB / 32 GB / 64 / 128 GB
� 1600 MHz� Memory Sparing� 8 Cards per socket � Low Profile
� POWER8 Enterprise systems � Capacity:
• 16GB / 32 GB / 64 / 128 GB
� 1600 MHz� Memory Sparing� 16 Cards per socket
© 2014 IBM Corporation8
MemoryBuffer
DRAMChips
DDR Interfaces
POWER8
Link
Scheduler &
Management
16MB
Memory
Cache
Intelligence Moved into Memory• Scheduling logic, caching structures• Energy Mgmt, RAS decision point
– Formerly on Processor– Moved to Memory Buffer
Processor Interface• 9.6 GB/s high speed interface• More robust RAS•“ On-the-fly” lane isolation/repair
Performance Value• End-to-end fastpath and data retry (latency)• Cache � latency/bandwidth, partial updates• Cache � write scheduling, prefetch, energy
POWER8 Memory Buffer Chip
© 2014 IBM Corporation1010
POWER8 Integrated PCI Gen 3
POWER7
I/OBridge
GXBus
PCIe G2
PCIDevice
PCIe G3
PCIDevice
Native PCIe Gen 3 Support• Direct processor integration• Replaces proprietary GX/Bridge• Low latency• Gen3 x16 bandwidth (16 Gb/s)
POWER8
x16x16
© 2014 IBM Corporation11
Scale-out Systems built with POWER8
• Leadership Scale out systems designed for data
• Complemented by a full commitment to support an open stack of software - Ubuntu, SUSE, RedHat, PowerKVM, and Open Stack
• New Capabilities (4Q2014)- 2 TB memory option in S824- Linux SUSE SLES12 (LE) enabling easier application portability- Mixed Endian VM support of a single PowerKVM host- S824L w/ NVidia GPU acceleration- NEBS option for Telco
S812L S822L S822 S814 S824L S824
• 1-socket, 2U• Linux only
• 2-socket, 2U• Up to 24 cores• 1 TB memory• 9 PCI Gen3 slot• Linux only• PowerVM &
PowerKVM
• NEBs option
• 2-socket, 2U• Up to 20 cores• 1 TB memory• 9 PCIe Gen 3• AIX & Linux• PowerVM
• NEBs option
• 1-socket, 4U• Up to 8 cores• 512 GB memory• 7 PCIe Gen 3• AIX, IBM i, Linux• PowerVM• 4 core/P05 for
IBM i
• 2-socket, 4U• Up to 24 cores• Linux
• NVidia GPU
• Annc/GA in
4Q14
• 2-socket, 4U• Up to 24 cores• 11 PCIe Gen 3• AIX, IBM i, Linux• PowerVM• 1 TB memory
• 2 TB memory
option
© 2014 IBM Corporation12
IBM Power 822L pricing comparison ($US) – vs. Ivy Bridge
Comparable TCA
Linux on Intel
Ivy Bridge +
VMware
Vs.
Linux on POWER8
+ with PowerVM
Dell PowerEdgeR720
HP ProLiant DL380 G8
IBM Power 822L
$28,366 $29,829 $29,264
Server list price*-3-year warranty, on-site
$12,605 $14,068 $14,895
Virtualization
- OTC + 3yr. 9x5 SWMA
$10,064
VMware vSphere Enterprise 5.1
$ 10,064
VMware vSphere Enterprise 5.1
$9,880
PowerVM for IBM PowerLinux
Linux OS list price
- RHEL, 2 sockets, unlimited
guests, 9x5, 3 yr. sub./ supp.
$5,697Red Hat subscription and Red Hat
support
$5,697Red Hat subscription and Red Hat
support
$4,489Red Hat subscription and IBM
support
Total list price:
(Total cost of acquisition)$28,366 $29,829 $29,264
Server model Dell R720 HP Proliant DL380p G8 IBM Power 822L
Processor / cores Two 2.7 GHz , E5-2697, Ivy Bridge, 12-core processors Two 3.4 GHz POWER8, 10-core
Configuration 64 GB memory, 2 x 300GB 15k HDD, 10 Gb two port Same memory, HDD, NIC
* Based on US pricing for Power S822L announcing on April 28, 2014 matching configuration table above. Source: hp.com, dell.com, vmware.com
© 2014 IBM Corporation13
Full Landscape Comparison of x86 vs. Power for SAP
13
VMware
HA Database
ECC 6,000 DB/24,000 App SAPS
DataECC App
Data
x86
Database
HA Database
BW 12,500 DB/12,500 App SAPS
DataDatabase
Prod
60,000
SAPS
Req’d
14 nodes, 29 OS instances,28 SAN ports, 28 Network ports
VMware
ECC
BW HA
PowerVM
MDM, PI, SM
Power
ECC, BW, MDM, PI Dev/Test
ECC HA
BW
PowerVM
Data
ECC, BW, MDM, PI, SM Sbx
ECC QA
BW QA
MDM, PI QA
MDM, PI, SM HA
VIO
S
VIO
S
VIO
S
VIO
S
2 Power S822s, 14 OS instances4 SAN ports, 4 Network ports
ECC HA QA
DB/App
DB/App
DB/App
DB/App
DB/App
DB/App
DB/App
WPAR
WPAR
WPAR
DB/App
DB/App
DB/App
DB/App
DB/App
DB/AppWPAR
WPAR
DB/AppBW HA QA
MDM, PI HA QA
BW App
MDM DB/App
ECC App
BW App
PI DB/App
VMware
ECC App
SM DB/AppMDM, PI, SM
2-tier 5,000 SAPS total
QA
Dev/Test, Sandbox,
ECC, BW, MDM, PI
VMware
DataNon-
Prod
15,000
SAPS
Req’d
Data
HA DatabaseData
Database
HA DatabaseData
Database
VMware
ECC App
BW App
PI DB/App
VMware
MDM App
SM DB/App
ECC
BW
Mobility only within clusterMobility only within cluster
No Mobility
No Mobility
Full Mobility
More to manageMore to manage
Less to manage
Good Reliability
Good Reliability Mission
Critical Reliability
Estimated street price $190K Estimated street price $106K
App
41,500 SAPS
2699v3 36c
90,120 SAPS @
5% overhead,
45% utilization
= 38,526 SAPS
© 2014 IBM Corporation14
Combining the architectural strengths of Power 795with the modularity & efficiencies of Power 770/780
*All statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Introducing the New Power Enterprise Servers
High performance compute nodes • Fastest processor in the industry
• Up to 48-cores* and 4 TB per node
Modular structure delivers efficiency and flexibility• Flexible growth up to 4 compute nodes *
• Improved space and energy efficiency
System design for High Availability and Serviceabilty• Isolated, fully redundant System Control Unit
• Enhanced Serviceability
• Enhanced memory and system reliability
Economic efficiency with Capacity on Demand flexibility and IFLs• Elastic Capacity on Demand
• Mobile Capacity for Power Enterprise Pools
© 2014 IBM Corporation15
Power E870 & E880 servers
• 8 to 128 cores @ 4.35 GHz • Up to 192 cores* in 2015 • 256 to 16TB Memory • 1 to 4 nodes (5U) per system • Built-in initial Elastic CoD days
*Statement of Direction to support up to 8TB memory on E870 and up to 4 PCIe I/O Expansion Drawers per node on Power E870 & E880 in 2015.
Initial GA in 4Q14 supports 4TB maximum on E870 and 0 or 2 PCIe I/O drawers per node on E870 & E880.
�Increased performance and scale
�Built-in PowerVM Enterprise Edition
�Enterprise RAS
�System Control Unit (2U)
�Built-in Active Memory Mirroring for Hypervisor
�More performance per-watt
�8 PCIe adapter slots per node
�Up to 4 PCIe I/O drawers per Node*
�Share resources in Power Enterprise Pool
�Medium Software tier
�24x7 Warranty
Power E880Power E870
• 8 to 80 cores @ 4.19 GHz • 8 to 64 cores @ 4.0 GHz• 256 to 4TB Memory (8TB SOD*)• 1 or 2 nodes (5U) per system
© 2014 IBM Corporation16 16
688,630
436,100
384,330
311,720
202,180
139,220 138,900115,870
90,12055,970
35,680
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
IBMPower 795256-core
DB2
IBMPower E870
80-coreDB2
IBMPower 795128-core
DB2
IBMPower 780
96-coreDB2
IBMPower 780
64-coreDB2
IBMPower 760
48-coreDB2
DellR920
60-coreASE
IBMPower S824
24-coreDB2
DellR730
36-coreASE
DellR720
24-coreASE
CiscoB200 M316-core
ASE
2-T
ier
SA
PS
SAP SD 2-tier Benchmark
Source: http://www.sap.com/benchmark/ All results are as of 10/10/2014
© 2014 IBM Corporation1717
Enterprise POWER8 system structure
CEC Node(s)System Control Unit
� 2U Form factor
� One per system (required)
� External FSP
� Clock / Oscillator Support
� Optional DVD
� 19” Form Factor
� Connect multiple nodes
� Required on all systems
PCIe I/O Expansion Drawer(s)
EXP24S I/O Drawer(s)
� 4U Form factor
� 12 PCIe Expansion Slots
� Each connects via 2 PCIe adapter slots in a node
� 5U form factor in an industry-standard 19” rack
� 8 Low-profile PCIe Gen3 I/O slots / node
� 32 DIMM slots / node
� 2U Form factor
� 24 SFF SSD/HDD
� Each connects via 2 SAS adapters
© 2014 IBM Corporation18
POWER8 Enterprise Systems
192 cores*1 – 4 node*, 4-16S* (12c)
Up to 16 TB Memory
80 cores @ 4.19 GHz1 – 2 node, 4 - 8S (10c)
Up to 8TB* Memory
64c @ 4 GHz1 – 2 node, 8S (8c)Up to 8TB* Memory
128* cores @ 4.35 GHz1 – 4 node*, 4-16S (8c)
Up to 16 TB Memory
Power E8809119-MHE
Power E8709119-MME
1 E880 with 3 or 4 nodes, 96 or 128 cores, respectively, will GA in 2Q 2015. *Statement of Direction. All statements regarding IBM's future direction and intent are subject to change or withdrawal withoutnotice, and represent goals and objectives only.
© 2014 IBM Corporation19
Accelerating Availability of POWER8 Enterprise Systems
Power E8809119-MHE
Power E8709119-MME
GA = 4Q14
64-cores8 TB
1 E880 with 3 or 4 nodes, 96 or 128 cores, respectively, will GA in 2Q 2015. *Statement of Direction. All statements regarding IBM's future direction and intent are subject to change or withdrawal withoutnotice, and represent goals and objectives only.
192 cores*1 – 4 node, 4-16S (12c)*
Up to 16 TB Memory
80 cores @ 4.19 GHz1 – 2 node, 4 - 8S (10c)
Up to 8TB* Memory
64c @ 4.02 GHz1 – 2 node, 4-8S (8c)Up to 8TB Memory
128 cores @ 4.35 GHz1 – 4 node, 4-8S (8c) 1
Up to 16 TB Memory1
© 2014 IBM Corporation20
POWER8 Enterprise Systems
Power E8809119-MHE
Power E8709119-MME
1H15
64-cores8 TB
Expand to 128 cores
1 E880 with 3 or 4 nodes, 96 or 128 cores, respectively, will GA in 2Q 2015. *Statement of Direction. All statements regarding IBM's future direction and intent are subject to change or withdrawal withoutnotice, and represent goals and objectives only.
192 cores*1 – 4 node, 4-16S (12c)*
Up to 16 TB Memory
80 cores @ 4.19 GHz1 – 2 node, 4 - 8S (10c)
Up to 8TB* Memory
64c @ 4.02 GHz1 – 2 node, 4-8S (8c)Up to 8TB Memory
128 cores @ 4.35 GHz1 – 4 node, 4-8S (8c) 1
Up to 16 TB Memory1
© 2014 IBM Corporation21
Active Memory Mirroring for Hypervisor
21
Standard on every Power E870 & E880
Hypervisor
Copy B
Hypervisor
Copy A
Memory• Maintains two identical copies of the system hypervisor in memory at all times
• Both copies are simultaneously
updated with any changes
• In the event of a memory failure
on the primary copy, the second
copy will be automatically invoked and a notification sent to IBM via
the Electronic Service Agent
(ESA)
Eliminates system outages due to uncorrectable errors in memory
© 2014 IBM Corporation22
Elastic COD credit days are built into every E880 server
22
• All new Power E880 (9119-MHE) systems come included with Elastic COD
processor and memory day credits• Fifteen (15) processor days for every active and inactive processor ordered
• Two hundred & forty (240) GBs of On/Off Memory days for every processor ordered
• Only available at time of initial ship or upgrade
• Elastic COD processor and memory days will be kept as a credit by IBM
• Usage of Elastic COD resources• No restrictions apply to how client uses the Elastic CoD resources
• TCoD contract must be signed prior to receiving TCoD Enablement code
• TCoD Enablement feature code is ordered via an MES
• Reports of usage of Elastic COD resources will be sent to IBM on monthly basis
• Usage will be debited to clients account on a quarterly basis and a report will be sent providing the COD account status
• Normal billing occurs when all the client’s COD processor and memory days credits are exhausted
Example : Purchase a Power E880 w/128 cores
Receive – 1,920 (15 days x 128 cores) processors days
Receive – 30,720GB (240GBs x 128 cores) memory days
Equals:
30 days of usage of all 64 inactive cores -or-
60 days of usage of 32 inactive cores - or -
120 days of usage of 16 inactive core , etc
- all with 16GBs of memory per day per core
© 2014 IBM Corporation23 Embargoed Until Announced
Open Innovation is redefining the art of the possible
© 2014 IBM Corporation24 Embargoed Until Announced
The OpenPOWER Foundation creates a pipeline of continued innovation and extends POWER8 capabilities
Analytics extracting patterns from large amounts of data run 8X faster leveraging NVIDIA
GPU technology borne of the OpenPOWER Foundation
• Opening the architecture and innovating across the full hardware & software stack
• Driving an expansion of enterprise-class hardware and software
• Building a complete sever ecosystem delivering maximum client flexibility
© 2014 IBM Corporation25
IBM and NVIDIA deliver new acceleration capabilities for analytics, big data, and Java
�Runs pattern extraction analytic workloads
faster
�Provides new acceleration capability for
analytics, big data, Java, and other technical
computing workloads
�Delivers faster results and lower energy
costs by accelerating processor intensive
applications
Power System S824L
•Up to 24 POWER8 cores
•Up to 1 TB of memory
•Up to 2 NVIDIA K40 GPU Accelerators
•Ubuntu Linux running bare metal
© 2014 IBM Corporation26
26
Coherent Accelerator Processor Interface (CAPI) Overview
CAPP PCIe
POWER8 Processor
FPGA
Fu
nctio
n n
Fu
nctio
n 0
Fu
nctio
n 1
Fu
nctio
n 2
CAPI
IBM Supplied POWER Service Layer
Typical I/O Model Flow
Flow with a Coherent Model
Shared Mem. Notify Accelerator
AccelerationShared Memory
Completion
DD CallCopy or PinSource Data
MMIO NotifyAccelerator
AccelerationPoll / Int
CompletionCopy or Unpin
Result DataRet. From DDCompletion
Advantages of Coherent Attachment Over I/O Attachment
� Virtual Addressing & Data Caching (significant latency reduction)
� Easier, Natural Programming Model (avoid application restructuring)
� Enables Apps Not Possible on I/O (Pointer chasing, shared mem semaphores, …)
© 2014 IBM Corporation27
• Eliminates 97% of instruction path
• Saves 10 Cores per 1M IOPs
Pin buffers, Translate, Map DMA, Start I/O
Application
LVM
Disk & Adapter DD
Read/WriteSyscall
strategy() iodone()
FileSystem
strategy() iodone()
Interrupt, unmap, unpin,Iodone scheduling
< 500Instructions
20K Instructions
ApplicationPosix Async
I/O Style API
User Library
Shared Memory Work Queue
aio_read()
aio_write()
Attach flash memory to POWER8 via CAPI coherent Attach
Possible Example: CAPI Attached Flash Optimization
© 2014 IBM Corporation29
Boards / Systems
I/O / Storage / Acceleration
Chip / SOC
System / Software / Services
Implementation / HPC / Research
OpenPOWER: Today