View
214
Download
0
Category
Preview:
Citation preview
1
Intel® Xeon® Processor E7-8800/4800/2800 Product Families
2
Which Intel® Xeon® Processor Family is Best for Which Workload?
Mid
/S
mall
er
Wo
rklo
ad
s
Larg
er
Wo
rklo
ad
s
More/Larger Instances/Users
Fewer/Smaller Instances/Users
Xeon E7 4S
Servers
2S Volume Servers
Workload / Usage
Business Processing (DB, ERP, CRM, Batch)1
Decision Support (Data Warehouse, Business Intelligence)
Large-Scale Consolidation (Including Virtualization & Multi-tier)
Application Development
High Performance Computing
Collaboration
Web Infrastructure
IT Infrastructure
Development/Quality Assurance
1 For directional guidance only. This is not a server selection guide. Actual server sizing is a relatively complex effort involving workload characterization including such considerations as type of application, size of workload, number of users, type of transaction, SLA response times, targeted utilization, level and estimation accuracy of workload baseline/peak/growth, physical or managerial constraints, need to maintain a single state at all times to ensure all users see the same results at any given time, cost to migrate to a scale-out alternative, and system availability requirements.
3
Intel® Xeon® Processor E7 Family Delivers More Performance, Expandability and RAS
Performance to handle any workload Expandable & Scalable across processor, memory & I/O Increased Efficiency through processor & memory energy saving technologies Data Protection & Reliability to support mission critical applications
CORE 1 CORE 2
CORE 3 CORE 4
CORE 5 CORE 6
CORE 7 CORE 8
CORE 9 CORE 10
Xeon E7
CACHE
RAM
4
2-socket 4-socket 8-socket
Intel® Xeon® Processor E7 Family Scalability to Support 2S, 4S, 8S and Beyond
Xeon® CPU Socket
Memory
I/O Hub
Intel QuickPath Interconnect
3rd partry Node Controller (non-Intel)
OEM interconnect ...
1 Additional Configurations via OEM-specific scaling technologies (up to 256-sockets)
>8S1
(3rd party node controller)
5
Intel® Xeon® Processor E7 Family Server Refresh Benefits
1 Source: Intel measurements as of March 2011 of Xeon E7-4800 and dual 4-socket systems. Performance comparison using SPECint_rate_base2006. Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. For detailed calculations, configurations and assumptions refer to the legal information slide in backup.
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
2011 2006
18 Racks of Intel® Xeon® Dual Core
Servers
Up to 18X Performance1
or Up to 93% Estimated
Annual Energy Reduction
AND Latest Features: Intel Virtualization, Security,
Power Management, Reliability…
1 Rack of Intel® Xeon®
Processor E7-4800 Product Family
Servers
5
6
Memory Power Efficiency Integrated Power Gates1
Intel® Xeon® Processor E7 Family Energy Efficiency Enhancements
Enables idle cores to go to near zero power independently
LV-DIMM support
Lower power memory buffers
Automatic Operation or Manual Core Disable2
Significant Power Savings on Large Memory Systems3
Core0 Core2 Core10
Voltage (cores)
Voltage (rest of processor)
NEW!
Core1 .... E7-4800 E7-4800
E7-4800 E7-4800
NEW!
NEW!
1. Integrated power gates (C6) requires OS support
2. Requires BIOS setting change and system reboot
3. Depending on the workload and the configuration being run @ 100% SPECpower load it can save ~0.8W for 4GB DIMM DRx8
4. Based on SQL Server* database benchmark performance testing - http://attachments.wetpaintserv.us/jD7st0SRn16bZhoLImoZoQ505396.
Up to 27% System Idle Power Consumption & 50% PPW Improvement4
7
Intel® Xeon® Processor E7 Family Faster, Stronger Security
Intel® Trusted Execution Technology (Intel® TXT)
Hypervisor
VM VM
TXT Platform
VM
Malicious Software Detection
Intel® Advanced Encryption Standard New Instructions
(Intel® AES-NI)
Decrypt 3.6X Faster
with Intel® AES-NI
http://www.oracle.com/ocom/groups/public/@oco
m/documents/webcontent/452957.html?iframe=tr
ue&width=670&height=410
8
Intel® Xeon® Processor E7 Family Complete Portfolio of RAS Support for Highly Available Deployments
Reduce circuit-level errors
Detect and correct data errors
Contain uncorrected errors
• Parity Checking And ECC
• Memory Thermal Throttling
• Memory Demand & Patrol Scrubbing
• Corrupt Data Containment Mode
• Viral Mode • Intel QPI Protocol Protection Via CRC (8bit Or 16bit
Rolling)
Maintain partitions instead of systems
Proactively replace failing components
Heal failing data connections
Support redundancy and failover for key system components
Recover from uncorrected data errors
Help predict failures before they happen
• MCA Recovery With SW layer Support
• Intel® SMI Lane Failover
• Intel® SMI Clock Fail Over
• Intel® SMI & QPI Packet Retry
• QPI Clock Fail Over
• QPI Self-healing
• Enhanced DRAM Single Device Data Correction
• Enhanced DRAM Double Device Data Correction
• Fine-Grained Memory Mirroring
• Memory DIMM And Rank Sparing
• Dynamic CPU And Memory Migration
• Electronically Isolated (Static) Partitioning
• MCA Error Logging (CMCI) With OS Predictive Failure Analysis
• Memory Board Hot Add
• OS Memory On-lining*
• CPU Board Hot Add At QPI
• OS CPU On-lining
9
HW Un-correctable Errors
Machine Check Architecture Recovery Allows Recovery From Otherwise Fatal System Errors
Normal Status With Error Prevention
System Recovery with SW
Error Corrected
Error Detected1
Error Contained
HW Correctable Errors Un-correctable Errors
System works in conjunction with OS,
VMM, or DBMS to recover or restart
processes and continue normal operation
Bad memory location flagged so data will not
be used by OS or applications
Error information passed to SW
layer
MCA Recovery
1 Errors detected using Patrol Scrub or Explicit Write-back from cache
10
Machine Check Architecture Recovery Extensible Error Recovery Architecture with Growing
Software Industry Support
1Errors detected using Patrol Scrub or Explicit Write-back from cache
Uncorrectable data errors isolated and corrected by OS1
Affected application may
require restart
System remains up and running
Window Server* 2008 R2 +
RHEL* 6 +
U8+
SLES11* SP1 +
Uncorrectable data error isolated within the DBMS buffer pool1
Affected (non-critical) buffer is transparently
reloaded from disk
System and DBMS remain up and running
HANA IMDB 1.0 SP3+
* Other names and brands may be claimed as the property of others
vSphere* 5.0+
RHEL* 6 + - KVM
SLES11* SP1 +
Window Server* Hyper-V
Microsoft SQL-Server 2012
Uncorrectable data error isolated to a single VM /
guest OS1
Affected VM may require restart
System and all other VMs remain up and
running
11
Dramatically Shifting In-Memory Computing SAP HANA* & Intel® Xeon® Processor E7 Family
SAP HANA* 1.0 Performance Comparison: Intel® Xeon® Processor E7 Family vs
Intel Xeon Processor 7500 Series
SAP HANA*
Real-time business insight
Better decisions more quickly
Greater analytic flexibility
Near-perfect scaling on Intel
Xeon
processor E7 family
50
500
5000
1 2 4 8
Ru
nn
ing
tim
e (
s)
Socket Count
SAP HANA* Scalability
Customer Workload
Ideal
8S Glueless
0
50,000
100,000
150,000
200,000
250,000
300,000
288,873
210,471
Ru
nn
ing
Tim
e [
ms]
Running time [ms]
Intel Xeon Processor
7500 Series
Intel Xeon Processor E7 Family
1.37x Faster
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using
specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests
to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. Source: http://www.intel.com/content/www/us/en/high-
performance-computing/high-performance-computing-xeon-e7-analyze-business-as-it-happens-with-sap-hana-software-brief.html
12
Intel® Xeon® Processor E7 Family New Era of Mission Critical Computing
As of June 7, 2012: 4-socket server results on the TPC-H *1000GB non-clustered benchmark is basis for best published SPARC* T4-4 comparison to Intel Xeon processor E7 family. 4-socket server results on the SPECjbb*2005 benchmark is basis for best published POWER* 750 Express comparison to Intel Xeon processor E7 family. Configuration details in the Notes Page and Backup slides. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance.
Reduce Costs with Interoperable, Industry-standard Intel Xeon Processors
SPARC* T4-4
POWER 750 Express*
Up to
2X Performance
Up to
1.12X Performance
at
19% Price/Performance
at
19% System Cost
4-Socket Intel Xeon
E7-8837
4-Socket Intel Xeon
E7-4870
Intel Xeon E7 vs. SPARC*
Intel Xeon E7 vs. Power*
13
Introducing New DCG Processor Numbering
Intel® Xeon® processor E# – # # # # v#
Processor SKU (i.e. 10, 20, 30, etc…)
Prod Line (E3, E5, E7)
Wayness, maximum number of CPUs in a node (1, 2, 4, 8)
Version (v2, v3, v4, etc)
Brand
Socket/segment designation (2, 4, 6, 8)
Alpha Suffix
Description
L Low Power
‘Low Power’ SKUs (after 4 digit numeric set):
Prod Family
2011 Example above (socket changes over time, designator does not)
Designator Actual Socket
8 LS (Westmere EX)
6 R (Sandy Bridge)
4 B2 (Sandy Bridge)
2 H2 (Sandy Bridge)
14
Best Performance (Top Bin)
E7-2860 2.26GHz / 24M / 6.4GT/s
E7-2830 2.13GHz / 24M / 6.4GT/s
E7-4820 2GHz / 18M / 5.86GT/s
Low Voltage
E7-4830 2.13GHz / 24M / 6.4GT/s
Ad
van
ced
S
tan
dard
E7-4807 1.86GHz / 18M / 4.8GT/s
Freq Optimized
E7-2870
2.4GHz / 30M / 6.4GT/s E7-8870
2.4GHz / 30M / 6.4GT/s
E7-8860 2.26GHz / 24M / 6.4GT/s
E7-2803 1.73GHz / 18M / 4.8GT/s
E7-2820 2GHz / 18M / 5.86GT/s
2S only. Not scalable
E7-8837 2.67GHz / 24M / 6.4GT/s
E7-2850 2GHz / 24M / 6.4GT/s
E7-4850 2GHz / 24M / 6.4GT/s
X6550 2Ghz / 18M / 6.4GT/s
E6540 2Ghz / 18M / 6.4GT/s
X7560 2.26Ghz / 24M / 6.4GT/s
X7550 2Ghz / 18M / 6.4GT/s L7555
1.86Ghz / 24M / 5.86GT/s
E7530 1.86Ghz / 12M / 5.8GT/s
E7520 1.86Ghz / 18M / x4.8GT/s
X7542 2.66Ghz / 12M / 5.86GT/s
L7545 1.86Ghz / 18M / 5.86GT/s
E7540 2Ghz / 18M / 6.4GT/s
E6510 1.73Ghz / 12M / 4.8GT/s
10 cores
8 cores
6 cores
4 cores
2S only. Not scalable
Basic
E7-8867L 2.13GHz / 30M / 6.4GT/s
E7-4800 Product Family
E7-8800 Product Family
E7-2800 Product Family
Xeon 7500 Series
Xeon 6500 Series
E7-8800 Product Family
Xeon 7500 Series
Xeon 7500 Series
E7-x800 Product Family
E7-4870
2.4GHz / 30M / 6.4GT/s
E7-4860 2.26GHz / 24M / 6.4GT/s
E7-8850 2GHz / 24M / 6.4GT/s
E7-8830 2.13GHz / 24M / 6.4GT/s
E7-8800 Product Family
Xeon 7500 Series
Product Families Transition Matrix Intel® Xeon® Processor 7500 Series Intel Xeon Processor E7 Family
15
Advanced Data Protection and
Reliability
World-Class Economics
Performance Leadership
Intel® Xeon® Processor E7 Family Delivers More Performance, Expandability and RAS
Intel “inside” SAP
17 17
Intel® Xeon® Processor E7-8800/4800/2800 Product Families
Building on Xeon® 7500 Leadership Capabilities
• More performance within same max CPU TDP as Xeon 7500
• Lower partial active & idle power via Intel Intelligent Power Technology2
• Support for Low Voltage-DIMMs3
• Reduced power memory buffers4
More Efficient
• Supports 32GB DDR3 DIMMs (2TB per 4-socket system)1
More Expandable
More Security & RAS
• 10 cores / 20 threads
• 30MB of last level cache
More Performance
E7-4800 E7-4800
E7-4800 E7-4800
SECURITY
• Intel® Advanced Encryption Standard-New Instructions
• Intel® Trusted Execution Technology (TXT)
RELIABILITY, AVAILABILITY, SERVICEABILITY
• Enhanced DRAM Double Device Data Correction
• Fine Grained Memory Mirroring
8.6x for 1->10 cores
21% by HT
7% by NUMA
Memory Error
Recovery
1TB certified
on 8s
4% by Turbo Mode
6.33x for 1s->8s
18 18
Performance and Feature Enabling of exisiting and future
server platforms
Adaptation of ISA extension like SSE4.x, AVX2, AVX3
Scalability improvements for various usage scenarios
Intel Decimal Floating-Point Library
Drive co-innovation by partnering during u-arch, platform,
and Storage Product Definition cycles
Leverage Xeon-EX RAS feature for DB memory-error recovery
SAP is silicon definition partner for AVX2 and AVX3
Develop and deploy Intel Tools product portfolio w/ scaling
into E-prise segment
Development of Intel Performance Counter Monitor (Intel PCM)
Adoption of Intel VTune Amplifier XE
Intel Engineering Engagement Co-Engineering since 2005 (TREXBIAHANA)
19 19
Intel Engineering: MC Enabling Xeon E7 is reference platform for HANA
Incl. launch proof-point with 1.37x speed-up over prev. gen
Original implementation did not scale to 40 cores ≠ power of 2
1.21x due to HT
Great scalability on 8-socket glue-less reference design
Scaling 6.33x for 1s->8s
after SSG optimization
50
500
5000
1 2 4 8
Ru
nn
ing
tim
e [
s]
# of sockets
SAP HANA Scalability customer workload
8s glueless
perfect
20 20
Intel Engineering: SSE and µ-arch Adaptation of state-of-art
microprocessor ISA extension like SSE4.x
filtering of a result set by a selection benefits 2x speedup
7x faster hash function
3.5x faster implementation of bitvector operations
PoC using IPP features 1.15x (short strings) 10x (long strings)
Pre-enabling of Haswell new instructions
Mutual benefit w/ strategic roadmap alignment
Application traces on future Intel microprocessor architecture
5 micro-kernel ported to AVX2 and beyond
Pre-silicon performance simulations
Invaluable input for IA instructions set POR
0,00
1,00
2,00
3,00
4,00
5,00
6,00
Sp
eed
-Up
Bitvector Function
Bitvector Optimizations
21 21
Memory Error Recovery Xeon-EX platform RAS feature for HA in HANA 1.0 production code:
HW patrolling mechanism allows to detect memory failures and application to recover from (“early kill”)
Memory MTBF becomes increasingly important with xTB In Memory DB
EX feature, previously seen only in RISC, mainframe, and Itanium-based systems
Application-level recovery unique in industry on Xeon platform
Demos at IDF, ISMC, WSM launch, YouTube,...
Memory-error recovery when app is accessing corrupted data (“late kill”)
• Challenge: repair data in memory that is in use
– “changing tires while the car is driving“
• Integrated in HANA 1.5
Memory error recovery demo GUI
22 22
Intel Performance Counter Monitor
Intel Performance Counter Monitor
(Intel PCM)
Developed by SAP onsite team
Monitors fundamental performance metrics in real-time
Cache pressure incl. L3, inter-socket and memory traffic (unique on Intel Xeon 5500 and E7 processor series)
Developed API and GUI on Linux+Windows
Deployment and Scaling with:
Integrated and shipping with NewDB product
Published on Intel Software Network
Countless requests by other ISVs and OEMs
Intel Amplifier VTune standard tool for NewDB dev
– Scale perf tuning to all NewDB developers
Intel PCM plugs in Windows PerfMon
Intel PCM showing mem and QPI traffic
23 23
Ресурсы
• Онлайн:
– ark.intel.com
– ru.intel.com
• Оффлайн:
– Презентации
– Дополнительные материалы
• Прямой контакт:
– Планы выпуска (roadmap)
– Консультации
– Конкурентный анализ
Recommended