27
Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden [email protected]

Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden [email protected]

Embed Size (px)

Citation preview

Page 1: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Reporting and Analysis of IBM’sCPU Measurement Facility Data

Philadelphia CMGSeptember 14, 2012

Bill [email protected]

Page 2: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Introduction• CPU Measurement Facility• Using the SMF 113 records

• Data from a 2097-706 (z10)• Relative Nest Intensity• Matching LSPR Workloads (LOW, AVG, HIGH)

• Data from a 2818-Y02 (z114)• MIPS Reasonability Check• CPU Capacity Planning Checklist• Conclusions• PerfTechPro Offer

Topics

Page 3: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• CPU MF - 2012 Update and WSC Experiences for SHARE Atlanta Final.pdf

• CPU MF Webinar Part 1 May 2 Final.pdf• CPU MF Webinar Part 2 June 14 Final.pdf• Peter.Enrico.Exploring.SMF113.and.CPU.Counters.pdf• SA23-2260-02 The Load-Program-Parameter and the

CPU-Measurement Facilities.pdf• SA23-2261-01 IBM The CPU-Measurement Facility

Extended Counters Definition for z10 and z196.pdf

References

Page 4: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Introduced in z10 and later processors • New facility that provides hardware instrumentation data for production

systems• Two Major components

• Counters• Cache and memory hierarchy information • SCPs supported include z/OS and z/VM

• Sampling• Instruction time-in-CSECT

• New z/OS HIS started task• Gathered on an LPAR basis• Writes SMF 113 records

• New z/VM Monitor Records• Gathered on an LPAR basis – all guests are aggregated• Writes new Domain 5 (Processor) Record 13 (CPU MF Counters) records

• Minimal overhead

Introduction - CPU Measurement Facility

Page 5: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Counters are cumulative• Need to compute deltas between records

• Cut on the SMF accounting interval• Not the RMF interval

• Records pertain to a specific LPAR

Introduction – Using the SMF113 records

Page 6: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Data from a 2097-706 (z10)

Page 7: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Cycles Per Instruction

Page 8: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Level 1 Miss %

Page 9: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

% of L1 Misses Sourced from Memory

Page 10: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Instruction Sourcing Profile (of L1 Misses)

Page 11: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Relative Nest Intensity

Page 12: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Some Definitions:• L2LP = % from Level 2 Local cache (same book)• L2RP = % from Level 2 Remote cache (different book)• L3P = % from Level 3 cache• L4LP = % from Level 5 Local cache (same book)• L4RP = % from Level 4 Remote cache (different book) • MEMP = % sourced from Memory

• RNI - Reflects the distribution and latency of sourcing from shared caches and memory• For z10 EC and BC:

• RNI = (1.0*L2LP + 2.4*L2RP + 7.5*MEMP) / 100 • For z196 / z114:

• RNI = 1.6*(0.4*L3P + 1.0*L4LP + 2.4*L4RP + 7.5*MEMP) / 100

Relative Nest Intensity

Page 13: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

L1MP < 3% RNI>0.75 AVERAGERNI<0.75 LOW

3%<L1MP<6% RNI>1.0 HIGH0.6<RNI<1.0 AVERAGERNI<0.6 LOW

L1MP>6% RNI>0.75 HIGHRNI<0.75 AVERAGE

RNI-based LSPR Workload Decision Table

Page 14: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Matching LSPR Workloads (LOW, AVG, HIGH)

Page 15: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Data from a 2818-Y02 (z114)

Page 16: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Cycles Per Instruction

Page 17: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Level 1 Miss %

Page 18: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

% of L1 Misses Sourced from Memory

Page 19: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Instruction Sourcing Profile (of L1 Misses)

Page 20: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Matching LSPR Workloads (LOW, AVG, HIGH)

Page 21: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

RNI may be Load dependent

0:291:29

2:293:29

4:295:29

6:297:29

8:299:29

10:2911:29

0.00

0.50

1.00

1.50

2.00

2.50

3.00

3.50

Relative Nest IntensitiesSYSB (2818-Y02) SYSA (2097-706)

0:001:30

3:004:30

6:007:30

9:0010:30

12:0013:30

15:0016:30

18:0019:30

21:0022:30

0

200

400

600

800

1000

1200

MIPS UsedSYSB (2818-Y02) SYSA (2097-706)

Page 22: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

Relative Nest Intensity

Page 23: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

MIPS Reasonability Check

Page 24: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• From CPU Table (LOW RNI)• GPP ‘MIPS’ = 5045.60• ‘MIPS’/GPP = 5045.60 / 6 = 841 ‘MIPS’/GPP

• For 9:59 AM which is a LOW RNI interval• 3.54 Cycles per Instruction• 4.4 Cycles per nanosecond (per GPP)• 4.4 / 3.54 = 1.24 Instructions per Nanosecond• 1.24 x 1,000,000,000 Instructions per Second• 1.24 x 1000 = 1240 MIPS/GPP

MIPS Reasonability Check

Page 25: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Collect CPU Measurement Facility Data for all ‘important’ LPARs

• For each LPAR:• Determine RNI-based LSPR workload mix

among LOW, AVG or HIGH• Determine MIPS values for current and

contemplated CPUs based on RITRs for RNI-based LSPR workload mix just determined• Use IBM’s zPCR to account for specific LPAR

configuration

CPU Capacity Planning Checklist

Page 26: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Great insights into hardware and workload relationship

• RNI May be load dependent• More accurate CPU capacity planning

Conclusions

Page 27: Reporting and Analysis of IBM’s CPU Measurement Facility Data Philadelphia CMG September 14, 2012 Bill Shelden Bill.Shelden@PerfTechPro.com

• Send us some of your data:• 24 hours of data• RMF 70, 71, 72, 73, and 75s• SMF 113s from one LPAR

• We will:• Return a set of charts in a PDF• Discuss your results on a WebEx

• Contact Bill Hart at [email protected]

PerfTechPro Offer