15
Presented by Performance Evaluation and Analysis Consortium (PEAC) End Station Patrick H. Worley Computational Earth Sciences Group Computer Science and Mathematics Division

Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

Presented by

Performance Evaluation and AnalysisConsortium (PEAC) End Station

Patrick H. WorleyComputational Earth Sciences Group

Computer Science and Mathematics Division

Page 2: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

2 Worley_PEAC_SC07

The PEAC End Station provides the performance evaluation andperformance tool developer communities access to the LeadershipComputing Facility (LCF) systems.

Overview

Consortium goals

• Validate the effectiveness of performancemodeling methodologies

• Modify methodologies as necessary to improve their utility forpredicting resource requirements for production runs on LCFsystems

Performancemodeling

• Port performance tools to LCF systems and make themavailable to National Center for Computational Sciences (NCCS)users

• Further develop the tools to take into account the scaleand unique features of LCF systems

Performancetools

• Evaluate the performance of LCF systems using standardand custom micro-, kernel, and application benchmarks

Systemevaluation

Page 3: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

3 Worley_PEAC_SC07

All of this must be accomplished while adhering tothe “Golden Rules” of the performance community:

Overview (continued)

Consortium goals (continued)

• Provide access to other performance researchers who areinterested in contributing to the performance evaluation of theLCF systems or in porting complementary performance tools ofuse to the NCCS user community

• Provide access to application developers who wish to evaluatethe performance of their codes on LCF systems

Performanceandapplicationcommunitysupport

• Analyze performance• Help optimize current and candidate LCF application codes

Applicationanalysis andoptimization

• Low visibility (no production runs!)• Open and fair evaluations• Timely reporting of results

Page 4: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

4 Worley_PEAC_SC07

32 active users,39 active projects:

Status as of 8/28/07

Consuming: Contributing to:

• 13 applicationperformance analysisand optimization

• 8 system evaluation

• 8 tool development

• 6 infrastructuredevelopment

• 4 application modeling

• XT4: 1,168,000processor hours(exceeding 1,000,000processor-hourallocation)

• 1 refereed journalpaper

• 1 invited journal paper

• 6 refereed proceedingspapers

• 10 proceedings papers

• 2 book chapters

• Numerous oralpresentations

Page 5: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

5 Worley_PEAC_SC07

System evaluation

Application benchmarks from Chemistry (GAMESS), High EnergyPhysics (MILC), Seismic Processing (SEISMIC), Weather (WRF)

Computation, memory, and interprocess comm. benchmarksPurdueUniv.

APEX-MAP system characterization benchmark

Application benchmarks from Astrophysics (Cactus), Fluid Dynamics(ELBM3D), High Energy Physics (BeamBeam3D, MILC), Fusion (GTC),Materials Science (PARATEC), AMR Gas Dynamics (HyperCLaw)

Application benchmarks from Astrophysics (Chimera), Climate (CAM,CLM, FMS, POP), Combustion (S3D), Fusion (AORSA, GTC, GYRO,XGC), Molecular Dynamics (NAMD)

Lattice-Boltzman kernels and mini applications

Subsystem probes for system characterization needed for convolution-based performance modeling

SDSC

Computation, memory, interprocess comm., and I/O benchmarksORNL

Memory, interprocess communication, and I/O benchmarksLBNL

Page 6: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

6 Worley_PEAC_SC07

Performance tools

Performance model specification and verification frameworkModelingassertions

Scalable performance tool infrastructureMRNet

Performance analysis systemTAU

Performance analysis systemSvPablo

Scalable trace collection and analysis toolScalasca

Performance data collection infrastructurePAPI

MPI profiling infrastructurempiP

Tool suite for profile-based performance analysisHPCToolkit

Page 7: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

7 Worley_PEAC_SC07

Application performance analysis andoptimization

Combustion codePNEWT

Nuclear codeDeCart

Subsurface flow modelPFLOTRAN

Ab initio aeroacoustic simulations of jet and airfoil flowsPDNS3D/SBLI

Neutron transport codeNEWTRNX

Shape plasma version of GTC gyrokinetic turbulence codeGTC_s

Computational cosmologyGADGET

Framework application for core-edge transport simulationFACETS

AMR gas dynamics modelChombo

Page 8: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

8 Worley_PEAC_SC07

Application code scaling, optimization,and/or performance evaluation

Combustion modelS3D

Lattice-Boltzman applications-

Nuclear codeTDCC-9d

Coastal ocean modelPOLCOMS

Page 9: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

9 Worley_PEAC_SC07

System infrastructure

Reduction-based communication library-

Extension of C designed for high-performance computing onlarge-scale parallel systemsUPC

PVFS file system implementation on native Portals interfacePVFS/Portals

Toolset for numerical solution of PDEsPETSc

Runtime networking layer for UPC and Titanium compilersGASNet

Co-array Fortran compiler for distributed-memory systemscafc

Page 10: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

10 Worley_PEAC_SC07

Performance modeling

Genetic algorithm-based modeling of memory-boundcomputationsPMAC

LogGP model for MPI communication on the XT4

Reusable analytic model for wavefront algorithms, appliedto NPB-LU, SWEEP3D, and ChimaeraUniv. of Wisconsin

GTC fusion codeTexas A&M Univ.

NAS parallel benchmarks; HYCOM ocean codeORNL

Page 11: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

11 Worley_PEAC_SC07

I/O performance characterization (LBL)Dual vs. single core performance

evaluation using APEX-MAP (LBL)

Identifying performance anomalies (ANL) MPI performance characterization (ORNL)

Subsystem evaluations

7.066.735.565.47XT4 SN

13.713.010.710.7XT4

9.419.088.17.5XT3

1.811.46BG/L, VN

2.012.24BG/L

PeriodicPeriodic

8 Neighbors4 neighborsSystem

Ratio of time for all processes sending in halo updateto time for a single sender

Page 12: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

12 Worley_PEAC_SC07

Performance sensitivities (SDSC)Scalability optimizations (ORNL)

Application analyses and benchmarks

Porting andoptimizing new

applications(RENCI/NCSA)

Processing of genomesinto domain maps: needimproved load balancingthat takes into accountscale-free nature of thegraphs.

Page 13: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

13 Worley_PEAC_SC07

SvPablo sourcecode-correlatedperformance analysis (RENCI)

SCALASCAtrace-basedperformanceanalysis(FZ-Jülich,UTenn)

Tooldevelopment

mpiP callsiteprofiling(LLNL/ORNL)

Page 14: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

14 Worley_PEAC_SC07

Co-principal investigators

David BaileyLeonid Oliker

LawrenceBerkeleyNational

LaboratoryWilliam Gropp

ArgonneNational

Laboratory

Jeffrey VetterPatrick Worley (PI)

Oak RidgeNational

Laboratory

Bronisde Supinski

LawrenceLivermoreNational

Laboratory

Allan Snavely

University ofCalifornia–San Diego

Daniel Reed

Universityof NorthCarolina

JeffreyHollingsworth

Universityof Maryland

JohnMellor-

Crummey

RiceUniversity

Kathy Yelick

University ofCalifornia–Berkeley

Jack Dongarra

University ofTennessee

Barton Miller

University ofWisconsin

Allen Malony

Universityof Oregon

Page 15: Performance Evaluation and Analysis Consortium (PEAC) End ...GTC_s Shape plasma version of GTC gyrokinetic turbulence code ... BG/L, VN 1.46 1.81 BG/L 2.24 2.01 Periodic Periodic System

15 Worley_PEAC_SC07

Contact

Patrick H. WorleyComputational Earth Sciences GroupComputer Science and Mathematics Division(865) [email protected]

15 Worley_PEAC_0711

Barbara Helland

DOE Program ManagerOffice of Advanced Scientific Computing ResearchDOE Office of Science