26
ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein, Maxime Martinasso, ExCL team @ ORNL, ... ADAC Workshop February 15, 2018

ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

ADAC Federated TestbedCreating a Blueprint for Portable

EcosystemsSadaf Alam, Jeffrey Vetter, Mark Klein,

Maxime Martinasso, ExCL team @ ORNL, ...

ADAC WorkshopFebruary 15, 2018

Page 2: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Motivation and Use CasesADAC 1-4 workshops

January, 2016 June, 2016 January, 2017 July, 2017

Performance modeling

Performance tools

Emerging technologies

Emerging technologies & testbeds

Page 3: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

ExCL and CSCS experimental testbeds

TokyoTech, ORNL LCCF and CSCS evaluation systems

Page 4: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Bridging Gaps & Accelerating Innovations

Computer science research topics e.g. kernel, runtime, tools, etc.

T1T2

Production platforms

ADAC Experimental Testbeds

ADAC Benchmarking Testbeds

Applications, libraries, and algorithm development and tuning

Why?No

Now what?

Page 5: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Summary: Motivation & Use Cases● Enable evaluation of full ecosystem for emerging technologies

○ Including experiments requiring privileged access○ Hardware level testing○ Memory and storage hierarchies

● Share testbed resources○ Inventory of resources○ Federated access○ Coordinating installation of new hardware

● Share benchmarking and evaluation results

○ Portfolio of results○ Collection of tools with support matrix

Monthly calls + topic related discussions

Page 6: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Inventory(Work in Progress)

Page 7: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,
Page 8: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Inventory: CSCS Experimental Testbed● Existing infrastructure

○ Traditional with ○ Multiple generation Xeon based systems with GPUs○ InfiniBand and OPA interconnect and HCAs○ Power 8 ○ ARM8 nodes

● Upcoming○ AMD CPUs and GPUs○ Intel FPGAs○ SSD storage such as ○ To be announced ...

Page 9: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Inventory: Evaluation and Benchmarking Platforms● ORNL NCCS Open Resources (2-factor authentication)

a. ARM1 system https://www.olcf.ornl.gov/computing-resources/arm1/b. HPE-ARM system (system name unknown still)c. Cray XC40 Knights Landing (Percival) 168 nodesd. IBM Minsky (P8+/Pascal) (Summitdev) 54 node

● TokyoTecha. TSUBAME KFC

Page 10: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Challenges: Technical & Policy

Page 11: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Challenges● Policy: Governance and management

○ Application process○ Authentication and authorization○ Access policies

● Technical: System provisioning and scheduling (isolation and security)○ Innovative hardware ecosystem○ Virtualization○ Metal-as-a-service○ Access to external environments e.g. quantum systems thru cloud

Page 12: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

1

Application level benchmarking

3

Changes to installed drivers, low level power measurements

5Hardware & firmware changes

2

Changes to installed compilers and user tools.

4

Changes to OS, Kernel level experimentation

Management of Privileged Access

Privileged access and control

(managed by testbed providers)Monitoring and reporting

(managed by testbed engineers)

Page 13: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Lightweight Application Process for Accessing Testbeds

Page 14: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,
Page 15: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

ORNL XCAMS & CSCS account application is similar

Page 16: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

ArchitectureAddressing Experimental (incl. NDA) and

Evaluation Platform needs

Page 17: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Login Node

Bare-Metal Node Type A

Shared Node Type AShared Node Type B

Bare-Metal Node Type ABare-Metal Node Type ABare-Metal Node Type BBare-Metal Node Type CBare-Metal Node Type D

GatewayGatewayGateway

Special HW 1

Special HW 2

Special HW External

Exclusive Access Cluster

Management Server

Compute Nodes

Exclusive access to machines in this cluster. These nodes are only available once a VM has been launched from the web portal

Dedicated login node. Creates the idea of an integrated system

Hardware where virtualization is not possible. Only accessible from gateways

Hardware not located at ORNL. ie. quantum system

Virtual login nodes on top of VMs (not bare-metal). Only to ensure that 1 user is going to access other HW from here at a time. Gateway machines can have associated metadata to make them unique.

Same type of HW available in bare-metal, but shared. No VM needed. Direct access from login node. Multiple concurrent users

Web portal for bare-metal and gateway VMs management

ExCL 2.0 (ORNL)

IP/key based restricted

access

Page 18: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

CSCS Testbed ArchitectureOpenStack based bare-metal deployments Operation default for most

nodes is shared access (slurm w/container support).

Allowed users can re-provision a server into a dedicated resource for various test cases: VMs, Containers, or Bare Metal deployments. Clean up on release & return to shared mode.

Users to be able to provide own images for dedicated modes, with some pre-made available for ease of use.

NDA hardware can be hidden from certain users/groups (not operated in Shared mode, only available for Dedicated Provisioning)

Hardware connected to Monitored PDUs (for power measurements)

Page 19: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Proposal for Federated AAIEase of Access & Sharing of Resources

Page 20: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Concept Similar to OpenID Connect

Page 21: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

ADAC Federation with Trusted Identities● ADAC User and Project DB

○ ORNL ExCL and CSCS users initially○ Proof of concept for web based technologies

● Identity brokering with KeyCloak

ORNL registered user

CSCS OpenStack Platform

1. Access

2. Select ORNL IdP for authentication3. Redirected to ORNL

ORNL IdP

4. Authorized to access CSCS service

Page 22: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Status & Next Steps● Finalize federated AAI and central services decisions

○ Testing and implementation of Keycloak○ ADAC user and proj DB central management

● Introduce a landing page for ADAC testbeds ○ Links to other sites○ Inventory

● Tune testbed implementation for use cases○ Enable HPC features○ Metal-as-a-service○ Verify functionality of OpenStack services

■ Bare Metal (Ironic)■ VMs (Nova)■ Container Orchestration (Magnum)■ Resource Reservation (Blazar)

Vendors engagement is critical

Page 23: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

DOE Workshop on Extreme Heterogeneity

23-25 Jan 2018

Page 24: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,
Page 25: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Status● Gov shutdown forced cancellation of physical meeting● Moved to virtual meeting

○ Kept to original agenda (with some minor changes for timezones)○ Approximately 200 participants viewed plenary sessions!!

● Breakout groups converged on priority research directions

Page 26: ADAC Federated Testbed February 15, 2018 ADAC Workshop · 2020. 8. 4. · ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein,

Status● Initial Priority Research Directions (Categories)

○ Programmability and Software Development Productivity○ Managing Execution, Scheduling○ Correctness, Debugging, Reproducibility○ Modeling and Simulation for Performance, Power, Resiliency

● Deliverables○ Slides w/ PRDs be the end of the workshop / Done○ Brief report for HQ by Mar 1○ Final public report by May 1