11
www. chameleoncloud.org MARCH 10, 2015 1 CHAMELEON: A LARGESCALE, RECONFIGURABLE EXPERIMENTAL ENVIRONMENT FOR CLOUD RESEARCH Principal Investigator: Kate Keahey Co-PIs: J. Mambretti, D.K. Panda, P. Rad, W. Smith, D. Stanzione 2 nd Joint Lab for Extreme Scale Computing (JLESC) Workshop November 24, 2014, Chicago, IL

CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

MARCH 10, 2015 1

CHAMELEON:    A  LARGE-­‐SCALE,  RECONFIGURABLE  EXPERIMENTAL  ENVIRONMENT  FOR  CLOUD  RESEARCH    Principal Investigator: Kate Keahey

Co-PIs: J. Mambretti, D.K. Panda, P. Rad, W. Smith, D. Stanzione

2nd Joint Lab for Extreme Scale Computing (JLESC) Workshop November 24, 2014, Chicago, IL

Page 2: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

SCALING  TO  THE  CHALLENGE  

Big Compute A wide range of data analytics

Big Data Data volume, velocity, and

variety

Big Instruments Cyber-Physical

Systems, Observatories

Programmable networks cheap, ubiquitous sensors and other emergent trends

Large  Scale  

Reconfigurability   Connectedness  

Engagement  

Page 3: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

CHAMELEON:  A  POWERFUL  AND  FLEXIBLE  EXPERIMENTAL  INSTRUMENT  � Large-­‐scale  instrument  

�  TargeFng  Big  Data,  Big  Compute,  Big  Instrument  research  �  ~650  nodes  (~14,500  cores),  5  PB  disk  over  two  sites,  2  sites  connected  with  

100G  network  � Reconfigurable  instrument  

�  Bare  metal  reconfiguraFon,  operated  as  single  instrument,  graduated  approach  for  ease-­‐of-­‐use  

� Connected  instrument  � Workload  and  Trace  Archive  �  Partnerships  with  producFon  clouds:  CERN,  OSDC,  Rackspace,  Google,  and  

others  �  Partnerships  with  users  

� Complementary  instrument  �  ComplemenFng  GENI,  Grid’5000,  and  potenFally  other  testbeds  

 

Page 4: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

TEAM  

Kate Keahey Chameleon PI Science Director

Joe Mambretti Programmable networks

DK Panda High-performance networks

Dan Stanzione Facilities Director

Warren Smith Director of Operations

Paul Rad Industry Liason

Page 5: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

CHAMELEON  HARDWARE  

SCUs  connect  to  core  and  fully  connected  to  each  other  

Heterogeneous  Cloud  Units  

Alternate  Processors  and  Networks  

Switch  Standard  Cloud  Unit  42  compute    4  storage  x10  

Chicago  

To UTSA, GENI, Future Partners

AusFn  Chameleon  Core  Network  

100Gbps  uplink  public  network  (each  site)  

Core  Services  3  PB  Central  File  

Systems,  Front  End  and  Data  Movers  

Core  Services  Front  End  and  Data  

Mover  Nodes  

504  x86  Compute  Servers  48  Dist.  Storage  Servers  102  Heterogeneous  Servers  16  Mgt  and  Storage  Nodes  

Switch  Standard  Cloud  Unit  42  compute    4  storage  x2  

Page 6: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

CAPABILITIES  AND  SUPPORTED  RESEARCH  

VirtualizaFon  technology  (e.g.,  SR-­‐IOV,  accelerators),  systems,  networking,  infrastructure-­‐level  resource  management,  etc.  

Repeatable  experiments  in  new  models,  algorithms,  pla`orms,  auto-­‐scaling,  high-­‐availability,  cloud  federaFon,  etc.    

Development  of  new  models,  algorithms,  pla`orms,  auto-­‐scaling  HA,  etc.,  innovaFve  applicaFon  and  educaFonal  uses  

Isolated  par,,on,  full  bare  metal  reconfigura,on    

Isolated  par,,on,  pre-­‐configured  environments  

Persistent,  reliable,  shared  clouds  

Page 7: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

 SOFTWARE:  CORE  CAPABILITIES  

Pre-configured Image Catalog Bare metal images

User-Deployed Clouds Persistent Clouds

OpenStack

Provisioning, Network, Scheduling and Orchestration

KaDeploy, KaVLAN, OAR2, (Grid’5000) Ironic, Neuron, Nova (OpenStack, Rackspace OnMetal) Orchestration: Nimbus, Interactive Experiment Management

Page 8: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

EXPERIMENT  WORKFLOW    � User  interface:  log  in,  manage  profile  � Find  Resources  

� Machine-­‐parsable,  fine-­‐grained  descripFon  � Versioning  (hardware  upgrades,  etc.)  � VerificaFon  (maintenance,  failures,  etc.)  

� Reserve  Resources  (browsing  vs  matching)  � Reconfigure  testbed    � Shape  experimental  condiFons  � Monitoring  and  metrics  

� Including  fine-­‐grain  and  energy  monitoring  � IntegraFon  with  workload  generators,  simulaFon,  etc.    

Page 9: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

OUTREACH  AND  ENGAGEMENT  

� Early  User  Program  � Commibed  users,  driving  and  tesFng  new  capabiliFes,  enhanced  level  of  support  

� Chameleon  Workshop  � Annual  workshop  to  inform,  share  experimental  techniques  soluFons  and  pla`orms,  discuss  upcoming  requirements,  and  showcase  research  

� Advisory  Bodies  � Research  Steering  Commibee:  advise  on  capabiliFes  needed  to  invesFgate  upcoming  research  challenges  

� Industry  Advisory  Board:  provide  synergy  between  industry  and  academia    

Page 10: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

PROJECT  SCHEDULE  

� Fall  2014:  FutureGrid  resources  at  UC  and  TACC  (Hotel  and  Alamo)  available  as  OpenStack  clouds    

� Spring  2015:  IniFal  bare  metal  reconfiguraFon  capabiliFes  available  on  FutureGrid  UC&TACC  resources  for  Early  Users  

� Summer  2015:  New  hardware:  large-­‐scale  homogenous  parFFons  available  to  Early  Users  

� Fall  2015:  Large-­‐scale  homogenous  parFFons  and  bare  metal  reconfiguraFon  generally  available    

� 2015/2016:  Refinements  to  experiment  management  capabiliFes,  higher  level  capabiliFes  

� Fall  2016:  Heterogeneous  hardware  available    

Page 11: CHAMELEON:** A*LARGE … · 2019. 9. 6. · cheap, ubiquitous sensors and other emergent trends LargeScale& Reconfigurability& Connectedness ... Mover*Nodes* 504x86ComputeServers

www. chameleoncloud.org

PARTING  THOUGHTS  

�  Large-­‐scale,  responsive  experimental  testbed  �  TargeFng  criFcal  research  problems  at  scale  �  Evolve  with  the  community  input  

�  Reconfigurable  environment  �  Support  use  cases  from  bare  metal  to  producFon  clouds  �  Support  for  repeatable  and  reproducible  experiments  

�  One-­‐stop  shopping  for  experimental  needs  �  Trace  and  Workload  Archive,  user  contribuFons,  requirement  discussions  

�  Engage  the  community  �  Network  of  partnerships  and  connecFons    with  scienFfic  producFon  testbeds  and  

industry  �  Partnerships  with  exisFng  experimental  testbeds  �  Outreach  acFviFes  

�  Come  visit  us  at  www.chameleoncloud.org!