28
Cooling and thermal efficiently in the datacentre the datacentre George Brown HPC Systems Engineer

Cooling and thermal efficiently in the datacentre · 2012-11-29 · Cooling and thermal efficiently in the datacentre George Brown ... • Only twelve other institutions in the UK

  • Upload
    lyminh

  • View
    217

  • Download
    1

Embed Size (px)

Citation preview

Cooling and thermal efficiently in

the datacentrethe datacentre

George BrownHPC Systems Engineer

Viglen Overview

• Viglen Overview

• Products and Technologies

• Looking forward• Looking forward

Company Profile

• IT hardware manufacture, reseller & solution provider

• Public sector focus

• Intel Server Country Leader, Intel Cluster Ready Certified

• Supermicro Premier Partner

• NVIDIA Tesla Preferred Partner

• Xenon Networks - Accredited to HE National Maintenance Agreement for on-site maintenance (Xenon HQ in Altrincham)

• Consistently profitable, 35 years• Consistently profitable, 35 years

• Direct business model

• Quality Management System accredited to ISO 9001; 2008

• Infrastructure Library ISO20000 Accreditation

• March 2010 ISO 14001 Certification for our Environmental Management

System (controlling and improving our environmental performance)

• Annual growth of just under 30% (Sept 10)

BSI ISO

• Quality Management System accredited to ISO 9001

• Infrastructure Library ISO20000 Accreditation

• Viglen is the only PC manufacturer to achieve ISO 20000 certification (ITIL)

• Service Management Standards

• Prince 2 Methodologies adopted for Project Management

• Only twelve other institutions in the UK have reached this standardstandard

• Amber Valley Housing Ltd

• Cable & Wireless

• HBOS

• Lloyds TSB Group

• Logicalis Computing Ltd

• Logicalis UK Ltd

• London Borough of Newham

• Mitel Networks Limited

• Pirean Limited

• Samsung SDS Co., Ltd.

• Tamworth Borough Council

• University of Gloucestershire

• Viglen Ltd VHQ

National Approvals• Accredited on all UK higher/further education purchasing consortia

for desktop products including Inter-Regional Desktop Agreement

and Crescent Purchasing Consortium. Supply agreements with 100

HE/FE institutions.

• Number 2 position for total volume sales to IRDA

• Accredited to HE National Maintenance Agreement Framework

• Approved by the Treasury to the premier Catalist £multi-billion IT

public sector supply directory (formerly GCAT)• Client Devices• Client Devices• Peripherals• Network Infrastructure…• Peripherals/Operating systems/Office Productivity Tool/ Miscellaneous

Applications & database software/Network Software/Security & Maintenance Development/ Design & Testing Tools

• Supplier to Welsh Assembly & Scottish Parliament

• CPC – Laptops and desktops

• National Server Storage Agreement (NSSA)

• National Procurement Centre of Expertise

Intel Cluster Ready Production• Offer certified Intel Cluster Ready components that adhere to the

common Intel Cluster Ready Specification

• Rigorously tested for interoperability with other components

• Confidence that your cluster will work just as it should, right out of the box and will operate with registered Intel Cluster Ready Applications.

• Viglen provide detailed performance reports of all components, for all customer clusters (of more than 4 nodes) which include:

• CPU MFLOPS Performance (mflops_intel_mkl tests)

• Memory Bandwidth Stream Performance (memory_bandwidth_stream tests)

• Disk Cache and Read speeds (hdparm tests)• Disk Cache and Read speeds (hdparm tests)

• Network Latency and Throughput Performance (imb_pingpong_intel_mpi tests)

• Cluster Performance (hpcc tests)

• Remote Access on request.

Viglen Cluster Centre• Remote access for key customers

• Latest Generation Intel/AMD Quad Core HPC systems.

• GPU nodes with Tesla cards. (GPU Test Drive: AMBER, NAMD)

• Infiniband solutions based on Qlogic and Mellanox technologies.

• Storage technologies for IO sensitive application benchmarking.

• Early access to the latest technologies and the Viglen HX and HS HPC product range.

• Software Environments• Confirm codes/applications run smoothly on our cluster software, minimise the • Confirm codes/applications run smoothly on our cluster software, minimise the

transition phase to our systems and iron out problems before purchase.

• Cluster Toolkits enabling customers to optimise their codes, perform profiling and

make informed decisions on their desired cluster configuration.

• Windows/Linux Dual Boot Environment, Viglen GDD Power on/off nodes

• HPC Support Engineers• Experts from various scientific fields on hand to assist with code migration and

optimisation.

Projects and Services

• Viglen Update

• Projects and Services

• Looking forward

HPC Services

• ConsultancyAdvice and guidance on hardware, connectivity, software and setup.

• Viglen Cluster CentreTest and benchmark remotely on our HPC product range and software.

• Project ManagementManagement and control of your HPC project from order receipt to delivery and installation

HPC-Technology Configuration Centre • HPC-Technology Configuration Centre (ICR Production)Nodes stress tested and configured before deployment to ensure stability

• Installation and DeploymentHardware and software installation and configuration

• Warranty and SupportFlexible support and warranty options

Lancaster Framework

• 4 Year Framework with Lancaster. • Sole Supplier for 4 year to provide their high end computing facility.

• £1.2m overall worth, 800k initial spend.

• ~240 Compute Nodes

• Muti-Tiered Storage• Muti-Tiered Storage

• Fast Parallel Tier1

• Higher Capacity Tier2

• Platform Cluster Manager

University Of East Anglia

Site University of East Anglia

Project Goal 4 Year Frameworks to develop centralised HPC Facilities

HPC as a Service

UEA: 2000 2.66Ghz Cores with QDR IB Interconnect

Service Platform Cluster Manager

Failover login nodes

Windows HPC 2008 dual boot (UEA)Windows HPC 2008 dual boot (UEA)

Dynamic Dual Boot Platform/Viglen First:

Dynamically reprovison nodes with Windows/Linux based

on the demand in the queue

Accounting/Billing Monthly statistics provided on how many cpu seconds

UEA aiming to start running the cluster as a charable

service through priority queues/access.

GDD Green Data Centre: Beta site for Platform GDD. Shutdown

and power up nodes based on demand. All balanced

across power phases

EPCC - University of Glasgow

Department EPCC

Solution Dual boot cluster capable of dynamically allocating

Windows and Linux nodes.

Compute nodes; quad socket AMD 6276 with Compute nodes; quad socket AMD 6276 with

Chelsio 10Gbe Interconnect.

Room to expand computing power with GPUs in the

future.

Software Windows HPC, IBM Platform HPC, LSF

Gaia project – Hadoop Cluster

Department Astrology department

Solution Headnode HX425Hi

Data nodes HX412i

Qlogic QDR interconnectQlogic QDR interconnect

1 Petabyte of storage

Software Platform Cluster Manager

Platform Symphony

Turnkey cluster solutions

• Platform Cluster Manager

• LSF

• IBM acquisition completed earlier this year

Power in the data centre

• Power in the data centre

• Cooling solutions

• Accelerators and ARM• Accelerators and ARM

Power in the Datacentre

• Green Datacentre Daemons (GDD)

• Built in collaboration with UEA and IBM Platform

• Automatically power down nodes that are not in use

Power in the Datacentre

• Power management is becoming more intelligent

• Power aware schedulers

• CPU frequency scaling• CPU frequency scaling

• Sleep states

• TDP is still going up

Power in the Datacentre

80

100

120

140

160

E5-26xx

56xx

0

20

40

60

80 56xx

55xx

Power in the data centre

• Power in the data centre

• Accelerators and ARM

• Cooling solutions• Cooling solutions

Accelerators

• High performance per watt

• AMD 1.48 TFLOPS /375W = 3.9GFLOPS/W

• Nvidia K20 1.31 TFLOPS/235W = 5.57GLOPS/W5.57GLOPS/W

• Still has to be cooled along side traditional architecture

• Porting of code required

ARM

• Low power ~5 watts

• Performance improving

• 64 bit and advancements in double precision performance

• Facilitator to accelerators• Facilitator to accelerators

Looking forward

• Power in the data centre

• Accelerators and ARM

• Cooling solutions• Cooling solutions

Liquid Blade

Liquid Blade

• Fully immersed cooling with “Core Coolant”. Non- conductive, inert and x1350 the cooling capacity of air

• Capable of cooling “workstation only” processorsprocessors

• Cools components with greater efficiency than traditional solutions

• 8 blades in 5U

Liquid Blade

• CDU – Can pipe cooling fluid to radiator within rack, easily fitting into existing infrastructure

• External cooling – Can be connected into external cooling systems if available.

• External cooling – Can be connected into external cooling systems if available.

Liquid Blade

Liquid Blade

Thank You!

Web: http://www.viglen.co.uk/hpc

Email: [email protected]