20
CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept CERN This document produced by Members of the Helix Nebula consortium is licensed under a Creative Commons Attribution 3.0 Unported License. Permissions beyond the scope of this license may be available at http://helix-nebula.eu/. The Helix Nebula project is co-funded by the European Community Seventh Framework Programme (FP7/2007-2013) under Grant Agreement no 312301

CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

CERN’s Scientific Programme and the need for

computing resources

Bob Jones

Head of openlab

IT dept

CERN

This document produced by Members of the Helix Nebula consortium is licensed under a Creative Commons Attribution 3.0 Unported License.

Permissions beyond the scope of this license may be available at http://helix-nebula.eu/. The Helix Nebula project is co-funded by the European Community Seventh Framework

Programme (FP7/2007-2013) under Grant Agreement no 312301

Page 2: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

2

CERN founded 1954: “Science for Peace”

~ 2300 staff

~ 1050 other paid personnel

~ 11000 users

Budget (2012) ~1000 MCHF

20 Member States: Austria, Belgium, Bulgaria, the Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, Italy, Netherlands, Norway, Poland, Portugal, Slovakia, Spain, Sweden, Switzerland and the United Kingdom

1 Candidate for Accession: Romania

8 Observers to Council: India, Israel, Japan, the Russian Federation, the United States of America, Turkey, the European Commission and UNESCO

Page 3: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Community collaboration on an international scale

Tim Bell, CERN 3 OpenStack Conference, Boston 2011

Page 4: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Accelerating Science and Innovation 4

Page 5: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

5

200-400 MB/sec

Data flow to permanent storage: 4-6 GB/sec

1.25 GB/sec

1-2 GB/sec

1-2 GB/sec

Page 6: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

CERN IT Department CH-1211 Genève 23

Switzerland www.cern.ch/it

6

The LHC Data Challenge

• The accelerator will run for 20 years

• Experiments are producing more than 20 Million Gigabytes of data each year

• LHC data analysis requires a computing power equivalent to ~100,000 of today's fastest PC processors

• Requires many cooperating computer centres, as CERN can only provide <20% of the capacity

The LHC Computing Model - Bob Jones June 2012

Page 7: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

• A distributed computing infrastructure to provide the production and analysis environments for the LHC experiments

• Managed and operated by a worldwide collaboration between the experiments and the participating computer centres

• The resources are distributed

– for funding and sociological reasons

• Our task was to make use of the resources available to us – no matter where they are located

May 2012 - Frédéric Hemmer

WLCG – what and why?

Tier-0 (CERN):

• Data recording

• Initial data reconstruction

• Data distribution

Tier-1 (11 centres):

•Permanent storage

•Re-processing

•Analysis

Tier-2 (~130 centres):

• Simulation

• End-user analysis

7

Page 8: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

0

100000000

200000000

300000000

400000000

500000000

600000000

700000000

800000000

900000000

1E+09

Jan-10

Feb-10

Mar-10

Apr-10

May-10

Jun-10

Jul-10

Aug-10

Sep-10

Oct-10

Nov-10

Dec-10

Jan-11

Feb-11

Mar-11

Apr-11

May-11

Jun-11

Jul-11

Aug-11

Sep-11

Oct-11

Nov-11

Dec-11

LHCb

CMS

ATLAS

ALICE

WLCG in 2011

22 PB data written in 2011 More than 6 GB/s to tape during HI run

Dec. 11

‣ CMS has used close to 100% of the Tier-1 pledge over all of 2011

‣ Mostly Reprocessing of Data and Simulation

‣ Expect the trend to continue in 2012

6

0%

20%

40%

60%

80%

100%

120%

140%

160%

1 2 3 4 5 6 7 8 9 10 11

Pe

rce

nta

ge

of

Utilize

d H

ou

rs

Month 2010

Usage Tier-1 Only ALICE Percentage Use

ATLAS Percentage Use

CMS Percentage Use

LHCb Percentage Use

Total Percentage Use

Status Data Re-Processing and MC Production12/01/11 6

Era Pass Events Total Size [TB]

Run2010A+B Apr21

Run2011A Apr13

Run2011A Apr22

Run2011A May3

Run2011A May7

Run2011A May10

Run2011A May13

Run2011A 16Jun2011

Run2011A 02Jul2011

Run2011A 05Jul2011

Run2011A 16Jul2011

Run2011A 05Aug2011

Run2011A 31Aug2011

Run2011A 16Sep2011

Run2011A 20Sep2011

Run2011A 03Oct2011

Run2011A 06Oct2011

Run2011A 21Oct2011

Run2011A 02Nov2010

Run2011A 08Nov2011

Run2011B 08Nov2011

Run2011A 15Nov2011

Run2011B 15Nov2011

Run2011B 19Nov2011

Run2011A 26Nov2011

Run2011B 26Nov2011

1,420,077,332 388.89

40,729,454 2.74

10,767,224 9.59

12,808,958 9.13

29,085,527 21.57

476,783,419 257.65

18,604,900 5.59

463,744 0.36

144,560,884 130.85

180,655,731 161.52

56,168,211 8.55

123,381,634 110.63

273,081 0.65

52,384,460 28.10

3,347,184 0.00

59,822,214 69.45

13,730 0.00

12,657,332 8.19

4,053,868 0.64

1,238,155,918 626.75

5,652,294 5.46

2,865,513 0.00

2,305,064 0.00

184,445,390 228.60

1,475,622 0.27

2,476,862 0.61

Total

Total Run2010A+B

Total Run2011A+B

4,084,015,550 2,075.79

1,420,077,332 388.89

2,663,938,218 1,686.90

0

750,000,000

1,500,000,000

2,250,000,000

3,000,000,000

Events

Total Run2010A+B Total Run2011A+B

0

500

1,000

1,500

2,000

Total Size [TB]

Total Run2010A+B Total Run2011A+B

In progress,

CMSSW_4_4

In progress,

CMSSW_4_4

In progress,

Higgs

Status: 11/30/2011

Dat

a R

epro

cess

ing

2011

Grid resources close to full occupation (plot shows use vs. pledges)

109 HEPSPEC-hours/month (~150 k CPU continuous use)

CPU Use continues to grow… even during technical stops

Page 9: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

How to evolve WLCG?

A distributed computing infrastructure to provide the production and analysis environments for the LHC experiments

• Collaboration - The resources are distributed and provided “in-kind”

• Service - Managed and operated by a worldwide collaboration between the experiments and the participating computer centres

• Implementation - Today general grid technology with high-energy physics specific higher-level services

Evolve the Implementation while preserving the collaboration & service

9

Page 10: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

CERN openlab in a nutshell

• A science – industry partnership to drive R&D

and innovation with over a decade of success

• Evaluate state-of-the-art technologies in a

challenging environment and improve them

• Test in a research environment today what will

be used in many business sectors tomorrow

• Train next generation of engineers/employees

• Disseminate results and outreach to new

audiences

CONTRIBUTOR (2012)

10

Page 11: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

A European Cloud Computing Partnership

big science teams up with big business

11

Email:[email protected] Twitter: HelixNebulaSC Website: http://www.helix-nebula.eu/

Page 12: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Strategic Plan for a scientific Cloud Computing Infrastructure in Europe

Establish a sustainable multi-tenant cloud computing infrastructure in Europe

Initially based on the needs for the European Research Area & space agencies

Integrate commercial services from multiple IT industry providers

http://cdsweb.cern.ch/record/1374172/

Page 13: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

13

A Collaboration Initiative

European Cloud Computing

Strategy

User organisations

Demand-side

Commercial Service

Providers

Supply-side

European Commission & relevant projects

Bringing together all the stakeholders to establish a public-private partnership

Page 14: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Timeline

• Deploy flagships

• Analysis of functionality,

performance &

financial model

• Success Stories

Set-up (2011)

Pilot phase (2012-2014)

Full-scale cloud service market

(2014 … )

• Agree strategic plan

• Select flagships use cases

• Identify service providers

• Define governance model

• More applications

• More services

• More users

• More service providers

co-funded by EC under grant 312301 with 1.8M€

Page 15: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Pilot Phase Explore / push a series of perceived barriers to Cloud adoption via deployment of selected flagship applications:

Security: Unknown or low compliance and security standards

Reliability: Availability of service for business critical tasks

Data privacy: Moving sensitive data to the Cloud

Scalability / Elasticity: Will the Cloud scale-up to our needs

Network performance: Data transfer bottleneck; QoS

Integration: Hybrid systems with in-house / legacy systems

Vendor lock-in: Vendor dependency once data & applications are transferred to the Cloud

Business Models: Finding a win-win model for the suppliers and users

Legal concerns: liability, jurisdiction, intellectual property

Transparency: Clarity of conditions, terms and pricing

Page 16: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Initial flagship use cases

Scientific challenges with societal impact

Sponsored by user organisations

Stretch what is possible with the cloud today

16

ATLAS High Energy

Physics Cloud Use

Genomic Assembly

in the Cloud

SuperSites Exploitation

Platform

To support the computing

capacity needs for the

ATLAS experiment

A new service to simplify

large scale genome

analysis; for a deeper

insight into evolution and

biodiversity

To create an Earth

Observation platform,

focusing on earthquake

and volcano research

Page 17: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

• Proof of Concept deployments within the Pilot Phase started January 2012

• Each flagship has been deployed with a series of providers independently

CERN, EMBL and ESA succeeded in deploying scientific applications each involving tens of thousands of jobs running at data centres

operated by Atos, CloudSigma and T-Systems

Flagship deployments

First results

17

Page 18: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

CERN-ATLAS flagship configuration

Monte Carlo jobs (lighter I/O)

10s MB in/out

~6-12 hours/job

Ran ~40,000 CPU days

Ramón Medrano Llamas,Fernando Barreiro, Dan van der Ster (CERN IT), Rodney Walker (LMU Munich)

Difficulties overcome

Different vision of clouds

Different APIs

Networking aspects

Page 19: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Open to new members

Users

Service Providers

Adopters

Interested Parties

19 http://www.helix-nebula.eu

Page 20: CERN’s Scientific Programme and the need for computing ...€¦ · CERN’s Scientific Programme and the need for computing resources Bob Jones Head of openlab IT dept ... Tim Bell,

Accelerating Science and Innovation 20