24
Dan Nae Dan Nae California Institute of Technology California Institute of Technology The US LHCNet Project The US LHCNet Project ICFA Workshop, Krakow ICFA Workshop, Krakow October 2006 October 2006

Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Embed Size (px)

Citation preview

Page 1: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Dan Nae Dan Nae

California Institute of Technology California Institute of Technology

The US LHCNet ProjectThe US LHCNet Project

ICFA Workshop, KrakowICFA Workshop, KrakowOctober 2006October 2006

Page 2: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Who We AreWho We Are

A transatlantic network designed to support the A transatlantic network designed to support the LHCLHC and the U.S. HEP community and the U.S. HEP community

Funded by the US Funded by the US DoEDoE and and CERNCERN and managed and managed by by CaltechCaltech in collaboration with CERN in collaboration with CERN

Evolved from a network between US and CERN Evolved from a network between US and CERN which dates back to 1985which dates back to 1985

Our mission is to deliver a reliable network Our mission is to deliver a reliable network service to support the upcoming service to support the upcoming LHCLHC experiments at CERNexperiments at CERN

Designed to support the LHC three-tiered model Designed to support the LHC three-tiered model and to deliver data directly to/from the and to deliver data directly to/from the US Tier US Tier 1’s1’s ( (FNALFNAL and and BNLBNL))

Page 3: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

LHCnet ServicesLHCnet Services

We offer Layer 3 services (IP) and we peer with all We offer Layer 3 services (IP) and we peer with all major US research networks major US research networks

Layer 2 dedicated paths between Layer 2 dedicated paths between CERNCERN and the US and the US Tier1’sTier1’s

Layer 1 protected services coming soonLayer 1 protected services coming soon Redundant services using multiple paths across the Redundant services using multiple paths across the

AtlanticAtlantic Many layers of redundancy (equipment redundancy, Many layers of redundancy (equipment redundancy,

path diversity, collaborations with other research path diversity, collaborations with other research networks)networks)

Integrated monitoring with MonALISA *Integrated monitoring with MonALISA *

Page 4: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

US LHCNet Working Methods US LHCNet Working Methods

Production NetworkDevelop and build next generation networks

Networks for ResearchD0, CDF, BaBar, CMS, Atlas

GRID applications PPDG/iVDGL, OSG, WLCG,

DISUN

LHCOPN

Interconnection of US and EU Grid domains

VRVS/EVO

LHCNetLHCNet

High performanceHigh bandwidthReliable network

HEP & DoE Roadmaps

Testbed for Grid Development

Pre-ProductionN x 10 Gbps transatlantic testbed

New Data transport protocols

Interface and kernel setting

HOPI / UltraScience Net / Ultralight / CHEPREO /

LambdaSation

Lightpath technologies

Vendor Partnerships

Page 5: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

US LHCNetUS LHCNet

ALBALBATLATL

HUB

Aus.

Europe

SDGSDG

AsiaPac SEASEA

Major DOE Office of Science SitesHigh-speed cross connects with Internet2/Abilene

SNVSNV

Europe

Japan

USNet 10 Gbps circuit based transport. (DOE funded project)Major international

Production IP ESnet core, 10 Gbps enterprise IP traffic

Japan

Aus.

MANRings

10Gb/s

NYCNYC

CHICHI

LHCNet Data Network (10 Gb/s)

DCDC

GEANT2SURFNet

FNAL

BNL

≥ 2.5 Gb/s Connections to ESnet Hubs in New-York and Chicago Redundant “light-paths” to BNL and FNAL Redundant 10 Gbps peering with Abilene Access to USNet/HOPI for R&D

CERN

CMS

ATLAS

ESNET

Page 6: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

LHCNet configuration (October 2006)LHCNet configuration (October 2006)

Co-operated by Caltech and CERN engineering teams

Force10 platforms, 10GE WANPHY

New PoP in NY since Sept. 2005

10 Gbps path to BNL since April 2006

Connection to US Universities via UltraLight (NSF & university funded) backbone

Page 7: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

RSTP Backup and Load SharingRSTP Backup and Load Sharing

STP Root for NY VLANs

STP Root for Chicago VLANs

• Fast recovery time (enough to keep the CERN BGP peerings from reseting during a failure)

• But not so fast (sometimes they do)

• Cannot deal with flapping links

Page 8: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Future backbone topologyFuture backbone topology

LHCNet 10 Gb/s circuit (existing)LHCNet 10 Gb/s circuit (FY2007)

“Other” Circuits (IRNC, Gloriad, Surfnet)

Amsterdam

Geneva

GVA-CHI-NY triangleGVA-CHI-NY triangle New PoP in Amsterdam New PoP in Amsterdam

GEANT2 circuit between GVA and AMSGEANT2 circuit between GVA and AMS Access to other transatlantic circuits Access to other transatlantic circuits backup paths and additional backup paths and additional

capacitycapacity Connection to Netherlight, GLIF (T1-T1 traffic and R&D)Connection to Netherlight, GLIF (T1-T1 traffic and R&D)

New-York Chicago

Page 9: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

New Topology deploymentNew Topology deployment

Atlantic Ocean

NY 111 8th

Pottington

AC-2

NY60 Hudson

Highbridge (UK)

VSNL North

AMS-SARAAC-1 South

Whitesands

GVA-CERNFrankfort

VSNL

WAL

London Global Crossing

Qwest

Colt

GEANT

NY-MANLAN

CHI-Starlight

November 1st

In Production

In Production

January 2007

January 2007

Page 10: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Multiple Fiber Paths: Multiple Fiber Paths: Reliability Through DiversityReliability Through Diversity

CHI-Starlight

NY 111 8th

Bude (UK)Lyon-Paris

AC-2

NY - 60 Hudson Str

Highbridge (UK)

GVA-CERN

VSNL North

Basel-Frankfurt

CHI

AMS-SARA

AC-1South

Whitesands (UK)

NY-MANLAN

Atlantic Ocean

Buffalo-Cleveland Global Crossing

ColtGEANTCanarie or Qwest

Unprotected circuits (lower cost) Service availability from provider’s offers:

Colt Target Service Availability is 99.5% Global Crossing guarantees Wave Availability at 98%

Canarie and GEANT: No Service Level Agreement (SLA)

LCG Availability LCG Availability requirement: 99.95%requirement: 99.95%

Page 11: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Next Generation LHCNet:Next Generation LHCNet:Add Optical Circuit-Oriented ServicesAdd Optical Circuit-Oriented Services

Based on CIENA “Core Director” Based on CIENA “Core Director” Optical MultiplexersOptical Multiplexers Robust fallback, at the optical layerRobust fallback, at the optical layer Circuit-oriented services: Guaranteed Bandwidth Ethernet Private Line (EPL)Circuit-oriented services: Guaranteed Bandwidth Ethernet Private Line (EPL) Sophisticated standards-based software: Sophisticated standards-based software: VCAT/LCASVCAT/LCAS. .

Page 12: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

USLHCNet NOCUSLHCNet NOC The CERN Network Operation Center (NOC) The CERN Network Operation Center (NOC)

Delivers the first level support 24 hours a day, 7 days a week.Delivers the first level support 24 hours a day, 7 days a week. Watch out for alarms Watch out for alarms A problem not resolved immediately is escalated to the Caltech network engineering A problem not resolved immediately is escalated to the Caltech network engineering

team. team. USLHCnet engineers “on call” 24x7USLHCnet engineers “on call” 24x7

On site (at CERN) in less than 60 minOn site (at CERN) in less than 60 min Remote hand service at MANLAN and StarLight is available on a 24x7 basis with a four Remote hand service at MANLAN and StarLight is available on a 24x7 basis with a four

hour response time.hour response time.

Page 13: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Monitoring: http://monalisa.caltech.eduMonitoring: http://monalisa.caltech.edu

Operations & management Operations & management assisted by agent-based assisted by agent-based software (MonALISA)software (MonALISA)

500 TB of data sent from 500 TB of data sent from CERN to FNAL over the CERN to FNAL over the last two months last two months

Page 14: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

LHCNet Utilization LHCNet Utilization during Service Challengeduring Service Challenge

CERN-FNAL traffic during SC3 (April 2006)

Disk-to-Disk

Service challengeService challenge Achieving the goal of a production quality world-wide Grid that Achieving the goal of a production quality world-wide Grid that

meets the requirements of LHC experimentsmeets the requirements of LHC experiments Prototype the data movement services Prototype the data movement services Acquire an understanding of how the entire system performs Acquire an understanding of how the entire system performs

when exposed to the level of usage we expect during LHC when exposed to the level of usage we expect during LHC runningrunning

Page 15: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Circuit failure during SC2 Circuit failure during SC2

Page 16: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Additional SlidesAdditional Slides

Page 17: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

LHCNet configuration (2007)LHCNet configuration (2007)

Page 18: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

LHCNet connection to Proposed ESnet LHCNet connection to Proposed ESnet Lambda Infrastructure Based on National Lambda Infrastructure Based on National

Lambda Rail: FY09/FY10Lambda Rail: FY09/FY10

NLR wavegear sites

NLR regeneration / OADM sites

ESnet via NLR (10 Gbps waves)LHCNet (10 Gbps waves)

Denver

Seattle

Su

nn

yva

le

LA

San Diego

Chicago

PittsWash DC

Raleigh

Jacksonville

Atlanta

KC

Baton Rouge

El Paso - Las Cruces

Phoenix

Pensacola

Dallas

San Ant.Houston

Albuq. Tulsa

New YorkClev

Boise

CERN (Geneva)

LHCNet: To ~80 Gbps by 2009-10Routing + Dynamic managed

circuit provisioning

Page 19: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

UltraLightUltraLight

Page 20: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Pre-Production ActivitiesPre-Production Activities Prototype data movement servicesPrototype data movement services between CERN and the US between CERN and the US High speed disk-to-disk throughput developmentHigh speed disk-to-disk throughput development

New end-systems (PCI-e; 64 bit cpu; New 10 GE NICs)New end-systems (PCI-e; 64 bit cpu; New 10 GE NICs) New data transport protocols (FAST and others)New data transport protocols (FAST and others) Linux kernel patches; RPMs for deploymentLinux kernel patches; RPMs for deployment

Monitoring, Command and Control ServicesMonitoring, Command and Control Services (MonALISA) (MonALISA) ““Optical control plane” developmentOptical control plane” development

MonALISA services available for photonic switchesMonALISA services available for photonic switches GMPLS (Generalized MPLS); G.ASONGMPLS (Generalized MPLS); G.ASON Collaboration with Cheetah and Dragon projectsCollaboration with Cheetah and Dragon projects

Note: Equipment loans and donations; exceptional discountsNote: Equipment loans and donations; exceptional discounts

Page 21: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Milestones: 2006-2007Milestones: 2006-2007 May to September 2006:May to September 2006: Service Challenge 4 - Service Challenge 4 - completedcompleted August 2006:August 2006: Selection of telecom provider(s)Selection of telecom provider(s) from among from among

those responding to the call for tender - completedthose responding to the call for tender - completed October 2006:October 2006: Provisioning of new transatlantic circuits Provisioning of new transatlantic circuits Fall 2006:Fall 2006: Evaluation of CIENA platforms Evaluation of CIENA platforms

Try and buy agreementTry and buy agreement End 2006:End 2006: 1 1stst Deployment of Next-generation US LHCNet Deployment of Next-generation US LHCNet

Transition to new circuit-oriented backbone, Transition to new circuit-oriented backbone, based on optical multiplexers. based on optical multiplexers.

Maintain full switched and routed IP service Maintain full switched and routed IP service for a controlled portion of the bandwidth for a controlled portion of the bandwidth

Fall:Fall: Start of LHC operations Start of LHC operations

Page 22: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Primary Milestones for 2007-2010Primary Milestones for 2007-2010 Provide a robust network service without service Provide a robust network service without service

interruptions, throughinterruptions, through Physical diversity of the primary linksPhysical diversity of the primary links Automated fallback at the optical layerAutomated fallback at the optical layer Mutual backup with other networks Mutual backup with other networks

(ESnet, IRNC, CANARIE, SURFNet etc.)(ESnet, IRNC, CANARIE, SURFNet etc.) Ramp up the bandwidth, supporting an increasing number Ramp up the bandwidth, supporting an increasing number

of 1-10 Terabyte-scale flowsof 1-10 Terabyte-scale flows Scale up and increase the functionality of the network Scale up and increase the functionality of the network

management services providedmanagement services provided Gain experience on policy-based network resource Gain experience on policy-based network resource

management, together with FNAL, BNL, and the management, together with FNAL, BNL, and the US Tier2 organizations US Tier2 organizations

Integrate with the security (AAA) infrastructures of Integrate with the security (AAA) infrastructures of ESnet and the LHC OPNESnet and the LHC OPN

Page 23: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

Additional Technical Milestones Additional Technical Milestones for 2008-2010for 2008-2010

Targeted at large scale, resilient operation with Targeted at large scale, resilient operation with a relatively small network engineering teama relatively small network engineering team

2008: Circuit-Oriented services2008: Circuit-Oriented services Bandwidth provisioning automated (through the use of Bandwidth provisioning automated (through the use of

MonALISA services working with the CIENAs, for MonALISA services working with the CIENAs, for example) example)

Channels assigned to authenticated, authorized Channels assigned to authenticated, authorized sites and/or user-groupssites and/or user-groups

Based on a policy-driven network-management services Based on a policy-driven network-management services infrastructure, currently under developmentinfrastructure, currently under development

2008-2010: The Network as a Grid resource (2008-2010)2008-2010: The Network as a Grid resource (2008-2010) Extend advanced planning and optimization into the Extend advanced planning and optimization into the

networking and data-access layers. networking and data-access layers. Provides interfaces and functionality allowing physics Provides interfaces and functionality allowing physics

applications to interact with the networking resources applications to interact with the networking resources

Page 24: Dan Nae California Institute of Technology The US LHCNet Project ICFA Workshop, Krakow October 2006

ConclusionConclusion

US LHCNet: An extremely reliable, cost-effectiveUS LHCNet: An extremely reliable, cost-effective High Capacity Network High Capacity Network A 20+ Year Track RecordA 20+ Year Track Record

High speed inter-connections with High speed inter-connections with the major R&E networks and US T1 centers the major R&E networks and US T1 centers

Taking advantage of rapidly advancing network Taking advantage of rapidly advancing network technologies to meet the needs of the technologies to meet the needs of the LHC physics program at moderate costLHC physics program at moderate cost

Leading edge R&D projects as required,Leading edge R&D projects as required,to build the next generation US LHCNetto build the next generation US LHCNet