Moving Data Over Networks - nersc.gov · PDF fileMoving Data Over Networks Network-Based Data...

Preview:

Citation preview

MovingDataOverNetworksNetwork-BasedDataTransferatNERSC

EliDart,NetworkEngineerESnetScienceEngagementLawrenceBerkeleyNaAonalLaboratory

NERSCUsersGroupTraining

Berkeley,CA

February24,2016

Outline

•  Context•  ScienceDMZoverview

•  DataTransferNodes•  HandofftoShreyasCholia

2 – ESnet Science Engagement (engage@es.net) - 2/24/17

ScienceNetworksforScience

•  TheglobalResearch&EducaAon(R&E)networkecosystemiscomprisedofhundredsofinternaAonal,naAonal,regionalandlocal-scalenetworks–eachindependentlyownedandoperated.

•  ThesenetworksarepartofandconnectedtotheInternet,butareengineeredspecificallyforhigh-performancescienAficapplicaAons

February24,2017 33 – ESnet Science Engagement (engage@es.net) - 2/24/17

•  DatatransfersbetweenresourcesconnectedtoR&Enetworkscandomuchbe\erthandatatransferswhichusethecommodityInternet–  Terabytesarenoproblem–  Petabytesarefeasible

•  Justneedtomakesurewedoacoupleofthings–  LongdistanceporAonsworkwellingeneral–  Large-scalecompuAngcentersworkwellingeneral–  LocalconfiguraAonisreallyimportant

•  NERSChashigh-performancedataresources–  Fastnetworks–  Fastsystemsandfilesystems

•  ThistalkwilldescribewhatyoucandotointerfacewithNERSCeffecAvely

Effec4veHighPerformanceDataTransfer

4 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

•  NetworksareanessenAalpartofdata-intensivescience–  Connectdatasourcestodataanalysis–  Connectcollaboratorstoeachother–  Enablemachine-consumableinterfacestodataandanalysisresources(e.g.portals),automaAon,scale

•  PerformanceiscriAcal–  ExponenAaldatagrowth–  Constanthumanfactors–  Datamovementanddataanalysismustkeepup

•  EffecAveuseofwidearea(long-haul)networksbyscienAstshashistoricallybeendifficult

•  Someofthisisforyoursystemadministrator–  Pointyoursysadmintoh\p://fasterdata.es.net/formoreinfo–  Feelfreetofollowupwithmelater–engage@es.net

Mo4va4on

5 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

TheCentralRoleoftheNetwork

•  Theverystructureofmodernscienceassumessciencenetworksexist:highperformance,featurerich,globalscope

•  Whatis“TheNetwork”anyway?–  “TheNetwork”isthesetofdevicesandapplicaAonsinvolvedintheuseofaremoteresource•  Thisisnotaboutsupercomputerinterconnects•  Thisisaboutdataflowfromexperimenttoanalysis,betweenfaciliAes,etc.

–  Userinterfacesfor“TheNetwork”–portal,datatransfertool,workflowengine–  Therefore,serversandapplicaAonsmustalsobeconsidered

•  Whatisimportant?Orderedlist:1.  Correctness2.  Consistency3.  Performance

6 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

TCP–UbiquitousandFragile

•  NetworksprovideconnecAvitybetweenapplicaAonsrunningonhosts–  FromanapplicaAon’sperspecAve,theinterfaceto“theotherend”isasocket

–  HostoperaAngsystemkernelprovidessocketinterface,kernelimplementsTCPwheretheapplicaAoncan’tsee

–  CommunicaAonisbetweenapplicaAons–mostlyoverTCP

•  TCP–thefragileworkhorse–  TCPis(forverygoodreasons)Amid–packetlossisinterpretedascongesAon

–  Likeitornot,TCPisusedforthevastmajorityofdatatransferapplicaAons(morethan95%ofESnettrafficisTCP)

–  PacketlossinconjuncAonwithlatencyisaperformancekiller

7 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

A small amount of packet loss makes a huge difference in TCP performance

MetroArea

Local(LAN)

Regional

ConAnental

InternaAonal

Measured (TCP Reno) Measured (HTCP) Theoretical (TCP Reno) Measured (no loss)

With loss, high performance beyond metro distances is essentially impossible

8 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

WorkingWithTCPInPrac4ce

•  FareasiertosupportTCPthantofixTCP–  PeoplehavebeentryingtofixTCPforyears–limitedsuccess–  Likeitornotwe’restuckwithTCPinthegeneralcase

•  PragmaAcallyspeaking,wemustaccommodateTCP–  SufficientbandwidthtoavoidcongesAon–  Zeropacketloss– Verifiableinfrastructure

•  Networksarecomplex•  Mustbeabletolocateproblemsquickly•  Smallfootprintisahugewin–smallnumberofdevicessothatproblemisolaAonistractable

9 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

PuIngASolu4onTogether

•  EffecAvesupportforTCP-baseddatatransfer–  Designforcorrect,consistent,high-performanceoperaAon–  DesignforeaseoftroubleshooAng

•  EasyadopAoniscriAcal–  LargelaboratoriesanduniversiAeshaveextensiveITdeployments–  DrasAcchangeisprohibiAvelydifficult

•  Cybersecurity–defensiblewithoutcompromisingperformance

•  BorrowideasfromtradiAonalnetworksecurity–  TradiAonalDMZ

•  Separateenclaveatnetworkperimeter(“DemilitarizedZone”)•  SpecificlocaAonforexternal-facingservices•  CleanseparaAonfrominternalnetwork

–  Dothesamethingforscience–ScienceDMZ

10 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

DedicatedSystemsforData

Transfer

NetworkArchitecture

PerformanceTesAng&

Measurement

DataTransferNode•  Highperformance•  Configuredspecifically

fordatatransfer•  Propertools

ScienceDMZ•  Dedicatednetwork

locaAonforhigh-speeddataresources

•  Appropriatesecurity•  Easytodeploy-noneed

toredesignthewholenetwork

perfSONAR•  EnablesfaultisolaAon•  VerifycorrectoperaAon•  WidelydeployedinESnet

andothernetworks,aswellassitesandfaciliAes

TheScienceDMZDesignPa\ern

11 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

AbstractorPrototypeDeployment

•  (ThissecAonisforyoursystemadministrator–sendthemtome,useengage@es.net)

•  Add-ontoexisAngnetworkinfrastructure–  Allthatisrequiredisaportontheborderrouter–  Smallfootprint,pre-producAoncommitment

•  Easytoexperimentwithcomponentsandtechnologies–  DTNprototyping–  perfSONARtesAng

•  LimitedscopemakessecuritypolicyexcepAonseasy–  Onlyallowtrafficfrompartners–  Add-ontoproducAoninfrastructure–lowerriskthanrebuildingexisAnginfrastructure

12 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

ScienceDMZDesignPaNern(Abstract)

10GE

10GE

10GE

10GE

10G

Border Router

WAN

Science DMZSwitch/Router

Enterprise Border Router/Firewall

Site / CampusLAN

High performanceData Transfer Node

with high-speed storage

Per-service security policy control points

Clean, High-bandwidth

WAN path

Site / Campus access to Science

DMZ resources

perfSONAR

perfSONAR

perfSONAR

13 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

LocalAndWideAreaDataFlows

10GE

10GE

10GE

10GE

10G

Border Router

WAN

Science DMZSwitch/Router

Enterprise Border Router/Firewall

Site / CampusLAN

High performanceData Transfer Node

with high-speed storage

Per-service security policy control points

Clean, High-bandwidth

WAN path

Site / Campus access to Science

DMZ resources

perfSONAR

perfSONAR

High Latency WAN Path

Low Latency LAN Path

perfSONAR

14 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

ModularArchitecture–Mul4pleScienceDMZs

Dark Fiber

DarkFiber

10GE

DarkFiber

10GE

10G

Border Router

WAN

Science DMZSwitch/Routers

Enterprise Border Router/Firewall

Site / CampusLAN

Project A DTN(building A)

Per-project securitypolicy

perfSONAR

perfSONAR

Facility B DTN(building B)

Cluster DTN(building C)

perfSONARperfSONAR

Cluster(building C)

15 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

SupercomputerCenterDeployment

•  High-performancenetworkingisassumedinthisenvironment–  Dataflowsbetweensystems,betweensystemsandstorage,widearea,etc.–  GlobalfilesystemoqenAesresourcestogether

•  PorAonsofthismaynotrunoverEthernet(e.g.IB)•  ImplicaAonsforDataTransferNodes

•  “ScienceDMZ”maynotlooklikeadiscreteenAtyhere–  BytheAmeyougetthroughinterconnecAngalltheresources,youendupwithmostofthenetworkintheScienceDMZ

–  Thisisasitshouldbe–thepointisappropriatedeploymentoftools,configuraAon,policycontrol,etc.

•  Officenetworkscanlooklikeanaqerthought,buttheyaren’t–  Deployedwithappropriatesecuritycontrols–  Officeinfrastructureneednotbesizedforsciencetraffic

16 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

HPCCenter

©2014,EnergySciencesNetwork17 – ESnet Science Engagement (engage@es.net) - 2/24/17

Routed

Border Router

WAN

Core Switch/Router

Firewall

Offices

perfSONAR

perfSONAR

perfSONAR

Supercomputer

Parallel Filesystem

Front endswitch

Data Transfer Nodes

Front endswitch

HPCCenterDataPath

©2014,EnergySciencesNetwork18 – ESnet Science Engagement (engage@es.net) - 2/24/17

Routed

Border Router

WAN

Core Switch/Router

Firewall

Offices

perfSONAR

perfSONAR

perfSONAR

Supercomputer

Parallel Filesystem

Front endswitch

Data Transfer Nodes

Front endswitch

High Latency WAN Path

Low Latency LAN Path

CommonThreads

•  Twocommonthreadsexistinalltheseexamples

•  AccommodaAonofTCP–  WideareaporAonofdatatransferstraversespurpose-builtpath–  Highperformancedevicesthatdon’tdroppackets

•  Abilitytotestandverify–  Whenproblemsarise(andtheyalwayswill),theycanbesolvediftheinfrastructureisbuiltcorrectly

–  Smalldevicecountmakesiteasiertofindissues–  MulApletestandmeasurementhostsprovidemulApleviewsofthedatapath•  perfSONARnodesatthesiteandintheWAN•  perfSONARnodesattheremotesite

19 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,EnergySciencesNetwork

DedicatedSystems–DataTransferNode

•  TheDTNisdedicatedtodatatransfer•  Setupspecificallyforhigh-performancedatamovement

–  Systeminternals(BIOS,firmware,interrupts,etc.)–  Networkstack–  Storage(globalfilesystem,Fibrechannel,localRAID,etc.)–  Highperformancetools–  Noextraneoussoqware

•  Limita.onofscopeandfunc.onispowerful–  NoconflictswithconfiguraAonforothertasks–  SmallapplicaAonsetmakescybersecurityeasier–keypoint

20 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,TheRegentsoftheUniversityofCalifornia,throughLawrenceBerkeleyNaAonalLaboratoryandislicensedunderCCBY-NC-ND4.0

DataTransferToolsForDTNs

•  Parallelismisimportant–  ItisoqeneasiertoachieveagivenperformancelevelwithfourparallelconnecAonsthanoneconnecAon

–  Severaltoolsofferparalleltransfers,includingGlobus/GridFTP

•  LatencyinteracAoniscriAcal–  WideareadatatransfershavemuchhigherlatencythanLANtransfers–  ManytoolsandprotocolsassumeaLAN

•  WorkflowintegraAonisimportant

•  Keytools:GlobusOnline,HPN-SSH•  ESnettestDTNs:h\p://fasterdata.es.net/performance-tesAng/DTNs/

21 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,TheRegentsoftheUniversityofCalifornia,throughLawrenceBerkeleyNaAonalLaboratoryandislicensedunderCCBY-NC-ND4.0

DataTransferToolComparison•  InaddiAontothenetwork,usingtherightdatatransfertooliscriAcal•  DatatransfertestfromBerkeley,CAtoArgonne,IL(nearChicago).RTT=53ms,networkcapacity=10Gbps.

Tool Throughput SCP: 140Mbps HPNpatchedSCP: 1.2GbpsFTP 1.4Gbps GridFTP,4streams 5.4Gbps GridFTP,8streams 6.6Gbps

•  NERSCDTNshavebothHPN-SSHandGlobus•  Keypoint–yourlocalDTNandnetworkconnecAonsignificantlyaffectyourabilitytomovedatainandoutofNERSC

22 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,TheRegentsoftheUniversityofCalifornia,throughLawrenceBerkeleyNaAonalLaboratoryandislicensedunderCCBY-NC-ND4.0

8.9 Gbps

10.7 Gbps

7.5 Gbps

6.3 Gbps

7.6 Gbps 9.8 Gbps

11.4 Gbps

9.2 Gbps

21.9 Gbps

15.6 Gbps

26.1 Gbps

2.8 Gbps

DTN

DTN

DTN

DTN

alcf#dtn_miraALCF

nersc#dtnNERSC

olcf#dtn_atlasOLCF

ncsa#BlueWatersNCSA

Data set: L380Files: 19260Directories: 211Other files: 0Total bytes: 4442781786482 (4.4T bytes)Smallest file: 0 bytes (0 bytes)Largest file: 11313896248 bytes (11G bytes)Size distribution:

1 - 10 bytes: 7 files10 - 100 bytes: 1 files100 - 1K bytes: 59 files1K - 10K bytes: 3170 files10K - 100K bytes: 1560 files100K - 1M bytes: 2817 files1M - 10M bytes: 3901 files10M - 100M bytes: 3800 files100M - 1G bytes: 2295 files1G - 10G bytes: 1647 files10G - 100G bytes: 3 files

October 2016L380 Data Set

PerformanceBetweenCompu4ngFacili4es

23 – ESnet Science Engagement (engage@es.net) - 2/24/17

HandofftoShreyasCholia

•  Thanks!

24 – ESnet Science Engagement (engage@es.net) - 2/24/17 ©2015,TheRegentsoftheUniversityofCalifornia,throughLawrenceBerkeleyNaAonalLaboratoryandislicensedunderCCBY-NC-ND4.0

Recommended