Computer Science 37 Lecture 31

8/4/2019 Computer Science 37 Lecture 31

http://slidepdf.com/reader/full/computer-science-37-lecture-31 1/15

Lecture31

Multiprocessors

Question: Whatdoesitmeantocompute?

TuringMachine

program

y' !d ag

Perhaps:manipulateandtransformdata.

Question: WhywouldIwanttohavemorethanone

computerworkatthesameproblematthesametime?

Theideais:ifittakestimeTtofinishataskusingonecomputer,itwilltaketimeT/Ntoaccomplishthesame

taskusingNcomputers.Right?Well,kindof.

Amdahl’sLaw:TheLawofDiminishingReturns

t improvemenof Amount

t improvemenbyaffected time Executiont improvemenafter time Execution =

unaffected time Execution+

Conclusion: Inordertosee speedup equaltothenumberof

processorsusedtoexecuteanapplicationinparallel,this

applicationmustbehavenosequentialcomponentatall.

Sometimes,whenthesizeofproblemgrowsvery

large,thefractionofexecutiontimewhichcanbe

affectedbyimprovementgrowsmuchfasterthanthe

executiontimethatisunaffected.Inthosecases,parallelcomputingwillyieldgreatgains.

t improvemenof Amount

t improvemenbyaffected time Executiont improvemenafter time Execution =

unaffected time Execution+

Littleroadmapfortherestofthelecture:

Quickglanceatafewproblemsthatariseinmultiprocessing

Categoriesofmultiprocessorssystems

Panoramiclookatmultiprocessorcomputers

Awordortwoonprogrammingmultiprocessors

processor

singlebus

processor

Memory

Question: Whataretheproblemswiththispicture?

Enterthecachecoherency protocols…

processor

interconnectionnetwork

processor

memory memory memory

Question: Whataretheproblemswiththispicture?

ClassificationAccordingtoMemoryAccessTimes

Singleaddressspace:UMA:

Uniform

Memory

Access

Non-Uniform

Memory

Access

Sametime,nomatterwhich

processor,nomatterwhat

addressisaccessed(SMP).

Timedependsonwhichprocessor

isaskingforthedataandwhere

thedataisinmemory.

Multipleaddressspaces: distributedmemory,messagepassing.

ClassificationAccordingtoProcessingModel

Singleinstructionstream

Singledatastream

Multipleinstructionstreams

Singledatastream

Multipledatastreams

SIMDComputers:TheMASPAR

ACU: arraycontrolunit;issues

instructionstoallthePEs (RISC).

PEs: clustersof32-bitALUs;64KBmemory,

6432-bitregisters.

Topology: gridconnection.

Scalability: 1024,2048,4096,8192or16384

processors.

Target: greatfordataparallelapplications.

SIMDComputers:TheConnectionMachineCM-2

A5feettallcubeformedof

smallercubes,representinga

12-dimensionalhypercube

structureofthenetworkthat

connectedtheprocessors

together.

“Thishardgeometricobject,black,thenon-colorofsheer,staticmass,was transparent,filledwithasoft,constantlychangingcloudoflightsfromtheprocessor

chips,red,thecoloroflifeandenergy.Itwasthearchetypeofanelectronicbrain,

aliving,thinkingmachine.”

MIMDComputers:TheSGIOrigin2000

Expandableandflexiblerackdesign:addprocessorsasneedsgrow.Usescc-NUMAbuildingblockstoscalethesingle

shared-memorysystemfrom2to16processorsinasingle

EachmodulesupportstwotoeightMIPS®processorsandup

to16GBofmemoryandprovidesI/Obandwidthof6.24GB

persecond.

“Capableofconnectingwithmultiplerackstoscaleto64processorsina

single-systemimageutilizingtherevolutionaryNUMAlink TM interconnect, ahigh-speed,scalableinterconnectfabricthatprovidesincrementalbandwidth

whilemaintainingtheshared-memorymodelofanSMPserver.”

MIMDComputers:TheSunEnterprise6500

KeySpecifications: Upto30CPUs,maximummemoryof60GB(SMPstyle

sharedmemory),RAIDdisks.

KeyBenefits:Ahighlyexpandablesystemthatoffersmission-criticalperformanceand

availability.

MIMDComputers:Beowulf-typeClusters

Grendel (ClemsonUniversity): anexperimentalparallelcomputerbuiltfromcommoditycomponents.

Apile-of-PCsof18machines,eachwiththefollowing:150MHzPentiumCPU,

64MBEDODRAM,2GBIDEdisk,2FastEthernetcards.

OperatingSystem: RedHatLinux (kernel>=v2.0)

Themachinesaretiedtogetherwithtwonetworks.Thefirstisa busnetwork

usingastackof100Mb/shubs.Thesecondisafull-duplexswitchednetwork

usingaFastEthernetswitch.Defines2 nodes forinteractionwiththesystem,and

usestheother16asdedicatedcomputeandI/Oservers.Theconceptincludesnot

onlycommodityoff-the-shelf(COTS)hardware,butalsotheuseoffreely

availableoperatingsystemssuchasLinux,messagepassingsoftwaresuchasPVMandMPI,andothersoftwareoftencontributedbyBeowulfusers.

Cost: Canitgetanylower???

Woo-hoo!

DoesMultiprocessingAloneSolveThePerformanceProblem?

It’sbeendecadessinceresearchonparallelprocessingstarted,programmingamultiprocessorisstillahardtask.

Loop:{

Readdata;

Processdata;

Writedata;

Loop:{

Readdata;

Processdata;

Writedata;

ProcessorA ProcessorB

Problem: Communication(itstakestimetotransferdataaround).

Problem: Synchronization(dowehavetoagreeontime?).

Problem: Attherootofitall:DATADEPENDENCIES.

Computer Science 37 Lecture 31

Documents

Lecture 37. Autonomous Nonlinear Systems and Stability · Lecture 37. Autonomous Nonlinear Systems and Stability April 18, 2012 Konstantin Zuev (USC) Math 245, Lecture 37 April 18,

Computer Science 37 Lecture 25

R&AC Lecture 37

Lecture 21 37

Computer Science 37 Lecture 32

Lecture 37 of 42

Lecture 37

PHYS16 – Lecture 36 & 37

Computer Science 37 Lecture 4

ME 322: Instrumentation Lecture 37

Lecture 37: Proxy Pattern

ENG101- English Comprehension- Lecture 37

Computer Science 37 Lecture 8

Computing & Information Sciences Kansas State University Lecture 37 of 42CIS 636/736: (Introduction to) Computer Graphics Lecture 37 of 42 Monday, 28 April

Lecture 37

Computer Science 37 Lecture 20

Computer Science 37 Lecture 5

CS#61C:#Great#Ideas#in#Computer# Review# …cs61c/sp13/lec/37/2013Sp... · 2013. 4. 26. · CS#61C:#Great#Ideas#in#Computer# Architecture#(Machine#Structures)# Lecture'37:'IO'Interrupts'and

CSC 37 – COMPUTER NETWORKS

Computer Science 37 Lecture 19