Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
1
Abstract ...
Why does it take millions of transistors to realise the broadcast radio receiver done in five in the 70's? Why are there millions of lines of software in products that are not programmable? And why do we throw things away before they break? The last 40 years saw the exponential growth of silicon capacity and the technology lead markets that ensued. Complexity was never an obstacle whilst design cost p y gwas 2nd order, but now that limits are upon us it is also apparent that people buy products not technology ... so we strive to deliver elevated expectation despite Diminishing Returns.
My intent is to give context to subsequent Multicore discussions in and beyond this event. I will do this by looking at 'Efficiency' in a context of increasingly Diminishing Returns. This will lead quite naturally to Multicore (CMP); its rationale and its role But will also raise questions about the way(s) forward as
1
and its role. But will also raise questions about the way(s) forward ... as technology moves out of the market driving seat.
Prof. Ian PhillipsPrincipal Staff Eng’r,
ARM Ltd
Progress:Despite the Law of
1v0
Visiting Prof. at ...
Contribution to Industry Award 2008
Despite the Law of Diminishing Returns
ORWhatever happened to the 6 transistor radio?
M lti P W k h
2
Multi-Processor Workshop
5sep11
2
Things Haven’t Always Things Haven’t Always Been Like This ...!Been Like This ...!
3
A Moment of Retrospection
I learned all I know in the past!... I just have more of it than most people!
I’ll take you back 36yrs to 1975 ...N D H J b Wif B b C Ph New: Degree, Home, Job, Wife, Baby, Car, Phone ...
4
... More-or-less the same today!Mk1: Ian Phillips
C1975
Mk1: Human baby C1975
Aka Dylan-Paul Phillips
Mark’N’: Mud-Hut C1975
Aka: 5 Manor Close.
3
1975: Technology Products
Vauxhall Viva HB SL90
5
GPO Type 706 Telephone
... Recognisably the same, but very different today.
1975: Semiconductor Electronics
Domestically we had... Portable Radio
Pocket Calculator
Hi-Fi (Partial)
C l TV (P ti l) Colour TV (Partial)
... That’s about all!
Professionally we had... Computers & First PCs
Radio Receivers
Satellites
Under Sea Cables
BeoVision 3500 c1975
TI SR 51 Calculator c1975
6
Under-Sea Cables
Transmitters (Partial)
TV Cameras (Partial)
Telephony (Partial) Stuart 5 Transistor Radio 1975
IBM 220PX c1975
4
2010: Electronics Everywhere but Nowhere
7
Confusing Technology Confusing Technology with Products ...with Products ...
8
5
The Computer ... Or Is It?
9
Computer: A Machine for Computing ...
Computing ...
... A general term for algebraic (mathematical) manipulation of data ...
N t d P d D t /
... State and Time are factors in this.
It can include phenomena ranging from human thinking to calculationswith a narrower meaning. Wikipedia
y=F(x,t,s)NumeratedPhenomena
IN (x)
Processed Data/Information
OUT (y)
10
Usually used it to exercise analogies (models) of real-world situations; Frequently in real-time.
... No mention of Implementation Technology in this!
6
Planet Motion Computer – Orrery c1700
MechanicalTechnology
11
• Inventor: George Graham (1674-1751)
• Single-Task, Continuous Time, Analogue Mechanical Computing (With backlash!)
Babbage's Difference Engine 1837
The difference engine consists of a number of columns, numbered from 1 to N. Each column is able to store one decimal number. The only operation the engine can do is add the value of a column n + 1 to column n to produce the new value of n. Column N can only store a constant, column 1 displays (and possibly prints) the value of the calculation on the current iteration.
(Re)construction
c2000
MechanicalTechnology
12
Computer for Calculating Tables: A Basic ALU Engine
7
Enigma ~1940
MechanicalTechnology
13
Data Encryption/Decryption Computer
Colossus Computer 1944
Valve/MechanicalTechnology
14
Code-Breaking Computer: A Data Processor
8
Digital Computer – Baby 1947 (Reconstruction)
Valve/SoftwareTechnology
15
General Purpose, Quantised Time and Data, (Digital) Electronic Computing
Analogue Computer – AKAT c1960
Transistor Technology
16
General Purpose, Continuous Time, Approximate (Analogue) Electronic Computing
9
Evolution of Radio
Bush Radio
Evoke DAB Radio100 M Transistors
2-3 Embedded Processors
c2005
BTH Crystal Set
1 Diode
c1925
Tele-Verta Radio4 Valves
1 Rectifier Valve
c1945
17
Bush Radio7 Transistors
1 Diode
c1960
c2005
Ian’s ‘Span’
Vi
Radio as Computation ...
ValveTechnologyTransistor Technology
Integrated CircuitTechnology
Vrf=Vi*100
Vrf
Vif=Vrf*Vlo
Vlo
Vif
Vro='Bandpass'(Vif*1000)
Vro
18
Vlo=Cos(t*1^6)
Single-Task, Continuous Time, Approximate (Analogue) Electronic Computing
10
Products Make Business
21c Businesses have to be Operations and Competition is Global and so are Investors Nationality has little meaning
Business needs End-Customers buy Products, not Technology
Technologies enable Product Options
Business-Models make Money
New Products are Design is a Cost/Risk to be Minimised
New Technology increases Cost/Risk ... ... But does not always increase Value
19
HW, SW, Mechanics, Optics, etc are (just) means to an end!
... New Technology ≠ Market Success (Any More)
High Performance Computing (HPC)
The exponential progression of Moores Law has enabled thefantastic computation power we now take for grantednow take for granted ... A Few Tens of headline-grabbing
A Few Hundred Million visible
... BUT...... BUT...
Tens of Billions of invisible
20
ubiquitous
... Gives impression that General Purpose Digital Computation is what it is all about.
11
Products are Solutions; Products are Solutions; Not The OtherNot The Other--Way Way Round Round
21
Round ...Round ...
Embedded Computing Is ...
Entertainment
Remote Control
Security
Televisions Televisions
ID Cards
Memory
Logistics
Transport
Banking
Manufacturing
22
Manufacturing
Energy
Communications
Medical
... etc, etc, etc.
12
High Performance (Embedded) Computing
Obvious:
Business Model Aesthetics
PerformanceB d & I
Less-Obvious:
Infrastructure ... Manufacture and Distribution
Road network, Fuel supply, Tyres
Service network & Training Brand & Image
Finance schemes
Dealership
Warranty, etc
Sales and Marketing, etc
Technologies Internal Combustion Engine ...
Bearings, Casting, Metal forming, Paint, Aerodynamics, Glass, Rubber, Suspension ...
Manufacturing, Reliability, Quality ...
Electronic Systems
23
(Us!) – appeared 35,000 yr ago ‘Developed’ from Homo-Sapien (Wise Human) 100,000 yr ago
Primary Objective: Survive Nature (1,000 generations)
- appeared ~2 000 yr ago
The Evolution of Customer(kind) Universe – 13.6ByrEarth – 4.5Byr
- appeared ~2,000 yr ago Pythagoras, Socrates, Plato, Aristotle, Archimedes, ...
Objective: Understand Nature (100 gen.)
- appeared ~1,000 yrs ago Galileo, Descartes, (1000 ad)
Electricity - William Gilbert (1600ad)
Objective: Manipulate Nature (50 gen.)
- just 260 yrs ago
Pythagoras
Galileo
24
just 260 yrs ago Industrial Revolution (1750: 8 gen’n)
Year 0: Science Meets Exploitation
Objective: Exploit Nature (10 gen.)
... Remember: Real (Cro-Magnon) Customers Don't Buy Technology!
Brunell
13
The iConic Must-Have Product ...
25
... Cool Design(California) & Manufacture(China) !
26
14
... Most Design Never Noticed or Valued
27
The Threshold of Magic 1: Clarke: Any sufficiently advanced technology is indistinguishable from magic.
Everybody has a threshold, beyond which Functionality is Indistinguishable From Magic1!
Chemical Systems
Biological Systems
Economic Systems
Electronic Systems
The Incandescent Light:is the for most non-scientific, but well educated people!
28
... Its not a crime, to Not Understand Technology!
... The crime is not realising people don’t when you are the one who suffers as a result!
15
AllAll Technologies Are Technologies Are Important ...Important ...
29
Inside the Case
Exciting Technology ... At the Module
iPhone 4's vibrator motor. rear-facing 5 MP camera with 720p video at 30 FPS, tap to p , pfocus feature, and LED flash.
30 Source ... http://www.ifixit.com
16
Exciting Technology ... At the Module
The Control Board.
Inside the Case
31 Source ... http://www.ifixit.com
Exciting Technology ... Inside the Module
Inside The Control Board (a-side)
Visible Design-Team Members...
A4 Processor, specified by Apple, designed and manufactured by Samsung ...
The central unit that provides the iPhone 4 with its GP computing power.
GPS
Inc. ARM A8 600 MHz CPU (also other ARM CPUs and IP?)
ST-Micro (3 axis gyroscope) - (ARM Partner)
Broadcom (Wi-Fi, Bluetooth, and GPS) - (ARM Partner)
Skyworks (GSM)
Triquint (GSM PA)
Infineon (GSM Transceiver) - (ARM Partner)
32
Bluetooth, EDR &FM
Source ... http://www.ifixit.com
17
Exciting Technology ... Inside the Module
Inside The Control Board (b-side)
Visible Design-Team Members ... Samsung (flash memory) - (ARM Partner)
Cirrus Logic (audio codec) - (ARM Partner)
AKM (Magnetic Sensor)
33 Source ... http://www.ifixit.com
Texas Instruments (Touch Screen Controller and mobile DDR) - (ARM Partner)
Invisible Design-Team Members ... OS & Drivers, GSM Security; Graphics, Video and Sound ...
Manufacturing, Assembly, Test, Certification ...
Exciting Technology ... Inside ‘The Chip’
2 Memory Dies
Memory ‘Package’
The A4 SIP Package (Cross-section) The processor is the centre rectangle The silver circles beneath it are solder balls
Processor SOC DieGlue
4-Layer Platform Package’
34
The processor is the centre rectangle. The silver circles beneath it are solder balls.
Two rectangles above are RAM die, offset to make room for the wirebonds.
Putting the RAM close to the processor reduces latency, making RAM faster and cuts power.
Unknown Mfr (Memory)
Samsung/ARM (Processor)
Unknown (SIP Technology)
Source ... http://www.ifixit.com
18
The Phone: Hetrogeneous Computation ...
• About 20 Chips in a Smart-Phone
• Processing:
• Audio, Video, RF, T h T tTouch, Temperature, Orientation, G-Force, Magnetism, Power
• Core Functions:
• GSM, GPS, WiFi, 3/4G Net, BlueTooth
• Application Functions:
• Applets, Games, Mail, Diary Address-book
35
... Partitioning: The difference between a good and bad Product!
Diary, Address book, etc.
... Multi-Processing before we open the ‘App’n Processor’!
Commodity HMP In Qual. Today...
Block-Diagram for a typical 40nm Mobile Computing & Smart-Phone Platform Chip
Pocket ‘Super-Computer’ ...
10 Programmable Processors 4 x A9 Processors (2x2): ~10,000 MIP 4 x MALI 400 Fragment Proc: ~1Gp/s 1 x MALI 400 Vertex Processor1 x MALI 400 Vertex Processor 1 x MALI Video CoDec
Plus Dedicated Processors Smart MMUs Smart Interrupt Controllers Smart DMA Engines Smart QoS and Power Mgt Smart Cache & Memory Repair
Plus ...
36
Customer Additions/Peripherals!
... ~15 Proc., ~2GHz, 1-2W... Strong Application Focus
19
30nm Target Architectures
About 50MTr
About 50KTr
37
... Delivering ~5x speed (Architecture + Process + Clock)
There’s Design; There’s Design; and there’s and there’s
Technical Design Technical Design
38
Technical Design ...Technical Design ...
20
Partition and Refine until every Thread has identified an Established (Reuse) path to Physical Implementation
Then Construct and Verify ...
Design A Part-Formalised Process ...
Known-Links from Model-to-Reality (Reuse)
y o
f R
eu
se
Hig
h
AAA cell
Concept Phone
TFT-LCD
Electronics
Baseband
GaAs Front End
Std Radio Chip
Actual Phone
39
Lo
w
Hie
rarc
hy
ARM CPU
Signal Processing
MALI MPU
HW Support
FP Engine
Gates/MC.Code
F1F2
F3
F4
F5
Functional Analysis
Thr
ead
(F1) (F3)
(F5)(F2)
40
Execution Platform
HW1 HW2 HW3 HW4
Hardware Interface
RTOS/Drivers
Bus(es) Processor(s)
(F1) (F3)
21
The Real-Time Execution of Models
It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional and Non-Functional needs.
Design: A hierarchical mathematical process of Model Refinement and Verification; based on (Heuristic) Architectural decisions.
Implementation: Process of ‘bringing up’ and Validating the Functional-Model on the Physical Execution-Platform.
A good Solution is one that ...
1. Meets a valued human need
2. Is Manufacturable to support a Competitive Price/Biz Model
3. Works at least as well as your Competitors
41
4. Scores well on Aesthetic (Non-Functional) criteria
A bad Solution is one where the Technology Shows!
And so to And so to MultiMulti--Processors ...Processors ...
42
22
The Argument for (C)MP
Potentially Much Better Power Efficiency than Large/Fast Uni-Processor
Power is a Major Problem ... On Die and In System.
But 10x-100x improvement required!
Potential to deliver Higher Performance than Uni-Processorg
Can Amdahl’s Law be broken?
For GP applications difficult to see improvement after 3/4 processors
Potential to handle Redundancy Schemes
Many Processor ‘Tiles’ with NoC Connectivity
Potentially a small % malfunction can be ‘re-routed’
Potential to offer a Scalable, Standard Implementation
43
Reduces Chip Design/Masks/Qual’n/Production Cost by ~90%
Needs a new (tbd!) GP Software Methodology
Must work with Legacy (90% of designs are inherited).
... Can CMP actually deliver any (let alone all) of this?
Amdahl’s Law is Alive and Well
100% parallel
28
32
Speedup on parallel processors is limited by the
sequential portion of the program
95% parallel12
16
20
24
Max
imum
spe
edup
sequential portion of the program
Sequential portion need not be large
to significantly constrain speedup!
44
90% parallel
75% parallel50% parallel
0
4
8
0 4 8 12 16 20 24 28 32Number of cores
23
Transputer (Inmos 1978)
Highly Parallel Apps (Graphics)
Lots of history; no success ...
Pixelfusion –
1,536 processors/chip
Many (C)MP Technologies ... even in the UK
p p
Clearspeed –
192 full 64 bit arch/chip
Picochip –
Aimed at 3G Pico-Cells
Spinnaker –
18 processors (Scal. to 1^6)
18,000 neurons (Scal. To 10^9)
XMOS –
45
New version of Transputer
Occam
HandleC – HW synthesys (See also SystemC)
OpenCL – Smartphone's / Tablets (GPGPU)
... Success(?) requires a End-Product and Market of appropriate Scale.
OpenCL Enables Heterogeneity
Plug-in architecture
Mali-T604 Vithar GPU
ARM/NEON
Custom Device
Video Decoder
Discovery of computational units
Scheduling of work
46
... It does not automatically solve “which computation where”
24
Architecture: A Viable Mix of Technology
YES: Power is a Major Concern ...
Power-Efficiency the way of recovering it.
Away from.. ..towards.. ..wherever possible.
But so is Productivity, NRE Cost, TTM, Quality ...But so is Productivity, NRE Cost, TTM, Quality ...
Reuse: As much as possible: Mech, Elect, SW, Acoustic, RF, Stacks, OS, Displays, Keyboards, etc.
Teams: Use people who know how to do the work (duh!)
Use External Expertise: It is seldom a differentiating factor in your Product.
Producible: Make something that can be economically made (duh!)
Performance: Competitive; don’t push the bounds of possibility.
New Technology: As little as possible
47
New Technology: As little as possible.
And so are Aesthetics ...
Colour, Style, Package, Availability, Quality, Business Model, etc ...
... Remember: The Product is the way to deliver a Compelling End-Customer Experience.
Conclusions
Multi-Processing makes sense in lots of Products today...
It will seldom be entire solutions (ie: Small markets)
It will seldom be homogeneous
If the Work-Load and Programming Models are good.
Physical Concurrency makes sense in lots of Products...
Mechanical, CPU/GPU, Optical, RF, MEM, SAW, etc
Simplifies Productivity, Design, Qualification, Quality and Reuse
Few Products (None) have the luxury of a Clean-Sheet Design...
Legacy is unavoidable
CPU is not the answer for everything...
48
‘Software’ is amongst the least energy efficient technology
DSP, Video HW and GPU can be (much) better
But Analogue and Mechanical are best
... Products can be Enabled or Disabled MP Technology
25
The END ...The END ...Thanks for ListeningThanks for Listening
49
f gf g
Reading & References
The Innovator's Dilemma: When New Technologies Cause Great Firms to Fail (Disruptive Tech.) by Clayton M. Christensen: HBS Press, 1997
Open Innovation: The New Imperative for Creating and Profiting from Technology (Research in 21C) by Henry William Chesbrough : HBS Press, 2003
The World Is Flat (Globalisation) by Thomas L. Friedman: Penguin, 2005
Staying Power (Business) by Michael Cusumano: Oxford, 2010
A Short History of Nearly Everything (A different view on what we know) by Bill Bryson: Black Swan, 2003
The Voyages of the Beagle (Scientific Observation)
50
By Charles Darwin,1860
An Essay on the Principles of Population (Natural Competition) By Thomas Malthus,1789