25
1 Abstract ... Why does it take millions of transistors to realise the broadcast radio receiver done in five in the 70's? Why are there millions of lines of software in products that are not programmable? And why do we throw things away before they break? The last 40 years saw the exponential growth of silicon capacity and the technology lead markets that ensued. Complexity was never an obstacle whilst design cost was 2nd order, but now that limits are upon us it is also apparent that people buy products not technology ... so we strive to deliver elevated expectation despite Diminishing Returns. My intent is to give context to subsequent Multicore discussions in and beyond this event. I will do this by looking at 'Efficiency' in a context of increasingly Diminishing Returns. This will lead quite naturally to Multicore (CMP); its rationale and its role But will also raise questions about the way(s) forward as 1 and its role. But will also raise questions about the way(s) forward ... as technology moves out of the market driving seat. Prof. Ian Phillips Principal Staff Eng’r, ARM Ltd Progress: Despite the Law of 1v0 ARM Ltd [email protected] Visiting Prof. at ... Contribution to Industry Award 2008 Despite the Law of Diminishing Returns OR Whatever happened to the 6 transistor radio? M lti P W kh 2 Multi-Processor Workshop 5sep11

Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

1

Abstract ...

Why does it take millions of transistors to realise the broadcast radio receiver done in five in the 70's? Why are there millions of lines of software in products that are not programmable? And why do we throw things away before they break? The last 40 years saw the exponential growth of silicon capacity and the technology lead markets that ensued. Complexity was never an obstacle whilst design cost p y gwas 2nd order, but now that limits are upon us it is also apparent that people buy products not technology ... so we strive to deliver elevated expectation despite Diminishing Returns.

My intent is to give context to subsequent Multicore discussions in and beyond this event. I will do this by looking at 'Efficiency' in a context of increasingly Diminishing Returns. This will lead quite naturally to Multicore (CMP); its rationale and its role But will also raise questions about the way(s) forward as

1

and its role. But will also raise questions about the way(s) forward ... as technology moves out of the market driving seat.

Prof. Ian PhillipsPrincipal Staff Eng’r,

ARM Ltd

Progress:Despite the Law of

1v0

ARM [email protected]

Visiting Prof. at ...

Contribution to Industry Award 2008

Despite the Law of Diminishing Returns

ORWhatever happened to the 6 transistor radio?

M lti P W k h

2

Multi-Processor Workshop

5sep11

Page 2: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

2

Things Haven’t Always Things Haven’t Always Been Like This ...!Been Like This ...!

3

A Moment of Retrospection

I learned all I know in the past!... I just have more of it than most people!

I’ll take you back 36yrs to 1975 ...N D H J b Wif B b C Ph New: Degree, Home, Job, Wife, Baby, Car, Phone ...

4

... More-or-less the same today!Mk1: Ian Phillips

C1975

Mk1: Human baby C1975

Aka Dylan-Paul Phillips

Mark’N’: Mud-Hut C1975

Aka: 5 Manor Close.

Page 3: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

3

1975: Technology Products

Vauxhall Viva HB SL90

5

GPO Type 706 Telephone

... Recognisably the same, but very different today.

1975: Semiconductor Electronics

Domestically we had... Portable Radio

Pocket Calculator

Hi-Fi (Partial)

C l TV (P ti l) Colour TV (Partial)

... That’s about all!

Professionally we had... Computers & First PCs

Radio Receivers

Satellites

Under Sea Cables

BeoVision 3500 c1975

TI SR 51 Calculator c1975

6

Under-Sea Cables

Transmitters (Partial)

TV Cameras (Partial)

Telephony (Partial) Stuart 5 Transistor Radio 1975

IBM 220PX c1975

Page 4: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

4

2010: Electronics Everywhere but Nowhere

7

Confusing Technology Confusing Technology with Products ...with Products ...

8

Page 5: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

5

The Computer ... Or Is It?

9

Computer: A Machine for Computing ...

Computing ...

... A general term for algebraic (mathematical) manipulation of data ...

N t d P d D t /

... State and Time are factors in this.

It can include phenomena ranging from human thinking to calculationswith a narrower meaning. Wikipedia

y=F(x,t,s)NumeratedPhenomena

IN (x)

Processed Data/Information

OUT (y)

10

Usually used it to exercise analogies (models) of real-world situations; Frequently in real-time.

... No mention of Implementation Technology in this!

Page 6: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

6

Planet Motion Computer – Orrery c1700

MechanicalTechnology

11

• Inventor: George Graham (1674-1751)

• Single-Task, Continuous Time, Analogue Mechanical Computing (With backlash!)

Babbage's Difference Engine 1837

The difference engine consists of a number of columns, numbered from 1 to N. Each column is able to store one decimal number. The only operation the engine can do is add the value of a column n + 1 to column n to produce the new value of n. Column N can only store a constant, column 1 displays (and possibly prints) the value of the calculation on the current iteration.

(Re)construction

c2000

MechanicalTechnology

12

Computer for Calculating Tables: A Basic ALU Engine

Page 7: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

7

Enigma ~1940

MechanicalTechnology

13

Data Encryption/Decryption Computer

Colossus Computer 1944

Valve/MechanicalTechnology

14

Code-Breaking Computer: A Data Processor

Page 8: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

8

Digital Computer – Baby 1947 (Reconstruction)

Valve/SoftwareTechnology

15

General Purpose, Quantised Time and Data, (Digital) Electronic Computing

Analogue Computer – AKAT c1960

Transistor Technology

16

General Purpose, Continuous Time, Approximate (Analogue) Electronic Computing

Page 9: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

9

Evolution of Radio

Bush Radio

Evoke DAB Radio100 M Transistors

2-3 Embedded Processors

c2005

BTH Crystal Set

1 Diode

c1925

Tele-Verta Radio4 Valves

1 Rectifier Valve

c1945

17

Bush Radio7 Transistors

1 Diode

c1960

c2005

Ian’s ‘Span’

Vi

Radio as Computation ...

ValveTechnologyTransistor Technology

Integrated CircuitTechnology

Vrf=Vi*100

Vrf

Vif=Vrf*Vlo

Vlo

Vif

Vro='Bandpass'(Vif*1000)

Vro

18

Vlo=Cos(t*1^6)

Single-Task, Continuous Time, Approximate (Analogue) Electronic Computing

Page 10: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

10

Products Make Business

21c Businesses have to be Operations and Competition is Global and so are Investors Nationality has little meaning

Business needs End-Customers buy Products, not Technology

Technologies enable Product Options

Business-Models make Money

New Products are Design is a Cost/Risk to be Minimised

New Technology increases Cost/Risk ... ... But does not always increase Value

19

HW, SW, Mechanics, Optics, etc are (just) means to an end!

... New Technology ≠ Market Success (Any More)

High Performance Computing (HPC)

The exponential progression of Moores Law has enabled thefantastic computation power we now take for grantednow take for granted ... A Few Tens of headline-grabbing

A Few Hundred Million visible

... BUT...... BUT...

Tens of Billions of invisible

20

ubiquitous

... Gives impression that General Purpose Digital Computation is what it is all about.

Page 11: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

11

Products are Solutions; Products are Solutions; Not The OtherNot The Other--Way Way Round Round

21

Round ...Round ...

Embedded Computing Is ...

Entertainment

Remote Control

Security

Televisions Televisions

ID Cards

Memory

Logistics

Transport

Banking

Manufacturing

22

Manufacturing

Energy

Communications

Medical

... etc, etc, etc.

Page 12: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

12

High Performance (Embedded) Computing

Obvious:

Business Model Aesthetics

PerformanceB d & I

Less-Obvious:

Infrastructure ... Manufacture and Distribution

Road network, Fuel supply, Tyres

Service network & Training Brand & Image

Finance schemes

Dealership

Warranty, etc

Sales and Marketing, etc

Technologies Internal Combustion Engine ...

Bearings, Casting, Metal forming, Paint, Aerodynamics, Glass, Rubber, Suspension ...

Manufacturing, Reliability, Quality ...

Electronic Systems

23

(Us!) – appeared 35,000 yr ago ‘Developed’ from Homo-Sapien (Wise Human) 100,000 yr ago

Primary Objective: Survive Nature (1,000 generations)

- appeared ~2 000 yr ago

The Evolution of Customer(kind) Universe – 13.6ByrEarth – 4.5Byr

- appeared ~2,000 yr ago Pythagoras, Socrates, Plato, Aristotle, Archimedes, ...

Objective: Understand Nature (100 gen.)

- appeared ~1,000 yrs ago Galileo, Descartes, (1000 ad)

Electricity - William Gilbert (1600ad)

Objective: Manipulate Nature (50 gen.)

- just 260 yrs ago

Pythagoras

Galileo

24

just 260 yrs ago Industrial Revolution (1750: 8 gen’n)

Year 0: Science Meets Exploitation

Objective: Exploit Nature (10 gen.)

... Remember: Real (Cro-Magnon) Customers Don't Buy Technology!

Brunell

Page 13: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

13

The iConic Must-Have Product ...

25

... Cool Design(California) & Manufacture(China) !

26

Page 14: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

14

... Most Design Never Noticed or Valued

27

The Threshold of Magic 1: Clarke: Any sufficiently advanced technology is indistinguishable from magic.

Everybody has a threshold, beyond which Functionality is Indistinguishable From Magic1!

Chemical Systems

Biological Systems

Economic Systems

Electronic Systems

The Incandescent Light:is the for most non-scientific, but well educated people!

28

... Its not a crime, to Not Understand Technology!

... The crime is not realising people don’t when you are the one who suffers as a result!

Page 15: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

15

AllAll Technologies Are Technologies Are Important ...Important ...

29

Inside the Case

Exciting Technology ... At the Module

iPhone 4's vibrator motor. rear-facing 5 MP camera with 720p video at 30 FPS, tap to p , pfocus feature, and LED flash.

30 Source ... http://www.ifixit.com

Page 16: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

16

Exciting Technology ... At the Module

The Control Board.

Inside the Case

31 Source ... http://www.ifixit.com

Exciting Technology ... Inside the Module

Inside The Control Board (a-side)

Visible Design-Team Members...

A4 Processor, specified by Apple, designed and manufactured by Samsung ...

The central unit that provides the iPhone 4 with its GP computing power.

GPS

Inc. ARM A8 600 MHz CPU (also other ARM CPUs and IP?)

ST-Micro (3 axis gyroscope) - (ARM Partner)

Broadcom (Wi-Fi, Bluetooth, and GPS) - (ARM Partner)

Skyworks (GSM)

Triquint (GSM PA)

Infineon (GSM Transceiver) - (ARM Partner)

32

Bluetooth, EDR &FM

Source ... http://www.ifixit.com

Page 17: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

17

Exciting Technology ... Inside the Module

Inside The Control Board (b-side)

Visible Design-Team Members ... Samsung (flash memory) - (ARM Partner)

Cirrus Logic (audio codec) - (ARM Partner)

AKM (Magnetic Sensor)

33 Source ... http://www.ifixit.com

Texas Instruments (Touch Screen Controller and mobile DDR) - (ARM Partner)

Invisible Design-Team Members ... OS & Drivers, GSM Security; Graphics, Video and Sound ...

Manufacturing, Assembly, Test, Certification ...

Exciting Technology ... Inside ‘The Chip’

2 Memory Dies

Memory ‘Package’

The A4 SIP Package (Cross-section) The processor is the centre rectangle The silver circles beneath it are solder balls

Processor SOC DieGlue

4-Layer Platform Package’

34

The processor is the centre rectangle. The silver circles beneath it are solder balls.

Two rectangles above are RAM die, offset to make room for the wirebonds.

Putting the RAM close to the processor reduces latency, making RAM faster and cuts power.

Unknown Mfr (Memory)

Samsung/ARM (Processor)

Unknown (SIP Technology)

Source ... http://www.ifixit.com

Page 18: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

18

The Phone: Hetrogeneous Computation ...

• About 20 Chips in a Smart-Phone

• Processing:

• Audio, Video, RF, T h T tTouch, Temperature, Orientation, G-Force, Magnetism, Power

• Core Functions:

• GSM, GPS, WiFi, 3/4G Net, BlueTooth

• Application Functions:

• Applets, Games, Mail, Diary Address-book

35

... Partitioning: The difference between a good and bad Product!

Diary, Address book, etc.

... Multi-Processing before we open the ‘App’n Processor’!

Commodity HMP In Qual. Today...

Block-Diagram for a typical 40nm Mobile Computing & Smart-Phone Platform Chip

Pocket ‘Super-Computer’ ...

10 Programmable Processors 4 x A9 Processors (2x2): ~10,000 MIP 4 x MALI 400 Fragment Proc: ~1Gp/s 1 x MALI 400 Vertex Processor1 x MALI 400 Vertex Processor 1 x MALI Video CoDec

Plus Dedicated Processors Smart MMUs Smart Interrupt Controllers Smart DMA Engines Smart QoS and Power Mgt Smart Cache & Memory Repair

Plus ...

36

Customer Additions/Peripherals!

... ~15 Proc., ~2GHz, 1-2W... Strong Application Focus

Page 19: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

19

30nm Target Architectures

About 50MTr

About 50KTr

37

... Delivering ~5x speed (Architecture + Process + Clock)

There’s Design; There’s Design; and there’s and there’s

Technical Design Technical Design

38

Technical Design ...Technical Design ...

Page 20: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

20

Partition and Refine until every Thread has identified an Established (Reuse) path to Physical Implementation

Then Construct and Verify ...

Design A Part-Formalised Process ...

Known-Links from Model-to-Reality (Reuse)

y o

f R

eu

se

Hig

h

AAA cell

Concept Phone

TFT-LCD

Electronics

Baseband

GaAs Front End

Std Radio Chip

Actual Phone

39

Lo

w

Hie

rarc

hy

ARM CPU

Signal Processing

MALI MPU

HW Support

FP Engine

Gates/MC.Code

F1F2

F3

F4

F5

Functional Analysis

Thr

ead

(F1) (F3)

(F5)(F2)

40

Execution Platform

HW1 HW2 HW3 HW4

Hardware Interface

RTOS/Drivers

Bus(es) Processor(s)

(F1) (F3)

Page 21: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

21

The Real-Time Execution of Models

It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional and Non-Functional needs.

Design: A hierarchical mathematical process of Model Refinement and Verification; based on (Heuristic) Architectural decisions.

Implementation: Process of ‘bringing up’ and Validating the Functional-Model on the Physical Execution-Platform.

A good Solution is one that ...

1. Meets a valued human need

2. Is Manufacturable to support a Competitive Price/Biz Model

3. Works at least as well as your Competitors

41

4. Scores well on Aesthetic (Non-Functional) criteria

A bad Solution is one where the Technology Shows!

And so to And so to MultiMulti--Processors ...Processors ...

42

Page 22: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

22

The Argument for (C)MP

Potentially Much Better Power Efficiency than Large/Fast Uni-Processor

Power is a Major Problem ... On Die and In System.

But 10x-100x improvement required!

Potential to deliver Higher Performance than Uni-Processorg

Can Amdahl’s Law be broken?

For GP applications difficult to see improvement after 3/4 processors

Potential to handle Redundancy Schemes

Many Processor ‘Tiles’ with NoC Connectivity

Potentially a small % malfunction can be ‘re-routed’

Potential to offer a Scalable, Standard Implementation

43

Reduces Chip Design/Masks/Qual’n/Production Cost by ~90%

Needs a new (tbd!) GP Software Methodology

Must work with Legacy (90% of designs are inherited).

... Can CMP actually deliver any (let alone all) of this?

Amdahl’s Law is Alive and Well

100% parallel

28

32

Speedup on parallel processors is limited by the

sequential portion of the program

95% parallel12

16

20

24

Max

imum

spe

edup

sequential portion of the program

Sequential portion need not be large

to significantly constrain speedup!

44

90% parallel

75% parallel50% parallel

0

4

8

0 4 8 12 16 20 24 28 32Number of cores

Page 23: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

23

Transputer (Inmos 1978)

Highly Parallel Apps (Graphics)

Lots of history; no success ...

Pixelfusion –

1,536 processors/chip

Many (C)MP Technologies ... even in the UK

p p

Clearspeed –

192 full 64 bit arch/chip

Picochip –

Aimed at 3G Pico-Cells

Spinnaker –

18 processors (Scal. to 1^6)

18,000 neurons (Scal. To 10^9)

XMOS –

45

New version of Transputer

Occam

HandleC – HW synthesys (See also SystemC)

OpenCL – Smartphone's / Tablets (GPGPU)

... Success(?) requires a End-Product and Market of appropriate Scale.

OpenCL Enables Heterogeneity

Plug-in architecture

Mali-T604 Vithar GPU

ARM/NEON

Custom Device

Video Decoder

Discovery of computational units

Scheduling of work

46

... It does not automatically solve “which computation where”

Page 24: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

24

Architecture: A Viable Mix of Technology

YES: Power is a Major Concern ...

Power-Efficiency the way of recovering it.

Away from.. ..towards.. ..wherever possible.

But so is Productivity, NRE Cost, TTM, Quality ...But so is Productivity, NRE Cost, TTM, Quality ...

Reuse: As much as possible: Mech, Elect, SW, Acoustic, RF, Stacks, OS, Displays, Keyboards, etc.

Teams: Use people who know how to do the work (duh!)

Use External Expertise: It is seldom a differentiating factor in your Product.

Producible: Make something that can be economically made (duh!)

Performance: Competitive; don’t push the bounds of possibility.

New Technology: As little as possible

47

New Technology: As little as possible.

And so are Aesthetics ...

Colour, Style, Package, Availability, Quality, Business Model, etc ...

... Remember: The Product is the way to deliver a Compelling End-Customer Experience.

Conclusions

Multi-Processing makes sense in lots of Products today...

It will seldom be entire solutions (ie: Small markets)

It will seldom be homogeneous

If the Work-Load and Programming Models are good.

Physical Concurrency makes sense in lots of Products...

Mechanical, CPU/GPU, Optical, RF, MEM, SAW, etc

Simplifies Productivity, Design, Qualification, Quality and Reuse

Few Products (None) have the luxury of a Clean-Sheet Design...

Legacy is unavoidable

CPU is not the answer for everything...

48

‘Software’ is amongst the least energy efficient technology

DSP, Video HW and GPU can be (much) better

But Analogue and Mechanical are best

... Products can be Enabled or Disabled MP Technology

Page 25: Prof. Ian Phillips Despite the Law of Diminishing Returns...The Real-Time Execution of Models It is about creating a Functional-Model and an Execution-Platform for it, to meet Functional

25

The END ...The END ...Thanks for ListeningThanks for Listening

49

f gf g

Reading & References

The Innovator's Dilemma: When New Technologies Cause Great Firms to Fail (Disruptive Tech.) by Clayton M. Christensen: HBS Press, 1997

Open Innovation: The New Imperative for Creating and Profiting from Technology (Research in 21C) by Henry William Chesbrough : HBS Press, 2003

The World Is Flat (Globalisation) by Thomas L. Friedman: Penguin, 2005

Staying Power (Business) by Michael Cusumano: Oxford, 2010

A Short History of Nearly Everything (A different view on what we know) by Bill Bryson: Black Swan, 2003

The Voyages of the Beagle (Scientific Observation)

50

By Charles Darwin,1860

An Essay on the Principles of Population (Natural Competition) By Thomas Malthus,1789