91
1 Instruction Set Principles and Examples 游游游

1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

Embed Size (px)

Citation preview

Page 1: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

1

Instruction Set Principles and Examples

游象甫

Page 2: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

2

Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands Operations in the instruction set Instructions for control flow Encoding an instruction set The Role of compilers The MIPS architecture

Page 3: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

3

Brief Introduction to ISA Instruction Set Architecture: a set of

instructions Each instruction is directly executed

by the CPU’s hardware How is it represented?

By a binary format since the hardware understands only bits

Concatenate together binary encoding for instructions, registers, constants, memories

From 柯皓仁 , 交通大學

Page 4: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

4

Brief Introduction to ISA (cont.) Options - fixed or variable length formats

Fixed - each instruction encoded in same size field (typically 1 word)

Variable – half-word, whole-word, multiple word instructions are possible

Typical physical blobs are bits, bytes, words, n-words

Word size is typically 16, 32, 64 bits today

From 柯皓仁 , 交通大學

Page 5: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

5

An Example of Program Execution

Command Load AC from

Memory Add to AC from

memory Store AC to

memory Add the

contents of memory 940 to the content of memory 941 and stores the result at 941

Fetch Execution

From 柯皓仁 , 交通大學

Page 6: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

6

A Note on Measurements We’re taking the quantitative approach BUT measurements will vary:

Due to application selection or application mix Due to the particular compiler being used Also dependent on compiler optimization

selection And the target ISA

Hence the measurements we’ill talk about Are useful to understand the method Are a typical yet small sample derived from

benchmark codes

From 柯皓仁 , 交通大學

Page 7: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

7

Instruction Set Design

timeCycleCPIICTimeCPU _**_

The instruction set influences everything

From 柯皓仁 , 交通大學

Page 8: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

8

Characteristics of Instruction Set

Page 9: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

9

Classifying Instruction Set Architectures By the type of internal storage in a

processor

Page 10: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

10

By the Type of Internal Storage - Stack

Push A

Push B

Add

Pop C

Page 11: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

11

By the Type of Internal Storage – Accumulator

Load A

Add B

Store C

Page 12: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

12

By the Type of Internal Storage – Register-Memory

Load R1, A

Add R3, R1, B

Store R3, C

Page 13: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

13

By the Type of Internal Storage – Register (load-store)

Load R1, A

Load R2, B

Add R3, R1, R2

Store R3, C

Page 14: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

14

Pro’s and Con’s of Stack, Accumulator, Register Machine

From 柯皓仁 , 交通大學

Page 15: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

15

Classifying Instruction Set Architectures (cont.) A load-store architecture survived

because Registers are faster than memory Registers are more efficient for a

compiler to use (A * B) - (B * C) – (A * D) -> evaluated in

any order Hold variables

Page 16: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

16

Instruction Set Characteristics of General-Purpose Register (GPR) Architectures Whether an ALU instruction has two or

three operands In the three-operand format, the instruction

contains one result operand and two source operands

In the two-operand format, one of the operands is both a source and a result for the operation

How many of the operands may be memory addresses in ALU instructions

Page 17: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

17

Combinations of Number of Memory Addresses and Operands Allowed

Number of memory address

Maximum number of operands allowed

Types of architecture

Examples

0 3 Register-register

Alpha, ARM, MIPS, PowerPC, SPARC, SuperH, Trimedia TM5200

1 2 Register-memory

IBM 360/370, Intel 80x86, Motorola 68000, TI TMS320C54x

2 2 Memory-memory

VAX

3 3 Memory-memory

VAX

Page 18: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

18

Compare Three Common General -Purpose Register Computers

From 柯皓仁 , 交通大學where (m,n) means m memory operands and n total operands

Page 19: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

19

Instruction Characteristics Memory Addressing Type and Size of Operands Operations in the Instruction Set Instructions for Control Flow Encoding an Instruction Set

Page 20: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

20

Memory Addressing

Page 21: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

21

Memory Addressing How memory addresses are

interpreted Endian order Alignment

How architectures specify the address of an object they will access Addressing modes

Page 22: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

22

Memory Addressing (cont.) All instruction sets discussed in this

book are byte addressed The instruction sets provide access for

bytes (8 bits), half words (16 bits), words (32 bits), and even double words (64 bits)

Two conventions for ordering the bytes within a larger object Little Endian Big Endian

Page 23: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

23

Little Endian The low-order byte of an object is stored in

memory at the lowest address, and the high-order byte at the highest address. (The little end comes first.)

For example, a 4-byte object (Byte3 Byte2 Byte1 Byte0) Base Address+0 Byte0 Base Address+1 Byte1 Base Address+2 Byte2 Base Address+3 Byte3

Intel processors (those used in PC's) use "Little Endian" byte order.

Dr. William T. Verts An Essay on Endian Order, http://www.cs.umass.edu/~verts/cs32/endian.html, April 19, 1996

Page 24: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

24

Big Endian The high-order byte of an object is

stored in memory at the lowest address, and the low-order byte at the highest address. (The big end comes first.)

For example, a 4-byte object (Byte3 Byte2 Byte1 Byte0) Base Address+0 Byte3 Base Address+1 Byte2 Base Address+2 Byte1 Base Address+3 Byte0

Dr. William T. Verts An Essay on Endian Order, http://www.cs.umass.edu/~verts/cs32/endian.html, April 19, 1996

Page 25: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

25

Endian Order is Also Important to File Data Adobe Photoshop -- Big Endian BMP (Windows and OS/2 Bitmaps) -- Little

Endian DXF (AutoCad) -- Variable GIF -- Little Endian JPEG -- Big Endian PostScript -- Not Applicable (text!) Microsoft RIFF (.WAV & .AVI) -- Both, Endian

identifier encoded into file Microsoft RTF (Rich Text Format) -- Little Endian TIFF -- Both, Endian identifier encoded into file

Dr. William T. Verts An Essay on Endian Order, http://www.cs.umass.edu/~verts/cs32/endian.html, April 19, 1996

Page 26: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

26

Memory Addressing (cont.) Alignment restrictions

Accesses to objects larger than a byte must be aligned

An access to an object of size s bytes at byte address A is aligned if A mod s = 0

A misaligned access takes multiple aligned memory references

See Fig. 2.5

Page 27: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

27

Addressing Modes Addressing modes can significantly

reduce instruction counts but add the complexity of building a computer and may increase the average CPI

How architectures specify the address of an object they will access? Constants Register Locations in memory

Page 28: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

28

Example for Addressing Modes

Addressing mode

Example instruction

Meaning When used

Register Add R4,R3 Regs[R4] <- Regs[R4] + Regs[R3]

When a value is in a register

Immediate Add R4,#3 Regs[R4] <- Regs[R4] + 3

For constants

Displacement Add R4,100(R1)

Regs[R4] <- Regs[R4] + Mem[100 + Regs[R1]]

Accessing local variables (+ simulates register redirect, direct addressing modes)

Register indirect

Add R4, (R1) Regs[R4] <- Regs[R4] + Mem[Regs[R1]]

Accessing using a pointer or a computed address

Page 29: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

29

Example for Addressing Modes (cont.)

Addressing mode

Example instruction

Meaning When used

Indexed Add R3,(R1+R2)

Regs[R3] <- Regs[R3] + Mem[Regs[R1] + Regs[R2]]

Sometimes useful in array addressing: R1 = base of array; R2= index amount

Direct or absolute

Add R1,(1001)

Regs[R1] <- Regs[R1] + Mem[1001]

Sometimes useful for accessing static data; address constant may need to be large

Memory indirect

Add R1,@(R3)

Regs[R1] <- Regs[R1] + Mem[Mem[Regs[R3]]]

If R3 is the address of a pointer p, the mode yields *p

Page 30: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

30

Example for Addressing Modes (cont.)

Addressing mode

Example instruction

Meaning When used

Autoincrement

Add R1,(R2)+

Regs[R1] <- Regs[R1] + Mem[Regs[R2]]Regs[R2] <- Regs[R2] + d

Useful for stepping through arrays within a loop. R2 points to start of array; each reference increments R2 by size of an element, d

Autodecrement

Add R1,-(R2) Regs[R2] <- Regs[R2] – dRegs[R1] <- Regs[R1] + Mem[Regs[R2]]

Same use as autoincrement. Autodecrement/-increment can also act as push/pop to implement a stack

Scaled Add R1,100(R2)[R3]

Regs[R1] <- Regs[R1] + Mem[100 + Regs[R2] + Regs[R3] * d]

Used to index arrays. May be applied to any indexed addressing mode in some computers

Page 31: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

31

Summary of Use of Memory Addressing Mode

displacement, immediate, and register indirect addressing modes represent 75% to 99% of the addressing mode usage

For VAX architecture

Page 32: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

32

Displacement Addressing Mode What’s an appropriate range of the

displacements?

For Alpha architecture

The size of address should be at least 12-16 bits, which capture 75% to 99% of the displacements

Page 33: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

33

Immediate or Literal Addressing Mode Does the mode need to be supported

for all operations or for only a subset?

Page 34: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

34

Immediate Addressing Mode (cont.) What’s a suitable range of values for

immediates?

For Alpha architecture

The size of the immediate field should be at least 8-16 bits, which capture 50% to 80% of the immediates

Page 35: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

35

Addressing Modes for Signal Processing DSPs deal with infinite, continuous

streams of data, they routinely rely on circular buffers Modulo or circular addressing mode

For Fast Fourier Transform (FFT) Bit reverse addressing 0112 1102

Page 36: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

36

Frequency of Addressing Modes for TI TMS320C54x DSP

From 柯皓仁 , 交通大學

Page 37: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

37

Type and Size of Operands

Page 38: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

38

Type and Size of Operands How is the type of an operand designated?

Encoding in the opcode For an instruction, the operation is typically specified

in one field, called the opcode By tag (not used currently)

Common operand types Character

8-bit ASCII 16-bit Unicode (not yet used)

Integer One-word 2’s complement

Page 39: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

39

Common Operand Types (cont.)

Single-precision floating point One-word IEEE 754

Double-precision floating point 2-word IEEE 754

Packed decimal (binary-coded decimal)

4 bits encode the values 0-9 2 decimal digits are packed into one byte

Page 40: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

40

Distribution of Data Access

For SPEC benchmarks

Page 41: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

41

Operands for Media and Signal Processing Vertex

(x, y, z) + w to help with color or hidden surfaces

32-bit floating-point values Pixel

(R, G, B, A) Each channel is 8-bit

Page 42: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

42

Special DSP Operands Fixed-point numbers

A binary point just to the right of the sign bit Represent fractions between –1 and +1 Need some registers that are wider to guard

against round-off error Round-off error

a computation by rounding results at one or more intermediate steps, resulting in a result different from that which would be obtained using exact numbers

Page 43: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

43

Fixed-point Numbers (cont.)

2‘ complement number Fixed-point numbers

Douglas L. Jones, http://cnx.org/content/m11930/latest/

Page 44: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

44

Example Give three 16-bit patterns:0100 0000 0000 00000000 1000 0000 00000100 1000 0000 1000What values do they represent if they are two’s

complement integers? Fixed-point numbers? AnswerTwo’s complement: 214, 211, 214 + 211 + 23

Fixed-point numbers: 2-1, 2-4, 2-1 + 2-4 + 2-12

Page 45: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

45

Operand Type and Size in DSP

From 柯皓仁 , 交通大學

Page 46: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

46

Operations in Instruction Sets

Page 47: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

47

What Operations are Needed Arithmetic and Logical

Add, subtract, multiple, divide, and, or Data Transfer

Loads-stores Control

Branch, jump, procedure call and return, trap

System Operating system call, virtual memory

management instructionsAll computers provide the above operations

Page 48: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

48

What Operations are Needed (cont.) Floating Point

Add, multiple, divide, compare Decimal

Add, multiply, decimal-to-character conversions

String move, compare, search

Graphics pixel and vertex operations,

compression/decompression operationsThe above operations are optional

Page 49: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

49

Top 10 Instructions for the 80x86 load: 22% conditional branch: 20% compare: 16% store: 12% add: 8% and: 6% sub: 5% move register-register:

4% call: 1% return: 1%

The most widely executed instructions are the simple operations of an instruction set

The top-10 instructions for 80x86 account for 96% of instructions executed

Make them fast, as they are the common case

From 柯皓仁 , 交通大學

Page 50: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

50

Operations for Media and Signal Processing Partitioned add

16-bit data with a 64-bit ALU would perform four 16-bit adds in a single clock cycle

Single-Instruction Multiple-Data (SIMD) or vector

Paired single operation Pack two 32-bit floating-point

operands into a single 64-bit register

Page 51: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

51

Operations for Media and Signal Processing (cont.) Saturating arithmetic

If the result is too large to be represented, it is set to the largest representable number, depending on the sign of the result

Several modes to round the wider accumulators into the narrower data words

Multiply-accumulate instructions a <- a + b*c

Page 52: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

52

Instructions for Control Flow

Page 53: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

53

Instructions for Control Flow Jump

The change in control is unconditional Branch

The change is conditional Procedure call Procedure return

Page 54: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

54

Distribution of Control Flows

Page 55: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

55

Addressing Modes for Control Flow Instructions How to get the destination address of a

control flow instruction? PC-relative

Supply a displacement that is added to the program counter (PC)

Position independence Permit the code to run independently of where it is

loaded

A register contains the target address The jump may permit any addressing mode

to be used to supply the target address

Page 56: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

56

Usage of Register Indirect Jumps Case or switch statements Virtual functions or methods High-order functions or function

pointers Dynamically shared libraries

Page 57: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

57

How Far are Branch Targets from Branches?

For Alpha architecture

The most frequent in the integer? programs are to targets that can be encoded in 4-8 bitsAbout 75% of the branches are in the forward direction

Page 58: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

58

How to Specify the Branch Condition?

From 柯皓仁 , 交通大學

Program Status Word

Page 59: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

59

Frequency of Different Types of Compares in Branches

Page 60: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

60

Procedure Invocation Options The return address must be saved

somewhere, sometimes in a special link register or just a GPR

Two basic schemes to save registers Caller saving

The calling procedure must save the registers that it wants preserved for access after the call

Callee saving The called procedure must save the registers it

want to use

Page 61: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

61

Encoding an Instruction Set

Page 62: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

62

Encoding an Instruction Set How the instructions are encoded into a

binary representation for execution? Affects the size of code Affects the CPU design

The operation is typically specified in one field, called the opcode

How to encode the addressing mode with the operations Address specifier Addressing modes encoded as part of the

opcode

Page 63: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

63

Issues on Encoding an Instruction Set Desire for lots of addressing modes and

registers Desire for smaller instruction size and

program size with more addressing modes and registers

Desire to have instructions encoded into lengths that will be easy to handle in a pipelined implementation Multiple bytes, rather than arbitrary bits Fixed-length

Page 64: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

64

3 Popular Encoding Choices Variable

Allow virtually all addressing modes to be with all operations

Fixed A single size for all instructions Combine the operations and the addressing modes

into the opcode Few addressing modes and operations

Hybrid Size of programs vs. ease of decoding in the

processor Set of fixed formats

Page 65: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

65

3 Popular Encoding Choices (Cont.)

Page 66: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

66

Reduced Code Size in RISCs More narrower instructions Compression

Page 67: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

67

Summary – Encoding the Instruction Set Choice between variable and fixed

instruction encoding Code size than performance ->

variable encoding Performance than code size -> fixed

encoding

Page 68: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

68

Role of Compilers

Page 69: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

69

Compiler vs. ISA Almost all programming is done in

high-level language (HLL) for desktop and server applications

Most instructions executed are the output of a compiler

So, separation from each other is impractical

Page 70: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

70

Goals of a Compiler Correctness Speed of the compiled code Others

Fast compilation Debugging support Interoperability among languages

Page 71: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

71

Structure of Recent Compilers

Page 72: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

72

Structure of Recent Compilers (cont.) Multi-pass structure

Easy to write bug-free compilers Make assumptions about the ability of later

steps to deal with certain problems Phase-ordering problem

Ex. 1 choose which procedure calls to expand inline before they know the exact size of the procedure being called

Ex. 2 Global common sub-expression elimination Find two instances of an expression that compute

the same value and saves the result of the first one in a temporary

Assume a register, rather than memory, will be allocated to save the result

Page 73: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

73

Optimization Types High level optimizations

Done on the source Local optimizations

Done on basic sequential block (straight-line code)

Global optimizations Extend the local optimizations across

branches and loops

Page 74: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

74

Optimization Types (Cont.) Register allocation

Use graph coloring (graph theory) to allocate registers

NP-complete Heuristic algorithm works best when

there are at least 16 (and preferably more) registers

Processor-dependent optimizations

Page 75: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

75

Major Types of Optimizations and Example in Each Class

From 柯皓仁 , 交通大學

Page 76: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

76

Change in IC Due to Compiler Optimization

Level 1: local optimizations, code scheduling, and local register allocation

Level 2: global optimization, loop transformation (software pipelining), global register allocation

Level 3: procedure integration

Page 77: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

77

Optimization Observations Hard to reduce branches Biggest reduction is often memory

references Some ALU operation reduction happens but

it is usually a few % Implication:

Branch, Call, and Return become a larger relative % of the instruction mix

Control instructions are the hardest to speed up

From 柯皓仁 , 交通大學

Page 78: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

78

Impact of Compiler Technology on the Architect’s Decisions

Important questions How are variables allocated and

addressed? How many registers will be needed?

An example Variable alias on register allocation

p= &aa = …*p = ……a…

Page 79: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

79

How can Architects Help Compiler Writers Provide Regularity

Address modes, operations, and data types should be orthogonal (independent) of each other

Simplify code generation especially multi-pass Counterexample: restrict what registers can be used

for a certain classes of instructions Provide primitives, not solutions

Special features that match a HLL construct are often un-usable

What works in one language may be detrimental to others

From 柯皓仁 , 交通大學

Page 80: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

80

How can Architects Help Compiler Writers (Cont.) Simplify trade-offs among alternatives

How to write good code? What is a good code?

Metric: IC or code size (no longer true) caches and pipeline…

Help compiler writers understand the costs of alternatives

Provide instructions that bind the quantities known at compile time as constants

Page 81: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

81

Short Summary An ISA has at least 16 GPR (not counting for

FP registers) to simplify allocation of registers Orthogonality suggests all supported

addressing modes apply to all instructions that transfer data

Other advices Provide primitives instead of solutions Simplify trade-offs between alternatives Don’t bind constants at run time

Counterexample – Lack of compiler support for multimedia instructions

From 柯皓仁 , 交通大學

Page 82: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

82

The MIPS Architecture

Page 83: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

83

MIPS64 A Simple load-store instruction set Design for pipelining efficiency,

including a fixed instruction set encoding

Efficiency as compiler target

Page 84: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

84

Register for MIPS 32 64-bit integer GPRs (or integer

registers) R0, R1, ... R31, R0= 0 always

32 FPRs for single (32 bits) or double precision (64

bits) F0, F1, ... , F31

Extra status registers Ex, floating-point status register

Page 85: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

85

Data Types for MIPS 8-bit bytes, 16-bit half words, 32-bit

words, and 64-bit double words for integer data

32-bit single precision and 64-bit double precision for FP

MIPS64 operations work on 64-bit integer and 32- or 64-bit floating point Bytes, half words, and words are loaded into

the GPRs with zeros or the sign bit replicated to fill the 64 bits of the GPRs

Page 86: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

86

Addressing Modes for MIPS Data Transfers Immediate and displacement

With 16-bit field Displacement

Add R4, 100(R1)Regs[R4] <- Regs[R4] + Mem[100 +

Regs[R1]] Register-indirect

Placing 0 in the displacement field Add R4, (R1)Regs[R4] <- Regs[R4] + Mem[Regs[R1]]

Page 87: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

87

Addressing Modes for MIPS Data Transfers (cont.) Absolute addressing

Using R0 as the base register Add R1, (1001)Regs[R4] <- Regs[R4] + Mem[1001]

MIPS memory Byte addressable with 64-bit address Mode selection for Big Endian or Little Endian All references between memory and either

GPRs or FPRs are through loads and stores

Page 88: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

88

MIPS Instruction Format Encode

addressing mode into the opcode

All instructions are 32 bits with a 6-bit primary opcode

Page 89: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

89

MIPS Instruction Format (Cont.)

opcode rs rt Immediate

6 5 5 16I-Type Instruction

Loads and Stores LW R1, 30(R2) S.S F0, 40(R4) ALU ops on immediates DADDIU R1, R2, #3

rt <-- rs op immediate Conditional branches BEQZ R3, offset

rs is the register checked rt unused immediate specifies the offset

Jump registers, jump and link register JR R3 rs is target register rt and immediate are unused but = 011

From 柯皓仁 , 交通大學

Page 90: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

90

MIPS Instruction Format (Cont.)

R-Type Instruction

J-Type Instruction: Jump, Jump and Link, Trap and return from exception

opcode Offset added to PC

6 26

Register-register ALU operations: rdrs funct rt DADDU R1, R2, R3

Function encodes the data path operations: Add, Sub...

read/write special registers

Moves

opcode rs rt shamt

6 5 5 5 5 6

rd func

From 柯皓仁 , 交通大學

Page 91: 1 Instruction Set Principles and Examples 游象甫. 2 Outline Introduction Classifying instruction set architectures Memory addressing Type and size of operands

91

Homework 2.3, 2.5, 2.6, 2.11