21
DSP Architecture s Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai – 600 036 [email protected]

DSP Architectures Additional Slides

  • Upload
    chul

  • View
    41

  • Download
    2

Embed Size (px)

DESCRIPTION

DSP Architectures Additional Slides. Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –600 036 [email protected]. Figure 4.3(a) Block diagram of a barrel shifter. Figure 4.3(b) Implementation of a 4-bit, shift-right barrel shifter. - PowerPoint PPT Presentation

Citation preview

Page 1: DSP Architectures Additional Slides

DSP Architecture

sAdditional Slides

Professor S. SrinivasanElectrical Engineering Department

I.I.T.-Madras, Chennai –600 [email protected]

Page 2: DSP Architectures Additional Slides

Figure 4.3(a) Figure 4.3(a) Block diagram of a barrel shifterBlock diagram of a barrel shifter

Page 3: DSP Architectures Additional Slides

Figure 4.3(b) Figure 4.3(b) Implementation of a 4-bit, shift-right Implementation of a 4-bit, shift-right barrel shifterbarrel shifter

Page 4: DSP Architectures Additional Slides

Figure 4.5 Figure 4.5 A MAC unit with accumulator guard bitsA MAC unit with accumulator guard bits

Page 5: DSP Architectures Additional Slides

Figure 4.6 Figure 4.6 A schematic diagram of the A schematic diagram of the saturation logicsaturation logic

Page 6: DSP Architectures Additional Slides

Figure 4.7 Figure 4.7 Block diagram of an arithmetic logic Block diagram of an arithmetic logic unitunit

Page 7: DSP Architectures Additional Slides

Figure 4.9 Figure 4.9 Register pointer updating algorithm for Register pointer updating algorithm for circular buffer addressing mode: SAR circular buffer addressing mode: SAR = = start start address register contents, EAR address register contents, EAR = = end address end address

register contents, PNTR register contents, PNTR = = pointerpointer

Page 8: DSP Architectures Additional Slides

Figure 4.10 Figure 4.10 Different cases that arise in Different cases that arise in updating the pointer in circular buffer updating the pointer in circular buffer

addressing modeaddressing mode

Page 9: DSP Architectures Additional Slides

Figure 4.10 ContinuedFigure 4.10 Continued

Page 10: DSP Architectures Additional Slides

Figure 4.11 Figure 4.11 Block diagram of an address Block diagram of an address generation unitgeneration unit

Page 11: DSP Architectures Additional Slides

Bit-reversal Hardware

Page 12: DSP Architectures Additional Slides

Figure 4.12 Figure 4.12 A conceptual diagram of a program A conceptual diagram of a program sequencersequencer

Page 13: DSP Architectures Additional Slides

Instruction Level Parallelism

VLIW architecture

• Each instruction specifies several operations to be done in parallel

• Advantages : Simple hardware

compilers can spot ILP easily

• Disadvantages : Little compatibilty between generations

Explicit NOPs bloat code size

Page 14: DSP Architectures Additional Slides

Super scalar architecture

• Hardware responsible for finding ILP in a sequential program

• Advantage : Compatibility between generations

• Disadvantage : Very complex hardware

Page 15: DSP Architectures Additional Slides

Explicitly Parallel Instruction Computing (EPIC)

• Combines VLIW and super scalar architectures

• Instructions are grouped into 3 operating blocks and a template block

• Template block tells hardware if instructions can be executed in parallel

• Also gives information whether the block can be executed in parallel

Page 16: DSP Architectures Additional Slides

ILP versus Power

Increasing instructions / cycle

Requires fewer cycles to execute a task

Uses longer clock for same performance

Uses lower supply voltage

And hence uses less power

However, too many functional units and too many transitions per clock cycle increase power consumption.

Page 17: DSP Architectures Additional Slides

Low Power architecture

Power consumed by additional circuits vs. ability to lower clock rate while maintaining performance

Circuits must be highly used

Move complexity into software

Voltage scaling : Reduce Vdd

Clock gating : Turn off clock when chip is not in use ( applies to

sub-modules of chip also)

Page 18: DSP Architectures Additional Slides

VLIW is more suitable than super scalar for low power

- VLIW is smaller for same number of functional units

- Compiler is better at finding parallelism than hardware

Put multiple processors on chip rather than lots of functional units in one processor

Helps in running independent tasks

Page 19: DSP Architectures Additional Slides

General Purpose Microprocessor 2000 GHz clock speed 32-bit address or more 32-bit bus, 128-bit instructions Complex MMU Super scalar CPU MMX instructions On chip cache Single cycle execution 32-bit floating point ALU on board Very expensive 10s of watts of power

Page 20: DSP Architectures Additional Slides

DSP in 2000

Clock 100 ~ 200 MHz

16-bit floating point or 32-bit floating point

16-24 bits address space

Large on-chip and off-chip memories

Single cycle execution of most instructions

Harvard architecture

Lots of special DSP instructions

50 mw to 2w power

Cheap

Page 21: DSP Architectures Additional Slides

Future of DSP Microprocessor

Sufficiently unique for an independent class of applications (HDD, cell phone)

Low power consumption, low cost

High performance within power, cost

constraints (MIPS/mw, MIPS/$)

Fixed point & floating point

Better compilers - but users must be informed

Hybrid DSP/ GP systems