7
PES PES Institute of Technology & Institute of Technology & Management Management Department of Electronics &Communication Department of Electronics &Communication Engineering Engineering SYNOPSIS SYNOPSIS Of VLSI Design and Implementation of Low Power MAC Unit with Block Enabling TechniqueBy 1. Miss. Shreedevi S.N 2. Miss. Medha B 3. Miss.Rakheeshree L.R 4. Miss.Bhagyalakshmi S.M 1

VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

Embed Size (px)

Citation preview

Page 1: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

PESPES Institute of Technology &Institute of Technology & ManagementManagement

Department of Electronics &CommunicationDepartment of Electronics &Communication EngineeringEngineering

SYNOPSISSYNOPSIS

Of

“VLSI Design and Implementation of Low Power MAC Unit with Block

Enabling Technique”

By

1. Miss. Shreedevi S.N

2. Miss. Medha B

3. Miss.Rakheeshree L.R

4. Miss.Bhagyalakshmi S.M

Project Guide HOD Mr.Praveen J Dr.Ravi M. Yadahalli

1

Page 2: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

Assistant professor E&C Department E&C Department PESITM

VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

INTRODUCTION

In the majority of digital signal processing (DSP) applications the critical operationsare the multiplication and accumulation. Real-time signal processing requires high speedand high throughput Multiplier-Accumulator (MAC) unit that consumes low power, whichis always a key to achieve a high performance digital signal processing system. Thepurpose of this work is, design and implementation of a low power MAC unit with blockenabling technique to save power. Firstly, a 1-bit MAC unit is designed, with appropriategeometries that gives optimized power, area and delay. The delay in the pipeline stages inthe MAC unit is estimated based on which a control unit is designed to control the dataflow between the MAC blocks for low power. Similarly, the N-bit MAC unit is designedand controlled for low power using a control logic that enables the pipelined stages atappropriate time. The adder cell designed has advantage of high operational speed, smalltransistor count and low power. The MAC is implemented on a 0.18um CMOS technologyusing CADENCE VIRTUOSO tool. This work also investigates on various architectures ofmultipliers and adders which are suitable for implementation of high throughput signalprocessing and at the same time to achieve low power consumption.

Present problems in MAC architecture:In the majority of digital signal processing (DSP) applications the

critical operations usually involve many multiplications and/or accumulations. For real-time signal processing, a high speed and high throughput Multiplier-Accumulator (MAC) is always a key to achieve a high performance digital signal processing system. In the last few years, the main consideration of MAC design is to enhance its speed. This is because, speed and throughput rate is always the concern of digital signal

2

Page 3: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

processing system. But for the epoch of personal communication, low power design also becomes another main design consideration. This is because, battery energy available for these portable products limits the power consumption of the system.

Solution to overcome the problem:The main motivation of this work is to investigate various VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique ,pipelined multiplier/accumulator architectures and circuit design techniques which are suitable for implementing high throughput signal processing algorithms and at the same time achieve low power consumption. A conventional MAC unit consists of multiplier and an accumulator that contains the sum of the previous consecutive products. The function of the MAC unit is given by the following equation:

F = Σ Ai Bi

Multiplier and Accumulator Unit:

figure 1: Basic structure of MAC

MAC is composed of an adder, multiplier and an accumulator. Usually adders implemented are Carry-Select or Carry-Save adders, as speed is of utmost importance in DSP. One implementation of the multiplier could be as a parallel array multiplier. The inputs for the MAC are to be fetched from memory location and fed to the multiplier block of the MAC, which will perform multiplication and give the result to adder which will accumulate the result and then will store the result into a memory location. This entire process is to be achieved in a single clock cycle.

Wallace tree multiplier is used instead of conventional multiplier because Wallace tree multiplier can increase the MAC unit design speed. Ripple Carry Adder (RCA) is used as

3

Page 4: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

an accumulator in this design. Apparently, together with the utilization of Wallace tree multiplier approach, carry save adder in the final stage of the Wallace tree multiplier and Ripple Carry adder as the accumulator, this MAC unit design is not only reducing the standby power consumption but also can enhance the MAC unit speed so as to gain better system performance. The operation of the designed MAC unit is as in the above equation. The product of Ai X Bi is always fed back into the Ripple Carry accumulator and then added again with the next product Ai x Bi. This MAC unit is capable of multiplying and adding with previous product consecutively up to as many as N times. The total design area is also being inspected by observing the total count of transistors. Power delay product is calculated by multiplying the power consumption result with the time delay.

Wallace tree MultiplierThe design analysis starts with the analysis of elementary algorithm

for multiplication by Wallace tree multiplier which makes use of CSA ( Carry Save Adder). Methodology Used:

Block Enabling Technique:In any MAC unit, data flows from the input register to the output

register through multiple stages such as, multiplier stage, adder stage and the accumulator stage as shown in figure . In block enabling technique, we find the delay of each stage. Every block gets enabled only after the expected delay. For the entire duration until the inputs are available,the successive blocks are disabled, thus saving power.

General Block Diagram of a Pipeline MAC with block enabling Technique

4

Page 5: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

General Block Diagram of a Pipeline MAC with block enabling Technique

VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

Conclusion:

A full-adder circuit based on MUX is used for MAC architecture. Compared to other full-adder circuits, the MUX based full adder has the highest operational speed and less transistor count. The basic building blocks for the MAC unit are identified and each of the blocks is analyzed for its performance. Power and delay is calculated for the blocks. 1-bit MAC unit is designed with enable to reduce the total power consumption based on blockenable technique. Using this block, the N-bit MAC unit is constructed and the total power consumption is to be calculated. By using power reduction techniques adopted in this work, power is saved. The MAC unit designed in this work can be used in filter realizations for High speed DSP Application.

5

Page 6: VLSI Design and Implementation of Low Power MAC Unit with Block Enabling Technique

References[1] S.J. Jou, C.Y.Chen, E.C. Yang, and C.C.Su(1995), “A pipelined Multiplier-accumulator using a high speed, low power static and dynamic full adder design”, IEEE Custom Integrated circuit conference, 1995, pp. 593-5961[2] Anantha. P. Chandrakasan, Samuel Sheng, Robert W. Brodersen, “Low-Power CMOS digital design(1992),” IEEE Journal of Solid-State Circuits, Vol 27, No. 4, April, 1992[3] Neil H.E. Weste ,and David Harris, CMOS VLSI Design: a circuits and systems perspective, Addison-Wesley Publishing Company, 3rd ed.[4] S.J. Jou, C.Y.Chen, E.C. Yang, and C.C.Su(1997), “A pipelined multiplier-accumulator using a high speed, low power static and dynamic full adder design”, IEEE journal of Solid-state circuits, vol.32, no.1, Jan.1997,pp.114-118[5] M.Suzuki, N.Ohkubo, T.Shinbo, T.Yamanaka, A.Shimizu, K.Sasaki, and Y. Nakagome(1993), “A 1.5ns 32-bit CMOS ALU in Double pass-transistor logic”, IEEE Journal of Solid state circuits, vol.28, no. 11, November 1993, pp.1145-1151[6] F. Lu and H. Samulei(1993), “A 200-MHz CMOS pipelined multiplier- accumulator using a quasi-domino dynamic full adder cell design”, IEEE J. Solid state circuits, vol.28, pp.123- 132,

6