Upload
others
View
5
Download
0
Embed Size (px)
Citation preview
Copyright © 2007 Altera Corporation
Vaughn Betz and Sanjay Rajput
Learn to Design with Stratix III FPGAs’ Programmable Power Technology
and Selectable Core Voltage
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 2
Agenda
The power challenge
Stratix® III power innovations
− Programmable power
− Selectable core voltage
− Quartus® II power optimization
− IO power: On-chip dynamic termination and DDR3
Device selection and competitive overview
Wrap up
Copyright © 2007 Altera Corporation
The Power Challenge
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 4
Altera’s End CustomersAltera’s End CustomersCommunications
ConsumerBroadcast
Automotive
Test, Measurementand Medical
Computer and Storage
Military and Industrial
Cellular
Basestations
Wireless LAN
Switches
Routers
Optical
Metro
Access
Broadband
Audio/video
Video display
Studio
Satellite
Broadcasting
Medical
Test equipment
Manufacturing
Card readers
Control systems
ATM
Navigation
Entertainment
Secure comm.
Radar
Guidance and control
Wireless
Networking
Wireline
Entertainment
Broadcast
Automotive
Instrumentation Military
Security and Energy Management
Servers
Mainframe
RAID
SAN
Copiers
Printers
MFP
Computers
Storage
Office Automation
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 5
The Challenge: System Design Trend
High-End
System
Version 1.0
Next Generation Design
New System RequirementsStratix III FPGAs
Higher processing performance
Customizable capabilities
Integration of more functions
Re-programmability
Similar physical constraints
High-End
System
Version 2.0
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 6
The Challenge: System Design Trend
High-End
System
Version 1.0
Next Generation Design
New System RequirementsStratix III FPGAs
Higher processing performance
Customizable capabilities
Integration of functions
Re-programmability
Similar physical constraints
Same or Lower Power Budget
High-End
System
Version 2.0
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 7
Meeting the Power ChallengePower
Density
1.5M Gates 3.0M Gates
Increased Performance
Increased Density
65 nm (Increased Leakage)
Stratix II
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 8
Meeting the Power ChallengePower
Density
1.5M Gates 3.0M Gates
Increased Performance
PowerBudget
Increased Density
65 nm (Increased Leakage)
Stratix III FPGAs cut power by 50% vs. 90 nm
Stratix III
Stratix II
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 9
Benefits of Lower Power
Stay within a fixed power budget− Chassis limits (heat, space, current)− Outside power budget not an option
Reduce system cost− Fewer/smaller heat sinks and fans− Smaller power supplies
Increase reliability− No fans no moving parts− Lower system temperature
Reduce design time and effort to meet power and thermal constraints
Copyright © 2007 Altera Corporation
Stratix III FPGAs: Lower Power, Higher Performance
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 11
Industry-Leading Low-Power Technology
Stratix III FPGA Power Reduction Technique
Lower Static Power
Lower Dynamic
Power
Silicon Process Optimizations
Programmable Power Technology
Selectable Core Voltage
(0.9 V or 1.1 V)
Power Optimized DDR Memory Interface
Quartus II Software PowerPlay Power Analysis and Optimization
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 12
Leading Edge Process TechnologyIncreased Performance, Reduced Power
Advanced 65-nm process− 15% capacitance reduction reduces dynamic power 15%
− Lower voltage reduces dynamic power another 16%
Multiple-gate oxide thicknesses (triple oxide) − Trade-off static power vs. speed per transistor
Multiple-threshold voltages− Trade-off static power vs. speed per transistor
Low-k inter-metal dielectric− Reduces dynamic power, increases performance
Strained silicon− Increased performance
Copper interconnect− Increased performance, reduced IR drop
Copyright © 2007 Altera Corporation
Programmable Power
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 14
Design-Specific Power Optimization Only a small fraction of logic is performance critical
Slack Histogram
PerformanceCritical
Not Performance Critical
0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
0-10%
10-20%
20-30%
30-40%
40-50%
50-60%
60-70%
70-80%
80-90%
90-100%
Slack %
Nu
mb
er
of
Co
nn
ec
tio
ns
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 15
Programmable Power TechnologyLogic Array
High-Speed Logic
Timing Critical Path
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 16
Programmable Power TechnologyLogic Array
High-Speed Logic
Timing Critical Path
Low-Power Logic
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 17
Programmable Power Technology
High performance where you need it, lowest power everywhere else
Logic Array
High-Speed Logic
Timing Critical Path
Low-Power Logic
Unused Low-Power Logic
* Power mapping fully automated by Quartus II based on timing constraints
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 18
High Speed/Low Power
Low-power mode for a tile results in− 60% reduction in static power
− 5% reduction in dynamic power
− ~20% increase in delay
Quartus II CAD system doesn’t use on critical paths
No impact on system speed
Tiles can be− Pair of Logic Array Blocks (LABs)
− RAM block
− DSP block
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 19
High-Resolution Power Control
Speed of the fastest LABs
power of the slowest
Only a smallpercentage of
high-speed tiles required to
maintain design performance
Stratix III FPGA
(EP3SL340) has 8,050 Tiles for very high
resolution power/performance
optimization
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 20
Most Tiles Are Low PowerAll Clocks At Maximum Speed (Worst Case)
Average: 19% High Speed
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
1 7 13 19 25 31 37 43 49 55 61 67 73 79 85 91
Complete Customer Design
Hig
h S
pe
ed
Tile
s/D
evic
e T
iles
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 21
Static Power Tamed (85°C)
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
0 50K 100K 150K 200K 250K 300K
Number of LEs
Co
re S
tati
c P
ow
er
(W)
All Low-Power Tiles (1.1V)
Typical High-Performance Design (1.1V)
Stratix II Stratix III
350K
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 22
Real Customer Designs
Stratix II Advantage
Stratix III Advantage
FM
AX
Ra
tio
: S
tra
tix
III v
s.
Str
ati
x II
Low PowerWithout
SacrificingPerformance
Average: 25%Increase
Speed: Stratix III vs. Stratix II
0.75
1.00
1.25
1.50
1.75
Designs
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 23
Using Programmable Power
Circuit board requirements: none!
− Stratix III FPGAs create low-power tiles using on-chip circuitry
− No extra power supplies, no extra board components
Design changes: none!
− Quartus II software automatically uses high-speed tiles where needed for timing
− All unused tiles set to low power
− All tiles with timing margin set to low power
− Failed timing constraints: all tiles not on critical paths set to low power
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 24
Programmable Power Controls
High-speed tile usage always optimized
Extra effort lowers high-speed usage (by a few %)− Also reduces dynamic power (average 15% vs. off)
− At ~20% compile time cost
Advanced options under Fitter | More Settings
Copyright © 2007 Altera Corporation
Selectable Core Voltage
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 26
Selectable Core Voltage
Customer selects the FPGA core voltage
− 1.1 V for maximum performance
− 0.9 V for minimum power
I/O and PLL voltages unaffected
− Still get maximum I/O interface speed
− Crucial, since I/O bandwidth limits many systems
Nominal Voltage
Min. Regulator
vOUT
Max. Regulator
vOUT
Slow Timing Model
Fast Timing Model
Power Model
1.1 V 1.05 V 1.15 V
0.9 V 0.86 V 0.94 V
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 27
Selectable Core Voltage
Variable Voltage Region
Delay-LockedLoops (DLLs)
I/Os and Interface Circuitry
M9K Blocks
DSP Blocks
Phase-LockedLoops (PLLs)
ALMs
M144K Blocks
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 28
Power and Timing Impact
Core Voltage
Dynamic Power
Reduction From Stratix II FPGAs
Static Power Reduction From Stratix II FPGAs
Performance Gain Over Stratix II
FPGAs
1.1 V 33% 52% 25%
0.9 V 55% 64% 0%
More choice to meet your power
and performance budgets
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 29
Even Lower Static Power (85°C)
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
0 50K 100K 150K 200K 250K 300K
Number of LEs
Co
re S
tati
c P
ow
er
(W)
All low-power tiles (0.9 V)
All low-power tiles (1.1 V)
Typical high-performance design (1.1V)
Stratix II Stratix III
350K
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 30
Using Selectable Core Voltage
Provide 0.9 V and 1.1 V supplies to FPGA
− Risk mitigation: provide two supplies on board if concerned about power budget
− Set voltage regulator output to 0.9 V if power budget exceeded at 1.1 V
Quartus II software:
− Select -4L speed grade
− Select 0.9 V operation
Copyright © 2007 Altera Corporation
Power Optimized DDR Memory Interface
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 32
Stratix III On-Chip Termination (OCT)
Both parallel (Rt=50Ω) and series (Rs=50Ω)
termination
50 Ω
50 Ω
VTT
OE
25 Ω
50 Ω
OE
Zo=50 Ω
VTT
Memory Chip*Stratix III FPGA
(*) DDR 1/2/3, RLDRAMII, QDR II/II+ support
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 33
50 Ω
VTT
OE
Stratix III FPGA (TX) Memory Chip
50 Ω
OE
Zo=50 Ω
VTT
50 Ω
VTT
OE
Stratix III FPGA (RX) Memory Chip
50 Ω
OE
Zo=50 Ω
VTT
Write Read
Stratix III FPGA Dynamic OCT
Write: Rs on, Rt off → Matching line impedance
Read: Rs off, Rt on → Terminating far end
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 34
Benefits of Dynamic OCT
1. Power significantly reduced vs. traditional parallel OCT
Saves 1.6 W of DC power on 72-bit DDR2
bus
2. Proper line termination and impedance matching on bidirectional busses
Enhanced signal integrity
3. No need for on-board termination resistors
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 35
Stratix III FPGAs Support DDR3
Stratix III: The only FPGA that supports DDR3
DDR3 is 30% lower power than DDR2
− DDR2: 1.8V
− DDR3: 1.5V
Example system:
− 72-pin 200MHz memory interface, with on-chip termination
Conventional FPGA DDR2 power: 3.9 W
Stratix III (dynamic OCT) DDR2 power: 2.3W
Stratix III (dynamic OCT) DDR3 power: 1.6W
− Total savings of 2.3 W
Copyright © 2007 Altera Corporation
Quartus II Power Optimization
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 37
PowerPlay: Automatic Optimization
DesignEntry]
Constraints
Speed Area Power
Place and Route
Optimize Power
PowerPlay Power
Analyzer
Power-Optimized Design
Synthesis
Optimize Power
Set compiler settings
to focus on reducing power
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 38
Automatic Programmable Power
Place and Route
Synthesis
Timing Analyze
Tiles with Timing Slack Low Power
Unused Tiles Low Power
Done?No Yes
All High-Speed Tiles
Mostly Low-Power Tiles
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 39
Power-Optimized RAM Mapping
Power Efficient Option
32
2:4Decoder
Four 512x32 M9K RAMs
Default Option
32
Four 2Kx8 M9K RAMs
2K X 32 RAM
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 40
Power-Driven Place and Route Minimize capacitance of high-toggling signals
Without violating timing constraints
Power Optimize
100 Million Toggle/s20 Million Toggle/s
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 41
Clock Shut Down Hardware
Blue: Clock
Required
Only Red
Partsof Clock
NetworkToggle
Stratix III FPGAs: Can shut down clock at 3 levels of tree− Top-level: Shut down 1/16 of clock tree
− Next-level: 1 / 500 of clock tree
− Bottom-level: 1 / 10,000 of clock tree
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 42
Placement to Reduce Clock Power
Power Optimize
Group Clocks for Maximum Shutdown
Clocking Legal, Timing Optimized
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 43
PowerPlay Power Optimization
Automatic, but less accurate
Requires testbench,
more accurateRTL
Simulation
Power-Driven Fit
Design
Power Report
VectorlessEstimation
Power Analyzer
Normalor
Extra Effort
Power-DrivenSynthesis
Estimate Toggle Rates
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 44
Dynamic Power Optimization
0%
5%
10%
15%
20%
25%
30%
35%
40%
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77
Design
Dyn
am
ic P
ow
er
Re
du
ctio
n v
s.
Min
imu
m E
ffo
rt
Extra Effort
Normal Effort
Copyright © 2007 Altera Corporation
Device Selection and Competitive Overview
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 46
Device Selection for Power
Use Altera’s Early Power Estimator
Accurate
Allows what-if analysis of different voltages
Best option if no code has been written
After implementation, Quartus II PowerPlay Power Analyzer provides the best estimate
− Knows exact design utilization, block configurations, routing, signal behavior, etc.
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 47
Buyer Beware
Not all power estimators are accurate− Especially for dynamic power
Xilinx XPE – XPower estimator− Register power greatly underestimated
− IO power greatly underestimated
Example: estimating memory interface power
− Does not report any clock power – not modeled
− LUT fan-out unrealistically low (2)
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 48
How Good Are the Estimates?
Compare estimates to silicon measurements− Import design info from FPGA CAD tool to estimator
− Obtain dynamic power estimate with correct toggle rates
− Compare to silicon measurement of dynamic power
DesignStratix II EPE
Ver. 6.1 % Error
Virtex-5 XPEVer. 9.1 % Error
counter_16x1024 9% -40%
grey6 22% -56%
des3_6 -3% -81%
rijndael_iter -20% -31%
tessierbeamform_12beams 28% -36%
mult_18x18_32copies 14% -60%
ram_8192dx64wx1 13% -32%
Logic
Full Design
DSP Blocks
RAM Blocks
Negative number = Underestimation (Bad)
Positive number = Overestimation (Safe)
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 49
40% Power Advantage for Stratix III
0
Re
lati
ve
To
tal
Po
we
r
1.0
0.8
0.6
0.4
0.2
1.1
23%40%
Virtex-5 Stratix III 1.1V Stratix III 0.9V
Total Power For Typical Design
* Total power (Dynamic + Static + IO), 85C junction temp., equivalent density devices, average customer usage of logic, memory, multipliers, and IO.
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 50
Performance Benchmarking
Fastest speed grades are compared
Full timing constraints for each design
− Tight fMAX constraint for each clock domain
− I/O constraint on all pins
Best effort (true FPGA performance)
− Multiple compilations to get best result
See “How Fast is the Fastest FPGA” net seminar for more details
Compare QoR
Source HDL (Customer Designs)
Quartus II V.6.1 ISE V. 9.1
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 51
0.80
0.90
1.00
1.10
1.20
1.30
1.40
1.50
1.60
1.70
Customer Designs
Altera Advantage
Xilinx Advantage
Fm
ax R
atio
Str
atix I
II (
Fast)
and V
irte
x-5
(F
ast)
Stratix III (fast) vs. Virtex-5 (fast)
Stratix III FPGAs Average
One SpeedGrade Faster than
Virtex-5
Relative Core Performance Comparison
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 52
Summary
Stratix III FPGAs− Meet the power challenge of next generation designs− With the highest performance AND lowest power
Power reduction− Excellent process technology and engineering− Programmable power− Selectable core voltage− Power-efficient DDR interface− Quartus II software power optimization
Lower power and higher performance than any
competitive FPGA
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 53
More Resources
Stratix III website - www.altera.com/stratix3
− Click on “Programmable Power Technology” to learn more about this and other Stratix III power related features
Programmable Power White Paper
− http://www.altera.com/literature/wp/wp-01006.pdf
Stratix III EPE – power estimation spreadsheet
− http://www.altera.com/support/devices/estimator/pow-powerplay.jsp
− User guide available on this page
Download Quartus II software and start designing with Stratix III FPGAs today
− www.altera.com/download
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 54
Quartus II v6.1 Supports Stratix III FPGAs
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 55
Quartus II v6.1 Supports Stratix III FPGAs
Quartus II web edition v6.1
− Supports Stratix III devices: EP3SE50, EP3SL70
Quartus II subscription edition v6.1
− Supports all Stratix III devices
Copyright © 2007 Altera CorporationCopyright © 2007 Altera Corporation 56
Additional Stratix III Net Seminars
Overview of Altera’s 65-nm Stratix III FPGAs (10 minutes)
Using Stratix III FPGAs to Achieve Higher Performance
Systems with Lower Power (60 minutes)
How Fast is the Fastest FPGA? Stratix III Performance Capabilities (60 minutes)
Upcoming Stratix III Net Seminar (June 2006): How to Maximize Performance with Stratix III FPGAs Using Quartus II Software (60 minutes)