30
DESIGNING TOMORROW’S CHIPS Yatin Hoskote Director, Soc Technology and IP Intel labs MEMOCODE Oct 2013

EC Q2 Communication

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: EC Q2 Communication

DESIGNING TOMORROW’S CHIPSYatin Hoskote

Director, Soc Technology and IP

Intel labs

MEMOCODE Oct 2013

Page 2: EC Q2 Communication

MEMOCODE 2013 2

The number of devices is exploding

What are compute devices going to look like in the100 billion device market?

Page 3: EC Q2 Communication

MEMOCODE 2013

Wearable Computing

Mobile

Computing

Invisible Computing

Intelligent Computing Everywhere

3

Page 4: EC Q2 Communication

MEMOCODE 2013

IoT Revolution

4

Car

Work

Home

My Devices

Retail

Healthcare

Buildings

Transportation

Resources

Cities

Personal Cloud

Energy

Fully Evolved IoT = Fully Interconnected and Interoperable “System of Systems”

Page 5: EC Q2 Communication

MEMOCODE 2013

Specialized Algorithms

5

Gesture Recognition

Hand Gestures / Body Pose

Speech Recognition

Continuous Speech Recognition

Interest Point Detection

Descriptor Generation

Match (L1/L2 Distance)

GMM (Gaussian Mixture Model)

HMM (Hidden Markov Model)

Neural Networks, WFST, etc

Feature Extraction

HMM (Hidden Markov Model)

Need power-efficient, real-time implementations

Need better accuracy, higher levels of cognition

Page 6: EC Q2 Communication

MEMOCODE 2013

High Level of Integration

• Communications, compute, sensors, actuators etc

6

AmbientLight

Sensor

Accelerometer

Gyrometer(Gyro)

(Gyroscope)

Magnetometer(Compass)

Inclinometer(Tiltmeter)

DeviceOrientation

GPS WiFi BT LE

NFC Piezoelectricactuator

ImagesensorMicrophone Haptics

Fingerprintsensor

Proximitysensor

Page 7: EC Q2 Communication

MEMOCODE 2013

Ultra Low Power

• Long battery life, always on, always connected, low power communications, aggressive power management

• Low cost

7

Power/ Battery Life Challenge

>10X gap!

1.5Whr battery has 2hrs life capturingaudio/video

Page 8: EC Q2 Communication

MEMOCODE 2013

Rapid Design Environment

• Need smaller focused teams with domain expertise working in a design environment that enables rapid prototyping at multiple levels

8

Systems engineering

+

Platform based design

Page 9: EC Q2 Communication

MEMOCODE 2013

Model-based Top-down Design Flow

9

MRD

UX prototyping

Reference modeling

Algorithmic refinement

System modeling

Design space exploration

HW/SW Co-design

Emulation

Silicon and FFRD

Refinem

ent

Feedback

Lib

rary

of buildin

g b

locks

Test

corp

us

Page 10: EC Q2 Communication

MEMOCODE 2013

UX Prototyping

• Mock ups

• Focus groups

• Smoke and mirror prototypes

• User experiences and usages

• Suite of workloads and expected results

• Decide on quality metrics

10

Cessna TTx cockpit mockup

Page 11: EC Q2 Communication

MEMOCODE 2013

Personal Billboard UX

11

A design research prototype that explores new ways for people to

express themselves in real-time, in the same physical location.

The Billboard app allows people to display text and images on the cover

screen of a dual-screen Ultrabook, and for others to interact with this content by “liking”, voting on, or “peeling” a copy of the displayed content

Page 12: EC Q2 Communication

MEMOCODE 2013

Reference Modeling

• Functional models and executable spec

• Models of sensors, actuators, peripherals

• Reuse and enhance suite of UX workloads

12

Face

Iris

Voice

Location

Phone interface

Camera Trace

Mic.Trace

GPS Trace

Bluetooth Trace

Gesture

Authen.

control

Page 13: EC Q2 Communication

MEMOCODE 2013

Algorithmic Refinement

Edge-detectionAlgorithm

• Key step, requiring domain expertise

13

Page 14: EC Q2 Communication

MEMOCODE 2013

Design Space Exploration: Steps x Levels

Define appropriate architecture and deploy given applications onto it

Applications

ConstraintsE.g. Power for given workloads

Implementation

Architecturetemplates

DSE SoC IP

Define

Analyze

Optimize

14

Page 15: EC Q2 Communication

MEMOCODE 2013

System Modeling

• Functional and data partitioning

• Models for all sensors, actuators, peripherals

• Reuse and enhance workloads and test corpus

15

CPU SRAM Face Iris Gesture

Voice GPS BT LEDMA

Fabric

SW +

Tests

FW

Initial architecture model

camera miclocation

Page 16: EC Q2 Communication

MEMOCODE 2013

Single Source System Model

• Consistency among models and with golden model• Models are at multiple different levels of abstraction

16

Golden HLM

VPHLSPower

estimationArchitectureexploration

Functionalvalidation

Post-Sidebug

Performanceverification

Page 17: EC Q2 Communication

MEMOCODE 2013

Example: System Power Modeling

• Display power profile for real workloads

17

Expected results

Time

Power

Sys_off

Always Available (AAV)

Audio Log Video LogTake Pic

States include single or multiple function (e.g. image capturing, storing, button pressing etc.)

State

Page 18: EC Q2 Communication

MEMOCODE 2013

Video Subsystem

Example: Functional Validation

• System model constructed from functional model(s), 3rd party IP, and firmware

• Common trace points ( ) identified for system model and RTL model

18

Raw

RGB

RGB

YCbCr

JPEG Encoder

PacketizerRaw

CameraData

DMA

Shared

Memory

Bus

Host core

InstructionSet

Simulator

Firmware

Page 19: EC Q2 Communication

MEMOCODE 2013

IP Implementation

19

Codesign Memory analysis Code refactoring

Page 20: EC Q2 Communication

MEMOCODE 2013

HW/SW Codesign

TLM Untimed

TLM LT/AT

Synthesizable

Cycle Accurate

HLS

RTL SystemVerilog

Synthesis

Netlist

C/C++

Compiler

Firmware

Binary

HW/SW partitioning

Cycle Approximate

Cycle Accurate

Untimed

Perf

orm

ance

/Co

st F

eed

bac

k

SystemC

Reference ModelC++/Matlab

20

Modelrefinement

Page 21: EC Q2 Communication

MEMOCODE 2013

HLS Flow

TLM

Block Level Synthesis

Synthesizable HLS Models

Virtual SystemPrototypes

FPGA

Interface IPGenerator

Memory IP Generator

HL Synthesis

Control Datapath IP

RTL

Optimizer

ASIC

Control and Data Flow Graphs

21

IP

Standard Cells

Memory

compiler

Page 22: EC Q2 Communication

MEMOCODE 2013

3

Interface

Inner loop RAM optimization

630

4

Long Path Optimization

5

Pipelined design20

10

30

6

1.3ns clk

50

40

60

2d 3d 4d 5d 6d1day 2weeks 6months

Exploring the Design Space

145

7d 8d 9d 10d

Dual Input Read

Compiled C to DSP Core1

Handcrafted RTL

7

1

2 Memory Buffer

7

Development Time

Dela

y *

are

a

Viterbi Decoder with Soft Outputs

22

Page 23: EC Q2 Communication

MEMOCODE 2013

Next Generation HLS

23

Abstr

action

Micro-Architecture

& Control

Scheduled

Transistor

Post-Timed

Gate

Pre-Timed

Timed

Physical

Unscheduled = No clock

Scheduled = Clocks

Design Complexity

Power(cpf/upf)Test (ctl)

Floor Plan(def)

Sine

Macro-Architecture

Unscheduled

RTL(IP)

Scheduled

+

Page 24: EC Q2 Communication

MEMOCODE 2013

Programmable Accelerators

• Domain specific instruction processors

• Architecture Description Language ADL

• Complete tool suite

– Base architecture template

– Design tools to go from source code to RTL

– Programming tools for firmware development

• Fast development loop

24

Architecture

Description

Instruction

Gen

App

src

Which way to go further

HW

modification

Simulator

Metrics

Algorithm/

SW

modification

Compiler

RTL

Page 25: EC Q2 Communication

MEMOCODE 2013

IP Library ToolKit

Parametric

Algorithmic Pyramid

25

RTL

Drivers/FW

Test

Debug

ESL

Parameters

Workloads

IP Stack

Configurable

Vertical IP

Page 26: EC Q2 Communication

MEMOCODE 2013

Software Development

• Software development cost is outpacing hardware

• Hybrid virtual platforms

• Single source model

• Validation

26

App

proc

VP

ProcessorSimulator+ wrapper

FPGA

Page 27: EC Q2 Communication

MEMOCODE 2013

Emulation

• Path to FPGA from high level models using high level synthesis

• Functional validation and SW development

• Highly useful as a demonstration vehicle

27

Page 28: EC Q2 Communication

MEMOCODE 2013

Summary

28

UX prototyping

Reference modeling

Algorithmic refinement

System modeling

Design space exploration

HW/SW Co-design

Emulation

Silicon and FFRD

Lib

rary

of buildin

g b

locks

Test

corp

us

Page 29: EC Q2 Communication

MEMOCODE 2013

Call to Action

• Need a formal refinement methodology

– Equivalence verification

– Automatic generation of models

– Compositional verification for constraints and behavior

• Design support

– Feedback to designer at each step for quick iterations

– Debug assistance

• Embedded software analysis

– Firmware code generation / driver synthesis

– Firmware validation

• IP library and standardization

– Analog and mixed signal IP

29

Page 30: EC Q2 Communication

THANK YOU

30