Building Automous Machines with JETSON - Inria · HPC data centers running mix ... €630 / £450...

NVIDIA TEGRA K1 Mar 2, 2014

NVIDIA Confidential

Francois Courteille Senior Solution Architect, Accelerated Computing

Building Automous Machines with JETSON Machine Learning, Vision , and GPUs

AGENDA

JETSON™ EMBEDDED PLATFORM Deep learning for next-generation intelligent, autonomous machines

Background on NVIDIA

Deep Learning & GPUs

NVIDIA Embedded Platform

Deep Learning & Autonomy

THE WORLD LEADER IN VISUAL COMPUTING

GAMING ENTERPRISE TECHNOLOGY HPC & CLOUD AUTO

CREDENTIALS BUILT OVER TIME

300K CUDA Developers, 4x Growth in 4 years

Majority of HPC Applications are GPU-Accelerated, 410 and Growing

100% of Deep Learning Frameworks are Accelerated

2011 2012 2013 2014 2015 2016

# of Applications Academia Games Finance

Manufacturing Internet Oil & Gas

National Labs Automotive Defense

THEANO

MATCONVNET

PURINEMOCHA.JL

MINERVA MXNET*

BIG SUR TENSORFLOW

WATSON CNTK

END-TO-END TESLA PRODUCT FAMILY

HYPERSCALE HPC

Tesla M4, M40

MIXED-APPS HPC

Tesla K80

STRONG-SCALING HPC

Tesla P100

FULLY INTEGRATED DL SUPERCOMPUTER

For customers who need to get going now with fully

integrated solution

Hyperscale & HPC data centers running apps that

scale to multiple GPUs

HPC data centers running mix of CPU and GPU workloads

Hyperscale deployment for DL training, inference, video &

image processing

THE BIG BANG IN MACHINE LEARNING

“ Google’s AI engine also reflects how the world of computer hardware is changing. (It) depends on machines equipped with GPUs… And it depends on these chips more than the larger tech universe realizes.”

DNN GPU BIG DATA

K40 K80 + cuDNN1

M40 + cuDNN4

P100 + cuDNN5

BLISTERING PACE

OF INNOVATION

FOR AI

AlexNet training throughput based on 20 iterations, CPU: 1x E5-2680v3 12 Core 2.5GHz. 128GB System Memory, Ubuntu 14.04

M40 bar: 8x M40 GPUs in a node P100: 8x P100 NVLink-enabled

Deep Learning Training Performance Caffe AlexNet

2013 2014 2015 2016

Speed-u

s K40 in 2

72% 74%

88% 93%

Human:94.9%

2010 2011 2012 2013 2014 2015

Deep Learning on GPUs

HOW FAR HAS AUTONOMY COME? ImageNet classification accuracy

Object Classification

Localization/Mapping

Collision Avoidance

3D Reconstruction

Segmentation

DEEP LEARNING AND AUTONOMY

DEEP LEARNING FOR AUTONOMOUS VEHICLES

Jetson TX1

JETSON TX1

GPU 1024 GFLOPS 256-core Maxwell

CPU 4x 64-bit ARM A57 CPUs | 1.6 GHz

Memory 4 GB LPDDR4 | 25.6 GB/s

Video decode 4K 60Hz H.264 / H.265

Video encode 4K 30Hz H.264 / H.265

CSI Up to 6 cameras | 1400 Mpix/s

Display 2x DSI, 1x eDP 1.4, 1x DP 1.2/HDMI

Wi-Fi 802.11 2x2 ac

Networking 1 Gigabit Ethernet

PCI-E Gen 2 1x1 + 1x4

Storage 16 GB eMMC, SDIO, SATA

Other 3x UART, 3x SPI, 4x I2C, 4x I2S, GPIOs

Power 10-15W, 6.6V-19.5VDC

Size 50mm x 87mm

JETSON AT THE LEADING EDGE

Powering the Next Generation of Autonomous Machines

Jetson TX1 Developer Kit

Jetson TX1 Module

Developer Board

5MP Camera

€630 / £450 retail

€350 / £250 academic discount

JETSON SDK: THE DETAILS

CUDA OpenGL

Linux for Tegra

Compute

Jetson TX1

Vision Machine Learning

cuBLAS

cuSolver

cuSPARSE

cuRAND

Thrust

CUDA Math Library

Graphics

NVTX NVIDIA Tools eXtension

Source code editor

Debugger

Profiler

System Trace

Vertically Integrated Packaages

Vertically Integrated Packages

libjpeg

TRAIN, THEN DEPLOY

YOUR NEURAL NETWORKS Programming Approaches

Classified Object!

Trained Deep Neural Net Model

Camera Inputs

Network

Solver

Data Scientist

Training: Server, GRID, DIGITS Inference: GIE, Embedded (Jetson)

Your Application (dog breed detector, for example)

CUDA-accelerated Deep Learning library

Supports industry-standard frameworks:

Out-of-the-box speedups of neural networks:

For both Inference and Training:

Jetson TX1 // Tesla/Titan/GRID

Frameworks

DIGITS

Process Data Configure DNN Visualization Monitor Progress

Interactive Deep Learning GPU Training System

NVIDIA DIGITS

OBJECT DETECTION New in DIGITS 4

ADVANCED DRIVER ASSISTANCE SYSTEMS (ADAS)

REMOTE SENSING

MEDICAL DIAGNOSTICS

INTELLIGENT VIDEO ANALYTICS

developer.nvidia.com/digits

GPU INFERENCE ENGINE (GIE) High-performance deep learning inference for production deployment

developer.nvidia.com/gie

CPU-Only Tesla M4 + GIE

40x more inference perf/watt with GIE

EMBEDDED

Jetson TX1

DATA CENTER

Tesla M4

AUTOMOTIVE

Drive PX Batch Sizes

GoogLenet Images Per Second Per Watt

CPU-Only vs Tesla M4 + GIE on

dual-socket Haswell E5-2698 v3@2.3GHz HT on

END-TO-END DEEP LEARNING FOR

SELF-CONTROL

Motor PWM

Sensory Inputs

Perceptron

Recognition

Inference

Goal/reward

function

user application

Long-t

PC GAMING

ONE ARCHITECTURE — END-TO-END AI

CUDA + Linux throughout the stack.

24 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.

NVIDIA DGX-1 WORLD’S FIRST DEEP LEARNING SUPERCOMPUTER

170 TFLOPS FP16

8x Tesla P100 16GB

NVLink Hybrid Cube Mesh

Accelerates Major AI Frameworks

Dual Xeon

7 TB SSD Deep Learning Cache

Dual 10GbE, Quad IB 100Gb

3RU – 3200W

THANK YOU! Q&A: WHAT CAN I HELP YOU BUILD?

Building Automous Machines with JETSON - Inria · HPC data centers running mix ... €630 / £450...

Documents

NVIDIA Jetson TK1 Documentation PM375 Module …developer.download.nvidia.com/embedded/jetson/TK1/2014-03-24/... · NVIDIA Jetson TK1 Documentation PM375 Module Specification.

GPU-based pedestrian detection for autonomous drivingrefbase.cvc.uab.es/files/CSE2016.pdf · the Nvidia’s Tegra X1 ARM processor, like the Jetson TX1 and DrivePX platforms, pave

TX1 Calibration Steps

Jetson TX1 Developer Kit User Guide - Nvidiadeveloper.download.nvidia.com/embedded/L4T/r23_Release_v2.0/L4T_Jetson... · Developer Tools Jetson TX1 Developer Kit User Guide DG_07976-001

LI-TX1-CB Data Sheet - Leopard Imaging · LI-TX1-CB LEOPARD IMAGING INC Data Sheet Rev 1.2 Technical details Carrier board for NVIDIA Jetson TX1/TX2 SOM 3 MIPI camera interfaces (4-lane)

Jetson TX1/TX2 Developer Kit Carrier Board Specification · 2018. 9. 18. · J27 CR3 CR4 Main Jetson Module Connector Serial Port Header Fan Header PCIe x4 Connector JTAG Header SATA

Jetson TV & Appliance

Proton Testing of nVidia Jetson TX1 - NASA · 2017-09-29 · screen shot is provided to show the visual record of each type. The network loss was recorded as the terminal client losing

REAL-TIME LINUX & TOOLS FOR THE JETSON TX1 · 2017. 5. 18. · Time-proven APIs for hard real-time development ... Documentation. REDHAWK CPU Shielding Best approach for maximizing

Jetson TX1/TX2 Developer Kit Carrier Board Specification · CONNECTING THE POWER SUPPLY TO THE AC POWER JACK. Connecting a device while powered Connecting a device while powered on

LI-TX1-CB-IMX185M12-D LEOPARD IMAGING INC Data Sheet … fileLI-TX1-CB-IMX185M12-D LEOPARD IMAGING INC Data Sheet Rev 1.0 Key Features Compatible with Nvidia Jetson TX1 (TX1 SOM not

HELLO AI WORLD: MEET JETSON NANO - info.nvidia.com · WEBINAR AGENDA Intro to Jetson Nano - AI for Autonomous Machines - Jetson Nano Developer Kit - Jetson Nano Compute Module Jetson

Jetson TX1 Developer Kit User Guide - Nvidiadeveloper.download.nvidia.com/embedded/L4T/r23... · Plug the power adapter into an appropriately rated electrical outlet. 5. ... PG_07830_001

DEEP LEARNING: TRANSFORMING ENGINEERING AND SCIENCE · 9 JETSON TX1 JETSON TX2 GPU Maxwell Pascal CPU 64-bit A57 CPUs 64-bit Denver 2 and A57 CPUs Memory 4 GB 64 bit LPDDR4 25.6 GB/s

Jetson: Playoff PowerBuys

Jetson TX1 and TX2 Developer Kits - docs.nvidia.com · A system trace and multi-core CPU PC sampling profiler that provides an interactive view of captured profiling data, helping

Jetson TX1 Developer Kit User GuideDeveloper Tools Jetson TX1 Developer Kit User Guide DG_07976-001 | 6 • An USB Micro-B cable connecting the Jetson system to your Linux host for

Jetson Appliance Sale!

sales@connecttech.com connecttech · NVIDIA® Quadro & Tesla Solutions NVIDIA® Jetson™ Nano & Xavier™ NX Products NVIDIA® Jetson™ TX2/TX2 4GB/TX2i/TX1 Products Carrier Boards

April 4-7, 2016 | Silicon Valley - GTC On Demand · Installs Android NDK & SDK tool chain ... Real-time HD SLAM with GPU processing @ 25Hz Jetson TX1 Zed. 21 DEMO STEREOLABS. 22 PERFWORKS