15
Marc Hamilton Vice President, Solution Architecture and Engineering NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED ANALYTICS

NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

Marc Hamilton

Vice President, Solution Architecture and Engineering

NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED ANALYTICS

Page 2: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

2

AI IS EVERYWHERE

“Find where I parked my car” “Find the bag I just saw in this magazine”

“What movie should I watch next?”

Page 3: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

3

TOUCHING OUR LIVES

Bringing grandmother closer to family by bridging language barrier

Predicting sick baby’s vitals like heart rate, blood pressure, survival rate

Enabling the blind to “see” their surrounding, read emotions on faces

Page 4: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

4

FUELING ALL INDUSTRIES

Increasing public safety with smart video surveillance at airports & malls

Providing intelligent services in hotels, banks and stores

Separating weeds as it harvests, reduces chemical usage by 90%

Page 5: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

5

DEEP LEARNING DEMANDS NEW CLASS OF HPC

TRAINING INFERENCING

Data / Users

ScalablePerformance

Throughput+ Efficiency

Billions of TFLOPS per training run

Years of compute-days on Xeon CPU

GPU turns years to days

Billions of FLOPS per inference

Seconds for response on Xeon CPU

GPU for instant response

Page 6: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

6

DATA DELUGE TO DATA HUNGRY

INCREASING DATA VARIETY

Search Marketing

Behavioral Targeting

Dynamic Funnels

User Generated Content

Mobile Web

SMS/MMS

Sentiment

HD Video

Speech To Text

Product/Service Logs

Social Network

Business Data Feeds

User Click Stream

Sensors Infotainment Systems

Wearable Devices

CyberSecurity Logs

ConnectedVehicles

Machine Data

IoT Data

Dynamic Pricing

Payment Record

Purchase Detail

Purchase Record

Support Contacts

Segmentation

Offer Details

Web Logs

Offer History

A/B Testing

BUSINESS PROCESS

PETABYTES

TERABYTES

GIG

ABYTES

EXABYTES

ZETTABYTES

Streaming Video

Natural Language Processing

WEB

DIGITAL

AI

Page 7: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

7

GPU ACCELERATION OVERCOMES THE CHALLENGES OF SLOW COMPUTE ON ANALYTICS

ASK QUESTIONS YOU DON’T KNOW THE ANSWERS TO

Issuing iterative queries becomes wearisome

EXPLORE FURTHER

Analyst creativity is impaired

GO BEYOND WHAT’S BEING ASKED

Long response timeconstrains questions asked

Page 8: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

8

GPU-ACCELERATION HAS NO LIMITS

MapDMapD is 55x to 1,000x faster than

comparable CPU databases on billion+

row datasets

Kinetica

Hardware costs that are 1⁄10 that of

standard in-memory databases

BlazeGraph

200-300x speed-up

Graphistry

See 100x more data at millisecond

speed

SQream

The supercomputing powers of the GPU combined with SQream’s patented

technology, results in up to 100 times faster analytics performance on terabyte-

petabyte scale data sets

Page 9: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

9

NVIDIA ACCELERATED ANALYTICSGPUs in the Data Center

AI-ACCELERATEVISUALIZEANALYZE

Page 10: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

10

NVIDIA DGX-1AI Supercomputer-in-a-Box

170 TFLOPS | 8x Tesla P100 16GB | NVLink Hybrid Cube Mesh

2x Xeon | 8 TB RAID 0 | Quad IB 100Gbps, Dual 10GbE | 3U — 3200W

24

Page 11: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

11

“FIVE MIRACLES”

16nm FinFETPascal Architecture CoWoS with HBM2 New AI AlgorithmsNVLink

Page 12: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

12

0X

4X

8X

12X

16X

GeForce® GTX TITAN X GeForce® GTX 1080 Tesla® P100 DIGITS™ DevBox (4X GeForce® GTX Titan X)

Quadro® VCA (8X Quadro®M6000)

DGX-1™ (8X Tesla® P100)

Rela

tive T

rain

ing P

erf

orm

ance

ResNet Inception v3 AlexNet vgg MSR

DGX-1 — A LEAGUE OF ITS OWN

NVIDIA CONFIDENTIAL. PRELIMINARY NUMBERS. NOT FOR DISTRIBUTION.

Caffe on DeepMark. GeForce TITAN X and GTX 1080 system: Intel Core i7-5930K @ 3.5 GHz, 64 GB System Memory | Tesla P100 (SXM2) system: Dual CPU server, Intel E5-2698 v4 @ 2.2 GHz, 256 GB System Memory

1X

GeForce GTX TITAN X GeForce GTX 1080 Tesla P100 DIGITS DevBox(4X GeForce GTX TITAN X)

Quadro VCA(8X Quadro M6000)

DGX-1(8X Tesla P100)

Page 13: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

13

Instant productivity — plug-and-play, supports every AI framework and accelerated analytics software applications

Performance optimized across the entire stack

Always up-to-date via the cloud

Mixed framework environments — baremetal and containerized

Direct access to NVIDIA experts

DGX STACKFully integrated Analytics and Deep Learning platform

Page 14: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

14

AI massive opportunity

Data Scientist productivity is vital

NVIDIA is the choice for Deep Learning and AI-accelerated analytics

DGX-1 is fast, instantly productive

NVIDIA DGX-1The Essential Tool for

Data Scientists

Page 15: NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED … · 2016. 10. 12. · and accelerated analytics software applications Performance optimized across the entire

NVIDIA DGX-1: INTEGRATING THE POWER OF DEEP LEARNING AND ACCELERATED ANALYTICS