16
Storage Class Memory in Scalable Cognitive Systems Balint Fleischer Chief Research Officer

Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Embed Size (px)

Citation preview

Page 1: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Storage Class Memory in Scalable Cognitive Systems

Balint Fleischer Chief Research Officer

Page 2: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

The impact of NVM on the Application/Data architecture

2

• Accelerated demanding applications OLTP, Big Data, Etc.

• Changing scaling economics Social Networks, Search, HPC

• Improved operational characteristics Notebooks, tablets, Power efficiency

• Enabling new use cases/architectures Hyperconvergent systems, Cold Storage, Streaming platforms, Data Virtualization systems, Etc.

Flash Memory Summit 2016 Santa Clara, CA

Page 3: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

NVM journey

3

What comes next?

>20 Years

>10 Years

>5 Years

Flash Memory Summit 2016 Santa Clara, CA

?

Page 4: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Evolving IT focus

4

Enterprise Automation

Online Transactions Cognitive Computing

“Killer” use cases OTLP ERP Email

eCommerce Messaging

Social Networks Content Delivery

Discovery of solutions, capabilities Risk Assessment

Improving customer experience Comprehending sensory data

Key functions RDBMS BI

Fraud detection

Databases Social Graphs

SQL and ML Analytics Streaming

Natural Language Understanding Object Recognition

Probabilistic Reasoning Content Analytics

Data Types Structured Transactional

Structured Unstructured Transactional

Streaming Mixed

Graphs, Matrices

Storage Types Enterprise Scale Standards driven

SAN/NAS, etc

Cloud Scale Open source File/Object

Application Scale Highly Optimized

Intelligent

Flash Memory Summit 2016 Santa Clara, CA

Page 5: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Cognitive Computing

Augmen'ng human exper'se

&

Transforming human <-> Computer interac'on

Business Benefits Supporting Functionality

5 Flash Memory Summit 2016 Santa Clara, CA

Page 6: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Cognitive Computing use cases

6

Cloud based Cognitive Computing

Teaching Assistants Elderly Companions

Service Robots Personal Social Robots

Smart Cities

Robot Drones

Smart Homes

Intelligent vehicles

Personal Assistants (bots)

Sense, Learn, Infer and Interact Understand meaning and objective

Adaptive Skill enablement New skill/updates Shared Learning

Power/Skill execution tradeoffs

Smart Enterprise

Flash Memory Summit 2016 Santa Clara, CA

X

5G will be a major catalyst

K.I.T.T.

Page 7: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Cognitive Application Platform (CPaaS)

Overview Complex SW stack

Executed as Dataflow Optimized for fast response time

Highly scalable Extensive use of Advanced Algorithms Historical & Real Time Data processing

Proximity to data is key

Flash Memory Summit 2016 Santa Clara, CA

Target Applications Cloud based Assist Bots

Intelligence “booster” for Robots Enhancing awareness context

Shared knowledge/learning Robot Power optimization

Robot Cost reduction Robot re targeting (SDR)

Others

CPaaS

Bots

7

Page 8: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Running Cognitive Applications

8

Cognitive Domain

Streaming Domain

Application Domain

RESPONSE INGEST

Sensing, Detecting, Identifying Image, Voice

Action E.g.. Traffic routing, Alerting

Cognitive Platform (CPaaS)

Flash Memory Summit 2016 Santa Clara, CA

Fully Containerized architecture Static Containers

Dynamic Containers Ephemeral Data Containers Persistent Data Containers

Implemented on a Scale out Cluster

(Commodity Servers, Large Memory, Fast, Low latency Interconnect and Fast NVMe drives)

Applications, Skills

Page 9: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Context Data Store Language & semantic models, Skills

Planning & Interaction history Low Latency, Object access support Direct Data Access by applications

Pipeline & Sensor Data Store Supports Data Parallel & Model Parallel modes

Working Data for Execution Engines Objects, Graphs, Matrices Assist

Very low latency, Random Access support Direct Data Access by applications

Server Node

Server Node

Server Node

Server Node

Optimized Storage for Cognitive Computing

Global Data Store Historical, reference and collective learning data

Scale out, Network Attached NVM based Storage Multi-Tenancy support Resilient and Elastic Flash Memory Summit 2016

Santa Clara, CA

Cognitive Storage

9

Page 10: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Implementing Cognitive Storage

10

Partitioned into Node vs Cluster shared Application Optimized (semantics consistent with data types)

Application Direct Accessed (use space IO) Intelligent Data placement for scalability

Connection to Data center fabric

Node

CPU

NVMe

In Memory Data Store

Near Memory Data Store NIC

Mapped into Main Memory

Partitioned across

Memory and MVME

Flash Memory Summit 2016 Santa Clara, CA

Graphs &

Matrices

Blocks &

Objects

Cognitive Storage

Page 11: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Technology for Cognitive Storage

11

CPU

Physical Memory

App/OS/Heap/Temp

Buffer Pool

In Memory Data Store

Near Memory Data Store NVMe Volume SCM is very promising Almost as fast as DRAM

Higher capacity vs. DRAM Envisioned to be less expensive

Read Performance biased Selectable Attributes (Persistency, etc)

SCM latency highlights the need

to further reduce overhead

“Memory Speed Data Store” (MSDS) would be better name to reflect new functionality

Huawei SCM SW stack

Flash Memory Summit 2016 Santa Clara, CA

Page 12: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Optimizing for applications Reducing IO Overhead

Connection to Data center fabric

Flash Memory Summit 2016 Santa Clara, CA

CPU

NVMe Closely coupled to NVM media Built in High Speed networking

Low latency link to CPU Programmable Data Engines

Data Management Security

Replication/Resiliency support CODEC

Data Placement Scanning, etc

Messaging/IPC engine

NVMe

12

Application Processing

IO Processing &

Media

Application Specific IO semantics

Page 13: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Architecture for Next gen Applications General Purpose

Computing

Connection to Data center fabric

Node

Flash Memory Summit 2016 Santa Clara, CA

CPU

NVMe

In Memory Data Store

Near Memory Data Store

Application Accelerator (Deep Learning, Etc.)

APU

NVMe

13

IO Accelerator (Data Mgmt, Etc.)

Page 14: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

CPU CPU

Data Coprocessor

CPU CPU

Data Coprocessor

Adaptable Cluster

SCM

CPU CPU

Data Coprocessor

Data Center Fabric

Flash Memory Summit 2016 Santa Clara, CA

Huawei OceanStor Dorado V3

All Flash Array >100TB, 500KIOPS in 2U

Huawei ES 3000

NVMe SSD >3TB,

14

Page 15: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Summary

§  By extending, scaling and accelerating human expertise, Cognitive

Computing is rapidly becoming the most important next gen application

§  SCM based Very Large Capacity, Low Latency, processor attached Data

Store is one of the key ingredient to make Cognitive Computing ubiquitous

§  Advanced Optimization of NVM storage for applications will be key to

achieve Performance and TCO objectives

Flash Memory Summit 2016 Santa Clara, CA

15

Page 16: Storage Class Memory in Scalable Cognitive Systems re targeting (SDR) Others CPaaS Bots 7 . Running Cognitive Applications 8 Cognitive Domain Streaming Domain Application ... Huawei

Thank you

16