20
Microservices based AI framework stack Srinivasa Addepalli ([email protected]) Intel (Being done in LF/ONAP project)

Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Microservices based AI framework stack Srinivasa Addepalli ([email protected]) Intel (Being done in LF/ONAP project)

Page 2: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

• Closed loop - Introduction

• Data analytics framework – Needs & Requirements

• Data analytics framework – An opinionated stack

• Roadmap

Agenda

Page 3: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Introduction

Page 4: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Big Picture – Analytics continuum

Reactive Pro-active

Predictive

Need AI (ML/DL) analytics Need Big data framework

Page 5: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Prediction examples

Intrusion Detection

Traffic Classification

Anomaly detection

Fault Prediction

Path prediction

Resource prediction

Traffic prediction Others

Page 6: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Edge Site/Data Center Edge Site/Data Center

Compute node

E2E Architecture & Closed loops

VIM (Openstack/K8S)

Compute nodes/Switches/PNFs Telemetry Action Controller

Analytics Analytics

Central Orchestrator

Service Orchestrator

Element Management Systems

Other controllers

APP Layer

Analytics

Analytics Applications

Page 7: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Big Data Analytics stages

Distribution Store Models DB

Collection Agents

Data Storage

(Data Lake) Training

Inferencing Action

corelation

Models

Action Controllers (External

controllers such as K8S,

APPC, SDNC)

Training apps

Inferencing Apps

Correlation apps

Page 8: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Data analytics framework – Needs / Requirements

Page 9: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Medium Edge

Collect

Distribute

Predict

Small Edge

Stack Goals - Decentralization

Cloud Site

Cloud Site

Cloud Site

Central Site

Co

llect

Dis

trib

ute

Sto

re

Trai

n

pre

dic

t

Co

rrel

ate

Act

ion

Central Analytics

Challenges: • Petabytes of data need to be transferred • Not edge friendly • Data Sovereignty challenges

Small Edge

Collect

Medium Edge

Collect Distribute

Predict

Medium Edge

Predict

Large Edge

Action

Collect Distribute

Store, Train Predict (s) Action(s)

Edge Clouds

Distribute

Predict Action

Private Cloud

Train

Public Cloud

Train Store Store

Correlate

Page 10: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Medium Edge

Collect

Distribute

Predict

Small Edge

Stack Goals – Bulk Deployment & configuration

Small Edge

Collect

Medium Edge

Collect Distribute Predict (s)

Medium Edge

Predict

Large Edge

Action

Collect Distribute Train (s)

Predict (s) Action(s)

Regional

Distribute

Predict (s) Action

Regional Regional

Automation & configuration • Edges could be in thousands • Each edge would have its own site orchestrator

(e.g K8S)

• Need for centralized automation • To deploy various parts of analytics stack in

multiple locations. • Configure them to make them work

together • Support for dynamic edges

Goal: Deploy and Configure big data analytics framework in hours instead of months

Train Store

Train Store

Page 11: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Stack Goals – Use known big data frameworks

• Goal is to leverage known and open source packages

• Choose right versions • AI (ML/DL) training and

inferencing • Make them work together • Gaps and fixes.

Page 12: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Stack Goals – Follow Cloud Native and Micro-services design patterns

Security • ISTIO CA, Envoy proxy for Mutual TLS among PODs. • ISTIO ingress for communication outside of clusters. • ISTIO RBAC

Security away from applications

Load balancing & CI/CD • ISTIO Envoy with Cilium acceleration • Visualization and monitoring • A/B testing & Canary

Operators • To bring up PODs. • To configure using CRDs

Functions • To enable developer functions to get executed

in the pipeline • Knative

Page 13: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Stack Goals – Reduce training time

Distributed Learning with data parallelism

Leverage hardware accelerators for performance

Page 14: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Stack Goals – Federated/Distributed Learning

• To honor Data sovereignty • Data does not leave the site. • Aggregation server based

model • Tensorflow based DL Data

Owner1

Aggregator Aggregator

train

Data Owner2

Data Owner1

Data Owner2

train train train

Distributed learning orchestration

Page 15: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Data analytics framework – An opinionated stack

Page 16: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Stack - Packages & enhancements to realize analytics stack

Distribution

Store Models DB

Collection Agents

Data Storage

(Data Lake)

Training

Inferencing

Messaging

CollectD Node Exporter

cAdvisor Collectd Operator

Prometheus Operator

HDFS M3DB, M3DB

operator

HDFS Writer HDFS Operator

Spark, spark operator

Scikit learn Horovod

TF, TF Operator Model

Dispatcher

Minio

TF, Seldon Serving

Kafka Broker Kafka Operator Zookeeper

Visualization Grafana

Hue

• An opinioned stack • Helm Charts • Container images • Customized Day0 profiles • Day2 templates for

dynamic configuration • Deployed using K8S in

each site • Security via ISTIO • Deploy with ONAP-K8S. • Sample applications • Code:

https://github.com/onap/demo/tree/master/vnfs/DAaaS

Deployment and Automation using ONAP-K8S Support

Action corelation

Action Controllers

knative

Filter Minio Opeator

Page 17: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Roadmap

Page 18: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

18

Roadmap

Automation – One click deployment of stack in multiple locations

Pipelining and DAC support for Analytics applications

Monitoring – Stack as well as Analytics applications

Knative support – for action functions

Federated Learning - to maintain data sovereignty challenges

Generic action functions for K8S and ONAP

Model LCM and Model security

Leverage hardware accelerators and OpenVINO

Inferencing using non-TF (non-DL)

Others…

Page 19: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

19

Call to action

It is open source, contributions are welcome. Source code repo is in LFN/ONAP. But… it is modular and can be deployed without ONAP

Feedback…. Feedback… Feedback…. - Try & let us know. - New collection mechanisms? - Lightweight messaging? - Model repositories? - Any data lakes?

Want to try your data analytics applications? Please contact us….

Page 20: Microservices based AI framework stack...VIM (Openstack/K8S) Compute nodes/Switches/PNFs Action Controller Telemetry Analytics Analytics ... Apps Correlation apps. ... Goal: Deploy

Q&A