56
Streaming Analytics & IoT Peter Pugh-Jones

Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Streaming Analytics & IoT

Peter Pugh-Jones

Page 2: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Streaming Analytics

Page 3: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

insights on streaming data

Page 4: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Streaming and/or Real-Time?

“Streaming analytics typically means making analytically informed decisions in milliseconds, while

examining many thousands of events per second, generated from many millions of devices which can

also be enriched by many other disparate sources of data”Jerry Baulier, Senior Director SAS R&D, Event Stream Processing

Because of this, Streaming Analytics is definitely a Big Data topic, but not necessarily restricted to being only

a Real-Time topic.

Page 5: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Big data processing with SAS

I have a lot of sources of data that I want to grab from at high speed, in very large quantities

Streaming (ESP)

I want to keep all the data and access it really fast for as long as possible

Storage (HADOOP & LASR)

I want to perform predictive analytics on the data and generate models to use from it

Analytics & Visualisation (LASR, IMSTAT, Embedded Process, Visual Statistics, Visual Analytics)

I want to deploy and enable predictive models I have built into the streams, to help generate value as early as possible

Streaming Analytics (ESP)

Page 6: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

WHAT IS THE INTERNET OF THINGS?

Page 9: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

IOT

• Dreamliner• SENSORS

• Many of the sensors in the Dreamliner are connected wirelessly to a central data processor. For instance, an "active gust

alleviation" system uses sensors to measure turbulence at the nose, then instantly adjusts wing flaps to counter it.

• DIAGNOSTICS

• A self-monitoring diagnostic system sends real-time data to technicians on the ground via a wireless broadband link. The

system can predict mechanical problems, meaning less time in the hangar and fewer delays.

An example of an IoT ecosystem might be a house, car or an airplane like a ……

A good read on IoT ecosystems

Page 10: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

HOW DOES SAS FIT?

Because we have been collecting and analysing data from devices and sensors since we started in 1976 SAS

has the analytics to help with IoT already.

The differences are, that we can now do the analytics:-

• Continuously (as the data is in flight, or in the stream)

• At the base level of granularity (allowing the predictions and decisions we make to be more accurate)

• At high speed (as it’s streaming or real-time)

• With machine generated data (cleaner from the start, as it is the source)

• From more and more devices as they become internet enabled

Page 11: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

WHAT IS THE IOT TO SAS?

Streaming Analytics is a great way of enabling what SAS already does so well,

within the Internet of Things.

Page 12: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

WHAT IS EVENT STREAM PROCESSING?

Page 13: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

MQ

XML

Cloud

CONSUMERSSOURCES SAS® Event Stream Processing

Apply analytics and rules to

pinpoint event relevance and

urgency with continuous

pattern detection

Manipulate live data streams

to detect, filter, aggregate &

correlate signals from the

noise

Conduct low latency

assessment of high-volume,

high-velocity data – triggering

real-time actions/alerts

ENGINEERED FOR FAST ADAPTIVE ACTION

Page 14: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

WHAT IS AN EVENT?

AN EVENT is an occurrence happening at a determinable time and place, with orwithout the participation of human agents, that can be recorded as a collection offields containing DATA that we want to use to help contribute to:

• Making more informed decisions• More accurate predictions and forecasts• Capturing new patterns and behaviors we may have missed in the past

WORKING WITH EVENTS allows us to apply analytics while the event is, ‘IN MOTION’, before it is stored, (or lost), and may provide analytical insights you may never have had access to before and much sooner than before, generating new value from your data!

Page 15: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

WHAT IS EVENT STREAM PROCESSING?

Event Stream Processing (ESP) is a subcategory of Complex Event Processing (CEP), focused on working with ‘Events In Motion’ called Event Streams*.

CEP encompasses methods, techniques, and tools for processing events while they occur, i.e.,in a continuous and timely fashion and derives valuable, higher-level knowledge from lower-level events; this knowledge takes the form of so called Complex Events, that is, situationsthat can only be recognized as a combination of several events.

Complex Events can therefore be considered the output or result of performing analysis andprocessing on Event Streams.

* Definition provided by the Event Processing Technical Society

Page 16: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Simple streaming example

Page 17: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

How many dots can you see?

(Each dot is an event)

Page 18: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 19: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 20: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Four

Page 21: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 22: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 23: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Seven

Page 24: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 25: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 26: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Sixteen

Page 27: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Let’s add some complexity

Page 28: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

How many red dots can you see?

Page 29: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 30: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 31: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Fourteen

Page 32: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Let’s add some

temporal complexity

Page 33: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

How many yellow dots

appear after the blue dots?

Page 34: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 35: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 36: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Twenty

Page 37: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

HOW MANY RED DOTS WILL there Be next?

IF there were 20 red dots, put the next three blue dots in a specific category.

If there are ten blue dots, join to alternate data and see how many blue dots there are in total ?

If there were 6 green dots, wait for an hour, then compare again.

WHAT ELSE?

Page 38: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

HIGH SPEED PROCESSING

SAS EVENT STREAM PROCESSING ENGINE

DATA IN (called Events)

DATA OUT(Events)

From huge volume of

streaming data flowing at

very high rate :

Millions of records/sec

Data (Events) are

processed as soon as they

arrive (happen) :

Latency: milliseconds

Page 39: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

STREAMING DATA ANALYTICS: BENEFITS

BUSINESS

USERS

• Descriptive, Predictive, and Optimization Analytics on

continuous data streams, right now!

• Access to streaming data information

• Reduced time to decision and action

• Generation of new opportunities

• Reduced risk and cost

IT USERS

• Ability to deal with the increasing volume and speed of

big data flow

• Reduced storage (due to data aggregation) and

computational needs

• Faster to market

• Easier & quicker maintenance of analytical models

• Focus on business logic rather than production features

Page 40: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

REAL-TIME DECISIONING & ACTIONS:

BENEFITS ACROSS INDUSTRIES

Fraud Detection

Cyber Security

Telecom

Real Time Marketing

Capital Markets

ManufacturingIndustry, Energy

Supply Chain

Enterprise

Decision Management IT Operational

Page 41: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Analyze data in motion with low latencies

SAS EVENT STREAM PROCESSING ENGINE

DATA IN (called Events)

DATA OUT(Events)

Detect Events & Patterns

of Interest

Filtering

Aggregation

Pattern detection

Computations

Correlations

Procedural

Text analytics

Data quality

and much more…

Page 42: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Analytics and Insights ON Streaming DATA

TECHNOLOGY

INTEGRATION

• SAS® Event Stream Processing Engine is integrated

into some SAS Solutions and can be deployed at the

front-end of most others

• Complements batch and real-time capabilities of SAS

solutions with streaming data analysis

STREAMING

ANALYTICS

• Enables SAS analytic solutions to process streaming

events.

• Leverages analytical model results to provide real time

insights and action on streaming data

• Enabled the deployment of additive and incremental

analytic models on streaming data

Page 43: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

KEY CONCEPT

Page 44: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

DATAFLOW CENTRIC

SAS EVENT STREAM PROCESSING ENGINE

DATA IN (Events)

DATA OUT(Events)

Event

Stream

Event

Stream

Event

StreamEvent

Stream

Event

Stream

Event

Stream

Event

Stream

Design of the rule model (called “Continuous Query”)

using components (called “Windows”)

Event

Stream

DATA IN (Events)

DATA IN (Events)

DATA OUT(Events)

SOURCE1

WINDOW

SOURCE2

WINDOW

SOURCE3

WINDOW

FILTER

WINDOW

CALCULATIONS

WINDOW

JOIN

WINDOW

JOIN

WINDOW

CALCULATIONS

WINDOW

THRESHOLD

WINDOW

Page 45: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

HIGH LEVEL DEPLOYED EXAMPLES

Page 46: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Continuous Data

Integration & Streaming Analytics

Analytics Servers

High Level EXAMPLE - INVESTMENT BANK SAS &

HADOOP SOLUTION

SAS Reporting & Aggregation

In-memory data

Ad-hoc Analytics

Event Stream Processing (ESP)

RISK AGGREGATION AND

REPORTING

LONG TERM-CACHE(S)

(RISK) DATA LAKE

3rd Party Reporting, API’s & Adjustment

Applications

API

NEAR-CACHE

CLOUDERA HADOOP

DISPARATE

DATA SOURCES

Page 47: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

• Accuracy• Granularity

• Transparency• Timeliness

Performance of

existing offering is

slow and

cumbersome

Access to data takes

a long time requiring

many ad-hoc sets of

data to be extracted

by Risk IT Lack of Access

to Data and

limited Analysis

ability

Raw and

standardised

views of data

not available

No ability to trace

information through

from front-office

trades/positions to VaR

Current manual analysis

process introduces

inaccuracies

Comparison of data against

front-office reports highlights

numerous data

inconsistencies

No ability to

access granular,

low level data for

analysis

Hierarchy drilldown,

querying and filtering

capabilities required

for proper analysis

are not available

INVESTMENT BANK CHALLENGES ADDRESSED BY THE SAS

SOLUTION

Risk Aggregation

and reporting

Page 48: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

Solution Overview

High level architecture (ESP-RTDM)

Page 49: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

use case • Value from a Real case*

Take Rate: 5x Business As Usual1

Revenue Increase: €100k/day2

Setup: 8 Weeks, Payback < 6 Months 3

* 21M Active Customers, 50K TPS, Chile Argentina Spain

Page 50: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

INTERACTING WITH ESP

Page 51: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

SAS EVENT STREAM PROCESSING CQ MODEL

SOURCE1

WINDOW

SOURCE2

WINDOW

SOURCE3

WINDOW

FILTER

WINDOW

CALCULATIONS

WINDOW

JOIN

WINDOW

JOIN

WINDOW

CALCULATIONS

WINDOW

THRESHOLD

WINDOW

BUILDING AND INTERACTING WITH ESP CQ MODELS

There are five interaction points with ESP.

1) Other SAS products and solutions• such as Visual Scenario Designer

2) ESP Studio

3) ESP STREAMVIEWER

4) ESP XML Server engine

5) ESP C++ API Framework

Page 52: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

STREAMVIEWER

• HTML5 interface

• Uses HTTP (RESTful) XML server

• Has 2 modes:

• Streaming mode: display all events

• Update : events processed with

opcode

• Subscribe & Publish

Page 53: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands
Page 54: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

ESP Model

builder

ESP MODEL

Page 55: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

www.SAS.com

Q & A

Page 56: Streaming Analytics & IoT - Sas Institute · 2016-03-11 · “Streaming analytics typically means making analytically informed decisions in milliseconds, while examining many thousands

www.SAS.com

Come visit me at the IoT stand!

[email protected]