16
Copyright © 2014, SAS Institute Inc. All rights reserved. make connections • share ideas • be inspired SAS – A Contemporary Analytics Environment Adrian Jones Advisory Business Solutions Manager, SAS

SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Embed Size (px)

Citation preview

Page 1: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

SAS – A Contemporary Analytics EnvironmentAdrian JonesAdvisory Business Solutions Manager, SAS

Page 2: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Experimentation + Innovation Productionization

Two Mindsets

Page 3: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Innovation & Experimentation = Market Leadership…

Experimentation should enable Innovation

Experimentation needs data from within AND outside your organization – in fact there should be NOTHING that is off limits respecting privacy of course

Innovation, hopefully, leads to new services and new approaches that can be moved into production in a targeted and governed manner to ensure compliance amongst other things

Innovation should not be constrained by the

data you already have or can “afford” to use!

Page 4: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Idea

Explore

Validate

010101010101010

01010101010101010

0101010101010101010

010101010101010101010

01010101010101010101010

0101010101010101010101010

01010101010101010101010

010101010101010101010

0101010101010101010

01010101010101010

010101010101010

Internal

Data

External

Data

Outcomes

No such thing as a bad

outcome just the result

of testing an idea!

The Big Data Lab - Unlocking possibilities

Page 5: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

The Big Data Lab - Unlocking possibilities at the lowest possible cost with accessible skills

• Commodity HW

• Low Entry Point

• Easy to expand

• Cheap Storage/Disk

Hardware Software Skills

• Scalable Foundation

• Data Ingestion

• Data Transformation

• Data Visualization

• Broad Array of Analytics

• Programming

• Data Manipulation

• Analytics

= Data Scientist or just

an inquisitive mind?

Mandate is to come up with ideas for transformative

disruptive change not just incremental improvements

Page 6: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

The Big Data Lab is driving

the adoption of Hadoop at

MANY organizations

Page 7: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Hadoop Adoption - What are they doing right now?

Page 8: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Business ProcessesIntegration

AnalyticsData

Looking for efficiency and repeatability to produce something of the highest value at the lowest cost possible.

Next Generation Analytic Platform - Need to focus on more than just data!

Page 9: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Repeatable steps for obtaining the most value from all styles of analytics

Leading companies often recognise the need for a process, standardize it and then continuously focus on speeding up the cycle

A problem need not touch every part of this cycle

Big Data is already having an impact on the whole of the analytics lifecycle

Next Generation Analytic Platform - Need to support ALL parts of the Analytics Lifecycle

IDENTIFY /

FORMULATE

PROBLEM

DATA

PREPARATION

DATA

EXPLORATION

TRANSFORM

& SELECT

BUILD

MODEL

VALIDATE

MODEL

DEPLOY

MODEL

EVALUATE /

MONITOR

RESULTS

Page 10: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

The new wave - Can Hadoop play a major role going forwards in the factory?

The EDW and Data Marts are not going away

The role of the EDW might change

Ad-hoc BI and Analytics does not need an EDW designed for transactions!

Is Hadoop the technology to free us from the shackles?

Skills in Hadoop technologies is a big issue

Page 11: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Hadoop as a new

data store

Hadoop as an additional input to

the EDW

Hadoop as a foundation for BI &

Analytics

Hadoop as a “data lake” or

staging area for the

Warehouse

Many possible roles for Hadoop…

Page 12: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

IDENTIFY /

FORMULATE

PROBLEM

DATA

PREPARATION

DATA

EXPLORATION

TRANSFORM

& SELECT

BUILD

MODEL

VALIDATE

MODEL

DEPLOY

MODEL

EVALUATE /

MONITOR

RESULTSSAS Visual Analytics

SAS Visual Statistics

SAS In-Memory Statistics for Hadoop

Done using either the Data

Preparation, Data Exploration or

Build Model Tools

SAS High Performance Analytics Offerings

supported by relevant clients like SAS

Enterprise Miner, SAS/STAT etc.

Done using the Build Model

Tools and other checks

SAS Scoring Accelerator for Hadoop

SAS Code Accelerator for Hadoop

SAS Visual Analytics

SAS Data Loader for Hadoop, SAS Access

to Hadoop/Impala, SAS Event Stream

Processing Engine, SAS Federation Server

…and now we are delivering solutions on top

of this such as SAS High Performance Anti-

Money Laundering, SAS High Performance

Risk, SAS High Performance Marketing

Optimization etc.

Our strategy is to enable the entire lifecycle around HADOOP

Page 13: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

IDENTIFY /

FORMULATE

PROBLEM

DATA

PREPARATION

DATA

EXPLORATION

TRANSFORM

& SELECT

BUILD

MODEL

VALIDATE

MODEL

DEPLOY

MODEL

EVALUATE /

MONITOR

RESULTSSAS Visual Analytics

SAS Visual Statistics

SAS In-Memory Statistics for Hadoop

Done using either the Data Preparation,

Data Exploration or Build Model Tools

SAS High Performance Analytics Offerings

supported by relevant clients like SAS

Enterprise Miner, SAS/STAT etc.

Done using the Build Model

Tools and other checks

SAS Scoring Accelerator for Hadoop

SAS Code Accelerator for Hadoop

SAS Visual Analytics

SAS Data Loader for Hadoop (Includes the DQ

Accelerator for Hadoop and the Code

Accelerator for Hadoop), SAS Access to

Hadoop/Impala, SAS Event Stream Processing

Engine, SAS Federation Server

Scoring Accelerator: Also available for SAP HANA,

Oracle Exadata, Teradata, Pivotal (Greenplum), IBM

DB2, Netezza (IBM Pure Data for Analytics)

Also available for Oracle Exadata,

Teradata, Pivotal (Greenplum)

Code Accelerator: Also available for Teradata,

Pivotal (Greenplum)

Visual Analytics and Visual

Statistics: Also available to run

directly on Oracle Exadata,

Teradata, Pivotal (Greenplum) or

on a standard server reading data

from almost anywhere.

Support for most major RDBMS

systems

Visual Analytics : Also available to run

directly on Oracle Exadata, Teradata,

Pivotal (Greenplum) or on a standard

server reading data from almost anywhere.

Our strategy is to enable the entire lifecycle around HADOOP (and RDBMS if that is your choice)

Page 14: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

Hadoop is scalable and cheap (storage & processing)

Hadoop is rapidly evolving and maturing

Hadoop is rapidly becoming mainstream technology

Major marketplace interest, activity and spend

Increasing adoption rates by your competitors

Deals with two needs – cost reduction and innovation

The technologies already exist to help from SAS!

Hadoop + SAS – a contemporary analytics platform

Page 15: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

The overwhelming majority of organizations

have neither a finely honed analytical capability

nor a detailed plan to develop one.

Source: Competing on Analytics

Page 16: SAS – A Contemporary Analytics Environment · DB2, Netezza (IBM Pure Data for Analytics) Also available for Oracle Exadata, Teradata, Pivotal (Greenplum) Code Accelerator: Also

Copyright © 2014, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

[email protected]