Intro to Predixion Software - Real World Predictive Analytics

Preview:

DESCRIPTION

Intro deck for presentation to SQL Server User Group in Bangalore

Citation preview

By SQL Server MVP Lynn Langit

Real World Predictive Analytics

About Lynn Langit

Industry awards

Microsoft – MVP for SQL

Server

Google – GDE for Cloud

Platform

10Gen – Master for MongoDB

Practicing Architect

Technical author / trainerGoogle Cloud

SeriesSQL Server

Series

Cloudera certified

Developer

2 books on SQL Server BI

What is Predictive Analytics?

“Technology that LEARNS from EXPERIENCE (data) to PREDICT the future”

It uses techniques from the following:Statistics Machine LearningData MiningGame theory

What Is Predixion Insight?

Run Server-based Predictive Applications • In the Cloud –OR-• On Premises (Uses SQL Server 2012 SSAS)

Model in Excel • Integrated Toolbars

Use Varied Data Sources• Excel, PowerPivot• SQL Server• Hadoop (future)

Collaborate with Smart Output Viewers• Visualize results of analysis in Excel or in a browser• Share results via email, comments, publishing

Predixion Insight ServerEnterprise Data Sources

Client Tools

and/or

PredixionAPI Connexion

Deployment

Predixion Cloud Architecture

Private Cloud

Excel 2007 – 2013

PowerPivot Add-in

Predixion Insight Client

Datasets, Models and Automation

Framework

PowerQuery Add-in

(SAS, SPSS, R)

PredixionPMML Connexion

Predixion Insight ServerEnterprise Data Sources

Client Tools

and/or

PredixionAPI Connexion

Deployment

Predixion On Premise Architecture

SQL Server 2012 SSAS

Excel 2007 – 2013

PowerPivot Add-in

Predixion Insight Client

Datasets, Models and Automation

Framework

PowerQuery Add-in

(SAS, SPSS, R)

PredixionPMML Connexion

Visualizing Predixion Software

Some Customer Scenarios

92%

Healthcare: Preventable Hospital Readmissions

• Hospital data is used to build a hospital-specific Predixion model • The Predixion model is then used to risk stratify patients upon admission • Risk scores are updated throughout the patient’s stay • Readmission risk scores are used by care givers to target appropriate

patient care paths.

Oil and Gas: High Velocity Streaming Predictions

Predixion model pushed into CEP (complex event processing)/StreamThousands of predictions scored per second in memory, near zero latencyContinuous processing of event streams from multiple sourcesGathers and preprocesses data into predictors for predictive modelsReports in real time on potential failures for preventative maintenance

Streaming Sensor data

Real Time Scoring

Digital oilfield In streaming data scoring Preventative Maintenance

Getting Started with Predixion

Cloud-based Predictive Analytics• Simple to set-up• Install Excel add-in• Connect to Predixion

Cloud Service

On-premise Predictive Analytics• Requires local SQL Server• Must follow install

instructions • Keeps all data on

premises

Try it out!

Insight Now DEMO

Insight Now Ribbon in Excel

Identifying Key Influencers

Many Data Viewers

• Decision Trees• TreeMap• Predictor• Application-level views• Many more • Displayed – in Excel or – in the browser

Data Viewers in Browser

Visual Macros

Insight Analytics DEMO

Preparing the data

• Use the Insight Workbench • Perform data profiling• Sample the data• Create an Application

Machine Learning Semantic Model

• PAX – Predictive Analytics Expressions• Example ‘Z-Score normalization’• N = (v-average(v))/stdev(v)

• Data Access • Insight Workbench

Insight Workbench

Insight Workbench DEMO

Insight Analytics Model Algorithms

Predixion Insight Analytics

Data Mining Algorithm

Classify Logistic Regression

Estimate Regression Trees, Gaussian Trees, Neural NetworkLinear Regression, Logistic Regression

Segment Probabilistic Clustering or K-Means Clustering

Associate Associative (Decision) Trees or Association Rules (Market Basket)

Forecast Blended Time Series (ARIMA & ARTXP)

Test the Model’s Accuracy

Manage and Share Results

Introduced in 3.0

Extensible Machine Learning Library Architecture• Pluggable to support Mahout and more

In-place Scoring APIs• Support streaming APIs, such as StreamInsight• In-database scoring with SQL Server• Brings library to the data (like MapReduce)

Marketplace• Reusable applications (by vertical)

Support in SQL Server 2012

• In-database scoring• Integration with SSIS

PAX Functions

Predixion Marketplace

Predixion Marketplace DEMO

Looking Forward

• Support for Mahout (Hadoop)• Support for R

www.TeachingKidsProgramming.org• Free Courseware • Do a Recipe Teach a Kid (Ages 10 ++)• Java or Microsoft SmallBasic• C# on Pluralsight

• recipes)

VOTECONFIRMSHARE

Keep Learning• Twitter: @LynnLangit• YouTube:

http://www.youtube.com/user/SoCalDevGal

• Hire me– To help build your BI/Big Data solution– To teach your team next gen BI– To learn more about using NoSQL

solutions

Recommended