26
Data Mining with Excel 2010 and PowerPivot Mark Tabladillo Ph.D. MTabladillo <(at)> solidq.com September 18, 2010

Data mining with excel 2010 and power pivot

  • Upload
    igsc

  • View
    772

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Data mining with excel 2010 and power pivot

Data Mining with Excel 2010 and PowerPivotMark Tabladillo Ph.D.MTabladillo <(at)> solidq.comSeptember 18, 2010

Page 2: Data mining with excel 2010 and power pivot

SQL Saturday 46 -- Raleigh NC#sqlsat46

© 2

010

Mar

k Ta

blad

illo P

h.D

.

2

Page 3: Data mining with excel 2010 and power pivot

MarkTab & Data Mining

© 2

010

Mar

k Ta

blad

illo P

h.D

.

3

Page 4: Data mining with excel 2010 and power pivot

© 2

010

Mar

k Ta

blad

illo P

h.D

.

4

Page 5: Data mining with excel 2010 and power pivot

© 2

010

Mar

k Ta

blad

illo P

h.D

.

5

Page 6: Data mining with excel 2010 and power pivot

OutlineWhat is

Data MiningWhat is

PowerPivot Demos

© 2

010

Mar

k Ta

blad

illo P

h.D

.

6

Page 7: Data mining with excel 2010 and power pivot

Data Mining as a Service

© 2

010

Mar

k Ta

blad

illo P

h.D

.

7

Page 8: Data mining with excel 2010 and power pivot

OutlineWhat is

Data MiningWhat is

PowerPivot Demos

© 2

010

Mar

k Ta

blad

illo P

h.D

.

8

Page 9: Data mining with excel 2010 and power pivot

Data Mining Definitions• Data mining • Machine Learning• Data mining algorithms -- typically use estimation or

optimization to achieve results (as opposed to only calculations).

© 2

010

Mar

k Ta

blad

illo P

h.D

.

9

Page 10: Data mining with excel 2010 and power pivot

Data Mining Tasks• Supervised

• Answer known, what is correlated?• Unsupervised

• Answer unknown (unspecified), what are the groups?• Forecasting

• Given a trend, what is next?

© 2

010

Mar

k Ta

blad

illo P

h.D

.

10

Value Slide

Page 11: Data mining with excel 2010 and power pivot

Data Mining Add-In for Excel• Requires Analysis Services instance• Version 10.00.2531.00 (April 2009)• 32-Bit Add-In• Microsoft .NET Framework 2.0 (32-bit)• Office 2007 (Professional, Professional Plus, Ultimate,

Enterprise)• SQL Server Enterprise or Standard (or Developer) 2008 or

higher

© 2

010

Mar

k Ta

blad

illo P

h.D

.

11

Page 12: Data mining with excel 2010 and power pivot

The Analyze Tab

© 2

010

Mar

k Ta

blad

illo P

h.D

.

12

Page 13: Data mining with excel 2010 and power pivot

The Analyze Tab

© 2

010

Mar

k Ta

blad

illo P

h.D

.

13

Menu Option Data Mining Algorithm

Analyze Key Influencers Naïve Bayes

Detect Categories Clustering

Fill from Example Logistic Regression

Forecast Time Series

Highlight Exceptions Clustering

Scenario Analysis (Goal Seek) Logistic Regression

Scenario Analysis (What If) Logistic Regression

Prediction Calculator Logistic Regression

Shopping Basket Analysis Association Rules

Page 14: Data mining with excel 2010 and power pivot

Data Mining Tab

© 2

010

Mar

k Ta

blad

illo P

h.D

.

14

Page 15: Data mining with excel 2010 and power pivot

Data Mining Tab

© 2

010

Mar

k Ta

blad

illo P

h.D

.

15Many

Page 16: Data mining with excel 2010 and power pivot

Data Mining Capacities

© 2

010

Mar

k Ta

blad

illo P

h.D

.

16

SQL Server 2008 R2 Analysis Services Object Maximum sizes/numbers

Maximum data mining models per structure 2^31-1 = 2,147,483,647

Maximum data mining structures per solution 2^31-1 = 2,147,483,647

Maximum data mining structures per Analysis Services database 2^31-1 = 2,147,483,647

Maximum data mining attributes (variables) per structure 2^31-1 = 2,147,483,647

Reference:http://www.marktab.net/datamining/index.php/2010/08/01/sql-server-data-mining-capacities-2008-r2/

Page 17: Data mining with excel 2010 and power pivot

Data Mining Tab

© 2

010

Mar

k Ta

blad

illo P

h.D

.

17

Page 18: Data mining with excel 2010 and power pivot

OutlineWhat is

Data MiningWhat is

PowerPivot Demos

© 2

010

Mar

k Ta

blad

illo P

h.D

.

18

Page 19: Data mining with excel 2010 and power pivot

PowerPivot for Excel• Take advantage of familiar Excel tools and

features• Process massive amounts of data in seconds• Load even the largest data sets from virtually any

source• Use powerful new analytical capabilities, such as

Data Analysis Expressions (DAX)• Make the most of multi-core processors and

gigabytes of memory

© 2

010

Mar

k Ta

blad

illo P

h.D

.

19

Page 20: Data mining with excel 2010 and power pivot

PowerPivot for Excel Sources• SQL Server• SQL Azure• Oracle, Teradata, Sybase, Informix, IBM DB2• OLEDB/ODBC• Analysis Services (SSAS)• Reporting Services (SSRS)• Excel, Text File

© 2

010

Mar

k Ta

blad

illo P

h.D

.

20

Page 21: Data mining with excel 2010 and power pivot

PowerPivot Reference• http://www.powerpivot.com (Product Site)• http://www.powerpivotpro.com (Blog Site)

© 2

010

Mar

k Ta

blad

illo P

h.D

.

21

Page 22: Data mining with excel 2010 and power pivot

OutlineWhat is

Data MiningWhat is

PowerPivot Demos

© 2

010

Mar

k Ta

blad

illo P

h.D

.

22

Page 23: Data mining with excel 2010 and power pivot

Resources• MarkTab.NET

Blog, links, video resources and information for data mining

• Blog: http://marktab.net/datamining• Twitter: @MarkTabNet

© 2

010

Mar

k Ta

blad

illo P

h.D

.

23

Page 24: Data mining with excel 2010 and power pivot

© 2

010

Mar

k Ta

blad

illo P

h.D

.

24

Page 25: Data mining with excel 2010 and power pivot

Regroup and Conclusion• Main Points from this Presentation

© 2

010

Mar

k Ta

blad

illo P

h.D

.

25

Page 26: Data mining with excel 2010 and power pivot

Contact Information• Mark Tabladillo

mtabladillo <{at}> solidq.com

• Also on:TwitterLinked In

© 2

010

Mar

k Ta

blad

illo P

h.D

.

26