15
Previously known as Think Big. Move Fast.

Ds03 data analysis

Embed Size (px)

Citation preview

Page 1: Ds03   data analysis

Previously known as

Think Big. Move Fast.

Page 2: Ds03   data analysis

Template designed by

brought to you by

Page 3: Ds03   data analysis

SolidQ

• Born in 2002 in USA and Spain

• Established in 2007 in Italy

• More than 1000 customers and more than 200 consultants worldwide

• Dedicated to Data Management on the Microsoft Platform

• Books Authors, Conference Speakers, SQL Server MVPs and Regional Directors

• www.solidq.com

Page 4: Ds03   data analysis

Davide Mauri

• 18 Years of experience on the SQL Server Platform

• Specialized in Data Solution Architecture, Database Design, Performance Tuning, Business Intelligence

• Microsoft SQL Server MVP

• President of UGISS (Italian SQL Server UG)

• Mentor @ SolidQ

• Video, Book & Article Author

• Regular Speaker @ SQL Server events

• Projects, Consulting, Mentoring & Training

Page 5: Ds03   data analysis

Data Analysis

Page 6: Ds03   data analysis

Data Analysis

• Enterprise• SQL Server Analysis Services

• Multimensional

• Tabular

• Self-Service• Power Pivot

Page 7: Ds03   data analysis

Data Analysis

• Multidimensional -> MDX

• Strictly tied to Kimball concepts: Fact, Measures, Dimensions

• Mature Product

• Provide optimum performance on most of the cases

• Medium / High Complexity • Even for simple situations

Page 8: Ds03   data analysis

Analysis Services Multidimensional

Page 9: Ds03   data analysis

Data Analysis

• Tabular / Power Pivot -> DAX

• Based on the idea of Tables and Relationships

• Based on the concept of “Contexts”• Row & Filter Context

• In-Memory Engine• Column Store• Visually very similar to Excel

• Simple for simple things. Can become very complex for medium/complex things.

Page 10: Ds03   data analysis

Power Pivot & Analysis Services Tabular

Page 11: Ds03   data analysis

Data Analysis

• Data Mining features included in the Multidimensional Engine

• Classification algorithms • predict one or more discrete variables, based on the other attributes in the dataset.

• Regression algorithms • predict one or more continuous variables, such as profit or loss, based on other attributes in the

dataset.

• Segmentation algorithms • divide data into groups, or clusters, of items that have similar properties.

• Association algorithms • find correlations between different attributes in a dataset.

• Sequence analysis • algorithms summarize frequent sequences or episodes in data, such as a Web path flow

Page 12: Ds03   data analysis

Data Analysis

• Microsoft Association Algorithm

• Microsoft Clustering Algorithm

• Microsoft Decision Trees Algorithm

• Microsoft Linear Regression Algorithm

• Microsoft Logistic Regression Algorithm

• Microsoft Naive Bayes Algorithm

• Microsoft Neural Network Algorithm

• Microsoft Sequence Clustering Algorithm

• Microsoft Time Series Algorithm

• Plugin Algorithms

Page 13: Ds03   data analysis

Data Analysis

• Data Mining Language: DMX

• Data Mining also available through Excel AddIn

Page 14: Ds03   data analysis

Data Mining

Page 15: Ds03   data analysis

Previously known as

Think Big. Move Fast.