Upload
others
View
4
Download
0
Embed Size (px)
Citation preview
Study Unit 1
CRISP-DM
ANL 309 | Business Analytics Applications
© 2014 SIM University. All rights reserved.
Introduction
• There are various steps in the cross-industry t d d f d t i i (CRISP DM)standard process for data mining (CRISP-DM)
framework
© 2014 SIM University. All rights reserved.
Why Is There a Need for a Framework?
• To be effective, business analytics must be part of the business process.
• This implies that the results of a business analytics project must be p y p jactionable.
• To exact the optimum benefits of applying business analytics, someTo exact the optimum benefits of applying business analytics, some process must be in placed to integrate the business analytics into the business process.
© 2014 SIM University. All rights reserved.
Virtuous Cycle of Data Mining
• Berry and Linoff (2000,2004) term this process as the “virtuousBerry and Linoff (2000,2004) term this process as the virtuous cycle of data mining” and define it as “an iterative learning process that builds on results over time” (Berry and Linoff, 2004, p22).
• Business analytics is continuous learning process integrated into the business process and produces results, actedinto the business process and produces results, acted upon by the companies to increase its customer value.
© 2014 SIM University. All rights reserved.
CRISP-DM Framework
• The Cross industry Standard Process for Data Mining• The Cross-industry Standard Process for Data Mining (CRISP-DM): This is a framework adopted in the application of business analytics to customer relationship management
• The CRISP-DM integrates business analytics into the business process by focusing data mining or business analytics technology on specific business problemstechnology on specific business problems.
© 2014 SIM University. All rights reserved.
Six Phases in CRISP-DM
1. Business Understanding
2. Data Understanding
3. Data Preparation
4 Modeling4. Modeling
5. Evaluation
6. Deployment
© 2014 SIM University. All rights reserved.
CRISP – DM Framework
© 2014 SIM University. All rights reserved.
Data Understanding
Collect the data
Perform exploratory data analysis to understand the dataPerform exploratory data analysis to understand the data
Evaluate the quality of the data
Identify subset of the data for modelling
© 2014 SIM University. All rights reserved.
Data Preparation
Prepare the raw data setPrepare the raw data set
Select the relevant variables and cases for data mining objectives
Apply the necessary transformations to the data
Deal with missing values
Most labour-intensive phaseost abou te s e p ase
© 2014 SIM University. All rights reserved.
Modelling
• Select and apply the appropriate modeling techniques.
• Build and calibrate several models where appropriate.pp p
• Loop back to the data preparation phase if necessary.
© 2014 SIM University. All rights reserved.
Evaluation
E l t th d l i t f th d t i i bj ti d• Evaluate the models in terms of the data mining objectives and the business goals as identified in the business understanding phase.
• Review the process and decide on the follow-up action.
© 2014 SIM University. All rights reserved.
Deployment
Prod ce final report and presentation• Produce final report and presentation.
• Identify how the results can be translated into appropriate business strategies to achieve the business goals.
© 2014 SIM University. All rights reserved.
Business Analytics: An Iterative Process
• The sequence of the phases is not rigid. In fact, moving back and forth among the phases is always required.
• The business analytics process is iterative, requiring the lessons learnt during the processes be it inputs into the existing or new and more focused business questions and processes
© 2014 SIM University. All rights reserved.