Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Gürdal Ertek | College of Business | Abu Dhabi University, UAE
Abu Dhabi | Dubai | Al Ain | Al Dhafra
Data-Driven Marketing: A 20-Minute Crash Course
IT in 21st Century• Emerging information technologies (IT)
• shaping and transforming • organizations, • industries, and even • nations
• include• big data, • data science and artificial intelligence, • cyber security, • virtual reality,• cloud systems, • distributed computing, • mobile technologies, and • RFID (radio frequency identification). 2
• "Data science is the study of data and
• a data scientist is someone who solves problems by studying data.
• So pretty much, all science is data science."
Siraj Raval
3
What is Data Science?
Sou
rce:
htt
ps:
//b
it.ly
/2e2
zF4
L
4
Who are involved in data science?
Source: https://bit.ly/2GMA1cQ
5
How is data analyzed?
Source: https://bit.ly/2J7u6gVCross-industry standard process for data mining
6
▪Research & Teaching:▪ Data Science & AI▪ Project Management▪ Supply Chain Management▪ R&D Management
▪ErtekProjects.com▪ Publications▪ Online Training
▪Email: [email protected]
Speaker: Dr. Gurdal Ertek[Dr. Good News]
7
Abu Dhabi University – College of Business
7
•Abu Dhabi
•Dubai
•Al Ain
•Al Dhafra
Outline• Data Science
• Roles in Data Science
• How is Data Analyzed?
• Speaker Bio
• Lessons1. Pivot Table Analysis
2. Clustering Similar Customers
3. Clustering Similar Products
4. Market Basket Analysis
5. Time Series Prediction
8Available Under: https://ertekprojects.com/cet2018
9
LESSON 1
Pivot Table Analysis
1
Available Under: https://ertekprojects.com/cet2018
10
Source Data: Multinational Sales Transactions
Source: https://tinyurl.com/cet2018lesson1
11
Planning the Pivot Table Analysis
12
Sample Pivot Tables
13
LESSON 2
Clustering Similar Customers
2
Available Under: https://ertekprojects.com/cet2018
14
Source Data: Pivot Table
…
…
15
Clustering Similar Products
16
Clustering Similar Customers
17
Customers in Each Cluster (with similar purchase patterns)
Cluster 4
Cluster 5
18
Profiling Customer Clusters
19
LESSON 3
Clustering Similar Products
3
Available Under: https://ertekprojects.com/cet2018
20
Source Data: Weekly Sales of P1…P811
…
…Source: https://tinyurl.com/cet2018lesson3
21
Clustering Similar Products
40
881
22
Clustering Similar Products
23
Products in Each Cluster (with similar sales patterns)
Cluster 19Cluster 20
Cluster 21
24
LESSON 4
Market Basket Analysis
4
Available Under: https://ertekprojects.com/cet2018
Association Mining for Market Basket Analysis• Association Mining
• Very popular analytical method.
• Interpretable and actionable results.
• Patterns of "appearing together".
• Frequent itemsets• Sets of items appearing together frequently.
• Association Rule [if A then B]• If Item A is observed,
Then Item B is also observed.
25
Association Mining for Market Basket Analysis
• Association Rule [if A then B]• If Item A is observed,
Then Item B is also observed.
26
Support: Fraction of transactionshaving both Item A and Item B.
Confidence: Conditional probability ofobserving Item B, given that Item A is observed.
𝑆 =𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦(𝐴 ∩ 𝐵)
𝑇
𝐶 =𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦(𝐴 ∩ 𝐵)
𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦(𝐴)
27
Source Data: Grocery Sales Transactions
… …
Source: https://tinyurl.com/cet2018lesson4
28
Association Mining
29
Association Rules
IF THEN
30
Association Rule Graph
31
Filtering the Rules for an Item
IF baking_powderTHEN
• whole_milk (with 52% confidence)• other_vegetables (with 41% confidence)• whipped_sour_cream (with 26% confidence)
32
LESSON 5
Time Series Prediction
5
Available Under: https://ertekprojects.com/cet2018
33
Source Data: Grocery Sales Transactions
… …
34
Time Series Prediction
35
Time Series Prediction
36
BONUS: LESSON 6
Chatbots
6
Available Under: https://ertekprojects.com/cet2018
37
Google DialogFlow
38
Google DialogFlow
39
Google DialogFlow
40
Google DialogFlow
41
Google DialogFlow
42
FINAL WORDS
:)
Available Under: https://ertekprojects.com/cet2018
Lessons Learned
43
• 80% of project will be data engineering & preparation.
• "A Taxonomy of Dirty Data" filetype:pdf
• MS Excel (especially Pivot Table) is sufficient for many projects.
• Target visual analytics before machine learning.
• Start with free visual modeling software: Orange, RapidMiner.
44
• For big data, cloud computing is inevitable.• SAP, IBM, Oracle, Microsoft, AWS, Google
• Digital marketing data is at least as valuable as ERP data.
• Junior Data Scientist = $100 for Udemy Courses
+ Free Kaggle datasets+ 1,000 hours hard work
Lessons Learned
Gürdal Ertek | College of Business | Abu Dhabi University, UAE
Abu Dhabi | Dubai | Al Ain | Al Dhafra
Data-Driven Marketing: A 20-Minute Crash Course
Acknowledgement
46