59
Database Marketing Dr. Ron Rymon Marketing Communications Program IDC, Herzliya

Database Marketing

Embed Size (px)

DESCRIPTION

Database Marketing. Dr. Ron Rymon Marketing Communications Program IDC, Herzliya. Overview. Goal: describe the framework, and touch on the current trends and buzzwords Outline: Uses of the marketing database The Data Implementation technologies Analysis techniques Modeling techniques. - PowerPoint PPT Presentation

Citation preview

Page 1: Database Marketing

Database Marketing

Dr. Ron Rymon

Marketing Communications Program

IDC, Herzliya

Page 2: Database Marketing

Overview

Goal: describe the framework, and touch on the current trends and buzzwords

Outline: Uses of the marketing database The Data Implementation technologies Analysis techniques Modeling techniques

Page 3: Database Marketing

Uses of the Marketing Database

Page 4: Database Marketing

The Marketing Database

• Comprehensive collection of interrelated data ...

• Arranged around each customer ...

• Allow timely and accurate retrieval ...

• Support analytical, predictive, operational needs ...

• Serving multiple applications …

Page 5: Database Marketing

Active Database:An Integrated Business Resource

Marketing

Distribution

CustomerService

SalesResearch

Finance

Database

Page 6: Database Marketing

Information is Power:Active databases drive the business…

• Identify your best customers– profitability analysis, clustering

• Develop new customers and cross-sell– similar to current, identify competitors’ customers

• Improve delivery of sales promotion– response modeling / targeting

• Personalize message– based on purchase patterns, volume

• Use as a research tool across the organization– customer, product, market research

Page 7: Database Marketing

Key Building Blocks

• Data– a database is only as powerful as its data

• Implementation technologies– hardware, networking, warehouses

• Analysis techniques– RFM, LTV, OLAP, Segmentation, Visualization

• Modeling– Regressions, Artificial Intelligence, Data Mining

Page 8: Database Marketing

The Data

A database is only as powerful as the data it houses.

Page 9: Database Marketing

• Every database is a collection of records• Each record is a collection of fields• Here, one record per customer and/or prospect

– unique identifier– general customer/account information

• including demographic, psychographic, socio-economic

– our offers+communications to the customer– customer’s actions: response, purchase, payment

Customer-centric Database

Page 10: Database Marketing

What Data To Hold

• Too often, data is collected based on availability, and not based on projected need

• Should accumulate internally– data that can be used to support current and future

strategies (mktg and otherwise, e.g., operations)

– …. data that may be valuable to other organizations

• Should source external data– unavailable internally

– too expensive to maintain/update

Page 11: Database Marketing

What’s in YOUR database?

Data Requirements

Basic Demo Usage Payment

Targeting

Retention

Mar

keti

ng +

oth

erA

ppli

cati

ons

Usageupgrading

Page 12: Database Marketing

Data Sources : Internal

• Operations / Sales– past usage/purchase, e.g., amount, variability

• Finance– payments, e.g., timeliness, amounts

• Customer service– e.g., inquiries, complaints

• Other data collection methods:– sales/orders, promotions/drawings, inquiries, surveys,

warranty cards, research panels...

Page 13: Database Marketing

Data Sources: Distribution Channels

• Many companies use distributors, retailers

• Problem: lack of direct communications with end-customer, no “relationship”

• Part-solution 1: keep tabs on channel + aggregate statistics on customers

• More aggressive solution: special marketing programs to reach customers

Page 14: Database Marketing

Data Sources : External Lists

• 50% of U.S. DMers sell their lists

• Use to enlarge universe : new names

– can buy segments by specific features (model)

• Enhance data : cross information

– U.S. census data

– Credit bureau

– Various marketers of related products

– List compilers / maintainers / sellers

John Smith 60 10.2

Eric Cohen 35 1.3

Jack Marshal 20 0

enhance data

enlargeuniverse

Page 15: Database Marketing

Other Data Sources

• Mass-advertised offers– TV call-ins, direct response

• Joint offers with other merchants– take-one brochures in banks, restaurants– drawings

• Trade shows, happenings, community activity

• Referrals!

Page 16: Database Marketing

Data Management

• Many sources:– conversions, transformations, cleaning, merge-purge

• Many “clients”– marketing, sales, product managers, operations

• Temporal issues– updates, audits, archives/deltas

• Quantities: huge databases– Storage, access, processing, communications

• Resolution, Enhancement

Page 17: Database Marketing

Merge-Purge : Example

• Palmer, Robert and Mary, 123 Sun Avenue, Apt 7, Key West, FL 31250

• Dr. Robert C. Palmer, Custom Engineers LLC, 123 Sun Avenue, 7th Floor, Key West, FL 31250

• Rob Charles Palmer, CE Inc., 123 Sun Ave #7, Key West FL 31250

• Bob Palmer Jr., 123 Sun #7, Key West, FL 31252• Maria Palmer, 123 Sun Avenue, Suite 7, Key

West, FL 31250

Page 18: Database Marketing

Other Issues

• Legal– Privacy Act– Anti-discrimination– Advertising Code– Telephone Consumer Protection Act

• Consumer groups– Right to be omitted (just write to DMA)– Environmental issues

• DMA invests in education:– Dmers: best practice– Customers: better image

Page 19: Database Marketing

Implementation Technologies

Page 20: Database Marketing

Computing Platforms

• Issues / Needs:– Information sources + integration

– Storage/access, maintenance, completeness, update

– Computation: process queries, algorithms

– Analyses and reports, feed to operations, customer/user interaction

• Trends:– Traditionally, all DMers used mainframes

– Today, some migration to mid-range (UNIX)

– PC-based computers gaining power (NT)

– Client/Server architectures

– Everything networked

Page 21: Database Marketing

Applications

• Database is a foundational software

• Must support variety of applications:– transaction processing

– analyses

– on-line interaction

• Trends:– Relational databases

– Data warehouses

– Data redundancy/multiplicity

Database

Database Management

O.S.

Page 22: Database Marketing

Relational Database

ID Cust Name Address …1234 John Brown 123 Main St ….

ID Date Cust Product Quant. Price98765 3.5.98 1234 A703 5 150.0098766 4.5.98 1234 A707 2 240.0098767 4.5.98 1235 A703 1 30.00

Transactions for John Brown

3.5.98 5 Levis Jeans $1504.5.98 2 CK Jacket $120

Purchases of Levis Jeans

3.5.98 John Brown 5 $1504.5.98 Jane Doe 1 $30

ID Product Supplier …..A703 Levis Jeans S7003 ….

Tables

Reports (SQL Queries)

Page 23: Database Marketing

Data Warehouse

• Stores data for informational and analytical processing

– Separate from operations

– Subject-oriented

– Integrated

– Historical

Operational Data Warehouse

loans

credit card

savings customer

product

investments

Page 24: Database Marketing

Example: Computer-by-Mail Inc.

House Files

TelemarketersClient --- Server

mail

operations

Analyst

MktgExecutive

DataWarehouse

Page 25: Database Marketing

Analysis Techniques

Page 26: Database Marketing

Data Limitations

• Important: The data is a limited encoding of reality

• Many potholes:– Omission

– Errors, noise

– Representation

– Sampling bias

• Cannot be too careful !

Page 27: Database Marketing

Exploratory data analysis :Single-variable

• Descriptive statistics– Mean, Median– Variance

• Histograms– Shows distribution

0%

5%

10%

15%

20%

25%

30%

35%

40%

0-20 20-30 30-40 40+

Page 28: Database Marketing

Exploratory data analysis:Multi-variable

• Examine relationship between two or more variables– Cross tabs

– Correlation

– Scatter plots

– Clustered histograms

0%

5%

10%

15%

20%

25%

30%

35%

40%

45%

0-20 20-30 30-40 40+

BuyerNoBuy

Page 29: Database Marketing

RFM Analysis

• RFM score– Recency: how close is the last purchase– Frequency: number of recent purchases– Monetary: dollars spent recently

• Example:– Recency: 10 pts if within 3 mos, 1 pt lower per additional

month, to 1– Frequency: 1 pt for each purchase within 12 months– Monetary: 1pt for each $100 in past year, to 10– Score=R*F*M, the higher the better

Page 30: Database Marketing

Life-Time Value of Customers

• LTV Goal:– recognize each customer’s contribution

• Method:– calculate the “expected” net revenue

– discounted:• risk of attrition

• probability of sales

• rate of money

• Typically computed per 1000, if possible by segment

Page 31: Database Marketing

OLAP Tools

• OLAP : On-Line Analytical Processing

• Goal: A database-driven system that provides– Fast

– Analysis• common business reports, statistics

– of Shared• same information available to many users

– Multi-dimensional• every piece of information is multiply categorized

– Information“The OLAP Report”

Page 32: Database Marketing

OLAP Tools

• Data represented internally as multi-dimensional cube

– e.g., customer’s attributes, purchases, payments, etc.

• User chooses presenting two dimensions at a time

– e.g., show $-sales, by geographic region and income

• Heavy use of hierarchical variables, with drilling capabilities:

– time: year, quarter, month, week, day, hour

– product: hardware, printers, small printers, PX-1000

– dollars: by ranges 0-1000, 1000-5000, 5000-25000, etc.

• Analyses, highlights of interesting cases, etc.

Page 33: Database Marketing

OLAP Tools: Example Screen

Page 34: Database Marketing

Data Visualization Tools

• Many relationships are best communicated visually:– histograms, pie-charts, scatter plots, graphs

– use color/texture, shapes

– temporal animations

• Visualization software allows– single-variable over time

– one variable as a function of another

– interaction detection

– segmentation

Page 35: Database Marketing

Modeling Techniques

Page 36: Database Marketing

Modeling Behavior

• Target variable– a.k.a. dependent/modeled/explained variable

– typically, whether bought/responded or not

• Goal:– Use other variables in a model to classify/predict

– other variables: a.k.a. independent, observable, explaining

– model: formula, algorithm

• Success criterion: future performance

Page 37: Database Marketing

Modeling and Validation Framework

• Data flow:– Historical data

– Modeling software

– Constructs model

– Tested on more historical data

– Repeat until satisfied

– Use model to predict

Training SetModel

Productionor Test Set

Output

Page 38: Database Marketing

Critical Success Factors

• Choice of data– scope: same/similar period, audience, offer,

communication

– explaining variables: available, useful, well-represented

• Choice of modeling technique– appropriate for the goal

– powerful: good fitting power

• Careful and “pessimistic” testing and validation

Page 39: Database Marketing

Validation

• NEVER test on same data set– avoid “memorizing” the data, overfitting

• Out-of-sample methods– separate training set and test set– cross-validation, a.k.a. jack-knifing– remember temporal aspect

• Evaluate the model’s robustness– estimate chance probability, bootstrapping

Page 40: Database Marketing

Classification v. Prediction Systems

• Classification systems:– distinguish few types of customers, e.g., responded or not– technically, target variable is discrete/categorical– validation through “hit rate”

• Prediction systems– predict probability of purchase, or purchase dollars– technically, target variable is continuous– validation through “closeness” measures

Page 41: Database Marketing

Linear Scoring Systems

• Use linear regression

• Coefficients evaluated using historical data

• Higher score interpreted as greater likelihood of responding

• Every coefficient measures “independent” contribution

• Classification variant: discriminant analysis– e.g., predict response if score is above 0.3

dcba AgesePastPurchaIncomeScore

Page 42: Database Marketing

Logistic Regressions

• Logistic regression (logit)

• Target variable– historical data: 0 or 1

– future application: used as probability

• Independent variables: continuous or categorical• Probit: variation that relies on normal distribution

dcba

dcba

1

AgesePastPurchaIncome

AgesePastPurchaIncome

e

eyProbabilit

Page 43: Database Marketing

Presenting and Evaluating Results

• Lift table

Top Scoring % Respond %Non-Respond5% 26.8% 4.9%

10% 41.2% 9.8%15% 52.4% 14.8%20% 62.6% 19.8%

50% 87.9% 49.7%75% 96.5% 74.8%

Page 44: Database Marketing

Presenting and Evaluating Results

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

% Non-Responders

%R

es

po

nd

ers

• Lift curve (a.k.a. Receiver Operating Characteristic curve)

Page 45: Database Marketing

Presenting and Evaluating Results

• Confusion Matrix (given a specific threshold)

• Accuracy=(Pr,r+Pn,n)/Total• Detection=Pr,r/(Pr,r+Pr,n)• Two error types: Pr,n and Pn,r

PredictedRespond

PredictedNon-Respond

Actual Respond Pr,r Pn,r

ActualNon-Respond

Pr,n Pn,n

Page 46: Database Marketing

Break-even Analysis

• Issue: how much to mail?• Solution: find break-even point• e.g.:

• Caution: use held-out data!

%mail %respond Income Cost ofproduct

Cost ofmail

MarginalProfit

5% 25% $1,250 $750 $50 $45010% 21% $1,050 $630 $50 $37015% 16% $800 $480 $50 $27020% 11% $550 $330 $50 $17025% 6% $300 $180 $50 $7030% 3% $150 $90 $50 $1035% 2% $100 $60 $50 -$10

Page 47: Database Marketing

Non-Linear Systems

• In regressions, a change in one independent variables always affects in same direction– e.g., if age affects positively, then the older the better, always

• One solution: transformations– e.g., if U-shaped relation, use quadratic form

• Or, use non-linear techniques:– Neural networks

– Decision trees

– Other: rule-based systems, genetic algorithms, Bayesian nets

Page 48: Database Marketing

Neural Networks

• Motivated by biological nervous system

• Perceptron = a model of a neuron

WiXi

WiXi

e

eactivation

1

1x 2x 3x 4x 5x

1w5w

4w2w 3w

Page 49: Database Marketing

Classical Neural Net

• Multi-layer network of perceptrons

• Proper weights are “discovered” from random– forward propagation of training set

– compare output to actual target variable

– back propagation of error to adapt weights

Page 50: Database Marketing

50

Decision Trees

• Partition the data based on one attribute...

A B C D Resp.0 0 1 0 Buy0 1 1 1 Buy1 0 1 0 No1 1 0 0 Buy1 1 1 1 No

Page 51: Database Marketing

51

A=0

A=1

Induction of Decision TreesInduction of Decision Trees

Recursively, partition each of the nodes

A B C D Resp.0 0 1 0 Buy0 1 1 1 Buy1 0 1 0 No1 1 0 0 Buy1 1 1 1 No0 0 1 0 Buy

0 1 1 1 Buy1 0 1 0 No1 1 0 0 Buy1 1 1 1 No

Page 52: Database Marketing

52

0 0 1 0 Buy0 1 1 1 Buy

1 0 1 0 No1 1 1 1 No

A B C D Resp.0 0 1 0 Buy0 1 1 1 Buy1 0 1 0 No1 1 0 0 Buy1 1 1 1 No

Induction of Decision TreesInduction of Decision Trees

…until the node is homogeneous

1 1 0 0 Buy

1 0 1 0 No1 1 0 0 Buy1 1 1 1 No

A=0

A=1

C=0 C=1

A B C D Resp.0 0 1 0 Buy0 1 1 1 Buy1 0 1 0 No1 1 0 0 Buy1 1 1 1 No

Page 53: Database Marketing

53

Classification

• Go down a matching path...

A=0

A=1

C=0 C=1

Buy No

(A=1,B=0,C=0,D=1)

Buy

Page 54: Database Marketing

54

ClassificationClassification

Continue...

A=0

A=1

C=0 C=1

(A=1,B=0,C=0,D=1)

Buy

Buy No

Page 55: Database Marketing

55

Classification

A=0

A=1

C=0 C=1

No

(A=1,B=0,C=0,D=1)

NoBuy

Buy

…until reaching a leaf Use the leaf’s probability

Page 56: Database Marketing

56

Set of Rules, or Market Segments

A=0

A=1

C=0 C=1

A=0 => BuyA=1 and C=0 => BuyA=1 and C=1 => No

Buy

Buy

NoNo

• Each rule represents a market segment

Page 57: Database Marketing

Which Modeling Technique?

• Decision trees (ChAID, CART, C4.5)– symbolic: model is interpretable as set of rules

– essentially is a segmentation

– useful when few “classes”, e.g., based on action (send/not, or few offer types)

• Regressions, neural nets– numeric: allows fine-tuning, e.g., for prediction or ranking

– model is hard to interpret and used as “black-box”

– useful when target is continuous

Page 58: Database Marketing

Data Mining

• Knowledge Discovery in Databases (KDD)

• KDD is the process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data

• Includes all steps of data preparation + management• Data mining step uses statistical techniques, decision

trees, neural nets, etc.

Page 59: Database Marketing

Summary

• Customer data can be leveraged to better understand and manage current customers, and target new ones

• Data analysis and visualization– insights about our customers

– business economics

• Modeling– “mined” insights

– classify/predict behavior