18
Predicting Customer Satisfaction on Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui

RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Predicting Customer Satisfaction on Airbnb

RATE MY LISTING

1

By Aysa Fan, Mohammad Habib, Janson Lui

Page 2: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Understanding the problem

Project Objective

Assumptions

Demonstration

How It Works

● Exploratory Data Analysis

● Baseline Models

● Final Model

Takeaways

TOC

2

Page 3: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com

Understanding the Problem

3

Highly competitive rental market Customer Satisfaction Matters What drives customer

satisfaction?

Challenge 1 Challenge 2 Challenge 3

Page 4: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Project Objective

Predict Customer Satisfaction Score using the features of an AirBnb listing.(The higher the better)

Target Audience

● Existing hosts● Potential hosts

Prediction Result

1. Customer Satisfaction Score2. Amenity List for Improvement3. Listing Comparison

4

Page 5: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

5

Our data is specific to San Francisco.

Regional Data

Listing score is an acceptable proxy for customer satisfaction.

Listing Score as Customer Satisfaction Score

We use fixed features, such as neighborhood and property type, to establish a baseline customer satisfaction score for each listing, and use changeable features like air conditioning, bed type or parking to further refine the score.

Focusing on Changeable Features

Assumptions

Page 6: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Let’s see it in action

6

Page 7: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com

How It Works

7

Web Form

User

GunicornServer

Flask (init)

Web Form

Post

Dispatch

Response

Response

Model

Load model

Page 8: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com

Ratings Distribution Transformations

8

EDA: Dependent Variable (Ratings)

Page 9: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com9

EDA: Independent Variables - Part 1

Some amenities have higher weights in the overall model than others.

Page 10: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com10

EDA: Independent Variables - Part 2

We kept neighborhood in the model to establish a baseline score, over which the changeable features affect the score.

Page 11: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com11

EDA: Omitted Data

We omitted 3% of the data (rating < 80) from the main model

Page 12: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com12

Baseline Models: Part 1

MODEL MSE R-Square

Lasso 21.67 0.13

Median 27.246 -0.12

Mean 24.84 0

Single-Feature 25.702 -0.012

Page 13: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com13

Baseline Models: Part 2

Polynomials with different degrees Adding more data into the training model, things start to get better.

Page 14: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com14

Final Model Construction

Data

ElasticNet

Ridge

GradientBoostingRegressor

AdaBoostRegressor

Random Forest Regressor

XGBRegressor Prediction

10-Fold Cross Validation

vecstack StackingTransformer

Page 15: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

www.companyname.com15

Final Model Results

Feature Name

MSE - Training

R2 - Training

MSE - Test R2 - Test

Numeric Features

12.80 0.5 15.03 0.39

Categorical Features

19.71 0.22 21.30 0.14

Amenities 5.37 0.79 6.19 0.75

Final Model Overall

4.89 0.81 5.75 0.77

We performed permutation test on the model, and the prediction on actual data was significantly better than on randomly permuted data.

Page 16: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Our tool can accurately predict the customer

satisfaction rating using features of a listing.

16

Hosts can improve customer satisfaction by updating their properties, and offering different amenities.

In short...

Page 17: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Breakfast, guests, pets, and a washer/dryer matter most to customers in San Francisco.

WiFi and Cable TV, not so much!

17

Takeaway

Page 18: RATE MY LISTING - UC Berkeley School of Information · Airbnb RATE MY LISTING 1 By Aysa Fan, Mohammad Habib, Janson Lui. Understanding the problem Project Objective Assumptions Demonstration

Thanks for WatchingQuestions?

18