Exploiting Social Context for Review Quality Prediction

Yue Lu University of Illinois at Urbana-ChampaignPanayiotis Tsaparas Microsoft ResearchAlexandros Ntoulas Microsoft ResearchLivia Polanyi Microsoft

April 28, WWW’2010 Raleigh, NC

Why do we care about Predicting Review Quality?

User reviews (1764)

User “helpfulness” voteshelp prioritize reading

But not all reviews have votes1. New reviews2. Reviews aggregated from

multiple sources

What has been done?• As classification or regression problem

√ ×?

[Zhang&Varadarajan`06] [Kim et al. `06][Liu et al. `08] [Ghose&Ipeirotis `10]

Labeled

Unlabeled

• Textual features• Meta-data features

Reviews are NOTStand-Alone Documents

We also observe…

Reviewer Identity

Social Network Social Context=+

Our Work:Exploiting Social Context for Review Quality Prediction

Roadmap

• Motivation• Review Quality Prediction Algorithms • Experimental Evaluation• Conclusions

• SentiPositive• SentiNegative

Text-only Baseline

Textual Features

Text Statistics

• NumSent• NumTokens• SentLen• CapRatio• UniqWordRatio

Syntactic

• POS:RB• POS:PP• POS:V• POS:CD• POS:JJ• POS:NN• POS:SYM• POS:COM• POS:FW

Conformity

• KLDiv

Sentiment

FeatureVector( )=

Base Model: Linear Regression

w = argmin= argmin{ }

Quality( ) = Weights×FeatureVector( )i

Closed-form: w=

Straight-forward Approach: Adding Social Context as Features

Reviewer History

• NumReview

• AvgRating

Social Network

• InDegree• OutDegree• PageRank

Textual Features

Social Context Features

FeatureVector( )=

Disadvantages:•Social context features not always available• Anonymous reviews?• A new reviewer?•Need more training data

Our Approach: Social Context as Constraints

Reviewer Identity

Social Network

Quality( )Quality( )

is related to

Quality( ) is related to its Social Network

Our Intuitions:

How to combine such intuitions with Textual info?

Formally: Graph-based Regularizers

{ + β× Graph Regularizer }w = argmin

Trade-off parameter

Designed to “favor”our intuitions

BaselineLoss function

Advantages:• Semi-supervised: make use of unlabeled data• Applicable to reviews without social context

Labeled Unlabeled

We will define four regularizers base on four hypotheses.

1.Reviewer Consistency Hypothesis

Quality( )

Quality( ) ~

1 23 4

Quality( ) 2

Quality( ) ~3

Reviewers are consistent!

Regularizer for Reviewer Consistency

Reviewer Regularizer =∑ [ Quality( ) -

Quality( ) ]21 2

Sum over all data (train + test) for all pairs reviews in the same-author graph

Closed-form solution!1 2

Same-Author Graph (A)

[Zhou et al. 03] [Zhu et al. 03] [Belkin et al 06]

w=Graph LaplacianReview-Feature

Matrix

2.Trust Consistency Hypothesis

Quality( ) - Quality( ) ≤ 0

I trust people with quality at least as good as mine!

AVG ( Quality( ) )Defined as

Regularizer for Trust ConsistencyTrust Regularizer=∑max[0, Quality( ) -

Quality( )]2

Sum over all data (train + test) for all pairs ofreviewers connected in the trust graph

No closed-form solution…Still convexGradient Descent

Trust Graph

3.Co-Citation Consistency Hypothesis

Quality( ) - Quality( ) → 0

Trust Graph Co-citation Graph

I am consistent with my “trust standard”!

Regularizer for Co-citation Consistency

Co-citation Regularizer

=∑[ Quality( ) - Quality( ) ]2

Closed-form solution!

Sum over all data (train + test) for all pairs ofreviewers connected in the co-citation graph

Co-citation Graph (C)

w=Review-Reviewer Matrix

4.Link Consistency Hypothesis

Quality( ) - Quality( ) → 0

Trust Graph Link Graph

I trust people with similar quality as mine!

Regularizer for Link ConsistencyLink Regularizer

=∑[ Quality( ) - Quality( ) ]2

Closed-form solution!

Sum over all data (train + test) for all pairs ofreviewers connected in the co-citation graph

Link Graph

Roadmap

• Motivation• Review Quality Prediction Algorithms• Experimental Evaluation• Conclusions

Data from Ciao UKStatistics Cellphone Beauty Digital Camera# Reviews 1943 4849 3697Reviews/Reviewer ratio 2.21 2.84 1.06

Trust Graph Density 0.0075 0.014 0.0006

Summary Cellphone Beauty Digital CameraSocial Context rich rich sparse

Gold-std Quality Distribution balanced skewed balanced

Hypotheses Testing:Reviewer Consistency

Qg( ) -1 Qg( ) 2

Qg( ) -1 Qg( ) 3

Reviewer Consistency Hypothesis supported by data

Difference in Review QualityDe

nsityFrom same reviewer

From different reviewers

(Cellphone)

Hypotheses Testing:Social Network-based Consistencies

Qg( ) - Qg( ) B is not linked to AB trusts AB is co-cited with AB is linked to A

Social Network-based Consistencies supported by data

Difference in Reviewer QualityDe

(Cellphone)

Prediction Performance:Exploiting Social Context

Percentage of Training Data10% 25% 50% 100%

AddFeatures is most effective given sufficient training data

With limited training data, Reg methods work best

Reg:Reviewer > Reg:Trust > Reg:Cocitation > Reg:Link

(Cellphone)Better

Prediction Performance:Compare Three Categories

-15%-13%-11%

-9%-7%-5%-3%-1%

ce Cellphone Beauty Digital Camera

Better

Improvement on Digital Camera is smaller due to sparse social context

Reviews/Reviewer ratio = 1.06

Parameter Sensitivity

Text-only Baseline

(Cellphone) (Beauty)Regularization Parameter

consistently better than Baseline when parameter < 0.1

Better

Conclusions

• Improve Review Quality Prediction using Social Context

• Formalize into a Semi-supervised Graph Regularization framework• Utilize both labeled and unlabeled data• Applicable on data with no social context

• Promising results on real world data– Esp. limited labels, rich social context

Future Work

• Combine multiple regularizers• Optimize by nDCG instead of MSE• Infer trust network• Spam detection

Thank you!&

Questions?

Exploiting Social Context for Review Quality Prediction

Documents

Exploiting single image depth prediction for mono-stixel

Exploiting Topic based Twitter Sentiment for Stock Predictionliub/publications/ACL-2013-Jianfeng-stock... · Exploiting Topic based Twitter Sentiment for Stock Prediction Jianfeng

Gather-Excite: Exploiting Feature Context in Convolutional Neural …papers.nips.cc/paper/8151-gather-excite-exploiting... · 2019. 2. 19. · Gather-Excite: Exploiting Feature Context

Unsupervised Visual Representation Learning by Context Prediction › pdf › 1505.05192.pdf · 2016-01-19 · Unsupervised Visual Representation Learning by Context Prediction Carl

Beam-Width Prediction for Efficient Context-Free Parsing

Unsupervised Visual Representation Learning by Context Prediction

Using the forest to see the trees: exploiting context for ...murphyk/Papers/cacm09.pdf · Using the forest to see the trees: exploiting context for visual object detection and localization

Exploiting Text and Network Context for Geolocation of Social Media Users

Exploiting Hierarchical Context on a Large Database of Object Categories

Context-aware Deep Model for Joint Mobility and Time Prediction · Context-aware Deep Model for Joint Mobility and Time Prediction Yile Chen Nanyang Technological University Singapore

Prediction of Energy-mix in power production Bangladesh Context 2030

Exploiting Map Topology Knowledge for Context-predictive Multi-interface Car … · 2020-02-20 · Exploiting Map Topology Knowledge for Context-predictive Multi-interface Car-to-cloud

Exploiting Context-awareness and Social Interaction to Provide Help in Large-scale Environments

Context-aware taxi demand hotspots predictionagents.csie.ntu.edu.tw/.../ContextAwareTaxiDemandHotspotsPredicti… · Context-aware taxi demand hotspots prediction 7 The request history

Generating Resource Profiles by Exploiting the Context of Social Annotations

Deep Learning for Stock Market Prediction: Exploiting Time ... · Deep Learning for Stock Market Prediction: Exploiting Time-Shifted Correlations of Stock Price Gradients Benjamin

Exploiting Curiosity and Context

Prediction of energy mix in power production bangladesh context 2030

Context-Based Pedestrian Path Prediction · 2017. 8. 27. · Context-Based Pedestrian Path Prediction Julian Francisco Pieter Kooij 1,2, Nicolas Schneider , Fabian Flohr 1,2, and

Test Prediction and Performance in a Classroom Context