16
Content Based Recommendations Enhanced with Collaborative Information POLITECNICO DI MILANO Scuola di Ingegneria Industriale e dell'Informazione Corso di Laurea Magistrale in Ingegneria Informatica Anno Accademico 2014 – 2015 Candidato: Alessandro Liparoti (819828) Relatore: Prof. Paolo Cremonesi

Content - Based Recommendations Enhanced with Collaborative Information

Embed Size (px)

Citation preview

Page 1: Content - Based Recommendations Enhanced with Collaborative Information

Content Based Recommendations Enhanced with Collaborative Information

POLITECNICO DI MILANO

Scuola di Ingegneria Industriale e dell'Informazione

Corso di Laurea Magistrale in Ingegneria Informatica

Anno Accademico 2014 – 2015

Candidato: Alessandro Liparoti (819828)

Relatore: Prof. Paolo Cremonesi

Page 2: Content - Based Recommendations Enhanced with Collaborative Information

RECOMMENDER SYSTEMS

software tools which analyze different source of data in order to predict the rating that a user would give to an item

Main Families:

• Collaborative Filtering• Content-based Filtering• Hybrid algorithms

POLITECNICO DI MILANO

Page 3: Content - Based Recommendations Enhanced with Collaborative Information

COLLABORATIVE FILTERING

Collaborative Filteringassumption: users who agreed in the past will also agree in the

future

analyze past users’ ratings to compute predictions

User-Rating-Matrix (URM) is the rating given by user u to item i

POLITECNICO DI MILANO

✔ good performances

✘ not applicable if no enough ratings

for both users and items (cold-start problems)

Page 4: Content - Based Recommendations Enhanced with Collaborative Information

CONTENT-BASED FILTERING

Content-Based Filteringassumption: users will like items similar to those they liked in the

past

compute items similarities’ scores considering item features

Item-Content-Matrix (ICM) if item i hasthe feature k

POLITECNICO DI MILANO

✔ no need of items’ ratings (i.e. works in a new-item scenario)

✘ ignoring users’relations leads to worse performances

Page 5: Content - Based Recommendations Enhanced with Collaborative Information

HYBRID ALGORITHMS• new-item recommendations

- Factorization Machines (FM)- generic factorization model- can represent different types of models

- UFSM- computes item similarities as a CB approach- uses collaborative data to personalize them for each user

• no new-item recommendations - SSLIM

- learns a matrix of item-item coefficients- improves SLIM adding side information

POLITECNICO DI MILANO

Page 6: Content - Based Recommendations Enhanced with Collaborative Information

GOAL OF THE THESIS

usual hybridization use item content data to improve collaborative models

our hybridization build a content-based model enhanced with collaborative data

CONTENT-BASED COLLABORATIVE

POLITECNICO DI MILANO

✔exploits collaborative data (also in a new-item scenario)

✔uses weigths for features and user-feature relations

Page 7: Content - Based Recommendations Enhanced with Collaborative Information

CONTENT BASED COLLABORATIVE

content-based similarity function

CBC similarity function

• bk control the importance of feature k(e.g. usually genre > year of production for movies)

• cu,k control the importance of the relation between user u and feature k

(e.g. a user likes a particular actor)POLITECNICO DI MILANO

Page 8: Content - Based Recommendations Enhanced with Collaborative Information

CBC VERSIONS

rating prediction

CBCrmse

partial effects:analytical sequential approach

item recommendation

CBCbpr

stochastic gradient descent approach

POLITECNICO DI MILANO

CBC parameters θ are learned minimizing an error function

Page 9: Content - Based Recommendations Enhanced with Collaborative Information

DATASET

POLITECNICO DI MILANO

HetRec2011-Movielens

RecSys-Polimi IMDB

Page 10: Content - Based Recommendations Enhanced with Collaborative Information

SIMULATIONS

POLITECNICO DI MILANO

URM was split in three parts

two types of simulation

collaborative train on A+B test on C

new-item train on A test on C

Page 11: Content - Based Recommendations Enhanced with Collaborative Information

TESTING

POLITECNICO DI MILANO

- rating prediction metricsRMSE

RMSEp (only on positive ratings)

- item recommendation metricsprecision

recall

mean average precision (MAP)

mean reciprocal rank (MRR)

normalized discounted cumulative gain (NDCG)

Page 12: Content - Based Recommendations Enhanced with Collaborative Information

ALGORITHMS OF COMPARISON

POLITECNICO DI MILANO

• Collaborative - Matrix Factorization (MF)

Asymmetric-SVD

Bayesian Personalized Ranking MF

• Hybrid

Factorization Machines

• Content-Based

item k-nearest-neighbor

Page 13: Content - Based Recommendations Enhanced with Collaborative Information

RESULTS

POLITECNICO DI MILANO

collaborative simulations

Page 14: Content - Based Recommendations Enhanced with Collaborative Information

RESULTS – ITEM RECOMMENDATION

POLITECNICO DI MILANO

new-item simulations

Page 15: Content - Based Recommendations Enhanced with Collaborative Information

THINGS TO DO

POLITECNICO DI MILANO

use weights to control the importance of relations among items’ features

(e.g. two actors appearing together in many movies)

use a CBC-like similarity function to compare users instead of items

(e.g. gender, age, demographic information,…)

collaborative method (BPR-MF) not applicable

CBC is the best algorithm

Page 16: Content - Based Recommendations Enhanced with Collaborative Information

THE END

POLITECNICO DI MILANO

THANK YOU