28
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Embed Size (px)

Citation preview

Page 1: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu

Harbin Institute of TechnologySchool of Computer Science and

Technology

1

19/11/2013 12:09 PM

Page 2: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

OutlinesBackground and challengesRelated worksMethodology descriptionExperiments and analysesConclusion and further works

2

Page 3: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

BackgroundWith the rapid development of the Internet,

social network services such as Twitter, Facebook, and Tencent Weibo become more and more popular.

Though social network benefits us a lot, it also wastes us too much time to search useful information from massive information.

To deal with the problem of choosing information sources efficiently, it is important to develop effective social recommender system to help users easily find the potential items they want to follow.

3

Page 4: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

ChallengesSocial recommender system has much more

social information to use than traditional recommender system.

Social information includes social relationships, user’s profile, user’s social action and so on.

How to take advantage of these information to improve the performance of social recommender system is both a good way and a big challenge.

4

Page 5: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

SolutionsTo this end, we concentrate on combining

social relationships into recommendation algorithm to improve the performance of social recommender system.

5

Page 6: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

OutlinesBackground and challengesRelated worksMethodology descriptionExperiments and analysesConclusion and further works

6

Page 7: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Related worksThe related works are told from two aspects:

The popular methods of recommender system The research of social recommendation

7

Page 8: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Methods for recommender system The goal of recommender system is to suggest users

with items which may fit the users’ tastes. Recommendation methods can be classified into the

three categories: content-based, collaborative filtering (CF), and hybrid recommendation approaches.

As one of the most successful methods, CF has been widely used in business and deeply studied in academia.

To deal with the data sparsity challenge, many algorithms derived from CF have been proposed, such as Singular Value Decomposition (SVD) and SVD++.

8

Page 9: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Researches for social recommendation Social recommendation aims at forming a

recommender system by using recommendation methods with information from social networks such as user profile, user action, and social relationship.

9

Page 10: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Two recommendation tasksOne is KDD Cup 2012 Track1, which is to

develop such a system aiming at recommending items (celebrities) which could be persons, organizations, or groups in Tencent Weibo.

The other is Artist recommendation in Last.fm online music system, which proposes a novel hierarchical Bayesian model incorporating topic modeling and probabilistic matrix factorization.

10

Page 11: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

OutlinesBackground and challengesRelated worksMethodology descriptionExperiments and analysesConclusion and further works

11

Page 12: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Latent Factor ModelWe use Latent factor model as our basic

method.Latent factor models have been widely used

in rating prediction problem. The main idea of latent factor model is to predict a rating by the dot product of a user latent factor and an item latent factor.

Singular Value Decomposition (SVD) is one of the most popular latent factor models.

12

Page 13: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

SVDThe basic SVD model with user bias, item

bias and global bias can be described in the following equation:

Here is the global bias, is the user bias, is the item bias, is the user latent factor and is the item latent factor.

13

,T

u i u i u i gr p q b b b

gb u

bib

iq

up

Page 14: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

SGDStochastic Gradient Descent algorithm (SGD)

is an efficient and popular strategy to solve the above optimization problem.

The main idea of SGD is to minimize the error by taking a small step on parameters along the direction of gradient descent.

14

Page 15: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Incorporate Implicit FeedbackThe datasets are usually much more

complicated for building recommender system in the real world.

For example, in a movie recommender system, there are many other kinds of information which may include movie tags, movie category, user profile, user social network and so on.

How to utilize these diverse implicit feedback information is beyond the ability of SVD, while SVD++ can do it well by modeling them as latent features.

15

Page 16: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

SVD++ and implicit informationThe only thing has to be done is extending

the SVD++ to the following form:

The adding sum term represents the perspective of implicit feedback. can be user’s neighbors, tags or other implicit feedbacks. is the latent factor and is its weight.

16

,( )

( )Tu i u j j i u i g

j N u

r p y q b b b

( )N u

j

Page 17: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

BenefitUser information such as user’s tags,

keywords and history following records are very sparse which will lead to the features being ineffective.

Using social relationships can deal with the feature sparsity problem to some extent. We can use friends’ tags, keywords and history following records as a kind of expanding features.

17

Page 18: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

OutlinesBackground and challengesRelated worksMethodology descriptionExperiments and analysesConclusion and further works

18

Page 19: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

DatasetsWe conducted our experiments on two real

world datasets. The first one is a subset of dataset of Track 1

in KDD Cup 2012 provided by Tecent Weibo. The dataset is a snapshot of Tencent Weibo of five days.

Another dataset is hetrec2011-lastfm-2k obtained from Last.fm online music system. There are listening histories, tagging histories and friend relationships in the dataset.

19

Page 20: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Statistics on two datasets

20

Page 21: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Metrics(Mean Average Precision of Top N) Recall@nThe metric are defined as:

21

1

@ ( ) /n

k

ap n P k m

1

@ @ /N

ii

MAP n ap n N

@number of items the user likes in Top n recommendations

recall ntotal number of items the user likes

Page 22: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Experiment descriptionFor Tencent Weibo dataset, we conducted

our experiments on four user features: follow histories, tags, keywords and actions.

For Last.fm dataset, we conducted our experiments on the feature of tags. Tag features in Last.fm dataset are tags a user has made for artists.

22

Page 23: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Results for Tencent Weibo

23

Page 24: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Results for Last.fm dataset

24

Page 25: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

Analyses of resultsPrevious two tables showed the results of

our experiments on Tencent Weibo dataset and Last.fm dataset.

We can see that all of the expanding features can achieve improvement when used independently or by combination.

Except follow history in Tencent Weibo dataset, other expanding features in both datasets were even more useful than the original features.

25

Page 26: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

OutlinesBackground and challengesRelated worksMethodology descriptionExperiments and analysesConclusion

26

Page 27: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

ConclusionWe explored a new way of using

social relationships by expanding user features in social recommender systems.

Experimental results showed that expanding features by social relationships can improve the performance of recommender system efficiently.

27

Page 28: Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/2013 12:09 PM

End

Thank you!

28