30
A formal model to the routing questions problem in the context of twitter Cleyton Caetano de Souza

A formal model to the routing questions problem

Embed Size (px)

DESCRIPTION

Apresentação no ICWI 2011

Citation preview

Page 1: A formal model to the routing questions problem

A formal model to the routing questions problem in the context of

twitter

Cleyton Caetano de Souza

Page 2: A formal model to the routing questions problem

Schedule

1. Introduction

1. Problem

2. Related Works

3. The model

1. The problem

2. Details

4. A solution to the model

5. Conclusion

6. Future Works Cleyton-UFCG 2

Page 3: A formal model to the routing questions problem

Introduction

• Web has became essential

– Web, a repository of information

• Search Engines

– Looking answers

• Social Networks

– Waiting answers

Cleyton-UFCG 3

Page 4: A formal model to the routing questions problem

Problem

• Could occurs problems when you publish your question

– None answer

– None see

– Many answers

• Direct the answer to someone

– You ensure a answer, but will be a good one?

Cleyton-UFCG 4

Page 5: A formal model to the routing questions problem

Problem

• Informally, the problem that we proposes to solve is given a question posted by a user (asker) in Twitter, find among his followers that user with the characteristics:

– (1) knows the answer

– (2) has the trust of the questioner

– (3) provide the answer quickly

Cleyton-UFCG 5

Page 6: A formal model to the routing questions problem

Related Works

• (Morris, Teevan e Panovich 2010a)

– 93.5% of users received answers to their question after post them and these responses

– in 90.1% of cases, were provided within one day

• Applications

– Aardvark (Horowitz and Kamvar 2010)

– Q-Sabe (Andrade et al 2003)

• The differential of our research

Cleyton-UFCG 6

Page 7: A formal model to the routing questions problem

The Model

• The twitter is defined by the tuple

𝑇 = {𝑈, 𝑅}

• Where 𝑈 = {𝑢1, … , 𝑢 𝑈 } is a set of users

• And 𝑅 is the set of all relationships 𝑟𝑖,𝑗 between two users 𝑖 and 𝑗.

– The existence of 𝑟𝑖,𝑗 means that i follows j, this

way 𝑟𝑖,𝑗 ≠ 𝑟𝑗,𝑖

Cleyton-UFCG 7

Page 8: A formal model to the routing questions problem

The Model

• Each useru has the attributes

– 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢 that contains all users which follows 𝑢

– 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔𝑢 that contains all users which are followed by 𝑢

– 𝑀𝑢 = 𝑚1, … ,𝑚 𝑀 a ordered list that contains all

messages posted for 𝑢

• Each message 𝑚 has the attributes

– 𝑑𝑚- the post date

– 𝑠𝑚- the string posted

Cleyton-UFCG 8

Page 9: A formal model to the routing questions problem

The Problem

Given a query 𝑞 posted by 𝑢,

𝑓 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢 and 𝑝𝑓,𝑞 a function

that tell us the chances of

𝑓 provides a good answer

– Find: 𝑓

– To: 𝑀𝑎𝑥 𝑝𝑓,𝑞

– Over: 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢

Cleyton-UFCG 9

Page 10: A formal model to the routing questions problem

The problem

• We believe that 𝑝𝑓,𝑞 has a correlation with

three things

– 𝑘𝑓,𝑞 – the knowledge that 𝑓 in relation with 𝑞

– 𝑡𝑢,𝑓 – the trust of 𝑢 has in 𝑓

– 𝑎𝑓 – the level of activity of 𝑓

• That way will actually want to find the best combination of: 𝑘𝑓,𝑞, 𝑡𝑢,𝑓 and 𝑎𝑓

Cleyton-UFCG 10

Page 11: A formal model to the routing questions problem

Knowledge

• Each message 𝑚𝑢 corresponds a fraction of the total expertise of 𝑢

𝑘𝑢 = 𝑘𝑚𝑢𝑚𝑢∈𝑀𝑢

• In IR we represent this fraction as a vector of the words/token contained in 𝑚𝑢

• So the 𝑘𝑢 is a vector where each coordinate represents a token and its value is the frequency of this token in all messages 𝑚𝑢

Cleyton-UFCG 11

Page 12: A formal model to the routing questions problem

Knowledge

• If 𝑡𝑞 is the frequency of the token 𝑡 in 𝑞, the

knowledge needed to answer satisfactorily the question is calculated as a inner product between the vector that represent the follower and the vector that represent the question

𝑘𝑓,𝑞 = 𝑡𝑞 ∗ 𝑡𝑘𝑢𝑡∈𝑞

Cleyton-UFCG 12

Page 13: A formal model to the routing questions problem

Trust

• Trust is related to

– Friendship [Schenkel et al 2008]

– Similarity [Kuter and Golbeck 2010]

• So we believe (and simplify) 𝑡𝑢,𝑣 = 𝑓𝑢,𝑣 ∗ 𝑠𝑖𝑚 𝑢, 𝑣

Cleyton-UFCG 13

Page 14: A formal model to the routing questions problem

Friendship

• Friendship measures the importance of a user to another

• In Twitter a good estimative of friendship should consider the mentions (connections) between 𝑢 and 𝑣, so

𝑓𝑢,𝑣 =|𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠𝑢 𝑣 |

𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠𝑢

Cleyton-UFCG 14

Page 15: A formal model to the routing questions problem

Similarity

• The similarity measures how to users are equal under some criterion

• Appears intuitively that the similarity is related to equality among the attributes

𝑠𝑖𝑚1 𝑢, 𝑣 ∝𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑣𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑣

𝑠𝑖𝑚2 𝑢, 𝑣 ∝𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔𝑣𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔𝑣

𝑠𝑖𝑚3 𝑢, 𝑣 ∝ 𝑠𝑖𝑚(𝑘𝑢, 𝑘𝑣)

Cleyton-UFCG 15

Page 16: A formal model to the routing questions problem

Similarity

• Any combination of this equations could be used

• We choose use

𝑠𝑖𝑚 𝑢, 𝑣 =𝑠𝑖𝑚1 𝑢, 𝑣

1 − 𝑠𝑖𝑚1 𝑢, 𝑣∗𝑠𝑖𝑚2 𝑢, 𝑣

1 − 𝑠𝑖𝑚2 𝑢, 𝑣∗𝑠𝑖𝑚3 𝑢, 𝑣

1 − 𝑠𝑖𝑚3 𝑢, 𝑣

Cleyton-UFCG 16

Page 17: A formal model to the routing questions problem

Activity

• Users not interact with the same intensity

• It seems intuitive that the activity level of a user depends on the frequency with he/she post new tweets

Cleyton-UFCG 17

Page 18: A formal model to the routing questions problem

Activity

• Activity means the mean time between the messages posted by 𝑢

𝑎𝑢 =𝑡𝑜𝑑𝑎𝑦 − 𝑑𝑚, 𝑀𝑢 + 𝑑𝑚,𝑖+1 − 𝑑𝑚,𝑖

|𝑀|𝑖=1

𝑀𝑢 + 1

• As lower this value, most active is the user and bigger the chances of him give a answer quickly

Cleyton-UFCG 18

Page 19: A formal model to the routing questions problem

Solving the Model

• Calculate the tuples (𝑘𝑓,𝑞 , 𝑡𝑢,𝑓, 𝑎𝑓) to each

user is a simple task

• But, how decides who is the best?

Cleyton-UFCG 19

Page 20: A formal model to the routing questions problem

Solving the Model

• We consider this is a problem of decision making with multiple criteria

• We decide to use the Weight Product Model to solve based on [Triantaphyllou and Mann 1989]

Cleyton-UFCG 20

Page 21: A formal model to the routing questions problem

Solving the Model-Step 1

• The resolution of the model starts calculating the tuple (𝑘𝑓,𝑞 , 𝑡𝑢,𝑓, 𝑎𝑓) to each user

𝑓𝑢 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢

Cleyton-UFCG 21

Page 22: A formal model to the routing questions problem

Solving the Model-Step 2

• The we display this users in a matrix 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢 𝑥|𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠𝑢|

Cleyton-UFCG 22

Page 23: A formal model to the routing questions problem

Solving the Model-Step 3

• We create a function 𝑚𝑎𝑝 𝑥 which will map the values of (𝑘𝑓,𝑞 , 𝑡𝑢,𝑓, 𝑎𝑓) in a same scale

Cleyton-UFCG 23

Page 24: A formal model to the routing questions problem

Solving the Model-Step 4

• For each pair 𝑓1, 𝑓2 |𝑓1 ≠ 𝑓2we calculate

𝑝𝑓1,𝑓2 =𝑘𝑓1,𝑞

𝑘𝑓2,𝑞

𝑥

∗𝑡𝑢,𝑓1𝑡𝑢,𝑓2

𝑦

*𝑎𝑓1𝑎𝑓2

𝑧

• The values 𝑥,𝑦 and 𝑧 are factors of importance and must be between 0 and 1, besides that 𝑥 + 𝑦 + 𝑧 = 1

Cleyton-UFCG 24

Page 25: A formal model to the routing questions problem

Solving the Model-Step 5

• If 𝑝𝑓1,𝑓2 > 0 we put 1 in position (𝑓1, 𝑓2) and 0

in position (𝑓2, 𝑓1)

• If 𝑝𝑓1,𝑓2 < 0 we put 0 in position (𝑓1, 𝑓2) and 1

in position (𝑓2, 𝑓1)

• If 𝑝𝑓1,𝑓2 = 0 we put 1 in position (𝑓1, 𝑓2) and 1

in position (𝑓2, 𝑓1)

Cleyton-UFCG 25

Page 26: A formal model to the routing questions problem

Solving the Model-Step 5

Cleyton-UFCG 26

Page 27: A formal model to the routing questions problem

Solving the Model-Step 6 (End)

• We calculate the sum of each line of the matrix, this number represents the number of victories of each user

• In the end we have

• The question will be

routed to the user

with more victories

Cleyton-UFCG 27

Page 28: A formal model to the routing questions problem

Conclusion

• The differential of our research

– We focus in a successful network

– We treat the problem over a new perspective

– We lead with a recent and interesting problem

Cleyton-UFCG 28

Page 29: A formal model to the routing questions problem

Future Works

• The model was already implemented

• We are investigating if our heuristics are coherent

• We will investigating

– If the indications of the model are accurate

– If direct questions is more effective

– What factor of importance is most important

Cleyton-UFCG 29

Page 30: A formal model to the routing questions problem

Thank You

• Any Question?

Cleyton-UFCG 30