53
Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan Adar, Christine Alvarado, Eric Horvitz, Rosie Jones, and Michael Potts

Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Embed Size (px)

Citation preview

Page 1: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Finding and Re-Finding Through Personalization

Jaime Teevan

MIT, CSAIL

David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan Adar, Christine Alvarado, Eric Horvitz, Rosie Jones, and Michael Potts

Page 2: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Thesis Overview

• Supporting Finding– How people find– Individual differences affect finding– Personalized finding tool

• Supporting Re-Finding– How people re-find– Finding and re-finding conflict– Personalized finding and re-finding tool

Page 3: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan
Page 4: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan
Page 5: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Page 6: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Thesis Overview

• Supporting Finding– How people find– How individuals find– Personalized finding tool

• Supporting Re-Finding– How people re-find– Finding and re-finding conflict– Personalized finding and re-finding tool

Page 7: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Supporting Re-Finding

• How people re-find– People repeat searches– Look for old and new

• Finding and re-finding conflict– Result changes cause problems

• Personalized finding and re-finding tool– Identify what is memorable– Merge in new information

Page 8: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Supporting Re-Finding

• How people find– People repeat searches– Look for old and new

• Finding and re-finding conflict– Result changes cause problems

• Personalized finding and re-finding tool– Identify what is memorable– Merge in new information

Query log analysis

Memorability study

Re:Search Engine

Page 9: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Related Work

• How people re-find– Know a lot of meta-information [Dumais]

– Follow known paths [Capra]

• Changes cause problems re-finding– Dynamic menus [Shneiderman]

– Dynamic search result lists [White]

• Relevance relative to expectation [Joachims]

Page 10: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Query Log Analysis

• Previous log analysis studies– People re-visit Web pages [Greenberg]

– Query logs: Sessions [Jones]

• Yahoo! log analysis– 114 people over the course of a year– 13,060 queries and their clicks

• Can we identify re-finding behavior?

• What happens when results change?

Page 11: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Re-Finding Common

Repeat query

Repeat clickUnique click

40% 86%

33%

87% 38%

26%

of queries of queries

of queriesof queries

of repeat queries

of repeat queries

Page 12: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Change Reduces Re-Finding

• Results change rank

• Change reduces probability of repeat click– No rank change: 88% chance– Rank change: 53% chance

• Why?– Gone?– Not seen?– New results are better?

Page 13: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan
Page 14: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Change Slows Re-Finding

• Look at time to click as proxy for Ease

• Rank change slower repeat click– Compared with initial search to click– No rank change: Re-click is faster– Rank change: Re-click is slower

• Changes interfere with re-finding

?

Page 15: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Page 16: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

“Pick a card, any card.”

Page 17: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Case 1 Case 2 Case 3 Case 4 Case 5 Case 6

Page 18: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Your Card is GONE!

Page 19: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

People Forget a Lot

Page 20: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Change Blindness

Page 21: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Change Blindness

Page 22: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Page 23: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

We still need magic!

Page 24: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Memorability Study

• Participants issued self-selected query

• After an hour, asked to fill out a survey

• 129 people remembered something

Page 25: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Memorability a Function of Rank

00.10.20.30.40.50.60.70.8

1 2 3 4 5 6 7 8 9 10

Rank - R

P(R

emem

|R,C

)

Clicked - C Not clicked

Page 26: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Remembered Results Ranked High

-2

0

2

4

6

8

10

12

-2 0 2 4 6 8 10 12

Actual Rank

Rem

embe

red

Ran

k

Page 27: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Page 28: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

result list 1

result list 2

result list n

Re:Search Engine Architecture

User client

Web browser

MergeIndex of past queries

Result cache

Search engine

User interaction cache

query result list

query 1

query 2

query n

score 1

score 2

score n

result list

Page 29: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Components of Re:Search Engine

• Index of Past Queries

• Result Cache

• User Interaction Cache

• Merge Algorithm

Index of past queries

queryquery 1

query 2

query n

score 1

score 2

score n

result list 1

result list 2

result list n

Result cache

query 1

query 2

query n

User interaction cache

result list 1

result list 2

result list n

Merge result list

result list

Page 30: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Index of Past Queries

• Studied how queries differ– Log analysis– Survey of how people remember queries

• Unimportant: case, stop words, word order

• Likelihood of re-finding deceases with time

• Get the user to tell us if they are re-finding– Encourage recognition, not recall

Index of past queries

queryquery 1

query 2

query n

score 1

score 2

score n

Page 31: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Merge Algorithm

• Benefit of New Information score– How likely new result is to be useful…– …In a particular rank

• Memorability score– How likely old result is to be remembered…– …In a particular rank

• Chose list maximizes memorability and benefit of new information

result list 1

result list 2

result list n

Merge result list

result list

Page 32: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Benefit of New Information

• Ideal: Use search engine score

• Approximation: Use rank

• Results that are ranked higher are more likely to be seen– Greatest benefit given to highly ranked results

being ranked highly

Page 33: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Memorability Score

• How memorable is a result?

• How likely is it to be remembered at a particular rank?

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

1 2 3 4 5 6 7 8 9 10

-2

0

2

4

6

8

10

12

-2 0 2 4 6 8 10 12

Page 34: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Choose Best Possible List

• Consider every combination

• Include at least three old and three new

• Min-cost network flow problem

…10

7

7

10

m2

m1

m10

b10

b2

b1

st

Old

New

Slots

Page 35: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Page 36: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Evaluation

• Does merged list look unchanged?– List recognition study

• Does merging make re-finding easier?– List interaction study

• Is search experience improved overall?– Longitudinal study

Page 37: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

List Interaction Study

• 42 participants

• Two sessions a day apart – 12 tasks each session

• Tasks based on queries• Queries selected based on log analysis

– Session 1– Session 2

• Re-finding• New-finding

(“stomach flu”)

(“Symptoms of stomach flu?”)

(“Symptoms of stomach flu?”)(“What to expect at the ER?”)

Page 38: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

List Interaction Study

Page 39: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

New 1

New 2New 3New 4

New 5New 6

Old 5New 1Old 1Old 7New 2New 3New 4Old 4New 5New 6

Old

New

Experimental Conditions

• Six re-finding tasks– Original result list– Dumb merging– Intelligent merging

• Six new-finding tasks– New result list– Dumb merging– Intelligent merging

Page 40: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Old

New

Experimental Conditions

• Six re-finding tasks– Original result list– Dumb merging– Intelligent merging

• Six new-finding tasks– New result list– Dumb merging– Intelligent merging

Old 1Old 2Old 4New 1New 2New 3New 4New 5New 6Old 10

Old 1Old 2Old 4

Old 10

Page 41: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Measures

• Performance– Correct– Time

• Subjective– Task difficulty– Result quality

Page 42: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Experimental Conditions

• Six re-finding tasks– Original result list– Dumb merging– Intelligent merging

• Six new-finding tasks– New result list– Dumb merging– Intelligent merging

Faster, fewer clicks, more correct answers, and easier!

Similar to Session 1

Page 43: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: Re-Finding

Performance Original Dumb Intelligent

% correct 96%

Time (seconds)

99% 88%

38.7 45.670.9

Page 44: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: Re-Finding

Subjective Original Dumb Intelligent

% correct 99% 88% 96%

Time (seconds) 38.7 70.9 45.6

Task difficulty 1.57

Result quality 3.61 3.42 3.70

1.531.79

Page 45: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: Re-Finding

Original Dumb Intelligent

% correct 99% 88% 96%

Time (seconds) 38.7 70.9 45.6

Task difficulty 1.57 1.79 1.53

Result quality 3.61 3.42 3.70

List same?

• Intelligent merging better than Dumb

• Almost as good as the Original list

Similarity

60% 76%76%

Page 46: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: New-Finding

Performance New Dumb Intelligent

% correct 73% 74% 84%

Time (seconds) 139.3 120.5153.8

Page 47: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: New-Finding

Subjective New Dumb Intelligent

% correct 73% 74% 84%

Time (seconds) 139.3 153.8 120.5

Task difficulty 2.51 2.72 2.61

Result quality 3.193.38 2.94

Page 48: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: New-Finding

New Dumb Intelligent

% correct 73% 74% 84%

Time (seconds) 139.3 153.8 120.5

Task difficulty 2.51 2.72 2.61

Result quality 3.38 2.94 3.19

List same?

• Knowledge re-use can help

• No difference between New and Intelligent

Similarity

38% 50% 61%

Page 49: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Results: Summary

• Re-finding– Intelligent merging better than Dumb– Almost as good as the Original list

• New-finding– Knowledge re-use can help– No difference between New and Intelligent

• Intelligent merging best of both worlds

Page 50: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Conclusion

• How people re-find– People repeat searches– Look for old and new

• Finding and re-finding conflict– Result changes cause problems

• Personalized finding and re-finding tool– Identify what is memorable– Merge in new information

Page 51: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Future Work

• Improve and generalize model– More sophisticated measures of memorability– Other types of lists (inboxes, directory listings)

• Effectively use model– Highlight change as well as hide it

• Present change at the right time– This talk’s focus: what and how– What about when to display new information?

Page 52: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

Thesis Overview

• Supporting Finding– How people find– How individuals find– Personalized finding tool

• Supporting Re-Finding– How people re-find– Finding and re-finding conflict– Personalized finding and re-finding tool

Page 53: Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan

David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan Adar, Christine Alvarado, Eric Horvitz, Rosie Jones, and Michael Potts

Thank You!

Jaime Teevan

[email protected]