Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
Nicola Ferro, and Gianmaria SilvelloUniversity of Padua, Italy
Marco Angelini, Giuseppe Santucci “La Sapienza” University of Rome, Italy
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012
Visual Interactive Failure Analysis: Supporting Users in
Information Retrieval Evaluation
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Outline
Motivations
Models for Interaction
The Prototype and Application Examples
The Domino Effect
Final Remarks and On-Going Works
2
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Motivation
3
Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive
tool for evaluation
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Motivation
3
Failure Analysis
or “Understand What is Wrong”.
Is it better to re-rank or to re-query?
Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive
tool for evaluation
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Motivation
3
Failure Analysis
or “Understand What is Wrong”.
Is it better to re-rank or to re-query?
Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive
tool for evaluation
What-if Analysis
“Try to anticipate the effect of a modification of the system”.
?
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
The Underlying Idea
4
Visual Analytics is not only a mean to improve the presentation of results, but also a mean allowing the users to
analyze and interact with data
IR Evaluation Visual Analytics+
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
5
Rank Gain/Loss Model
Clustering via Learning to Rank
What-If Analysis Model
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6
Models for Interaction
Rank Gain/Loss Model
DCG is good if we want to compare performances between different systems but what if we want to look at
misplaced documents?
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6
Models for Interaction
Rank Gain/Loss Model
DCG is good if we want to compare performances between different systems but what if we want to look at
misplaced documents?
Relative Position = RPos let us determine how much a document is misplaced with respect to its ideal rank.
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6
Models for Interaction
Rank Gain/Loss Model Rank Gain/Loss Model: R_Pos
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6
Models for Interaction
Rank Gain/Loss Model
By means of R_Pos we can say if a document is ranked too soon or too late. But, can we quantify “too soon”
and “too late”?
How much these misplaced documents impact on the dcg?
too soon too late
Rank Gain/Loss Model: R_Pos
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
7
Rank Gain/Loss Model : Delta Gain
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
7
Rank Gain/Loss Model : Delta Gain
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
8
Failure
Analysis
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
9
Clustering via Learning to Rank
Learn the ranking model of the IR system under investigation in order to simulate the way in which it ranks the documents
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
9
Clustering via Learning to Rankq1 q2 qn
x
(1)1
x
(1)2
x
(1)m1 x
(2)m(2)
x
(2)1
x
(2)2
x
(n)1
x
(n)2
x
(n)m(n)
......
...
. . .
Training Data
y(1) y(2) y(n)
q
x1
xm
h(x)
?
x2
...qx1
xm
x2
...Test Data
Predicton
Learning System
Ranking System
Modelh
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
9
Clustering via Learning to Rankq1 q2 qn
x
(1)1
x
(1)2
x
(1)m1 x
(2)m(2)
x
(2)1
x
(2)2
x
(n)1
x
(n)2
x
(n)m(n)
......
...
. . .
Training Data
y(1) y(2) y(n)
q
x1
xm
h(x)
?
x2
...qx1
xm
x2
...Test Data
Predicton
Learning System
Ranking System
Modelh
• Lever on the clustering hypothesis
• Group together the documents which are similar from the considered ranking model point-of-view
Generating the clusters
(1) Submit each doc in Dj as a query and retrieve a set of docs Di;
(2) determine Ci = Dj \ Di;
(3) ranking the documents in Ci by employing the learned ranking model
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
10
What-if Analysis
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Models for Interaction
10
What-if Analysis
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
To Summarize: Data Pipeline
11
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Application Example: Free Movement
12
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Application Example: Free Movement
12
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 13
Application Example: Capped Movement
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 13
Application Example: Capped Movement
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Application Example: New Entry Evolution
14
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Application Example: New Entry Evolution
14
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
The Domino Effect
15
How does the change for topic 351 affect the other topics?
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
The Domino Effect
15
The change for 351 worsens the DCG curve of topic 355
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
The Domino Effect
15
The change for 351 improves the DCG curve of topic 400
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
The Domino Effect
16
Aggregate view for the whole set of documents before and after the movement
lunedì 20 agosto 12
Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide
Final Remarks and On-Going Work
We presented the visual interactive tool allowing analysts to perform failure and what-if analyses
We described the prototype implementing an actual fusion between IR evaluation and visual analytics tested on the TREC7 collection
We are performing additional tests employing different learning to rank algorithms to construct the clusters
We are investigating whether (and how) custom features extraction and selection may allow us to understand on which component of the IR system a change of ranking has an impact on
17
lunedì 20 agosto 12