31
Nicola Ferro, and Gianmaria Silvello University of Padua, Italy Marco Angelini, Giuseppe Santucci “La Sapienza” University of Rome, Italy Information Interaction in Context (IIiX 2012) Nijmegen, the Netherlands, August 21-24, 2012 Visual Interactive Failure Analysis: Supporting Users in Information Retrieval Evaluation lunedì 20 agosto 12

Visual Interactive Failure Analysis: Supporting Users in

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Visual Interactive Failure Analysis: Supporting Users in

Nicola Ferro, and Gianmaria SilvelloUniversity of Padua, Italy

Marco Angelini, Giuseppe Santucci “La Sapienza” University of Rome, Italy

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012

Visual Interactive Failure Analysis: Supporting Users in

Information Retrieval Evaluation

lunedì 20 agosto 12

Page 2: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Outline

Motivations

Models for Interaction

The Prototype and Application Examples

The Domino Effect

Final Remarks and On-Going Works

2

lunedì 20 agosto 12

Page 3: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Motivation

3

Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive

tool for evaluation

lunedì 20 agosto 12

Page 4: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Motivation

3

Failure Analysis

or “Understand What is Wrong”.

Is it better to re-rank or to re-query?

Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive

tool for evaluation

lunedì 20 agosto 12

Page 5: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Motivation

3

Failure Analysis

or “Understand What is Wrong”.

Is it better to re-rank or to re-query?

Reduce the time needed to understand and analyze the behavior of an IR system providing a visual interactive

tool for evaluation

What-if Analysis

“Try to anticipate the effect of a modification of the system”.

?

lunedì 20 agosto 12

Page 6: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

The Underlying Idea

4

Visual Analytics is not only a mean to improve the presentation of results, but also a mean allowing the users to

analyze and interact with data

IR Evaluation Visual Analytics+

lunedì 20 agosto 12

Page 7: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

5

Rank Gain/Loss Model

Clustering via Learning to Rank

What-If Analysis Model

lunedì 20 agosto 12

Page 8: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6

Models for Interaction

Rank Gain/Loss Model

DCG is good if we want to compare performances between different systems but what if we want to look at

misplaced documents?

lunedì 20 agosto 12

Page 9: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6

Models for Interaction

Rank Gain/Loss Model

DCG is good if we want to compare performances between different systems but what if we want to look at

misplaced documents?

Relative Position = RPos let us determine how much a document is misplaced with respect to its ideal rank.

lunedì 20 agosto 12

Page 10: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6

Models for Interaction

Rank Gain/Loss Model Rank Gain/Loss Model: R_Pos

lunedì 20 agosto 12

Page 11: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 6

Models for Interaction

Rank Gain/Loss Model

By means of R_Pos we can say if a document is ranked too soon or too late. But, can we quantify “too soon”

and “too late”?

How much these misplaced documents impact on the dcg?

too soon too late

Rank Gain/Loss Model: R_Pos

lunedì 20 agosto 12

Page 12: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

7

Rank Gain/Loss Model : Delta Gain

lunedì 20 agosto 12

Page 13: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

7

Rank Gain/Loss Model : Delta Gain

lunedì 20 agosto 12

Page 14: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

8

Failure

Analysis

lunedì 20 agosto 12

Page 15: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

9

Clustering via Learning to Rank

Learn the ranking model of the IR system under investigation in order to simulate the way in which it ranks the documents

lunedì 20 agosto 12

Page 16: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

9

Clustering via Learning to Rankq1 q2 qn

x

(1)1

x

(1)2

x

(1)m1 x

(2)m(2)

x

(2)1

x

(2)2

x

(n)1

x

(n)2

x

(n)m(n)

......

...

. . .

Training Data

y(1) y(2) y(n)

q

x1

xm

h(x)

?

x2

...qx1

xm

x2

...Test Data

Predicton

Learning System

Ranking System

Modelh

lunedì 20 agosto 12

Page 17: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

9

Clustering via Learning to Rankq1 q2 qn

x

(1)1

x

(1)2

x

(1)m1 x

(2)m(2)

x

(2)1

x

(2)2

x

(n)1

x

(n)2

x

(n)m(n)

......

...

. . .

Training Data

y(1) y(2) y(n)

q

x1

xm

h(x)

?

x2

...qx1

xm

x2

...Test Data

Predicton

Learning System

Ranking System

Modelh

• Lever on the clustering hypothesis

• Group together the documents which are similar from the considered ranking model point-of-view

Generating the clusters

(1) Submit each doc in Dj as a query and retrieve a set of docs Di;

(2) determine Ci = Dj \ Di;

(3) ranking the documents in Ci by employing the learned ranking model

lunedì 20 agosto 12

Page 18: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

10

What-if Analysis

lunedì 20 agosto 12

Page 19: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Models for Interaction

10

What-if Analysis

lunedì 20 agosto 12

Page 20: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

To Summarize: Data Pipeline

11

lunedì 20 agosto 12

Page 21: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Application Example: Free Movement

12

lunedì 20 agosto 12

Page 22: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Application Example: Free Movement

12

lunedì 20 agosto 12

Page 23: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 13

Application Example: Capped Movement

lunedì 20 agosto 12

Page 24: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide 13

Application Example: Capped Movement

lunedì 20 agosto 12

Page 25: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Application Example: New Entry Evolution

14

lunedì 20 agosto 12

Page 26: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Application Example: New Entry Evolution

14

lunedì 20 agosto 12

Page 27: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

The Domino Effect

15

How does the change for topic 351 affect the other topics?

lunedì 20 agosto 12

Page 28: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

The Domino Effect

15

The change for 351 worsens the DCG curve of topic 355

lunedì 20 agosto 12

Page 29: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

The Domino Effect

15

The change for 351 improves the DCG curve of topic 400

lunedì 20 agosto 12

Page 30: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

The Domino Effect

16

Aggregate view for the whole set of documents before and after the movement

lunedì 20 agosto 12

Page 31: Visual Interactive Failure Analysis: Supporting Users in

Information Interaction in Context (IIiX 2012)Nijmegen, the Netherlands, August 21-24, 2012 Gianmaria Silvello slide

Final Remarks and On-Going Work

We presented the visual interactive tool allowing analysts to perform failure and what-if analyses

We described the prototype implementing an actual fusion between IR evaluation and visual analytics tested on the TREC7 collection

We are performing additional tests employing different learning to rank algorithms to construct the clusters

We are investigating whether (and how) custom features extraction and selection may allow us to understand on which component of the IR system a change of ranking has an impact on

17

lunedì 20 agosto 12