© CvR1 The Geometry of IR Keith van Rijsbergen Tampere 15 th August, 2002 (lost in Hilbert space!)

Preview:

Citation preview

© CvR 1

The Geometry of IR

Keith van Rijsbergen

Tampere 15th August, 2002(lost in Hilbert space!)

© CvR 2

“Unscripted” comments I

StatesObservablesMeasurement

=> Reality?Projection PostulatesCognitive State Changes

© CvR 3

“Unscripted” comments II (quoting John von Neumann)

However, all quantum mechanical probabilities are defined byinner products of vectors. Essentially if a state of a system is givenby one vector, the transition probability in another state is theinner product of the two which is the square of the angle betweenthem. In other words, probability correspond precisely to intro-ducing the angles geometrically. Furthermore, there is only oneway to introduce it. The more so because in the quantum mechanicalmachinery the negation of a statement, so the negation of a statementwhich represented by a linear set of vectors, correponds to theorthogonal complement of this linear space.

Unsolved problems in mathematics, typescript, September, 1954

© CvR 4

What is this talk about?

Not about quantum computation.see Nielsen and Chuang, CUP, 2000

Not about Logicsee Engesser and Gabbay, AI, 2002

• History (von Neumann, Dirac, Schroedinger)• Motivation (complementarity)• Duality (Syntax/Semantics)• Measurement (Incompatibility)• Projections (subspaces)• Probability (inner products)• IR application (feedback, clusters, ostension)

© CvR 5

© CvR 6

© CvR 7

Images not Text: how might thatmake a difference?

no visual keywords (yet)- tf/idf issue

aboutness revisable (eg Maron)relevance revisable (eg Goffman)feedback requires salienceaboutness -> relevance -> aboutness

© CvR 8

This is not new!

Goffman, 1969: ‘..that the relevance of the informationfrom one document depends upon what is already knownabout the subject, and in turn affects the relevance of otherdocuments subsequently examined.’

Maron, : ‘Just because a document is about the subject sought by a patron, that fact does not imply that he wouldjudge it relevant.’

© CvR 9

Maron’s theory of indexing

…..in the case where the query consists of singleterm, call it B, the probability that a given documentwill be judged relevant by a patron submitting Bis simply the ratio of the number of patrons who submitB as their query and judge that document as relevant,to the number of patrons, who submit B as their search query

© CvR 10

JPEG Image

In 1949 D.M Mackay wrote a paper ‘Quantalaspects of scientific information’, SER, vol 41, no.314,in which he alluded to using the quantum mechanicsparadigm to IR

© CvR 11

Expectation Catalogue

It (-function) is now the means for predicting probability ofmeasurement results. In it is embodied themomentarily-attained sum of theoretically basedfuture expectation, somewhat as laid down in acatalogue. It is the relation-and-determinacy-bridgebetween measurements and measurements......It is, in principle, determined by a finite number of suitablychosen measurement on the object.....Thus the catalogueof expectations is initially compiled.

Schrödinger, 1935 &1980

© CvR 12

Hypotheses

Cluster Hypothesis: closely associated documents tend to berelevant to the same requests. (1971)

[co-ordination is positively correlated with external relevance,Jackson, 1969]

Association Hypothesis: If an index term is good at discriminatingrelevant from non-relevant documents then any closely associatedindex term is also likely to be good at this. (1979)

[co-occurrence of terms within documents is a suitable measureof similarity between terms, Jackson,1971]

© CvR 13

Navigation - Browsing

T-space

D-space

© CvR 14

DUALITY

Direct file/Inverted file Statespace/Space of Projections

d’ = (x,y,z,u,v,w)d” =(u,v,w,k,l,m)

[[u]] = {d’,d”}; [[x]] = {d’}; [[m]] = {d”}

Boolean Logic: [[ux]] = {d”}; [[xm]] ={d’,d’’}Quantum Logic: [[ux]] = same; [[xm]] = different

© CvR 15

The mathematics you need

Hilbert space (complex!!!)

•inner product <x|y>•norms ||x||2 = <x|x>•operator (linear) <x|A|y>•Hermitian A*=A•trace tr(A) =aii

•eigenvalues Ax = x

© CvR 16

Crash course on Dirac notation

|x> : vector (called ket)<x| = |x>*: functional (bra)

<x|y> = (row vector)(column vector)= xi*yi

|x><y| : linear operator|x><x| : a projector onto ray xtr(|x><y|) = <x|y>

I = |i><i| : universal projector

© CvR 17

Hierarchy of ProjectorsP0 = Pn = I

P1 = |1><1|P2 = |1><1| +|2><2|

.

.

.Pn = |1><1| +…+|n><n|

© CvR 18

Summary

Relevance/Aboutness

Documents

Queries

Observables

Operators

State function

Operators can be applied to state function; andoperators can be decomposed into projectors.

A = aiPi

© CvR 19

‘That is the relevance or irrelevance ofa given retrieved document may affectthe user’s current state of knowledgeresulting in a change of the user’sinformation need, which may lead toa change of the user’s perception/interpretation of the subsequentretrieved documents….’ Borlund, 2000

© CvR 20

T T

T

RR

Y

N

Y

N

N

Y

Relevance/Aboutnes is

Interaction/User dependent

© CvR 21

probability as inner product

|t><t||r><r||t><t| =|t> <t|r><r|t> <t| = <t|r><r|t> |t><t| = |<t|r>|2 |t><t| = cos2 |t><t| (in real Hilbert space)

© CvR 22

|r=1>

|t=1>

|t=0>

|r=0> x

© CvR 23

An operator T is of trace-class provided that T is positive(<x|T|x> 0, x) and trace of T is finite (<ei|T|ei>)

T is a density operator if T is trace-class and tr(T) = 1

T = aiPi is a density operator if 0 ai and ai = 1

© CvR 24

Theorem

Letbe any measure on the closed subspaces of aseparable (real or complex) Hilbert space H of dimensionat least 3. There exists a positive self-adjoint operator Tof trace class such that, for all closed subspace L of H,

(L) = Tr(TPL)

If is to be a probability measure, thus requiring that(H) = 1, then Tr(T) =1, that is, T is a density operator.

© CvR 25

Conditional Probability

P(LA|LB) = tr(PBDPBPA) / tr(DPB)

Note that PA could be E -> F

© CvR 26

What is T? – without blinding you with science

-Relevance Feedback ( a mixture with log weights)-Pseudo relevance feedback (a mixture with similarity weights)-Clustering (superposition of members?)-Ostension (a history)

© CvR 27

Conclusions?

Is it worth it? Does it matter?

- images- logic/probability/information/vectors- language

© CvR 28

Useful ReferencesReadings in Information Retrieval,Morgan Kaufman, Edited by Sparck Jones and Willett

Advances in Information Retrieval: Recent Research from CIIR, Edited by Bruce Croft.

Information Retrieval: Uncertainty and Logics,Advanced Models for the Representation and Retrieval of Information, Edited by Crestani, Lalmas, Van Rijsbergen.

Finding out about, Richard Belew.

Recommended