20
1 Thomas Mandl: GeoCLEF Track Overview 2007 Cross-Language Evaluation Forum (CLEF) homas Mandl, (U. Hildesheim) [email protected] 8 th Workshop of the Cross-Language Evaluation Forum (CLEF) Budapest 20 Sept. 2007 GeoCLEF Track Overview: Query Classification and Geographic Search

Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) [email protected] 8 th Workshop

Embed Size (px)

Citation preview

Page 1: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

1Thomas Mandl: GeoCLEF Track Overview 2007

Cross-Language

Evaluation Forum (CLEF)

Thomas Mandl, (U. Hildesheim) [email protected]

8th Workshop of theCross-Language Evaluation Forum (CLEF)Budapest 20 Sept. 2007

GeoCLEF Track Overview: Query Classification and Geographic Search

Page 2: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

2Thomas Mandl: GeoCLEF Track Overview 2007

GeoCLEF AdministrationGeoCLEF AdministrationGeoCLEF AdministrationGeoCLEF Administration

• Joint effort of

– Fredric Gey (U. California at Berkeley)– Diana Santos (Linguateca, SINTEF ICT, Norway)– Mark Sanderson (U. Sheffield)– Nicola Ferro, Giorgio Di Nunzio (U. Padua)– Thomas Mandl, Christa Womser-Hacker

(U. Hildesheim)

– and many others ...

Page 3: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

3Thomas Mandl: GeoCLEF Track Overview 2007

Geographic Information SystemsGeographic Information SystemsGeographic Information SystemsGeographic Information Systems

Page 4: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

4Thomas Mandl: GeoCLEF Track Overview 2007

Initial Aim of GeoCLEFInitial Aim of GeoCLEFInitial Aim of GeoCLEFInitial Aim of GeoCLEF

• Aim: to evaluate retrieval of multilingual documents with an emphasis on geographic search (GIR) – “find me news stories about riots near Dublin”

(Fred Gey @ CLEF Workshop 2005)

Page 5: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

5Thomas Mandl: GeoCLEF Track Overview 2007

ParticipationParticipationParticipationParticipation

CLEF Year 2005 2006 2007

Nr. of Participants

11 17 13

Nr. of submitted Experiments

117 149 108

New: Query New: Query Classification TaskClassification Task6 participants6 participants

Page 6: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

6Thomas Mandl: GeoCLEF Track Overview 2007

Search Task 2007Search Task 2007Search Task 2007Search Task 2007

• Three languages

• 600,000 + docs

• 25 topics (75 in three years now)

• Intention behind topics– geographically

challenging

Page 7: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

7Thomas Mandl: GeoCLEF Track Overview 2007

Reliability?Reliability?Reliability?Reliability?

• 25 topics are sufficient under most circumstances to reliably order systems (Sanderson & Zobel 2005)

Page 8: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

8Thomas Mandl: GeoCLEF Track Overview 2007

Partial Swap Rate AnalysisPartial Swap Rate AnalysisPartial Swap Rate AnalysisPartial Swap Rate Analysis

0.0

0.10.2

0.30.4

0.5

0.60.7

0.80.9

1.0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25

Average Correlation between system rankings of full and partial topic setfor German Mono-lingual task 2006

Page 9: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

9Thomas Mandl: GeoCLEF Track Overview 2007

Search TaskSearch TaskSearch TaskSearch Task

• How much and which geo knowledge and reasoning is necessary?

• Each year, keyword based systems do well on the task

Page 10: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

10Thomas Mandl: GeoCLEF Track Overview 2007

Query Classification TaskQuery Classification TaskQuery Classification TaskQuery Classification Task

• Goal: find geo queries in a log of real queries

• New in 2007

• Organized by Xing Xie (Microsoft Research Asia, Beijing, China)

Page 11: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

11Thomas Mandl: GeoCLEF Track Overview 2007

DataDataDataData

• Query log from the MSN search engine– in English– 800.000 queries (collected August 2006)– 500 queries were labelled and used for

evaluation• 100 queries for training• 400 for testing

Page 12: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

12Thomas Mandl: GeoCLEF Track Overview 2007

TaskTaskTaskTask

• Find queries with a geographic scope– Extract where component– Extract geo-relation-type– Extract what component– Classify what type {information, yellow page, map}

Example:

< what>lottery<what-type>information

<where>Florida, US

< geo-relation>in

<local>YES

Lottery in Florida

Page 13: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

13Thomas Mandl: GeoCLEF Track Overview 2007

Geo-relation-typesGeo-relation-typesGeo-relation-typesGeo-relation-types

• 27 classes

• Examples:– In– On– Near– Along– Distance– North_of– North_west_of– North_to– …

Page 14: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

14Thomas Mandl: GeoCLEF Track Overview 2007

Evaluation SetEvaluation SetEvaluation SetEvaluation Set

1. Choose 800 queries randomly from the query set.

2. Remove the typos and the ambiguous queries from the 800 ones manually.

3. Select the queries with special geo-relations from the remainder queries in the query set manually and add them to the evaluation set.

4. Select 500 queries for the final evaluation set.

Page 15: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

15Thomas Mandl: GeoCLEF Track Overview 2007

Query Types in Evaluation SetQuery Types in Evaluation SetQuery Types in Evaluation SetQuery Types in Evaluation Set

Page 16: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

16Thomas Mandl: GeoCLEF Track Overview 2007

Evaluation MetricsEvaluation MetricsEvaluation MetricsEvaluation Metrics

• Three assessors – individually assessed all system answers– reached an agreement

• Fully Correct classified query instances

• Recall, precision and combined F1-Score

Page 17: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

17Thomas Mandl: GeoCLEF Track Overview 2007

ResultsResultsResultsResults

Team Precision Recall F1

Ask 0.625 0.258 0.365

Csusm 0.201 0.197 0.199

Linguit 0.112 0.038 0.057

Miracle 0.428 0.566 0.488

Talp 0.222 0.249 0.235

Xldb 0.096 0.08 0.088

Page 18: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

18Thomas Mandl: GeoCLEF Track Overview 2007

ApproachesApproachesApproachesApproaches

• Gazeteers for location identification– Large base of geo names

• Pre-defined Rules

• Issues– Low Perfomance– Few training classes for many geo-types

Page 19: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

19Thomas Mandl: GeoCLEF Track Overview 2007

Parallel Session on Friday 9:00 – 10:30

Please come to the Breakout Session on Friday 11:00 – 12:00

and help us to promote GeoCLEF

More on GeoCLEFMore on GeoCLEFMore on GeoCLEFMore on GeoCLEF

Page 20: Thomas Mandl: GeoCLEF Track Overview 2007 1 Cross-Language Evaluation Forum (CLEF) Thomas Mandl, (U. Hildesheim) mandl@uni-hildesheim.de 8 th Workshop

20Thomas Mandl: GeoCLEF Track Overview 2007

OverviewOverviewOverviewOverview

More on the geographic search task …

• Mark Sanderson: Topic Development and Relevance Assessment

• Giorgio Di Nunzio: Results

• Diana Santos: Approaches and Interpretation