Transcript
Page 1: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

Welcome to CLEF 2005

Carol PetersISTI-CNR Pisa, Italy

Page 2: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

Cross-Language System Evaluation

9 years of activity

CLIR track at TREC (1997-1999) CLEF 2001 & 2000 - sponsored by DELOS

Network of Excellence (5FP) and US National Institute of Standards and technology

CLEF 2002 & 2003 - IST-2000-31002 CLEF 2004 & 2005 again sponsored by DELOS

Network of Excellence

plus

Page 3: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF Coordination

Centre for the Evaluation of Human Language and Multimodal Communication Technologies (CELCT), Trento, Italy

Centro per la Ricerca Scientifica e Tecnologica, Istituto Trentino di Cultura, Trento, Italy

College of Information Studies and Institute for Advanced Computer Studies, University of Maryland, USA

Department of Computer Science, University of Helsinki

Department of Computer Science and Information Systems, University of Limerick, Ireland

Department of Information Engineering, University of Padua, Italy

Department of Information Studies, University of Sheffield, UK

Evaluations and Language Resources Distribution Agency Sarl, Paris, France

German Research Centre for Artificial Intelligence, DFKI, Saarbrücken, Germany

Information and Language Processing Systems, University of Amsterdam, Netherlands

InformationsZentrum Sozialwissenschaften, Bonn, Germany

Lenguajes y Sistemas Informáticos, Universidad Nacional de Educación a Distancia, Madrid, Spain

Linguateca, Sintef, Oslo, Norway; University of Minho, Braga, Portugal

Linguistic Modelling Laboratory, Bulgarian Academy of Sciences

National Institute of Standards and Technology, Gaithersburg MD, USA

Oregon Health and Science University, USA Research Computing Center of Moscow State

University Research Institute for Linguistics, Hungarian

Academy of Sciences School of Computing, Dublin City University,

Ireland UC Data Archive and School of Information

Management and Systems, UC Berkeley, USA University Hospitals and University of Geneva,

Switzerland 

CLEF is coordinated by the Istituto di Scienza e Tecnologie dell'Informazione, Consiglio Nazionale delle Ricerche, PisaThe following Institutions are contributing to the organisation of the different tracks of the CLEF 2005 campaign:

Page 4: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEFSteering Committee

Maristella Agosti, University of Padova, Italy Eija Airio, University of Tampere, Finland Martin Braschler, Zurich, Switzerland Amedeo Cappelli, ISTI-CNR & CELCT, Italy Hsin-Hsi Chen, National Taiwan University, Taipei, Taiwan Khalid Choukri, Evaluations and Language resources

Distribution Agency, Paris, France Paul Clough, University of Sheffield, UK David A. Evans, Clairvoyance Corporation, USA Marcello Federico, ITC-irst, Trento, Italy Christian Fluhr, CEA-LIST, Fontenay-aux-Roses, France Norbert Fuhr, University of Duisburg, Germany Frederic C. Gey, U.C. Berkeley, USA Julio Gonzalo, LSI-UNED, Madrid, Spain Donna Harman, National Institute of Standards and

Technology, USA Gareth Jones, Dublin City University, Ireland Franciska de Jong, University of Twente, Netherlands Noriko Kando, National Institute of Informatics, Tokyo,

Japan Jussi Karlgren, Swedish Institute of Computer Science,

Sweden

Michael Kluck, German Institute for International and Security Affairs, Berlin, Germany

Natalia Loukachevitch, Moscow State University, Russia Bernardo Magnini, ITC-irst, Trento, Italy Paul McNamee, Johns Hopkins University, USA Henning Müller, University & University Hospitals of

Geneva, Switzerland Douglas W. Oard, University of Maryland, USA Maarten de Rijke, University of Amsterdam, Netherlands Jacques Savoy, University of Neuchatel, Switzerland Peter Schäuble, Eurospider Information Technologies,

Switzerland Richard Sutcliffe, University of Limerick, Ireland Max Stempfhuber, Informationszentrum

Sozialwissenschaften Bonn, Germany Hans Uszkoreit, German Research Center for Artificial

Intelligence (DFKI), Germany Felisa Verdejo, LSI-UNED, Madrid, Spain José Luis Vicedo, University of Alicante, Spain Ellen Voorhees, National Institute of Standards and

Technology, USA Christa Womser-Hacker, University of Hildesheim, Germany

Page 5: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2000 mono-, bi- and multilingual textual document retrieval on news collections

(Ad Hoc) mono- and cross-language information on structured scientific data

(Domain-Specific)

CLEF 2000-2005: Evaluation Tracks

CLEF 2001 interactive cross-language retrieval (iCLEF)

CLEF 2003 multiple language question answering (QA@CLEF) cross-language retrieval in image collections (ImageCLEF)

CLEF 2002 cross-language spoken document retrieval (CL-SR)

CLEF 2005 multilingual retrieval of Web documents (WebCLEF) cross-language geographical retrieval (GeoCLEF)

Page 6: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2005: Track Coordinators

Ad Hoc: Giorgio Di Nunzio, Nicola Ferro and Gareth Jones Domain-Specific: Michael Kluck and Natalia Loukachevitch iCLEF: Julio Gonzalo, Paul Clough and Alessandro Vallin QA@CLEF: Bernardo Magnini, Alessandro Vallin, Danilo Giampiccolo,

Lili Aunimo, Christelle Ayache, Petya Osenova, Anselmo Peñas, Maarten de Rijke, Bogdan Sacaleanu, Diana Santos and Richard Sutcliffe

ImageCLEF: Paul Clough, Henning Müller, Thomas Deselaers , Michael Grubinger, Thomas Lehmann, Jeffery Jensen, and William Hersh

CL-SR: Ryen W. White, Douglas W. Oard, Gareth J. F. Jones, Dagobert Soergel, Xiaoli Huang

Web-CLEF: Börkur Sigurbjörnsson, Jaap Kamps, Maarten de Rijke GeoCLEF: Fredric Gey, Ray Larson, Mark Sanderson, Hideo Joho and

Paul Clough

Page 7: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2005: Participating Groups

Budapest U. Tech.&Economics, Hungary Bulgarian Acad.Sci –TreeBank, Bulgaria * California State U. - Comp.Sci, USA CEA-LIST / LIC2M, France ** Chinese U. of Hong Kong, China CLIPS-Grenoble, France ** CMU - Language Technology, USA Daedalus & Madrid Univs, Spain ** DFKI-Artificial Intelligence, DE** Dublin City U. - Comp.Sci., Ireland * ENSM - St Etienne, France Hummingbird Core Tech., Canada **** Inst.Infocomm Research, Singapore IPAL-CNRS (IR2), Singapore ** IRIT/SIG,Toulouse, France *** ITC-irst Trento, Italy ***** Ist.Nac.Astrofisica, Optica, Electronica,

Mexico Johns Hopkins U., USA ***** LIMSI-CNRS, France ** Linguateca-Sintef, Norway * Linguit GmbH, Germany Metacarta Inc., USA Moscow State U.- Computing, Russia Mount Holyoke College, USA Nat.Chiao-Tung U.-Comp.Sci, Taiwan * Nat. Inst.Informatics, Japan *

U.Groningen - Inf.Sci, Netherlands U.Hagen – IICS, Germany ** U.Helsinki - Comp.Sci, Finland * U.Hildesheim - Inf.Sci, Germany ** U.Indonesia - Comp.Sci, Indonesia U.Jaen - Intell.Systems, Spain **** U.Liege - Elect.Eng.&CS, Belgium U.Limerick - Comp. Sci, Ireland ** U.Lisbon – Informatics, Portugal * U.Maryland - Comp.Sci, USA ***** U.Melbourne – NICTA, Australia U.Montreal, Canada ***** U. Nantes – Informatique, France U.Neuchatel – Informatique, Switzerland **** U.Ottawa - IT & Eng, Canada U.Pittsburgh – IR, USA U.Politecnica Catalunya – TALP, Spain U.Politecnica Valencia - Comp.Sci, Spain U.Salamanca – REINA, Spain *** U.Sheffield - Inf.Studies, UK ***** U.Stockholm, NLP, Sweden * U.Surugadai - Cultural Inf., Japan ** U.Waterloo - Comp.Sci, Canada UC Berkeley - IM&S-1, USA ***** UC Berkeley-IM&S-2, USA * UNED-LSI, Spain ****

Nat.Dong Hwa U., Taiwan * Nat.Taiwan U. - Comp-Sci, Taiwan

**** Nat.U. Singapore, Singapore Oregon Health & Sci. U., USA * Priberam Informatica, Portugal RWTH Aaachen-Comp.Sci., Germany

* RWTH Aachen - Med.Inf., Germany * SUNY Buffalo – Informatics, USA ** Swedish Inst.Comp.Sci, Sweden **** SYNAPSE Développement, France Thomson Legal Regulatory, USA

**** U. Hospitals Geneva, Switzerland * U.Alicante - Comp.Sci, Spain **** U.Amsterdam - Informatics,

Netherlands **** U.Amsterdam – Melange, Netherlands U.Autonomous Puebla - Comp.Sci,

Mexico U.Comahue - Comp.Sci, Argentina U.Concordia - Comp.Sci, Canada U.Evora – Informatics, Portugal * U.Geneva – Informatique, Switzerland U.Glasgow – IR, UK * U.Granada, Spain

Page 8: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF: Growth in Participation

CLEF 2000-2005

010

2030

405060

70

2000 2001 2002 2003 2004 2005

Years

Par

tici

pan

ts Others

North American

Asian

European

Page 9: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

No. of Participants per Track

Ad Hoc: Monolingual - 17 Bilingual - 16 Multilingual - 5

Domain-Specific - 7 iCLEF - 5

CL-SR - 7 QA@CLEF - 23 ImageCLEF - 24 WebCLEF - 15 GeoCLEF - 11

Page 10: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2000 – 2005Shift in Focus

0

5

10

15

20

25

2000 2001 2002 2003 2004 2005

Years

Pa

rtic

ipa

tin

g G

rou

ps

AdHoc: Mono

AdHoc: Bi

AdHoc: Multi

DomSpec

iCLEF

CL-SR

QA@CLEF

ImageCLEF

WebClef

GeoClef

Page 11: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2005 Document Collections

Ad Hoc, QA@CLEF, iCLEF, GeoCLEF CLEF multilingual comparable corpus of more than 2M news docs in 12 languages: DE,EN,ES,FI,FR,IT,NL,RU,SV, PT, BG and HU (new in 2005)Domain-Specific The GIRT-4 social science database in EN and DE: more that 300,000 docs The Russian Social Science Corpus: almost 100,000 docsImageCLEF St Andrews historical photographic archive: 28,000 images CasImage radiological medical database with case notes in FR and EN: 9,000 PEIR 33,000 images, MIR 2,000, PathoPic 9,000 IRMA collection in EN and DE for automatic medical image annotation: 10,000CL-SR Malach collection of spontaneous conversational speech derived from the Shoah archives: 589 hours WebCLEF EuroGOV, a multilingual collection of more than 2M webpages crawled from European governmental sites

Page 12: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2005 Topics

Ad hocMono- and Bi-: 50 topics in 13 languagesMultilanguage: 60 topics from CLEF 2003

Domain Specific 25 topics in 25 in EN, DE and RU

QA@CLEF200 questions in 10 languages

ImageCLEFAd Hoc 28 topics in 7 languages (All Fields) and 25 languages (title

only)Medical 25 topics: visual, text and visual, semantic; text in 3

languagesCL-SR

x training topics and 25 eval. Topics in EN, CZ, FR, DE, ESWebCLEF

> 500 topics in 11 languagesGeoCLEF

25 topics in DE, EN, ES, PT

Page 13: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2005: Results

Participation is up: 74 groups in 2005 (54 in 2004) Expansion of test-suite Great success of QA@CLEF and ImageCLEF Much interest in CL-SR, GeoCLEF and WebCLEF CLEF research community: synergy of diverse

expertise partly consequence of new tracks – IR, NLP, Image Processing, Speech Processing, GIS, …

CLEF 2005 Workshop 21-23 September, in conjunction with ECDL2005, >110 participants (ca 95 in 2004)

Page 14: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF Results in 9 Yrs Creation of strong CLIR research community (increase in

participation over years ) Strong profile (we are “known”) Promotion of research in key areas (multilingual IR; results

merging; cross-language access in multimedia; interactive query formulation and results presentation)

Encouraged take-up of techniques/resources between research groups

Stimulated synergy between researchers from different areas (IR, NLP, Image Processing, User Interfaces, …)

Literature: Working Notes, Proceedings and other publications report state-of-the-art plus emerging trends

Production of language resources; test-suites

Page 15: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

CLEF 2004 Proceedings

Page 16: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

The Future of CLEF

???2003

Can we survive?!

Page 17: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

The Future of CLEF

???CLEF 2004

It’s looking fine!

Page 18: Welcome to CLEF 2005 Carol Peters ISTI-CNR Pisa, Italy

CLEF 2005 Workshop, Vienna, Austria 21-23 September 2005

The Future of CLEF

???

Are we doing too much?!


Recommended