View
28
Download
1
Category
Preview:
DESCRIPTION
HLT: Current and Recent Research & Development in the Netherlands 2001-2008. Jan Odijk 24 Nov 2008. Overview. Earlier History Instruments & Programmes Players and their Projects Concluding Remarks. Earlier History. MT Projects in the 80’s Eurotra (EU, 1985- ca.1990) - PowerPoint PPT Presentation
Citation preview
HLT:Current and Recent Research & Development in the Netherlands2001-2008
Jan Odijk24 Nov 2008
OverviewEarlier HistoryInstruments & ProgrammesPlayers and their ProjectsConcluding Remarks
Earlier HistoryMT Projects in the 80sEurotra (EU, 1985- ca.1990)Distributed MT (BSO, 1984-ca.1990)Rosetta (Philips, 1985-1992)Emerging Community
Earlier HistoryCLIN (Computational Linguistics in the Netherlands) initiated in 1990After TIN (Linguistics in the Netherlands)But no associationYearly informal conference no or little pre-selectionYearly ProceedingsSelection of reviewed articleshttp://www.let.rug.nl/~vannoord/clin/clin.html
Earlier HistoryCommunity further strengthened by common projects in the 90sPriority Programme LST centered around public transportation information services (OVIS)Corpus Gesproken Nederlands (Spoken Dutch Corpus), together with Flanders
OverviewEarlier HistoryInstruments & ProgrammesPlayers and their ProjectsConcluding Remarks
InstrumentsNWO PionierNWO VernieuwingsimpulsVeni, Vidi, Vicihttp://www.nwo.nl/nwohome.nsf/pages/NWOA_4YJDQ3 EZ BsikIncrease knowledge and research capacity for 5 selected areas (incl. ICT)For mixed public/ private consortia that bundle knowledge, expertise and innovative capacity http://www.senternovem.nl/BSIK/ EC (IST)
OverviewEarlier HistoryInstruments & ProgrammesPlayers and their ProjectsConcluding Remarks
IMIXName: Interactive Multimodal Information ExtractionDuration: 2001-2008Budget: 2M4 small programmes, 3 post-doc projectsdemonstratorURL: http://www.nwo.nl/IMIX Funding: NWO
IMIX GoalsAims to develop knowledge and technology needed To find specific answers to specific questions in Dutch-language documentsusing multiple modalities at the input and output sides
STEVINName: STEVINDuration: 2004-2011Budget: 11.4M19 R&D projects14 demonstration projectsNetworking activities, educational activities, Funding: Netherlands 2/3 & Flanders 1/3 http://taalunieversum.org/taal/technologie/stevin/
STEVIN Goalscontribute to the further progress of HLT for the Dutch languagerealise an appropriate digital language infrastructure for the Dutch languagecarry out strategic research in the domains of language and speech technology, in particular in areas for which there is a large demand from specific applications and technologies; create networks and core research areas; promote the embedding of research and educate new generations of experts; encourage demand and knowledge transfer.
CATCHName: Continuous Access to Cultural HeritageDuration: 2005-??Budget: 6M until 2008 and 3M in 2008Currently 10 running projectsURL: http://www.nwo.nl/catch Funding: NWO + OCW. Cultural heritage institutes contribute in kind (2.8M so far)
CATCH GoalsAims to develop generic methods and techniquescutting across the areas of the humanities and computer science, aiming to facilitate an interaction with cultural heritage institutions.
IOP MMIName: Innovation-oriented Research Programme (IOP) Man Machine InteractionDuration: 1999-2003 (phase 1); 2004-2007Budget: ??URL: http://www.senternovem.nl/iopmensmachineinteractie/index.asp Funding: EZ (Min. of Economic Affairs)
IOP MMI Goals (phase 2)focus on the Design, Implementation and Evaluation of Intelligent Systems Which dynamic knowledge (of one another) should systems and users acquire and apply in order to optimally achieve their goals
CLARIN-NL(?)Name: Common Language Resource and Technology Infrastructure - NetherlandsDuration: 2009-2014Budget: 22M requested URL: Funding: OC&W (ESFRI)
CLARIN-NL GoalsCLARIN-NL aims to design, construct, validate, and exploit a research infrastructure that is needed to provide a sustainable and persistent eScience working environment for researchers in the Humanities and Social Sciences (HSS) who want to make use of language resources and technology.
OverviewEarlier HistoryInstruments & ProgrammesPlayers and their ProjectsConcluding Remarks
Amsterdam (UvA)Institute: Informatics InstituteCore Topics: Information RetrievalQuestion AnsweringKey peopleMaarten de Rijke
Amsterdam (UvA)QASSIR (NWO, 2006-2010)Question Answering as Semistructured Information RetrievalEfFoRT (NWO, 2006-2010)Effective Focused Retrieval TechniquesUvA, (Twente)MultiMATCH (EU, 2006-2009)Multilingual/Multimedia Access To Cultural Heritage11 partners from Europe incl. UvA
Amsterdam (UvA)MuNCH (CATCH, (2005-2009)Multimedia aNalysis for Cultural Heritage.UvA, Beeld & Geluid (B&G), Digitaal Erfgoed NederlandMuSeUM (CATCH, 2005-2009)Multiple-collection Searching Using Metadata.UvA, Gemeentemuseum Den Haag, Rijksbureau voor Kunsthistorisch Documentatie, Municipal Archives RotterdamA Model Checking Approach to Query Evaluation on XML Documents (NWO, 2004-2008).
Amsterdam (UvA)FactMine (IMIX, 2004-2007)Fact and Ontology Mining for Question AnsweringUvA, DFKI, Antwerpen, Erasmus MCAID (Dutch Government, 2004-2008)Adaptive Information Disclosure.
Amsterdam (UvA)ITEQA (NWO, 2004-2007)Inference for Temporal Question Answering.DuOMAn (STEVIN, 2008-2011)Dutch Online Media AnalysisUvA, Groningen, Gent, TrendLight, GridLineDAESO (STEVIN), Cornetto (STEVIN), KYOTO (EU IST), CLARIN-NL
Amsterdam (UvA)Institute: ILLCTopicsData Oriented Parsing (DOP)Key Persons:Remko SchaRens BodKhalil Simaan
Amsterdam (UvA)U-DOP (NWO VICI, 2006-2011)Unsupervised Learning with the DOP ModelDOP and Unsupervised Grammar Induction (NWO, 2004-2007) Unsupervised stochastic grammar induction from unlabeled data DOP and Learning Stochastic Tree-Grammars (NWO, 2003-2006)
Amsterdam (VU)Department: Language and CommunicationCore Topics: Computational Lexicology Key peoplePiek Vossen
Amsterdam (VU)Cornetto (STEVIN, 2006-2008)Combinatorial and Relational Network as Toolkit for Dutch Language TechnologyVU, UvA, Leuven, IrionKYOTO (EU FP7 ICT, 2008-2011)Knowledge Yielding Ontologies for Transition-based OrganizationVU, 8 other European partnersCLARIN
GroningenDepartment: Centre for Language and Cognition/ Computational LinguisticsCore Topics:Syntax and Parsing Key peopleJohn NerbonneGertjan van NoordGosse Bouma
GroningenAlpino (NWO PIONIER, 2000-2005)Algorithms for Linguistic ProcessingQADR (IMIX, 2004-2008)Question Answering for Dutch using Dependency RelationsGroningen, SpectrumCOREA (STEVIN, 2005-2007)Coreference Resolution for Extracting AnswersGroningen, Antwerpen, Language and Computing
GroningenLASSY (STEVIN, 2006-2009)Large Scale Syntactic Annotation of written DutchGroningen, LeuvenSCRATCH (CATCH, ??-??)SCRipt Analysis Tools for the Cultural HeritageGroningen, Nationaal ArchiefD-COI (STEVIN), IRME (STEVIN), DAISY (STEVIN), DuOMAn (STEVIN), PaCo-MT (STEVIN), CLARIN, CLARIN-NL
NijmegenInstitute: Centre for Language and Speech TechnologyCore Topics:Speech ProcessingLanguage Resource DevelopmentKey peopleLou Boves, Nelleke Oostdijk, Helmer Strik, Henk van den Heuvel
NijmegenNORISC (IMIX, 2004-2007)Next generatiOn template based Recognition for Interactive man-machine Speech CommunicationCOMIC (EU IST, 2002-2005)COnversational Multimodal Interaction with ComputersMATIS (IOP-MMI, ??-??)Multimodal Access to Transaction and Information Services
NijmegenD-COI (STEVIN, 2005-2006)Dutch Language Corpus InitiativeNijmegen, Tilburg, Twente, Groningen, Utrecht, Leuven, PolderlandSoNaR (STEVIN, 2008-2011)STEVIN Nederlandstalig ReferentiecorpusNijmegen, Tilburg, Twente, Utrecht, Leuven, GentJASMIN-CGN (STEVIN, 2005-2007)Extension of CGN with speech of children, non-natives, elderly and human-machine interactionNijmegen, Leuven, Talkinghome
NijmegenACORNS (EU, 2008-2010)Acquisition of Communication and Recognition Skills Intends to [] create an artificial agent that is capable of acquiring human verbal communication behaviour Nijmegen, 5 other European partnersAvoiding the ham in hamster (NWO VENI 2006-2010)Modelling the use of non-segmental information in human spoken-word recognition
NijmegenBATS (ICTRegie, IBBT, 2008-2012)Topic and Speaker Tracking in Broadcast Archives Nijmegen, LeuvenSPEXSpeech Processing Expertise CentreCollection, Annotation & ValidationELRA Speech Validation Centre
NijmegenAutonomata TOO (STEVIN, 2008-2010)Autonomata Transfer of Output Nijmegen, Gent, Utrecht, TeleAtlas, NuanceDISCO (STEVIN, 2008-2011)Development and Integration of Speech technology into COurseware for language learningNijmegen, Linguapolis Antwerpen, Taal- & Communicatiecentrum Nijmegen, PolderlandAutonomata (STEVIN), MIDAS (STEVIN), NBest (STEVIN), Praat (STEVIN), SPRAAK (STEVIN), CLARIN, CLARIN-NL, A Propos (IOP-MMI)
SoesterbergInstitute: TNO Defense & SecurityCore Topics:Speech ProcessingSpeech Technology EvaluationKey peopleDavid van Leeuwen
SoesterbergNBest (STEVIN, 2006-2008)Northern and Southern Dutch Benchmark Evaluation of Speech recognition TechnologyTNO Soesterberg, Nijmegen, Twente, Leuven, Gent. DelftSPRAAK (STEVIN)
TilburgInstitute: Tilburg Centre for Creative Computing, Induction of Linguistic Knowledge (ILK) Research groupCore Topics:Machine-learningMemory-Based learningKey peopleAntal van den Bosch
TilburgROLAQUAD (IMIX, 2004-2008)Robust Language Understanding in Question-Answering DialogueTilburg, TextkernelImplicit Linguistics (NWO VICI, 2005-2009)Machine Learning of Text-to-Text Processing,A Propos (IOP MMI, 2006-2009?)Proactive Personalization for Professional Document WritingTilburg, Nijmegen, industrial partners
TilburgMITCH (CATCH, 2007-2009?)Mining for Information in Texts from the Cultural HeritageNational Museum of Natural History, Tilburg D-COI (STEVIN), CLARIN, CLARIN-NL, SoNaR (STEVIN)
TilburgInstitute: Communication and Cognition Core Topics:Communication and CognitionLanguage GenerationMultimodalityKey peopleEmiel Krahmer
TilburgIMOGEN (IMIX, 2004-2008)Interactive Multimodal Output GenerationTilburg and TwenteBridging the gap between psycholinguistics and computational linguistics (NWO VICI, 2007-2011)Generation of referring expressionsDAESO (STEVIN, 2006-2009)Detecting and Exploiting Semantic OverlapTilburg, Antwerpen, Amsterdam (UvA), Textkernel
TilburgTUNA (EPSRC (UK), 2003-2007)Towards a Unified Algorithm for the Generation of Referring ExpressionsAberdeen, Open University (UK), TilburgFOAP (NWO Vidi, 2003-2007)Functions of Audio-Visual Prosody
TilburgInstitute: Communication Information Sciences Core Topics:Computational Semantics and PragmaticsDialogue TheoryMultimodal InteractionKey peopleHarry Bunt
TilburgParadime (IMIX, 2004-2008)Parallel Agent-based Dialogue Management Engine
TwenteInstitute: Human Media InteractionCore TopicsMultimodalitySpeech RecognitionKey peopleFranciska de Jong, Anton NijholtArjan van Hessen, Roelof Ordelman
TwenteAMI (EU IST, 2004-2006)Augmented Multi-party InteractionIDIAP (CH) and 11 other partners incl. TwenteM4 (EU IST, 2002-2005)MultiModal Meeting ManagerSheffield and 8 other partners incl. TwenteDRUID (Telematica??-2003)Multimedia Indexing & Retrieval on the basis of Image Processing & Language and Speech TechnologyTNO TPD, TNO TM, Twente, CWI
TwenteSAFIR (EU, 2003-2007)Speech Automatic Friendly Interface Research17 European partners incl. TwenteAngelica (NWO Meervoud (?), 2003-2007)A Natural-Language Generator for Embodied, Lifelike Conversational Agents
TwentePidgin (EZ, 2002-2004)Self-learning Cross Lingual InterfaceTwente, Irion, Carp, New Law FacilitiesChoral (CATCH, 2005-2009)Access to Oral HistoryTwente, Municipal Archives Rotterdam. Radio Rijnmond, Erasmus University RotterdamMultimediaN (Bsik, 2004-2008)MultimediaN/N5Multiple research groups and 20+ industrial partners
TwenteVIDIAM (IMIX, 2005-2008)Dialog Management and the Visual ChannelAMIDA (EU, 2006-2009)Augmented Multi-party Interaction with Distance AccessAMI consortium
TwenteMESH-EU (EU, 2006-2009)Multimedia Semantic Syndication for Enhanced News ServicesTwente and 10 other European partnersMediaCampaign (EU, 2006-2009)Discovering, inter-relating and navigating cross-media campaign knowledgeTwente and 7 other European partnersD-COI (STEVIN), NBest (STEVIN), SPRAAK (STEVIN), SoNaR (STEVIN), CLARIN, CLARIN-NL
UtrechtInstitute: UIL-OTSCore TopicsNetworksLanguage Technology for e-LearningLinguistic ResourcesLR and LT InfrastructureKey peopleJan OdijkSteven KrauwerPaola MonachesiGerrit Bloothooft
UtrechtELSNET (EU, 1991-Current)A Europe-based forum dedicated to human language technologies aiming to advance R&D in human language technologies in Europe LT4eL (EU IST, (2005-2008)Language Technology for eLearningUtrecht and 11 other European partnersLTfLL (EU IST, (2008-2011)Language Technology for LifeLong LearningOpen University and 8 other European partners incl. UtrechtIRME (STEVIN, 2005-2007)Identification and lexical Representation of Multiword ExpressionsUtrecht, Groningen, Van Dale
UtrechtCLARIN (EU ESFRI, 2008-2010)Common LAnguage Resource and technology InfrastructureUtrecht, Nijmegen, and 35+ other partnersFlaReNet (EU IST 2008-2011)Fostering Language Resources NetworkPisa, Utrecht, ILSP Athens, ELDA, LIMSI, Vienna, BarcelonaCLARIN-NL (submitted to OC&W, 2009-2014)Common LAnguage Resource and technology Infrastructure, Netherlands partUtrecht, Nijmegen, UvA, VU, Twente, Groningen, Tilburg and many others ISLE (EU IST), Autonomata (STEVIN), D-COI (STEVIN), SoNaR (STEVIN), Autonomata TOO
OverviewEarlier HistoryInstruments & ProgrammesPlayers and their ProjectsConcluding Remarks
Concluding RemarksSolid CommunityLargely complementary, some overlapBut always close collaborationOverall the situation in the Netherlands for LST research and development is pretty goodesp. in comparison to other countriesThe situation is getting more difficultlarge thematic programmes instead of specific technologiesThe options for fundamental research are limited and decreasing
Thank You
For Your Attention
Do NOT Go Beyond This Slide
Older or Marginally RelatedCHOICE (CATCH)semi-automatic semantic annotation and employing context information for ensuring continuous access to the cultural riches.B&G, VU, Telematica, MPI, ICN
Older or Marginally RelatedSTITCH (CATCH)Semantic Interoperability to Access Cultural HeritageKB, VU, MPI
Recommended