12
REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

Embed Size (px)

Citation preview

Page 1: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

NREACTION Workshop 2011.01.06Task 2 – Progress Report & PlansLisbon, PT and Austin, TX

Mário J. SilvaUniversity of Lisbon, Portugal

Page 2: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Information Discovery

Relationship extraction techniques to support information discovery in journalists’ activities

• Entity Ranking: finding the relevant entities for a given topic

• Entity Distillation: finding relevant resources for a given entity

• Attribute Selection: finding a list of key aspects to compare and differentiate a given set of entities

Page 3: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

AnnotationSocrates reuniu hoje em Braga com

Mesquita Machado e Firmino Marques

<PERSON>Socrates</PERSON> reuniu hoje em <LOCAL>Braga</LOCAL> com

<PERSON>Mesquita Machado</PERSON> e <PERSON>Firmino Marques</PERSON>

NER

Mapping

<POWER id=1>Socrates</POWER> reuniu hoje em <GeoNetPT id=10>Braga</GeoNetPT> com

<POWER id=10>Mesquita Machado</POWER> e <PERSON>Firmino Marques</PERSON>

Annotated Corpus

Page 4: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Analysis

Voos da CIA em PortugalEntity

Ranking Annotated Corpus

1. Luís Amado2. José Socrates (Power:1)

Entity Distillation

• XVII Governo Constitucional (Power:20)

• WikiLeaks

Attribute Selection

http://pt.wikipedia.org/wiki/Luís_AmadoOntology Extension

Page 5: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

First Approach

• NER– REMBRANDT (Reconhecimento de Entidades

Mencionadas Baseado em Relações e ANálise Detalhada do Texto)

• Mapping (Classification or Grounding)– String Matching Methods– Ontologies: POWER (task 1);

GeoNetPT; Yahoo! GeoPlanet

Page 6: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Page 7: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Page 8: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Socrates reuniu hoje em Braga com Mesquita Machado e

Firmino Marques.

Page 9: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Prototype: First Release

• April 2011• To be used in the Web Applications course

unit project• What’s missing?– Mapping– Interface– Evaluation• Precision and recall• Gold standard (Task 1)

Page 10: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Prototype: Second Release

• August 2011• Evaluate and Analyze First Prototype Results• Improved NER and Mapping– Using machine learning• Conditional Random Fields

– Information Content• FiGO

Page 11: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Prototype: Third Release

• December 2011• Containing Modules for:– Entity Ranking– Entity Distillation– Attribute Selection– Ontology Extension

• Participate in TREC (Entity Track)– http://ilps.science.uva.nl/trec-entity/

Page 12: REACTION REACTION Workshop 2011.01.06 Task 2 – Progress Report & Plans Lisbon, PT and Austin, TX Mário J. Silva University of Lisbon, Portugal

REAC

TIO

N

Prototype: Fourth Release

• August 2012• Containing Modules for:– Opinion mining• Using machine learning to• Detect and classify opinionated text• Targeting the identified entities and topics