Upload
erin-daugherty
View
213
Download
0
Tags:
Embed Size (px)
Citation preview
REAC
TIO
NREACTION Workshop 2011.01.06Task 2 – Progress Report & PlansLisbon, PT and Austin, TX
Mário J. SilvaUniversity of Lisbon, Portugal
REAC
TIO
N
Information Discovery
Relationship extraction techniques to support information discovery in journalists’ activities
• Entity Ranking: finding the relevant entities for a given topic
• Entity Distillation: finding relevant resources for a given entity
• Attribute Selection: finding a list of key aspects to compare and differentiate a given set of entities
REAC
TIO
N
AnnotationSocrates reuniu hoje em Braga com
Mesquita Machado e Firmino Marques
<PERSON>Socrates</PERSON> reuniu hoje em <LOCAL>Braga</LOCAL> com
<PERSON>Mesquita Machado</PERSON> e <PERSON>Firmino Marques</PERSON>
NER
Mapping
<POWER id=1>Socrates</POWER> reuniu hoje em <GeoNetPT id=10>Braga</GeoNetPT> com
<POWER id=10>Mesquita Machado</POWER> e <PERSON>Firmino Marques</PERSON>
Annotated Corpus
REAC
TIO
N
Analysis
Voos da CIA em PortugalEntity
Ranking Annotated Corpus
1. Luís Amado2. José Socrates (Power:1)
Entity Distillation
• XVII Governo Constitucional (Power:20)
• WikiLeaks
Attribute Selection
http://pt.wikipedia.org/wiki/Luís_AmadoOntology Extension
REAC
TIO
N
First Approach
• NER– REMBRANDT (Reconhecimento de Entidades
Mencionadas Baseado em Relações e ANálise Detalhada do Texto)
• Mapping (Classification or Grounding)– String Matching Methods– Ontologies: POWER (task 1);
GeoNetPT; Yahoo! GeoPlanet
REAC
TIO
N
REAC
TIO
N
REAC
TIO
N
Socrates reuniu hoje em Braga com Mesquita Machado e
Firmino Marques.
REAC
TIO
N
Prototype: First Release
• April 2011• To be used in the Web Applications course
unit project• What’s missing?– Mapping– Interface– Evaluation• Precision and recall• Gold standard (Task 1)
REAC
TIO
N
Prototype: Second Release
• August 2011• Evaluate and Analyze First Prototype Results• Improved NER and Mapping– Using machine learning• Conditional Random Fields
– Information Content• FiGO
REAC
TIO
N
Prototype: Third Release
• December 2011• Containing Modules for:– Entity Ranking– Entity Distillation– Attribute Selection– Ontology Extension
• Participate in TREC (Entity Track)– http://ilps.science.uva.nl/trec-entity/
REAC
TIO
N
Prototype: Fourth Release
• August 2012• Containing Modules for:– Opinion mining• Using machine learning to• Detect and classify opinionated text• Targeting the identified entities and topics