19
SALSA The Saarbrücken Lexical Semantics Annotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal

Semantic Annotation in SALSA

  • Upload
    albin

  • View
    42

  • Download
    3

Embed Size (px)

DESCRIPTION

SALSA The Sa arbrücken L exical S emantics A nnotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal. Semantic Annotation in SALSA. Manual semantic annotation of - PowerPoint PPT Presentation

Citation preview

Page 1: Semantic Annotation in SALSA

SALSA

The Saarbrücken Lexical Semantics Annotation & Acquisition Project

Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado,

Manfred Pinkal

Page 2: Semantic Annotation in SALSA

Semantic Annotation in SALSA

Manual semantic annotation of 0.8 million words of syntactically annotated German newspaper text (TIGER Corpus, Releases 1, 2)

with frames and frame elements (Berkeley FrameNet Database), staying as close as possible to the Berkeley FrameNet database

Page 3: Semantic Annotation in SALSA

SALSA: What's special?

SALSA is about GermanCross-lingual divergencies?

Page 4: Semantic Annotation in SALSA

Cross-lingual Divergencies

Convincing cross-lingual portability results (ED) in general

Adaptation necessary because of Inappropriate granularity of distinctions between FEs

Missing FEs

(Rare cases of) inappropriate granularity of frames

Page 5: Semantic Annotation in SALSA

SALSA: What's special?

SALSA is about GermanCross-lingual divergencies?

Corpus-driven lexicon development through exhaustive full-text annotation

Difficult cases

Incompleteness of Berkeley FrameNet

Page 6: Semantic Annotation in SALSA

Difficult cases

Metaphors

Support Verb Constructions

Idioms

Page 7: Semantic Annotation in SALSA

Difficult phenomena: Some Figures

 Sample of 246

Lemmas Sub-corpus nehmen

  Number % Number %

Standard readings 4638 85,7% 42 17,4%

Metaphor 369 6,8% 38 15,8%

Support 326 6,0% 132 54,8%

Idiom 79 1,5% 29 12,0%

Non-literal use 774 14,3% 199 82,6%

Total 5412 100,0% 241 100,0%

Page 8: Semantic Annotation in SALSA

SALSA corpus: Release I

Total size of 20.000 annotated instancesConsistent annotation through different verification steps All occurrences/readings of > 400 German verbal predicates (different frequency bands)Scheduled for Summer 2006

Page 9: Semantic Annotation in SALSA

The SALTO Annotation Tool

Page 10: Semantic Annotation in SALSA

SALSA II: Automatic Annotation and Acquisition

Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis Talk by Katrin and Sebastian

Page 11: Semantic Annotation in SALSA
Page 12: Semantic Annotation in SALSA

SALSA II: Automation

Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis Talk by Katrin and Sebastian

The Detour System (through WordNet to FrameNet) Talk by Anette and Al

Page 13: Semantic Annotation in SALSA

Fred &

Rosy

Fred,

Detour

& Rosy

Page 14: Semantic Annotation in SALSA

SALSAII: Automation

Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis Talk by Katrin and SebastianThe Detour System (through WordNet to FrameNet) Talk by Anette and AlCross-lingual projection of frame-semantic information Katrin and Sebastian

Page 15: Semantic Annotation in SALSA

Cross-lingual Projection

Page 16: Semantic Annotation in SALSA

SALSAII: Automation & Application

Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis Talk by Katrin and Sebastian

The Detour System (through WordNet to FrameNet) Talk by Anette and Al

Cross-lingual projection of frame-semantic information Katrin and Sebastian

Textual Entailment (RTE) Anette and Al

Page 17: Semantic Annotation in SALSA

t: In 1983, Aki Kaurismäki directed his first full-time feature.

h: Aki Kaurismäki directed a film.

Page 18: Semantic Annotation in SALSA

t: In 1983, Aki Kaurismäki directed his first full-time feature.

h: Aki Kaurismäki directed a film.

WordNetrelated

Grammaticallyrelated

Page 19: Semantic Annotation in SALSA

SALSA: Future Work

Bottstrapping frame information by data expansion techniquesLinking lexical semantic resourcs with upper-model ontologiesAnalysis of non-compositional phenomenaA worked-out semantic lexiconApplication to textual entailment