17
1(17) Relational Database GATE Format Handlers HTML docs RTF docs XML docs Named entity Core- ference ANNIE POS tagger Named entity Event extraction Custom application 1 Document content Document metadata Document format data Linguistic data File storage Oracle/ PostgresQL A Language Analysis Example

Relational Database

  • Upload
    kawena

  • View
    46

  • Download
    1

Embed Size (px)

DESCRIPTION

…. GATE Format Handlers. ANNIE. …. Named entity. HTML docs. RTF docs. XML docs. Core- ference. Document content Document metadata Document format data Linguistic data. POS tagger. …. Named entity. …. Event extraction. …. Custom application 1. Relational Database. - PowerPoint PPT Presentation

Citation preview

Page 1: Relational  Database

1(17)

Relational Database

GA

TE

Form

at Handlers

HTMLdocs

RTFdocs

XMLdocs

Named entity

Core-ference

ANNIE

POS tagger

Named entity

Eventextraction…

Custom application 1

…Document content

Document metadata

Document format data

Linguistic data

File storage

Oracle/PostgresQL

A Language AnalysisExample

Page 2: Relational  Database

2(17)

Vis

ual

Res

ourc

es

Page 3: Relational  Database

3(17)

Displaying Coreference Information

Page 4: Relational  Database

4(17)

Displaying Syntactic Information

Page 5: Relational  Database

5(17)

Lexicon Support – WordNet example

Page 6: Relational  Database

6(17)

 Performance Evaluation

• At document level – annotation diff

• At corpus level – corpus benchmark tool – tracking system’s performance over time

Page 7: Relational  Database

7(17)

Regression Testing – Corpus Benchmark Tool

Page 8: Relational  Database

8(17)

Populating Ontologies with IE

Page 9: Relational  Database

9(17)

Protégé and Ontology Management

Page 10: Relational  Database

10(17)

Information Retrieval SupportBased on the Lucene IR engine

Page 11: Relational  Database

11(17)

                     

GATE Unicode Kit (GUK) Java provides no special support for text input (this may change)

• Support for defining additional Input Methods (IMs)

• currently 30 IMs for 17 languages

• Pluggable in other applications

Editing Multilingual Data

Page 12: Relational  Database

12(17)

Processing Multilingual DataAll the visualisation and editing tools for ML LRs use enhanced Java facilities:

Page 13: Relational  Database

13(17)

Dialogue Systems

• GATE is being used in the Amities project for automating call centres• Creation of dialogue processing server components to run in the Galaxy Communicator Software Infrastructure• Easy adaptation of the portable IE components to work on noisy ASR output • Robustness and speed of GATE components for real-time dialogue systems

Page 14: Relational  Database

14(17)

Semantic Indexing in the MUMIS project

• Multimedia Indexing and Searching Environment • Composite index of a multimedia programme

from multiple sources in different languages• ASR, video processing, information extraction

(Dutch, English, German), merging, user interface• University of Twente/CTIT, University of Sheffield,

University of Nijmegen, DFKI, MPI, ESTEAM AB, VDA

Page 15: Relational  Database

15(17)

The Whole Picture

EN

DE FormalText

FormalText

FormalTextFormal

TextFormal

TextFormal

TextFormalText

FormalText

FormalTextText

Sources

IE

IE

IE

NL

FormalText

FormalText

FormalTextFormalText

FormalText

FormalTextFormalText

FormalText

FormalText

Transcriptions

ASR

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

Formal

Text

SpeechSignals

Merging Final Annotations

Formal

Text

Formal

TextForma

lText

Anno-tations

MultimediaData Base

Video & AudioSignal

UserInterface

Query

Results

Ontology & Lexicon

Page 16: Relational  Database

16(17)

User Interface

Page 17: Relational  Database

17(17)

Play