106
NLP & Semantic Computing Group N L P Semantic Perspectives for Contemporary Question Answering Systems Andre Freitas University of Passau JAIST, December 2016

Semantic Perspectives for Contemporary Question Answering Systems

Embed Size (px)

Citation preview

Page 1: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

N L P

Semantic Perspectives forContemporary Question Answering Systems

Andre FreitasUniversity of Passau

JAIST, December 2016

Page 2: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Outline Multiple Perspectives of Semantic

Representation Lightweight Semantic Representation Knowledge Graph Extraction from Text Querying Knowledge Graphs Text Entailment Reasoning Take-away Message

Page 3: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Multiple Perspectives of Semantic Representation

Page 4: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

QA & Semantics

• Question Answering is about managing semantic representation, extraction, selection trade-offs.

• And it is about integrating multiple components in a complex approach.

•Semantic best-effort, systems tolerant to noisy, inconsistent, vague, data.

4

Page 5: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

“Most semantic models have dealt with particular types of constructions, and have been carried out under very simplifying assumptions, in true lab conditions.”

“If these idealizations are removed it is not clear at all that modern semantics can give a full account of all but the simplest models/statements.”

Formal World Real World

Baroni et al. 2013

Semantics for a Complex World

5

Page 6: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation focal points•Types of knowledge to focus at the

representation: Facts vs Definitions Temporality Spatiality Modality Polarity Rhetorical structures Pragmatic categories …

6

Page 7: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Lightweight Semantic Representation

Page 8: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Objective•Provide a lightweight knowledge representation model which: Can represent textual discourse

information.• Maximizes the capture of textual information.

Is convenient to extract from text. Is convenient to access (query and

browse).8

Page 9: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation of Contextual Relations (Facts)General Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

9

Factoid shape

Page 10: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

RDF as the basic data modelGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Instance

10

Instance

Instance

Class

Property

Page 11: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.11

Page 12: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.12

Page 13: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Data Integration pointsGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Named entities are lower entropy integration points Pivot

points13

Page 14: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Data Integration pointsGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Named entities are also low entropy entry points for answering queries Pivot

points14

Page 15: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Data Integration pointsGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Also abstract classes … Pivot

points15

Page 16: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Data Integration pointsGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

They are also a very convenient way to represent. Pivot

points16

Page 17: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.17

Page 18: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Taxonomy Extraction Are predicates with more complex compositional patterns

which describe sets.

Parsing complex nominals.

American multinational conglomerate corporation

 On the Semantic Representation and Extraction of Complex Category Descriptors, NLDB 2014

multinational conglomerate corporation

corporation

conglomerate corporation

is a

is a

is a

Pivot points

Page 19: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.19

Page 20: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Context RepresentationGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Reification as a first class representation element

20

Page 21: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Context RepresentationGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Temporality, spatiality, modality, rhetorical relations …

21

Page 22: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.22

Page 23: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Open VocabularyGeneral Electric Company, or GE , is an American multinational conglomerate corporation incorporated in Schenectady , New York 

Temporality, spatiality, modality, rhetorical relations …

23

Page 24: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Open Vocabulary

•Easier to extract but more difficult to consume.

•We pay the price at query time.

•How to operate over a large-scale semantically heterogeneous knowledge-graphs?

24

Page 25: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Representation Assumptions• Data integration:

Named entities (instances) Abstract classes (unary predicates)

• Rich taxonomical structures.

• Context representation as a first class citizen.

• Open vocabulary.

• Word instead of sense/concept.25

Page 26: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Words instead of Senses•Motivation: Disambiguation is a tough

problem.

•Sense granularity can be, at many situations, arbitrary (too context dependent).

•We treat a word as a superposition of senses, almost in a “quantum mechanical sense”. 26

Page 27: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Revisited RDF (for Representing Texts)• Triple as the basic fact unit.

• Data Model Types: Instance, Class, Property…

• RDFS: Taxonomic representation.

• Reification for contextual relations (subordinations).

• Blank nodes for n-ary relations.

• Labels over URIs.27

Page 28: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Lightweight Semantic Representation

Representing Texts as Contextualized Entity-Centric Linked Data Graphs, WebS 2013

Page 29: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantics

Page 30: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic Models Semantic Model with low acquisition effort

(automatically built from text)

Simplification of the representation

Enables the construction of comprehensive commonsense/semantic KBs

What is the cost?

Some level of noise(semantic best-effort)

Limited semantic model30

Page 31: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantics as Commonsense Knowledge

Commonsense is here

θ

car

dog

cat

bark

run

leashSemantic Approximation is

here

31

Page 32: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional-Relational Networks

Distributional Relational Networks, AAAI Symposium, 2013

Page 33: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

The vector space is segmented33

Dimensional reduction mechanism!

A Distributional Structured Semantic Space for Querying RDF Graph Data, IJSC 2012

Page 34: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Compositionality of Complex Nominals

Page 35: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Compositional-distributional model for Categories

35

Page 36: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Compositional-distributional model for paraphrases

A Compositional-Distributional Semantic Model for Searching Complex Entity Categories, *SEM (2016)

Page 37: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Knowledge Graph Extraction from Text

Page 38: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Graphene

Page 39: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Graph Extraction Pipeline

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

ML-based

Rule-based

Rule-based

ML-based

39

Argumentation Classification

Page 40: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Minimalistic Text Transformations

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

ML-based

Rule-based

Rule-based

ML-based

40

Argumentation Classification

Page 41: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Minimalistic Text Transformations

•Co-reference Resolution Pronominal co-references.

•Passive We have been approached by the investment

banker. The investment banker approached us.

•Genitive modifier Malaysia's crude palm oil output is estimated

to have risen. The crude palm oil output of Malasia is

estimated to have risen.41

Page 42: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Text Simplification

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

ML-based

Rule-based

Rule-based

ML-based

42

Argumentation Classification

Page 43: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Text Simplification for KG Extraction“A few hours later, Matthias Goerne, a German baritone, offered an all-German program at the Frick Collection.”

relations are spread across clauses relations are presented in non-canonical form

43

Page 44: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Text Simplification for KG Extraction

44

Page 45: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 46: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Approach• Linguistic analysis of sentences from the English Wikipedia to identify constructs which provide only secondary information:

• non-restrictive relative clauses• non-restrictive and restrictive appositive phrases• participial phrases offset by commas• adjective and adverb phrases delimited by punctuation• particular prepositional phrases• lead noun phrases• intra-sentential attributions• parentheticals• conjoined clauses with specific features• particular punctuation

•Rule-based simplification rules.

A Sentence Simplification System for Improving Open Relation Extraction COLING (2016)

Page 47: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

N-ary Relation Extraction

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

Rule-based

Rule-based

ML-based

47

 OpenIE, University of Washington

Argumentation Classification

Page 48: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Taxonomy Extraction

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

Rule-based

Rule-based

ML-based

48

 Representation and Extraction of Complex Category Descriptors, NLDB 2014

Argumentation Classification

Page 49: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

RST Classification

Text Transformati

on

N-ary Relation Extractio

nText Simplificatio

n GraphSerializatio

n

Taxonomy

Extraction

Storage

Argumentation Classification

Rule-based

Rule-based

ML-based

49

Page 50: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Rhetorical Structure Theory• cause:

e.g. “because scraping the bottom with a metal utensil will scratch the surface.”

• circumstance e.g. “After completing your operating system reinstallation,”

• concession e.g. “Although the hotel is situated adjacent to a beach,”

• condition e.g. “If you can break the $ 1000 dollar investment range,”

• contrast e.g. “but you can do better with 2.4ghz or 900mhz phones.”

• purpose e.g.“in order for the rear passengers to get in the vehicle.”

• …50

Page 51: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Argumentation Representation•Supports/Attack•Rhetorical Structure Theory (RSTs)

•Informal Logic•Argumentation Schemes (Walton et al.)•Pragmatic Categories

Retrieval

Reasoning

Page 52: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

QueryingKnowledge Graphs

Page 53: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

With DSMs our graph supports semantic approximations as a first-class operation

Page 54: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Approach Overview

Query Planner

Ƭ-Space(embedding

graphs)

Commonsense knowledge

RDF

Core semantic approximation &

composition operations

Query AnalysisQuery Query Features

Query Plan

54

Corpus

Page 55: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Core Principles Minimize the impact of Ambiguity, Vagueness,

Synonymy. Address the simplest matchings first (semantic

pivoting).

Semantic Relatedness as a primitive operation.

Distributional semantics models as commonsense knowledge representation.

Lightweight syntactic constraints.55

Page 56: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

•Now let’s answer the query

“Who is the daughter of Bill Clinton married to?”

Question

56

Page 57: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

•Step 1: Determine answer type

Who is the daughter of Bill Clinton married to? (PERSON)

•Using POS Tags

Query Pre-Processing (Question Analysis)

57

Page 58: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

•Step 2: Semantic role labeling.

Who is the daughter of Bill Clinton married to?

•NER, POS Tags Rules-based: POS Tag + IDF

Query Pre-Processing (Question Analysis)

58

(INSTANCE) (PROPERTY)

(PROPERTY)

(CLASS)

(PERSON)

Page 59: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Query Pre-Processing (Question Analysis)

Bill Clinton

daughter married to

(INSTANCE)

Person

ANSWER TYPE

QUESTION FOCUS59

• Step 3: Put in a structured pseudo-logical form Rules based.• Remove stop words.• Merge words into entities.• Reorder structure from core entity position.

Page 60: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

• Step 3: Put in a structured pseudo-logical form Rules based.• Remove stop words.• Merge words into entities.• Reorder structure from core entity position.

Query Pre-Processing (Question Analysis)

Bill Clinton

daughter married to

(INSTANCE)

Person

(PREDICATE) (PREDICATE) Query Features

60

Page 61: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

• Map query features into a query plan.• A query plan contains a sequence of:

Search operations. Selection operations.

Query Planning

(INSTANCE) (PREDICATE) (PREDICATE) Query Features

(1) INSTANCE SEARCH (Bill Clinton) (2) DISAMBIGUATE ENTITY TYPE (3) GENERATE ENTITY FACETS (4) p1 <- SEARCH RELATED PREDICATE (Bill Clintion, daughter) (5) e1 <- GET ASSOCIATED ENTITIES (Bill Clintion, p1) (6) p2 <- SEARCH RELATED PREDICATE (e1, married to) (7) e2 <- GET ASSOCIATED ENTITIES (e1, p2) (8) POST PROCESS (Bill Clintion, e1, p1, e2, p2)

Query Plan

61

Page 62: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Core Entity SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

KB:

Entity search

62

Page 63: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

:Chelsea_Clinton

:child

:Baptists:religion

:Yale_Law_School:almaMater...

(PIVOT ENTITY)

(ASSOCIATED TRIPLES)

63

KB:

Page 64: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

:Chelsea_Clinton

:child

:Baptists:religion

:Yale_Law_School:almaMater...

sem_rel(daughter,child)=0.054

sem_rel(daughter,child)=0.004

sem_rel(daughter,alma mater)=0.001

Which properties are semantically related to ‘daughter’?

64

KB:

Page 65: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

:Chelsea_Clinton

:child

65

KB:

Page 66: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

:Chelsea_Clinton

:child

(PIVOT ENTITY)

66

KB:

Page 67: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional Semantic SearchBill Clinton

daughter married to Person

:Bill_Clinton

Query:

:Chelsea_Clinton

:child:Mark_Mezvinsky

:spouse

67

KB:

Note the lazy disambiguation

Page 68: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

68

Page 69: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Relevance

Medium-high query expressivity / coverage

69

Accurate semantic matching for a

semantic best-effort scenario

Ranking in the second position in

average

Page 70: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Comparative Analysis

Better recall and query coverage compared to baselines with equivalent precision.

More comprehensive semantic matching.

70

Page 71: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

StarGraph•Open source NoSQL platform for building

and interacting with large and sparse knowledge graphs.

•Semantic approximation as a built-in operation.

•Scalable query execution performance.

71

Page 72: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Heuristics for the selection of the semantic pivot is critical!•Discussed here just superficially:

Information-theoretical justification.

How hard is the Query? Measuring the Semantic Complexity of Schema-Agnostic Queries, IWCS (2015).

Schema-agnositc queries over large-schema databases: a distributional semantics approach, PhD Thesis (2015).

On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study, NLIWoD (2015).

72

Page 73: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Reasoning for Text Entailment

Page 74: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Beyond Word Vector Models

engineer degree

universityθ

Distributional semantics can give us a hint about the concepts’ semantic proximity...

...but it still can’t tell us what exactly the relationship between them is

engineer

degree???

Page 75: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Beyond Word Vector Models

engineer

degree???

engineer

degree???

Page 76: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Beyond Word Vector Models: Intensional Reasoning

Representing structured intensional-level knowledge.

Creation of an intensional-level reasoning model.

76

Page 77: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Commonsense Reasoning

Selective (focussed) reasoning Selecting the relevant facts in the context

of the inference

Reducing the search space.Scalability

Page 78: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional semantic relatedness as a Selectivity Heuristics

Distributional heuristics

78

target

source answer

Page 79: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional semantic relatedness as a Selectivity Heuristics

Distributional heuristics

79

target

source answer

Page 80: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Distributional semantic relatedness as a Selectivity Heuristics

Distributional heuristics

80

target

source answer

Page 81: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

John Smith

EngineerInstance-level

occupation

Does John Smith have a degree?

Page 82: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 83: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 84: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 85: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 86: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 87: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 88: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 89: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 90: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 91: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 92: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 93: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 94: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 95: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 96: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Page 97: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

A Distributional Semantics Approach for Selective Reasoning on Commonsense Graph Knowledge Bases, NLDB (2015).

Page 98: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Intensional-level representation• Dictionary definitions

refinement: a highly developed state of perception

state perfection

differentia quality

developed highly

quality modifier

differentia quality

refinement

is a

Page 99: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Annotating and Structuring WordNet Glosses• lake_poets:

• refinement:

• redundancy:

• slender_salamander:

• genus_Salix:

• unstaple:

Page 100: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Semantic Roles for Lexical DefinitionsAristotle’s classic theory of definition introduced important aspects such as the genus-differentia definition pattern and the essential/non-essential property differentiation. Taking those principles as starting point and analyzing a sample of randomly chosen WordNet’s definitions, we derived the following semantic roles for definitions:

origin location

[role] particle

accessory determiner

accessory quality

associated fact

purpose

quality modifier

event location

event time differentia event

differentia quality

supertype

definiendum

has particle

modified by

has component

char

acte

rized

by

has type

adds

non

-ess

entia

l inf

o to

Page 101: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Bringing it into the Real World

Page 102: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Semeval 2017

Page 103: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Take-away Message• Choosing the sweet-spot in terms of semantic

representation is critical for the construction of robust QA systems. Work at a word-based representation instead of

a sense representation. Text simplification/clausal disembedding

critical for relation extraction. Need for a standardized semantic

representation for relations extracted from texts.

Representation needs to be convenient for information extraction and data consumers.

Page 104: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Take-away Message•Distributional semantics:

Robust, language-agnostic semantic matching.

Semantic pivoting strategy. Selective reasoning over commonsense KBs.

•Need to move to more fine-grained models: Robust intensional-level reasoning.

Page 105: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

Take-away Message•Role of Machine Learning:

Fundamental to cope with the long tail of linguistic phenomena.

More explicit interplay with convenient semantic representation models.

Interpretability/explanation over accuracy.

Page 106: Semantic Perspectives for Contemporary Question Answering Systems

NLP & Semantic Computing Group

http://www.slideshare.net/andrenfreitas

These slides: