66
Natural Language Processing: Interpretation, Reasoning and Machine Learning Roberto Basili (Università di Roma, Tor Vergata) dblp: http:// dblp.uni-trier.de/pers/hd/b/Basili:Roberto.html Google scholar: https:// scholar.google.com/citations?user=U1A22fYAAAAJ&hl=it&oi=sra Politecnico di Milano, 4-5 Maggio 2017

Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Embed Size (px)

Citation preview

Page 1: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Roberto Basili

(Università di Roma, Tor Vergata)

dblp: http://dblp.uni-trier.de/pers/hd/b/Basili:Roberto.html

Google scholar: https://scholar.google.com/citations?user=U1A22fYAAAAJ&hl=it&oi=sra

Politecnico di Milano, 4-5 Maggio 2017

Page 2: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Overview

• Artificial Intelligence, Natural Language & Speech

• Information, Representation, (re)current challenges, success(and unsuccess)ful stories

• Natural Language Processing: linguistic background

• break

• Natural Language Processing: Tasks, Models and Methods

• The Role of Machine Learning Technologies

• Lexicon Acquisition : Automatic Development of Dictionaries, Semantic Lexicons and Ontologies

• Statistical Language Processing: Semantic Role Labeling

• break

• Natural Language Processing : Results & Applications

• Semantic Document Management

• Web-based Opinion Mining Systems, Market Watch & Brand Reputation Management

• Human Robotic Voice Interaction

Page 3: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

ML in NLP … a prologue

Predicate

Arg. 0

Arg. M

S

N

NP

D N

VP

V Paul

in

gives

a lecture

PP

IN N

Rome

Arg. 1

• The syntax-semantic mapping

• Different semantic theories

(e.g. PropBank vs. FrameNet)

Page 4: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Linking syntax to semantics

S

N

NP

Det N

VP

VPolice

for

arrested

the man

PP

IN N

shoplifting

Authority

Suspect Offense

Arrest

• Police arrested the man for shoplifting

Page 5: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

A tabular vision

• Word Predicate Semantic Role

• Police - Authority

• arrested Target Arrest

• the - SUSPECT

• man - SUSPECT

• for - OFFENSE

• Shoplifting - OFFENSE

Page 6: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Using Framenet/PropBank

Page 7: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

NLP: linguistic levels

Page 8: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Language as a system of rules

(*) J.L.Borges, “L’aleph”, 1949.

… comincia qui la mia disperazione di

scrittore. Ogni linguaggio è un alfabeto di

simboli il cui uso presuppone un passato

che gli interlocutori condividono; come

trasmettere agli altri l’infinito Aleph che la

mia timorosa memoria a stento abbraccia?

Page 9: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• ... Meaning is acquired and recognized within the daily pactices relatedto its usage

• The meaning of a word is to be defined by the rules for its use, not by the feeling that attaches to the words L. Wittgenstein's Lectures, Cambridge 1932-1935.

• Recognizing one meaning consists in the ability of mapping a linguisticexpression to an experience (praxis) through mechanisms such asanalogy or approximating equivalence functions or through the minimization of the risks of being wrong/inappropriate/obscure

• The interpretation process can be obtained through the induction of one (or more) decision function(s) from experience

A differentperspective

Page 10: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioneFenomeni

Osservazione1

Osservazione2

Learning Machine

Osservazione3

Osservazionen

Esempi

Modello

Page 11: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioniCitazioni

Parole

Sintagmi

SVM Learning

Alberi

FattiNoti

Testi Ann.

ModelloAnalisi

Page 12: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioneFenomeni

KernelParole

KernelSintagmi

SVM Learning

KernelTree

KernelFattiNoti

Testi

Modello

Riconoscimento

Annotazioni

Page 13: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioneFenomeni

KernelParole

KernelSintagmi

SVM Learning

KernelTree

KernelFattiNoti

Testi

Modello

Riconoscimento

Citazioni

Page 14: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

AnnotazioneFenomeni

KernelParole

KernelSintagmi

SVM Learning

KernelTree

KernelFattiNoti

Testi

Modello

Riconoscimento

Citazioni

The inductive process

Page 15: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioneFenomeni

KernelParole

KernelSintagmi

SVM Learning

KernelTree

KernelFattiNoti

Testi

Modello

Riconoscimento

Citazioni

Page 16: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The inductive process

AnnotazioneFenomeni

KernelParole

KernelSintagmi

SVM Learning

KernelTree

KernelFattiNoti

Testi

Modello

Riconoscimento

Citazioni

Page 17: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

IBM Watson: between Intelligence and Data

• IBM’s Watson

• http://www-03.ibm.com/innovation/us/watson/science-behind_watson.shtml

17

Page 18: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Jeopardy!

Page 19: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Watson

Page 20: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Watson

Page 21: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Watson

Page 22: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Semantic Inference in Watson QA

Page 23: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

… Intelligence in Watson

23

Page 24: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Watson: a DeepQA architecture

24

Page 25: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Ready for Jeopardy!

25

Page 26: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

… about the Watson intelligence

• Strongly positive aspects

• Adaptivity of the overall workflow

• Significant exploitation of available data

• Huge volumes of knowledge involved

• Criticalities

• The encyclopedic knowledge needed for Jeopardy is quite different in nature from the domain expertise required in many applications

• Wason is based on Factoid Questions strongly rooted on objective facts, that are explicit and non subjective

• Formalizing the input knowledge, as it is done a priori for Watson, is very difficult to achive in cost-effective manner: sometimes such knowledge is even absent in an enterprise

• For many natural languages the amount of information and resources is notavailable, so that a purely data-driven approach is not applicable

Page 27: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Machine Learning: the weapons

• Rule and Patern learning from Data

• Frequent Pattern Mining (Basket analysis)

• Probabilistic Extensions of Grammars

• Probabilstic CFGs

• Stochastic Grammars

• Discriminative learning in neural networks

• SVM: perceptrons

• Kernel functions in implicit semantic spaces

• Bayesian Models & Graphical Models

Page 28: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• POS tagging (Curch, 1989)

• Probabilistic Context-Free Grammars(Pereira & Schabes, 1991)

• Data Oriented Parsing (Scha, 1990)

• Stochastic Grammars (Abney, 1993)

• Lessicalizzati Modelli (C. Manning, 1995)

WeightedGrammars, tra Sintassi & Statistica

Page 29: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Bayesian & Graphical Models

Page 30: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Hidden Markov Models (HMM)

• States = Categories/Classes

• Obesrvations

• Emissions

• Transitions

• Applications:

• Speech Recognition

• Sequence Labeling (e.g. POS tagging)

Page 31: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 32: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 33: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 34: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 35: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 36: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 37: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 38: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 39: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

The forward algorithm: estimation

Page 40: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 41: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Viterbi decoding

Page 42: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Viterbi decoding

Page 43: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

NLP and HMM decoding

• The HMM sequence labeling approach can be applied to a variety of linguistic subtasks:

• Tokenization

• MWE recognition

• POS tagging

• Named Entity Recognition

• Predicate Argument Structure Recognition

• SRL: Shallow Semantic Parsing

Page 44: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 45: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 46: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

POS tagging

Page 47: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning
Page 48: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

NLP & Structured Prediction

• HMM Decoding is an example of a large class of structured prediction task

• Key elements:

• Transform a NLP task into into a sequence of classication problem.

• Transform into a sequence labeling problem and use a variant of the Viterbi algorithm.

• Design a representation (e.g. features and metrics), a prediction algorithm, and a learning algorithm for your particular problem.

Page 49: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• Characterizes neural networks since the early Cybernetics (Minsky&Papert, 1956)

• Strongly rooted in the notion of

• Inner product that in turns characterizes the norms thus the distances in the space

• Use a vector space in Rn as a input representation space

• (not so) Recent Achievments

• Statistical Learning Theory and Support Vector Machines (Vapnik, 1987)

• Deep Learning (Bengio et al., 2001)

Discriminative Learning

Page 50: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

In the hyperplane equation :

is the vector describing the targeted input example

is the gradient of the hyperplane

Classification Inference:

Linear Classification (1)

bwxbwxxf n ,, ,)(

51

x

w

( ) sign( ( ))h x f x

Page 51: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• Support Vector Machines (SVMs) are based on the Statistical Learning Theory [Vapnik, 1995]

• I t does not require the storage of all data but only a subset of discriminating instances (i.e. the support vectors, SV)

• The classifier is a linear combination of the SVs (i.e. it depends ONLY on the inner product with their vectors)

Support Vector Machines

52

Var1

Var2

Margin

Support Vectors

)sgn()sgn()(..1

bxxybxwxh jj

j

j

Support

Vectors

Page 52: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• In R2, 3 points can always be separated (or shuttered) by a linear classifier but

• 4 punti do not (as VC=3) [Vapnik and Chervonenkis(1971)]

• Solution 1 (neural networks): complexify the classification function

• It needs a more complex architecture usually based on ensembles (ad es. multistrato) of neurons

• Risk of over-fitting on the training data that is dangerous for performance on test ones

Separability in Higher Dimensional spaces

?

Page 53: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• Solution 2: Project instances in an higher dimensional space, i.e. a new feature space by using a projection function

• Basic idea from SLT: Feature space more complex are preferable to more complex functions as the risk of overfitting is minimized

Separability and High Dimensional Spaces (2)

Page 54: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• If a specific function called kernel is available

such that k(xi,xj)=(xi) (xj),

there is no need to project the individual

examples htrough the projection function

(Cristianini et al., 2002)

• A structured paradigm is applied such that

• It is trained against more complex structures

• It moves the machine learning focus onto the

representation ((xj) )

• k(.,.) expresses a similarity (metrics) that can

account for linguistic aspects and depend on

the lexicon, sintax and/or semantics

Representation & Kernels

)),(sgn(

))()(sgn())(sgn()(

..1

..1

bxxky

bxxybxwxh

jj

i

j

jj

j

j

Support Vectors

))()(sgn())(sgn()(..1

bxxybxwxh jj

j

j

Page 55: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• Given a tree we can see it as the occurrence of a joint event ….

Examples of Kernels sensitive to syntactic structures

NP

D N

VP

V

delivers

a talk

NP

D N

VP

V

delivers

a

NP

D N

VP

V

delivers

NP

D N

VP

V NP

VP

V

Page 56: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

NP

D N

a talk

NP

D N

NP

D N

a

D N

a talk

NP

D N NP

D N

VP

V

delivers

a talk

V

delivers

NP

D N

VP

V

a talk

NP

D N

VP

V

NP

D N

VP

V

a

NP

D

VP

V

talk

N

a

NP

D N

VP

V

delivers

talk

NP

D N

VP

V

delivers NP

D N

VP

V

delivers

NP

VP

V NP

VP

V

delivers

talk

The tree can be see it as the joint occurrence of all the following

subtrees:

Kernels & Syntactic structures: a collective view of the joint event

NP

D N

VP

V

delivers

a talk

NP

D N

VP

V

delivers

a

NP

D N

VP

V

delivers

NP

D N

VP

V NP

VP

V

Page 57: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

• The function in a tree kernel define a vector representing ALL subtrees of the input tree T. It naturally (i.e. without feature engineering) emphasizes:

• Lexical information (magazine)

• Coarse grain grammatical information (POS tags such as VBZ)

• Syntactic information (frammenti complessi)

• The inner product in the space of all substrees is proportional to the number of subtrees shared between two sentences

• The learning algorithm (e.g. SVM) will select discriminating examples in (infinite dimensional) space

Tree Kernels: the implicit metric space

manreads

magazine

S

VPNP

VBZ NP

NN

NN

T

)1,...,0,...,1,...,1,...,0,...,1,...,0,...,1,...,0()( T

man

S

VPNP

NN magazine

NP

NN

reads

magazine

VP

VBZ NP

NN

magazineVBZ

Page 58: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Application of distributional lexicons for Semantic Role Labeling @ UTV

• An important application of tree-kernl based SVMs isSemantic Role labeling wrt Framenet

• In the UTV system, a cascade of classification steps isapplied:

• Predicate detection

• Boundary recognition

• Argument categorization (Local models)

• Reranking (Joint models)

• Input: a sentence and its parse trees

• Adopted kernel: the combination of lexical (e.g. bow) and tree kernel (that is still a kernel)

Page 59: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Linking syntax to semantics

S

N

NP

Det N

VP

VPolice

for

arrested

the man

PP

IN N

shoplifting

Authority

Suspect Offense

Arrest

• Police arrested the man for shoplifting

Page 60: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Using Framenet/PropBank

Page 61: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Semantic Role Labeling via SVM Learning

• Two steps:

• Boundary Detection

• One binary classifier applied to the parse tree nodes

• Argument Type Classification

• Multi-classification problem, where n binary classifiers are applied, one for each argument class (i.e. frame element)

• They are combined in a ONE-vs-ALL scheme, i.e. the argument type that is categorized by an SVM with the maximum score is selected

Page 62: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

SRL in Framenet: Results

Page 63: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Framenet SRL: best results

• Best system [Erk&Pado, 2006]

• 0.855 Precision, 0.669 Recall

• 0.751 F1

• Trento (+RTV) system (Coppola, PhD2009)

Page 64: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Argument Classification (Croce et al., 2013)

• UTV experimented with a FrameNet SRL classification (gold standard boundaries)

• We used the FrameNet version 1.3: 648 frames are considered

• Training set: 271,560 arguments (90%)

• Test set: 30,173 arguments (10%)

[Bootleggers]CREATOR, then copy [the film]ORIGINAL [onto hundreds of VHS tapes]GOAL

Kernel AccuracyGRCT 87,60%

GRCTLSA 88,61%

LCT 87,61%

LCTLSA 88,74%

GRCT+LCT 87,99%

GRCTLSA+LCTLSA 88,91%

Page 65: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

Semantics, Natural Language & Learning

• From Learning to Read to Knowledge Distillation as a (integrated pool of) Semantic interpretation Task(s)

• Information Extraction

• Entity Recognition and Classification

• Relation Extraction

• Semantic Role Labeling (Shallow Semantic Parsing)

• Estimation of Text Similarity

• Structured Text Similarity/Textual Entailment Recognition

• Sense disambiguation

• Semantic Search, Question Classification and Answer Ranking

• Knowledge Acquisition, e.g. ontology learning

• Social Network Analysis, Opinion Mining

Page 66: Natural Language Processing: Interpretation, Reasoning and ...chrome.ws.dei.polimi.it/images/0/0e/Intro_AI_and_NLP_2017_ML... · Natural Language Processing: Interpretation, Reasoning

References

• AI & Robotics. «Robot Futures», Ilah Reza Nourbakhsh, MIT Press, 2013

• NLP & ML:

• «Statistical Methods for Speech Recognition», F. Jelinek, MIT Press, 1998

• «Speech and Language Processing”, D. Jurafsky and J. H .Martin, Prentice-Hall, 2009.

• “Foundations of Statistical Natural Language Processing, Manning & Schtze, MIT Press 2001.

• URLs:

• SAG, Univ. Roma Tor Vergata: http://sag.art.uniroma2.it/

• Reveal s.r.l.: http://www.revealsrl.it/