51
Dependency Grammar Samar Husain

Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Dependency Grammar

Samar Husain

Page 2: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Introduction

• HistoryHistory– Indian grammatical tradition: Panini

Medieval theories: Arabic grammatical tradition– Medieval theories: Arabic grammatical tradition– Classical and Slavic grammatical tradition

M d T i M l k– Modern: Tesniere, Melcuk

• No single dependency formulation– Diverse

Page 3: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Basic definition

• Syntactic structure consists of lexical elementsSyntactic structure consists of lexical elements linked by binary asymmetrical relations called dependenciesdependencies.

Page 4: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Sample Dependency Tree (Unlabelled)(Unlabelled)

Page 5: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Sample Dependency Tree (labelled)

Page 6: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Dependency Grammar Formalisms

• TesniereTesniere• Word Grammar

i l G i i i ( G )• Functional Generative Description (FGD)• Computational Paninian Grammar (CPG)• Extensible Dependency Grammar (XDG)

Page 7: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Core Characteristics

• Binary asymmetrical relationsy y

• Identifying dependency heads; H→D (in construction C)– H determines the syntactic category of C and often

replace Creplace C– H determines the semantic category of C; D gives

semantic specification.H i bli t d D ti l– H is obligatory and D optional

– H selects D and determines whether D is obligatory or optionalp

Page 8: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• The form of D depends on H (Agreement etc.)p ( g )• Linear position of D is specified with reference

to H

• Prototypical case with max properties• Others have distinguished between dependent

typeM h l i l– Morphological

– Semantic– Syntactic– Syntactic

Page 9: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• Head – Compliment vs Head – ModifierHead Compliment vs. Head Modifier– Endo-centric construction

• head can replace the whole without disrupting the• head can replace the whole without disrupting the syntactic structure

– Exo-centric construction• the head cannot readily replace the whole

– Valency: Requirement on its syntactic dependent y q y pthat reflects its interpretation as a semantic predicate

Page 10: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Flavors

• LayersLayers– Mono vs. Multi

Deep vs Surface– Deep vs. Surface• Representation

– Tree, Graph, etc.• Nodes• Dependency structure and word order

– Projectivityj y

Page 11: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Layers in MTT and FGD

Page 12: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Layers in PDT

Page 13: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Some non-projective structures in HindiHindi

Page 14: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Computational Paninian Grammar

• CPG is based on the Panini’s grammarCPG is based on the Panini s grammar– It is a dependency grammar

• CPG is primarily concerned with:– how the information is coded and– how it can be extracted

Page 15: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Core concepts

• aakaankshaaaakaankshaa– The verb’s core requirements in order for it to be

meaningful is the aakaankshaa of the verb. It can be roughly translated as the ‘argument structure’ of the verb.

• karakak h i i i h i ifi d b h– Karakas are the participants in the action specified by the

verb. These relations as mentioned earlier are syntactic-semantic in nature, in that they are syntactically grounded , y y y gbut also convey some meaning.

Page 16: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• AbhihitaAbhihita– The notion of abhihita signifies the karaka expressed by a

verbal TAM. This can be roughly translated as ‘agreement’. So for example, the main verb can point to the karta karaka by agreeing with it.

• Vibhakti (Sup Ting)• Vibhakti (Sup, Ting)– Vibhakti is an abstract concept used for signifying the case

markings on the nouns and the tense, aspect and modality g , p yof verbs. The former is called sup and the later is called ting.

Page 17: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• YogyataaYogyataa– Yogyataa can be roughly translated as semantic

selectional restriction of the verbselectional restriction of the verb.

Vi k h• Vivaksha– Signifies the intention of the speaker that lead to

diff i i f ti di d i th tdifference in information encoding during the act of speaking.

Page 18: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Salient Properties of CPG

• Layersy– Morphological– Local dependencies– karaka (participants in an actions)

• Representation of dependency– Labeled tree– Node

• Le ical/phrasal• Lexical/phrasal

– Labels• Syntactico-semantic (+others)

• Non-projectivity– Allowed

Page 19: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Layers in CPG

Page 20: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Morphological Layer (SSF representation)representation)

rAma ne mohana ko nIlI kiwAba xI‘Ram’ ERG ‘Mohan’ ACC ‘blue’ ‘book’ ‘gave’‘Ram gave Mohan a blue book’

Page 21: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially
Page 22: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially
Page 23: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially
Page 24: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Local dependencies (POS and chunks)

Page 25: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Dependency Tree (SSF representation)

Page 26: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially
Page 27: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially
Page 28: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Expanded Dependency Tree

rAma ne mohana ko nIlI kiwAba xI‘Ram’ ERG ‘Mohan’ ACC ‘blue’ ‘book’ ‘gave’Ram ERG Mohan ACC blue book gave‘Ram gave Mohan a blue book’

Page 29: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Selected dependency labels [TOTAL = 43 ]

k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)k3 instrumentk4 beneficiaryk4 beneficiaryk5 sourcek7t temporal locationk7p spatial locationk1s noun complement k2p destinationpk1, mk1, jk1 Causer, mediator-causer, causee rh causert purposert purposersp durationadv adverb (manner)

f f ( l di )pof Part-of (complex predicates)ccof conjunctionfragof fragment-of

Page 30: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)

The first is the class of all k-

l b l i l d

k3 instrumentk4 beneficiaryk5 source

labels; includes k1-k7*

k7t temporal locationk7p spatial locationk1s noun complementk1s noun complement k2p destinationpk1, mk1, jk1

Causer, mediator-causer, causee jk1rh causert purposersp durationnmod Noun modificationpof Part-of (complex predicates)p ( p p )ccof conjunctionfragof fragment-of

Page 31: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)k3 instrumentk4 beneficiaryk5 sourcek7t temporal locationk7p spatial locationk1s noun complementk1s noun complement k2p destinationpk1, mk1, jk1

Causer, mediator-causer, causee

Second is the class of r-labels,

jk1rh causert purpose

including rh, rt, rsp, rd etc.

rsp durationnmod Nmod (noun modificationpof Part-of (complex predicates)p ( p p )ccof conjunctionfragof fragment-of

Page 32: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)

•Third, the class of modifier labels: i l d d

k3 instrumentk4 beneficiaryk5 source

includes nmod, vmod, jjmod

k7t temporal locationk7p spatial locationk1s noun complementk1s noun complement k2p destinationpk1, mk1, jk1

Causer, mediator-causer, causee jk1rh causert purposersp durationnmod Noun modificationpof Part-of (complex predicates)p ( p p )ccof conjunctionfragof fragment-of

Page 33: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)

•Finally, the class of non-

k3 instrumentk4 beneficiaryk5 source of non

dependencies consisting of pof,

k7t temporal locationk7p spatial locationk1s noun complement g p

fragof, ccof k1s noun complement k2p destinationpk1, mk1, jk1

Causer, mediator-causer, causee jk1rh causert purposersp durationnmod noun modificationpof Part-of (complex predicates)p ( p p )ccof conjunctionfragof fragment-of

Page 34: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

•We get a 4-way k1 karta (similar to agent/doer)k2 karma (similar to patient/theme)

classification:-k-labelsr labels

k3 instrumentk4 beneficiaryk5 source -r-labels

-modifier labels-other (non

k7t temporal locationk7p spatial locationk1s noun complement other (non

dependencies)k1s noun complement k2p destinationpk1, mk1, jk1

Causer, mediator-causer, causee jk1rh causert purposersp durationnmod noun modificationpof Part-of (complex predicates)p ( p p )ccof conjunctionfragof fragment-of

Page 35: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Intransitive verb

rAma bETA hE‘Ram’ ‘sit’ PRES‘Ram is sitting’

Page 36: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Transitive verb

rAma Kira KAwA hE‘Ram’ ‘sweet’ ‘eat’ PRES‘Ram eats sweets’

Page 37: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Transitive verb

rAma rojZa eka seba KAwA hEj‘Ram’ ‘daily’ ‘an’ ‘apple’ ‘eat’ PRES‘Ram eats an apple everyday’

Page 38: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Complex Predicate

ve rAjanIwi para carCA kara rahe We‘They’ ‘politics’ ON ‘discussion’ ‘do’ CONT PRES‘They were discussing politics’

Page 39: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Clausal Complementvaha mAnawA hE ki rAma buxXimAna hE‘He’ ‘believe’ PRES ‘that’ ‘Ram’ ‘intelligent’ ‘is’‘He believes that Ram is intelligent’He believes that Ram is intelligent

Page 40: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

ParticiplerAma ne KAnA Kakara pAnI piyA‘Ram’ ERG ‘food’ ‘having eaten’ ‘water’ ‘drank’‘Having eaten the food Ram drank water’Having eaten the food Ram drank water

Page 41: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Coordination

rAma seba KAwA hE Ora siwA dUXa pIwI hEp‘Ram’ ‘apple’ ‘eat’ PRES ‘and’ ‘sita’ ‘milk’ ‘drink’ PRES ‘Ram eats apples and Sita drinks milk’

Page 42: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Coordination

rAma Ora SyAma skUla jAwe hE‘Ram’ ‘and’ ‘Shyam’ ‘school’ ‘go’ PRES‘Ram and SyAma go to school’

Page 43: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Relative Clausejo ladZakA vahAz bETA hE vaha mere skUla meM paDZawa hEjo ladZakA vahAz bETA hE vaha mere skUla meM paDZawa hEREL boy ‘there’ ‘sit’ PRES ‘he’ ‘my’ ‘school’ IN ‘study’ PRES‘The boy who is sitting there studies in my school’

Page 44: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Gapping

rAma xilli gayA Ora SyAma AgarA‘Ram’ ‘Delhi’ ‘went’ ‘and’ ‘Shyam’ ‘Agra’Ram Delhi went and Shyam Agra‘Ram went to Delhi and Shyam Agra.’

Page 45: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Phrase Structure Grammar

Page 46: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Phrase Structure Grammar and Dependency GrammarDependency Grammar

• PSGPSG– Constituency

• Similar syntactic enviornment– Eg. NP followed by a verb

• Can be moved around together, but cannot be broken• Substitution• Coordination

– Functional categories– Knowledge of language relies on the structural relationship

in the sentence rather than on the sequence of the words– Notion of headNotion of head

Page 47: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• Indirect access to relationsIndirect access to relations• Indirect access to heads

Page 48: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

• DGDG– Direct access of relations between elements– Transparent argument structure of the verbTransparent argument structure of the verb– Intuitive analysis

• Extremely popular in the CL/NLP community since few yearsy– Especially for free word order morphologically

rich languages

Page 49: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Issues

• Do DG and PSG complement each other?Do DG and PSG complement each other?• Are there some generalizations that are not

captured by individual grammaticalcaptured by individual grammatical frameworks?C bi h l i f DS d PSG• Can we combine the analysis of DS and PSG to get to a better understanding of structures

d h i l ?and phenomenons in a language?• Are some phenomenon better understood in

specific grammar ?

Page 50: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

Generative capacity: PSG and DG

Page 51: Dependency Grammar - IIIT Hyderabadltrc.iiit.ac.in/iasnlp2011/content/data/SS/DG-samar-IASNLP2011.pdf · • Extremely popular in the CL/NLP community since few years – Especially

References

- A. Bharati, D. M. Sharma, S. Husain, L. Bai, R. Begam and R. Sangal. 2009 A C T B k f I di L G id li f2009. AnnCorra: TreeBanks for Indian Languages, Guidelines for Annotating Hindi TreeBank.

- A. Bharati, V. Chaitanya and R. Sangal. 1995. Natural Language Processing: A Paninian Perspective. Prentice-Hall of India, New Delhi.

- S. Kubler, R. McDonald and J. Nivre. 2009. Dependency parsing. Morgan and Claypool.

- Marco Kuhlmann. 2007. Dependency Structures and Lexicalized Grammars PhD thesis Saarland University Saarbrücken GermanyGrammars. PhD thesis, Saarland University, Saarbrücken, Germany.

- J. Nivre, 2005. Dependency Grammar and Dependency Parsing. MSI report05133. Växjö University: School of Mathematics and Systems Engineering.

- Z. Žabokrtský. 2005. Resemblances between Meaning-Text Theory and F i l G i D i i I P d f 2 d I lFunctional Generative Description. In Proceedings of 2nd International Conference of Meaning-Text Theory, Moscow, 2005.

- http://ufal.mff.cuni.cz/pdt2.0/doc/pdt-guide/en/html/ch02.html