58
Introduction Metagrammar engineering as methodology The Grammar Matrix & CLIMB CLIMB Projects CLIMB Tools Conclusion References CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35

CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

  • Upload
    others

  • View
    24

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

CLIMB: Code generation for grammardevelopment

Antske Fokkens

Saarland University

Antske Fokkens 1 / 35

Page 2: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 2 / 35

Page 3: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 3 / 35

Page 4: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Multiple hypotheses

Theories (whether scientific or philosophical) aim toexplain phenomenaBut what if there is more than one hypothesis that canexplain the phenomenon?Ideally, all possible hypotheses are maintained untilevidence is found that exclude themThe scenario of multiple hypotheses is common in syntaxand grammar engineering; maintaining them until evidenceis found to make a decision is not

Antske Fokkens 4 / 35

Page 5: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

How to know what analysis to pick?

Ability to account for dataInteraction with other analysesTheoretical soundness: how well does the analysis fit togeneral theoretical assumptionsElegance/simplicityEfficiency

Antske Fokkens 5 / 35

Page 6: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar development

A1

A2

A3

B1

B2

Antske Fokkens 6 / 35

Page 7: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar development

A2

B1

C1

C4

C3

Antske Fokkens 6 / 35

Page 8: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar development

A2

B1

C3

D1

D3

F1

F2

D2 E1

F4

Antske Fokkens 6 / 35

Page 9: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar development

A2

B1

C3

D2 E1

F4

Practice: Select best analysis according to criteria given currentknowledge

Antske Fokkens 6 / 35

Page 10: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Interaction

Often, there is no conclusive evidence indicating what “the”correct analysis isPhenomena interact: what if an analysis chosen in thepast excludes the optimal solution for a new phenomenonto be added?Analyses can be revised based on new evidence, but thisbecomes less and less likely as time passes (chosenanalysis deeply embedded, alternatives forgotten)

⇒ The order in which phenomena are treated may have amajor impact on the resulting grammar

Antske Fokkens 7 / 35

Page 11: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 8 / 35

Page 12: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammars for systematic exploration (Fokkens,2011)

Can we keep track of choices made in the past and preservealternative solutions?

Instead of directly implementing a grammar, analyses canbe stored in a metagrammarThe metagrammar can generate grammars with alternativeanalyses that cover the same phenomenaDifferent alternatives from the past can be tried out, whennew phenomena are added to the grammar

Antske Fokkens 9 / 35

Page 13: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

Antske Fokkens 10 / 35

Page 14: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

Antske Fokkens 10 / 35

Page 15: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 16: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 17: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 18: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 19: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C2

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 20: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Possibilities

A1

A2

A3

B1

B2

C1

C3

C4

D1

D2

F1

F2

F3

F4

D3

E1

F5

Antske Fokkens 10 / 35

Page 21: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 11 / 35

Page 22: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

The LinGO Grammar Matrix (Bender et al., 2002,2010)

Questionnaire(accepts user

input)

Questionnairedefinition

Choices file

Validation

Customization

Customized grammar

Core grammar

HTMLgeneration

Storedanalyses

Elicitation of typologicalinformation

Grammar creation

Figure: Schematic system overviewAntske Fokkens 12 / 35

Page 23: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

choices

matrix.tdl

Antske Fokkens 13 / 35

Page 24: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

choices

matrix.tdl

Antske Fokkens 13 / 35

Page 25: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

choices

matrix.tdl

iverbs, tverbs

case

word order

Antske Fokkens 13 / 35

Page 26: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

choices

matrix.tdl

iverbs, tverbs

case

word order

Antske Fokkens 13 / 35

Page 27: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

matrix.tdlword order

case

iverbs, tverbs

Antske Fokkens 13 / 35

Page 28: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

matrix.tdlword order

case

iverbs, tverbs

Antske Fokkens 13 / 35

Page 29: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

matrix.tdlword order

case

iverbs, tverbs

Antske Fokkens 13 / 35

Page 30: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Grammar Matrix Workflow

matrix.tdlword order

case

iverbs, tverbssubordinates

Antske Fokkens 13 / 35

Page 31: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

case

verbssubordinates

word order

Antske Fokkens 14 / 35

Page 32: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

case

verbssubordinates

word order

Antske Fokkens 14 / 35

Page 33: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

verbs

case

word order

word order

subordinates

Antske Fokkens 14 / 35

Page 34: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

verbs

case

word order

word order

subordinates

Antske Fokkens 14 / 35

Page 35: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

verbs

case

word order

word order

subordinates

case

verbssubordinates

word order

Antske Fokkens 14 / 35

Page 36: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

verbs

case

word order

word order

subordinates

case

verbssubordinates

word order

Antske Fokkens 14 / 35

Page 37: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Metagrammar engineering

choices

matrix.tdl

verbs

case

word order

word order

subordinates

case

verbssubordinates

word order

Antske Fokkens 14 / 35

Page 38: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

The Grammar Matrix and CLIMB

The Grammar Matrix and CLIMB (Comparative Libraries ofImplementations with a Matrix Basis) (Fokkens et al., 2012)complement each other

Grammar Matrix CLIMB- Support to start new grammar - Supports long term

development- Accessible to new users - For expert developers- Focus on wide typological - Focus on high coverage

coverage in one or few language(s)

Antske Fokkens 15 / 35

Page 39: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Advantages of CLIMB

Systematic comparison of implementationsFacilitates maintainance of different versions of thegrammar (e.g. for different applications)Phenomena-based organization of the grammarCan provide direct feedback to Grammar Matrix

Antske Fokkens 16 / 35

Page 40: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 17 / 35

Page 41: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Germanic CLIMB, Goals

Test impact of the methodology on long(ish) term grammardevelopmentCompare analyses for word order and auxiliaries inGermanKeeping the complete Grammar Matrix set-up: whatchanges are required in the customization system(extensions versus revisions)

Antske Fokkens 18 / 35

Page 42: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Germanic CLIMB, Current state

Coverage of Cheetah’s development set obtained in 6months (compared to 1 year reported in Cramer (2011))Efficiency (preliminary result) between GG and CheetahRevisions of several analyses in matrix.tdl:

adjectivesmodificationwh-questionslong distance dependenciesrelative clausesadpositions

Antske Fokkens 19 / 35

Page 43: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Slavic CLIMB, Motivation

The initial set-up of SlaviCore (Avgustinova and Zhang,2009):

matrix.tdlslavic.tdl (static file with types useful for Slavic languages)my_slavic_language.tdl

Adapting SlaviCore to CLIMB:Increase flexibility for sharing analyses across SlaviclanguagesThrough automatic generation of parts of the hierarchy,Avgustinova (2007)’s theory on Slavic grammar analysescan be tested based on observation in individual languages

Antske Fokkens 20 / 35

Page 44: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Slavic CLIMB, current state

The Russian Research Grammar (Avgustinova and Zhang,2010, RRG) has been ported into CLIMBObservation: much could be done by changing ‘choices’:⇒ customization system could be exploited more!Many implementations are currently purely technicalLinguistically motivated organisation has started takingimplementations from BURGER (Osenova, 2010) intoaccount‘Clean’ implementation for case

Antske Fokkens 21 / 35

Page 45: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Chinese CLIMB (future project)

Combine analyses from independently created grammarsShared development at different sites (Uds/DFKI and NTU)

Antske Fokkens 22 / 35

Page 46: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 23 / 35

Page 47: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Feature path abbreviation

Slavic CLIMB contains a basic algorithm that allows usersto abbreviate paths (writing out the full path in TDL)The algorithm goes backwards through the path until itfinds the preceeding feature, or hits signIt complains when the path is ambiguousThe first element of a list needs to be stated explicitly(future work: allow users to define defaults)

Antske Fokkens 24 / 35

Page 48: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Feature path abbreviations, basic example

The Head Feature Principle:[HEAD 1

HEAD-DTR.HEAD 1

]Not possible (SUBJ has list as value):

SUBJ.HEAD nounMust be (for now):

SUBJ.FIRST.LOCAL.HEAD noun

Antske Fokkens 25 / 35

Page 49: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Code generation for larger grammars

Can be helpful for large revisions in the grammarThe basic idea:

A library specifies unique types for old and new analysis⇒ which types should be removed or insertedThis library also specifies types that changeAlgorithm goes through type hierarchy removing, insertingand changing types based on the chosen analysis

All changes and revisions related to the new analysis arein one place: easier to reviseBasic algorithm to flip back and forth between analysesremains: easier to compare analyses at different stages

Antske Fokkens 26 / 35

Page 50: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Spring cleaning (Fokkens et al., 2011)

The spring cleaning algorithm goes through a typehierarchy and removes types that do not have any impacton the grammar, i.e. types that

are not instantiated typesdo not define a feature or value of an instantiated typedo not define a lower bound between two relevant types

The original structure of the grammar is maintained

Antske Fokkens 27 / 35

Page 51: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Spring cleaning (future work)

Intermediate stages between spring cleaning and grammarcompression

Yi Zhang implemented an algorithm that identifies whichtypes are computationally relevant for competenceBecause features are moved up to supertypes, this canhave a negative impact on performance (Petter Haugereid)

Grammar comparision:The spring cleaning algorithm takes a close look at thestructure of the type hierarchyIt can be extended to compare two type hierarchies that arehighly similarGoal: identify changes made to the grammar after revision⇒ create an interface supporting metagrammardevelopment

Antske Fokkens 28 / 35

Page 52: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Outline

1 Introduction

2 Metagrammar engineering as methodology

3 The Grammar Matrix & CLIMB

4 CLIMB Projects

5 CLIMB Tools

6 Conclusion

Antske Fokkens 29 / 35

Page 53: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Conclusion

CLIMB allows engineers to:compare implementations systematicallyorganize implementations according to phenomena (nottype)easily maintain different versions of the grammar

There are two CLIMB projects, plus one more planned:Germanic CLIMBSlavic CLIMB (initial stages)Chinese CLIMB (future work)

CLIMB comes with some basic algorithms:Path abbreviationCode generation for larger grammarsSpring cleaning, plans for grammar comparison

Antske Fokkens 30 / 35

Page 54: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

This Summit: CLIMB Tutorial

TutorialBasic introduction to CLIMBCLIMB implementation & organization

Discussion:Lowering the hurdle to start using CLIMBArchitecture, basic set-up

Antske Fokkens 31 / 35

Page 55: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Acknowledgments

Thanks to:Tania Avgustinova, Emily M. Bender, Francis Bond, BartCramer, Rebecca Dridan, Dan Flickinger, MichaelGoodman, Joshua Growgey, Laurie Poulson, Ron Kaplan,Sanghoun Song, Hans Uszkoreit, David Wax, Yi Zhangyou for your attention ¨

Antske Fokkens 32 / 35

Page 56: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Bibliography I

Avgustinova, T. (2007). Language Family Oriented Perspective in MultilingualGrammar Design. Linguistik International: Band 17. Peter Lang -Eurpopäischer Verlag der Wissenschaft, Frankfurt am Main, Germany.

Avgustinova, T. and Zhang, Y. (2009). Parallel grammar engineering forSlavic languages. In Proceedings of GEAF, Singapore.

Avgustinova, T. and Zhang, Y. (2010). Conversion of a Russian dependencytreebank into HPSG derivations. In Proceedings of TLT’9.

Bender, E. M., Drellishak, S., Fokkens, A., Poulson, L., and Saleem, S.(2010). Grammar customization. Research on Language & Computation,8(1):23–72.

Antske Fokkens 33 / 35

Page 57: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Bibliography II

Bender, E. M., Flickinger, D., and Oepen, S. (2002). The grammar matrix: Anopen-source starter-kit for the rapid development of cross-linguisticallyconsistent broad-coverage precision grammars. In Carroll, J., Oostdijk, N.,and Sutcliffe, R., editors, Proceedings of the Workshop on GrammarEngineering and Evaluation at the 19th International Conference onComputational Linguistics, pages 8–14, Taipei, Taiwan.

Cramer, B. (2011). Improving the feasibility of precision-oriented HPSGparsing. PhD thesis, Saarland University.

Fokkens, A. (2011). Metagrammar engineering: Towards systematicexploration of implemented grammars. In Proceedings of the 49th AnnualMeeting of the Association for Computational Linguistics: HumanLanguage Technologies, pages 1066–1076, Portland, Oregon, USA.Association for Computational Linguistics.

Antske Fokkens 34 / 35

Page 58: CLIMB: Code generation for grammar development · CLIMB: Code generation for grammar development Antske Fokkens Saarland University Antske Fokkens 1 / 35. Introduction Metagrammar

IntroductionMetagrammar engineering as methodology

The Grammar Matrix & CLIMBCLIMB Projects

CLIMB ToolsConclusionReferences

Bibliography III

Fokkens, A., Avgustinova, T., and Zhang, Y. (2012). CLIMB grammars: threeprojects using metagrammar engineering. In Chair), N. C. C., Choukri, K.,Declerck, T., Dogan, M. U., Maegaard, B., Mariani, J., Odijk, J., andPiperidis, S., editors, Proceedings of the Eighth International Conferenceon Language Resources and Evaluation (LREC 2012), Instanbul, Turkey.European Language Resources Association (ELRA).

Fokkens, A., Zhang, Y., and Bender, E. M. (2011). Spring cleaning andgrammar compression: Two techniques for detection of redundancy inHPSG grammars. In Proceedings of the 25th PACLIC, Singapore,Singapore.

Osenova, P. (2010). BUlgarian Resource Grammar â Efficient and Realistic(BURGER).

Antske Fokkens 35 / 35