Upload
pascual-perez-paredes
View
2.744
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Annotating pedagogy: implementing language teaching and learning-oriented annotation on corpora
Citation preview
TaLC 08 Workshop 1TaLC 08 Workshop 1
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Aims
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Aims
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
Aims of this workshopAims of this workshop
3
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Workshop Structure
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Workshop Structure
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
5
Workshop StructureWorkshop Structure
1. Introducing the workshop background
2. Overview of the annotation tool: SACODEYL Annotator
3. Getting down to annotation: annotating a text (1)
Coffee Break
1. Introducing the workshop background
2. Overview of the annotation tool: SACODEYL Annotator
3. Getting down to annotation: annotating a text (1)
Coffee Break
6
Workshop StructureWorkshop Structure
3. Getting down to annotation: annotating a text (2)
4. Exploiting our Annotations5. Discussion and debate
3. Getting down to annotation: annotating a text (2)
4. Exploiting our Annotations5. Discussion and debate
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
1. Introduction
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
1. Introduction
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
8
IntroductionIntroduction
Braun (2005, 2007): pedagogically motivated corpora
(a) provide a more systematic range of material than individual texts or scattered collections of activities and, if well-designed, (b) offer a wider range of idiolects than the average material.
Braun (2005, 2007): pedagogically motivated corpora
(a) provide a more systematic range of material than individual texts or scattered collections of activities and, if well-designed, (b) offer a wider range of idiolects than the average material.
9
IntroductionIntroduction
Braun (2006) states that thematic annotation, including topic keys and section titles, are particularly useful in the implementation of pedagogically motivated corpora
Braun (2006) states that thematic annotation, including topic keys and section titles, are particularly useful in the implementation of pedagogically motivated corpora
10
IntroductionIntroduction
IntroductionIntroduction
Pérez-Paredes & Alcaraz (2009, forthcoming)
For the time being, the natural corpus playground continues to be tertiary education.
Our proposal:XML >>CL in the language classroom. The resulting annotated corpus can be seen as
being integrative of language data and annotated pedagogy. The former is the liquid, the latter is the container. The interesting aspect here is that pedagogy can be annotated and, subsequently, accessed by corpus users.
Pérez-Paredes & Alcaraz (2009, forthcoming)
For the time being, the natural corpus playground continues to be tertiary education.
Our proposal:XML >>CL in the language classroom. The resulting annotated corpus can be seen as
being integrative of language data and annotated pedagogy. The former is the liquid, the latter is the container. The interesting aspect here is that pedagogy can be annotated and, subsequently, accessed by corpus users. 11
12
IntroductionIntroduction
Corpus
LanguageData
Annotation
Language
Metadata
Pedagogy
IntroductionIntroduction
13
14
IntroductionIntroduction
Linguistic analysis of interest in FLT------>
Linguistics comes first------->
DDL materialsConcordances
and corpus
Researcher/LinguistEnd user The possibilities
Scenario(Alcáraz and
Pérez-Paredes2008)
15
IntroductionIntroduction
Pedagogical analysis (and annotation)
of language corpora------>
Pedagogy comes first------->
Pedagogy-driven DDL
Pedagogical analysis (and annotation)
of language corpora------>
Pedagogy comes first------->
Pedagogy-driven DDL Material
developer/Teacher/ LearnerEnd user
The feasibilityScenario
(Alcáraz and Pérez-Paredes
2008)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
2. Overview of the annotation tool: SACODEYL Annotator
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
2. Overview of the annotation tool: SACODEYL Annotator
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
17
SummarySummary
What is SACODEYL Annotator?How can I install SACODEYL
Annotator?Overview
Corpus ManagementCategory ManagementAnnotation ManagementMetadata Management
What is SACODEYL Annotator?How can I install SACODEYL
Annotator?Overview
Corpus ManagementCategory ManagementAnnotation ManagementMetadata Management
18
What is SACODEYL Annotator?
What is SACODEYL Annotator?
SACODEYL Annotator …Is a free and open source tool for annotating
textsHas been developed from the pedagogical
perspective (not morpho-sintactic tagging oriented)
Offers support for pedagogical annotation in texts
Supports the customization of the annotated tags according to the necessities of the teacher.
SACODEYL Annotator …Is a free and open source tool for annotating
textsHas been developed from the pedagogical
perspective (not morpho-sintactic tagging oriented)
Offers support for pedagogical annotation in texts
Supports the customization of the annotated tags according to the necessities of the teacher.
19
How can we install it?How can we install it?
In the SACODEYL Web Pagewww.um.es/sacodeyl
Platform-independent
(Remember to download the examples too)
In the SACODEYL Web Pagewww.um.es/sacodeyl
Platform-independent
(Remember to download the examples too)
20
A) In the desktopB) In Program Files SACODEYL Tools
Where has it been installed?Where has it been installed?
21
OverviewOverview
Categories Management
Menus and Corpus Management
Annotation Area
Metadata Management
22
Corpus ManagementCorpus Management
We can …Create a new corpusOpen an existent
corpusSave the current
annotation changes
We can …Create a new corpusOpen an existent
corpusSave the current
annotation changes
23
Creating a new corpus ICreating a new corpus I
24
Creating a new corpus IICreating a new corpus II
A new corpus ALWAYS has a default category tree.
Default tree has been developed by the teachers of SACODEYL Project
A new corpus ALWAYS has a default category tree.
Default tree has been developed by the teachers of SACODEYL Project
25
What categories does this defaultcategory tree contain ?
What categories does this defaultcategory tree contain ?
TopicsGrammaticalLexicalStyleCEF Level….
26
How can I add a new document to the current corpus?
How can I add a new document to the current corpus?
1. Press in …
2. Select the text format/encoding
3. Select the new document
1. Press in …
2. Select the text format/encoding
3. Select the new document
27
What does the text format mean?
What does the text format mean?
This is the manner in which the text is laid out in the file.
Mainly 3 text formats are supported: Plain text (written) .txt Oral text in SACODEYL Transcriptor format XML text in TEI standard format (text in special XML files)
This is the manner in which the text is laid out in the file.
Mainly 3 text formats are supported: Plain text (written) .txt Oral text in SACODEYL Transcriptor format XML text in TEI standard format (text in special XML files)
28
This is the form in which the text is stored (related to the Multilanguage).
(In Windows ANSI by default)
What does the text encoding mean? What does the text encoding mean?
29
Adding our exampleAdding our example
Our example file is en17.txtThis file has a SACODEYL
Transcriptor formatHas been encoded with
multilingual support encoding format (UTF-16)
Save the changes!!!!
Our example file is en17.txtThis file has a SACODEYL
Transcriptor formatHas been encoded with
multilingual support encoding format (UTF-16)
Save the changes!!!!
30
Selecting the text to work with
Selecting the text to work with
Now we can insert a great amount of texts in the same corpus.
We can select one and annotate it.
1. Press …
2. Select the document
Now we can insert a great amount of texts in the same corpus.
We can select one and annotate it.
1. Press …
2. Select the document
31
Friendly AppearanceFriendly Appearance
Pedagogical orientationPedagogical orientation
32
Information showed in the working document
Information showed in the working document
Section Number Applied Categories to this section
(Annotations) Speaker (only in oral text) Transcription
Section Number Applied Categories to this section
(Annotations) Speaker (only in oral text) Transcription
33
What is a section?What is a section?
Is a stretch of text that is pedagogically motivated.
A fragment that could be useful in a classroom
A section can be established in any kind of text (oral and written) with the insertion of the special char (#) for division of texts into sections.
Is a stretch of text that is pedagogically motivated.
A fragment that could be useful in a classroom
A section can be established in any kind of text (oral and written) with the insertion of the special char (#) for division of texts into sections.
34
Intuitive Annotation ProcessIntuitive Annotation Process
Drag and Drop to Annotate a Section
Drag and Drop to Annotate a Section
35
KeywordsKeywords
SACODEYL Annotator supports the annotation of keywords at any kind of level
Just select text and apply a category by right-clicking
SACODEYL Annotator supports the annotation of keywords at any kind of level
Just select text and apply a category by right-clicking
36
What is a Keyword?What is a Keyword?
“… [a] keyword is a stretch of language (a word, more than one word or a whole paragraph) that the annotator associates to a category…”
Pérez-Paredes and Alcaraz, ReCALL, Jan. 2009 Vol 21. (1) (Forthcoming)
“… [a] keyword is a stretch of language (a word, more than one word or a whole paragraph) that the annotator associates to a category…”
Pérez-Paredes and Alcaraz, ReCALL, Jan. 2009 Vol 21. (1) (Forthcoming)
37
Selective ViewSelective View
Offers a selective view of the information in order to facilitate the organization.
Offers a selective view of the information in order to facilitate the organization.
38
Annotating a TitleAnnotating a Title
Drag and Drop the special “Title” category to the desired section.
The title is rendered by a tool tip when placing the
cursor on the section. (No tool tip = No title)
Drag and Drop the special “Title” category to the desired section.
The title is rendered by a tool tip when placing the
cursor on the section. (No tool tip = No title)
39
MultilanguageMultilanguage
Supports a real multilingual annotation
May use texts in any language
Supports a real multilingual annotation
May use texts in any language
40
ExtensibleExtensible
Supports customization of the annotation
User can add his/her own annotation taxonomy or remove any annotation category
Supports customization of the annotation
User can add his/her own annotation taxonomy or remove any annotation category
41
How can I add a new category?
How can I add a new category?
Select the parent category. (i.e. Topics)
Press Add Cat. Button.Fill the dialog
Select the parent category. (i.e. Topics)
Press Add Cat. Button.Fill the dialog
42
How can I remove a category?
How can I remove a category?
Select the category to remove (i.e. Topic)
Be careful …All the associated children
will be removed alsoAll the annotation with the
tags will be removed alsoPress Delete Cat. Button.
43
How can I reorder the categories?
How can I reorder the categories?
Select the category to reorder (i.e. Topic)
Press Up Cat or Down Cat. to move it.
44
How can I customize a category?
Select the category to customize (i.e. Topic)
Press double click
45
Can I manage metadata for the working text?
Can I manage metadata for the working text?
46
What if I find mistakes?What if I find mistakes?
Supports edition of the inserted texts.
Uses XML TEI standard for encoding corpora.
Supports edition of the inserted texts.
Uses XML TEI standard for encoding corpora.
47
Multilingual InterfaceMultilingual Interface
Available in many languagesYou could also add your language
for the interfaces
Available in many languagesYou could also add your language
for the interfaces
48
Has a detailed help tutorialHas a detailed help tutorial
49
IntegrationIntegration
SACODEYL Annotator has been integrated with:SACODEYL TranscriptorSACODEYL Search SACODEYL VRP
SACODEYL Annotator has been integrated with:SACODEYL TranscriptorSACODEYL Search SACODEYL VRP
50
Resource ManagementResource Management
Offers enrichment of text with external resources
i.e. html links, videos, audios, etc.
Offers enrichment of text with external resources
i.e. html links, videos, audios, etc.
51
Where is the information stored?
Where is the information stored?
Remember: All the information is store in one file. The corpus file which you have created.
Remember: All the information is store in one file. The corpus file which you have created.
Corpus
LanguageData
Annotation
Language
Metadata
Pedagogy
52
Advanced FeaturesAdvanced Features
There are more advanced features… How can I establish the default category
tree for a new corpus? When a have a lot of documents in the
same corpus, how can I reorder the documents in the list of the available texts?
….. Visit our web site for updates
www.um.es/sacodeyl
There are more advanced features… How can I establish the default category
tree for a new corpus? When a have a lot of documents in the
same corpus, how can I reorder the documents in the list of the available texts?
….. Visit our web site for updates
www.um.es/sacodeyl
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
3. Getting down to annotation: annotating a text
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
3. Getting down to annotation: annotating a text
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
54
Annotation AimAnnotation Aim
Why are you annotating?What is the purpose of your annotation?What use are you giving to your
annotation?
We assume you are annotating a text for your own students’ language learning purposes
Why are you annotating?What is the purpose of your annotation?What use are you giving to your
annotation?
We assume you are annotating a text for your own students’ language learning purposes
55
Annotation AimAnnotation Aim
We assume you are annotating a text for your own students’ language learning purposes
We assume you are annotating a text for your own students’ language learning purposes
The best way to learn to walk is walk!!!!
Watch the video while you read the transcript of an interview which belongs to the English SACODEYL Corpus.
A talk with Mike, 15, about his interest in football and art, his cooking skills and plans for the future.
Video length: 10:47
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
4. Exploiting our annotations
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
4. Exploiting our annotations
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
57
What is the output of SACODEYL Annotator?
What is the output of SACODEYL Annotator?
A corpus stored in XML TEI format that contains:Texts (one text in our case)Annotations over the textThe category tree (shared by all the
texts)Resources linked to the textsMetadata associated to texts
A corpus stored in XML TEI format that contains:Texts (one text in our case)Annotations over the textThe category tree (shared by all the
texts)Resources linked to the textsMetadata associated to texts
58
What is XML TEI format?What is XML TEI format?
TEI Text Encoding InitiativeThis is a format for storing corporaHas been promoted by OTA
(Oxford Text Archive)Is a continuously growing format
(more than 50 versions released yet, currently TEI P5)
Is rapidly spreading among the available tools
TEI Text Encoding InitiativeThis is a format for storing corporaHas been promoted by OTA
(Oxford Text Archive)Is a continuously growing format
(more than 50 versions released yet, currently TEI P5)
Is rapidly spreading among the available tools
59
TEI Tools (Research)TEI Tools (Research)
TeiPublisher“This tool is a XML-based repository thatallows the publication of TEI corpora to
thepublic community and offers a search
tool.”Dexter
“This is other annotator tool that used TEI as
the format for the annotated files.”
TeiPublisher“This tool is a XML-based repository thatallows the publication of TEI corpora to
thepublic community and offers a search
tool.”Dexter
“This is other annotator tool that used TEI as
the format for the annotated files.”
60
TEI Tools (Research)TEI Tools (Research)
Oxygen XML Editor and XMLSpy“These are XML Editors that allows the
modification of the TEI files without any limitation”
(These are complex for non-advanced users)
Oxygen XML Editor and XMLSpy“These are XML Editors that allows the
modification of the TEI files without any limitation”
(These are complex for non-advanced users)
61
TEI Tools (Research)TEI Tools (Research)
TAPoR (http://portal.tapor.ca/)“The Text Analysis Portal for Research (TAPoR) is a gateway to tools for
sophisticated analysis and retrieval, along with
representative texts for experimentation.”
TAPoR (http://portal.tapor.ca/)“The Text Analysis Portal for Research (TAPoR) is a gateway to tools for
sophisticated analysis and retrieval, along with
representative texts for experimentation.”
62
TEI Tools (Research)TEI Tools (Research)
TokenX http://www.unl.edu/libr/etext/tokenx.shtml
“Is a text visualization, analysis, and play tool” WordHoard
http://wordhoard.northwestern.edu/userman/index.html
“Is a tool for annotating or tagging texts by morphological, lexical, prosodic, and
narratological criteria and for determining frequency
information”
TokenX http://www.unl.edu/libr/etext/tokenx.shtml
“Is a text visualization, analysis, and play tool” WordHoard
http://wordhoard.northwestern.edu/userman/index.html
“Is a tool for annotating or tagging texts by morphological, lexical, prosodic, and
narratological criteria and for determining frequency
information”
63
TEI Tools (Research)TEI Tools (Research)
XAIRAXAIRA (XML Aware Information
Retrieval Architecture) is an open source tool
for constructing high-quality
linguistically-motivated search interfaces to large collections of XML documents.
XAIRAXAIRA (XML Aware Information
Retrieval Architecture) is an open source tool
for constructing high-quality
linguistically-motivated search interfaces to large collections of XML documents.
64
The XAIRA search with our example!!!!
The XAIRA search with our example!!!!
65
TEI Tools (Research)TEI Tools (Research)
CorpusReader http://panini.u-paris10.fr/~sloiseau/CR/
“Is a tool for extracting subcorpora, KWIC and quantitative information from arbitrarily large corpora in the TEI vocabulary. It intends to
provide ways for processing corpora containing
milestoned annotation. It provides mechanism for
merging several XML documents together”.
CorpusReader http://panini.u-paris10.fr/~sloiseau/CR/
“Is a tool for extracting subcorpora, KWIC and quantitative information from arbitrarily large corpora in the TEI vocabulary. It intends to
provide ways for processing corpora containing
milestoned annotation. It provides mechanism for
merging several XML documents together”.
66
TEI Tools (Classroom)TEI Tools (Classroom)
A more interesting orientation.
How I can use the Annotation in the classroom?
SACODEYL Search Toolwww.um.es/sacodeyl
A more interesting orientation.
How I can use the Annotation in the classroom?
SACODEYL Search Toolwww.um.es/sacodeyl
Don`t miss out “The SACODEYL Search Tool:´Exploiting corpora for
language learning purposes.Room A3, 18:00 Friday, 4
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
6. Discussion and debate
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
6. Discussion and debate
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
Aims of this workshopre-visited
Aims of this workshopre-visited
68
69
Discussion and debateDiscussion and debate
Pedagogical annotation vs. Morphological tagging paradigm
Learner-centered vs. Researcher-oriented
Indirect applications of language corpora vs. Direct applications
Constraints of traditional CL in the languagge classroom
Pedagogical annotation vs. Morphological tagging paradigm
Learner-centered vs. Researcher-oriented
Indirect applications of language corpora vs. Direct applications
Constraints of traditional CL in the languagge classroom
70
Discussion and debateDiscussion and debate
Cognitive demands of traditional CL in the language classroom: learner as a reseacher and as a traveller
Is CL an extra hassle in language classrooms? (Mauranen 2004)
Customization of language corpus/collection of texts
Mediation role of corpus-based resources in the FLT classroom
Authenticity issues (Widdowson)
Cognitive demands of traditional CL in the language classroom: learner as a reseacher and as a traveller
Is CL an extra hassle in language classrooms? (Mauranen 2004)
Customization of language corpus/collection of texts
Mediation role of corpus-based resources in the FLT classroom
Authenticity issues (Widdowson)
71
References and further reading
References and further reading
Braun, S. 2005. “From pedagogically relevant corpora to authentic language learning contents”, ReCALL 17/1:47-64.
Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language learning purposes”. In Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 25-47.
Braun, S. 2007. “Integrating corpus work into secondary education: from data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328.
Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH. (Ed), 89–105.
Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation solutions for online data-driven learning”. ReCALL,21,1, (Forthcoming).
Römer, Ute. (Forthcoming). “Corpora and Language Teaching”. In Corpus Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö (eds.). Berlin: Mouton de Gruyter.
Widdowson, H.G. 2003. Defining issues in English Language Teaching. Oxford: Oxford University Press.
Braun, S. 2005. “From pedagogically relevant corpora to authentic language learning contents”, ReCALL 17/1:47-64.
Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language learning purposes”. In Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 25-47.
Braun, S. 2007. “Integrating corpus work into secondary education: from data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328.
Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH. (Ed), 89–105.
Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation solutions for online data-driven learning”. ReCALL,21,1, (Forthcoming).
Römer, Ute. (Forthcoming). “Corpora and Language Teaching”. In Corpus Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö (eds.). Berlin: Mouton de Gruyter.
Widdowson, H.G. 2003. Defining issues in English Language Teaching. Oxford: Oxford University Press.
TaLC 08 WorkshopTaLC 08 Workshop
Thanks for your attention
[email protected]@um.es
Thanks for your attention
[email protected]@um.es
[email protected] Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain