Corpora and its use in elt

CORPORA AND ITS USE IN ELT

What is a corpus?

Corpus, plural corpora is a collection of linguistic data, either compiled as written texts or as a transcription of recorded speech.

“Any body of text” that is, any collection of recorded instances of spoken or written language.

Corpus linguistics adherents believe that reliable language analysis occurs on field-collected samples, in natural contexts and with minimal experimental interference.

A landmark in modern corpus linguistics was the publication by Henry Kucera and W. Nelson Francis of Computational Analysis of Present-Day American English in 1967, a work based on the analysis of the Brown Corpus

Henry Kučera (15 February 1925 – 20 February 2010), born Jindřich Kučera, was a Czech linguist who was a pioneer in corpus linguistics and linguistic software.

John McHardy Sinclair (June 14, 1933 – March 13, 2007), Professor of Modern English Language at Birmingham University, 1965 to 2000. He pioneered work in corpus linguistics, discourse analysis, lexicography, and language teaching.

John Sinclair was a first-generation modern corpus linguist and the founder of the COBUILD project.

Types of corpora

Monolingual

Curpus

Written

Spoken

General

Specialized

Multilingual

Parallel Corpus

A corpora can be composed by texts in a single language or texts in more than one language. If the texts are in the same language such in translations, the corpora is called Parallel Corpus. In this kind of corpora the direction of the translation is not relevant.

Comparable Corpus

The goal of this type of corpora is to compare the languages or varieties presented in similar circumstances of communication.

Sublanguage Corpora

This Corpora include texts from a particular dialect, or variety of a language.

The General Corpora

Is formed by general texts that do not belong to single field, or register.

Corpora and its use in elt

Education

ELT-03 English as an International Language ELT-03... · might constitute International English and its value globally, while Peter Strevens and John Norrish challenge attitudes to

Assessment tools and learner corpora - Hypotheses.org · Assessment tools and learner corpora Angel Chan • Assessment Tools – Mandarin Receptive Vocabulary Test • Learner Corpora

Corpora in Indian Languages

Wizards of the Coast launches Magic: The Gathering Arena ... docs/case-study-wizards-o… · • ELT Queueing. As part of its ELT solution, Deci-sive Data created a queueing process

ELT Professional Learning Courses and Seminarselt.nysut.org/~/media/files/elt-nysut/elt-files/171003_2017fall... · ELT Professional Learning Courses and Seminars ... NYSUT’s Education

Comparing Corpora

1 Text-based typology Corpora, corpora of elicited texts and parallel corpora (based on STUF 2007) МД

Multimodal Corpora: How Should Multimodal Corpora Deal ...michaelkipp.de/publication/MultimodalCorpora2012-proceedings.pdf · How Should Multimodal Corpora Deal with the Situation?

Russian multimodal corpora

Corpora Games Book 103

Comparable Corpora for Terminology

6 Language corpora Liang Maocheng. 7.1 Introduction 7.2 Empiricism, corpus linguistics, and electronic corpora 7.3 Applications of corpora in applied

Corpora 22

Web Corpora

Corpora translation

Corpora from a sociolinguistic perspective · Corpora from a sociolinguistic perspective Corpora sob uma perspectiva sociolinguística Tyler Kendall* University of Oregon Eugene

Workshop Programme Multimodal Corpora From Multimodal ... Corpora... · "Multimodal Corpora From Multimodal Behaviour Theories to Usable Models" ... Analysis of gesture expressivity

Annotation of corpora

Using corpora in instruction

Motor in Corpora Do