24
Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Embed Size (px)

Citation preview

Page 1: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Gary HoltonANLC

E-MELD WorkshopAugust 2002

Alaska Native Language Archive

Page 2: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Alaska Native Language Center

Page 3: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Alaska Native Language Center Established in 1972 by state legislation

as a center for documentation and cultivation of the state's 20 Native languages

Staff includes language teachers, linguists, and language specialists

Archiving is central to both the documentation and pedagogical missions

Page 4: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Alaska Native Languages Eskimo-Aleut (5) Athabascan-Eyak-

Tlingit (13) Haida (1) Tsimshian (1)

Page 5: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Endangerment Status

Numbers of speakers Central Yup'ik 10,000 Inupiaq 3100

(+71,500 in Canada and Kalaallisut) Eyak 1

Age of youngest speaker <2 (Siberian Yupik) >80 (Holikachuk, Deg Xinag, Haida, Eyak ... )

Page 6: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Documentation Status comprehensive published dictionaries for

4 of the 20 languages grammars for 3 languages dissertations on 5 other languages

Page 7: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Alaska Native Language Archive Primary linguistic data archive for the 20

Alaska Native languages Comprehensive -- nearly everything

written in or about Alaska Native langs Primary focus on unpublished manuscripts

and field notes Items include:

print (~10,000 items), audio (~4700 tapes), digital data (??)

Page 8: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Archive Mission preservation

long-term storage and maintenance digital archiving of print and audio materials

access controlled but straighforward access by

community members educators linguists

Page 9: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Community-driven primary users of archive are members of

Native language communities communities also taking a lead in

preservation and access projects Eyak Language Digitization Project Unangan Tape Archive

Page 10: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Types of linguistic data field notes texts manuscripts pedagogical materials lexica

comparative wordlists etymological wordlists placenames

Page 11: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Formats Historically a non-digital (paper and tape)

archive, but increasingly have to deal with digital formats image files (pdf) raw text files word processor database (FoxPro, Access) audio (wav, aif)

Page 12: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Goals map archive metadata and expose via

OLAC-compliant data provider done

digitize existing resources in progress

create framework for archiving new digital data go E-MELD!

Page 13: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Digital Lexical Data at ANLC unstructured text files (Eyak, Inupiaq) structured text files

Lexware (Koyukon) Shoebox (Tanacross, Holikachuk) other "standard format" (Alutiiq)

relational databases Access (Athabaskan-Eyak-Tlingit

Comparative Lexical Database)

Page 14: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Two Examples Koyukon Athabaskan Dictionary Athabaskan-Eyak-Tlingit Comparative

Lexical Database (AET-CLD)

Page 15: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Koyukon Athabascan Dictionary Eliza Jones & Jules Jetté,

ed. by James Kari stored as structure text file, formatted

using Bob Hsu's Lexware project began ca. 1979 (1898)

printed dictionary published 2000 electronic version in progress ...

Page 16: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

Athabascan Morphology almost exclusively prefixing form of stem varies with TAM stem best represented by abstract lemma

or "root" lexeme consists of root plus one or more

(possibly discontinuous) prefixes

Page 17: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

.rt ts'eyh$1pa ch$w'-tag wind blows..th (P+pp#)de+0+ts'eyh

ex hedeets'eyheng it is windy

(blowing on the area)...n bet'o deets'eye...n mek'oodaats'eeye

..th P+pp#(#)de+\+ts'eyh

..th P+pp#(#)de+0+ts'eyh

..th P+e#k'e+de+\+ts'eyh

..th 0+ts'eyh

...an menedaa\ts'eeye

..n,i e\ts'eeyh, -e\ts'eeye'...n E\ts'eeyh Zo'@...n k'ets'e e\ts'eeye...n e\ts'eeyh yeege'...n e\ts'eeyh doyeege'....n e\ts'eebaaye

Root

Subentry

Sub-subentry

Example

Page 18: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

AET-CLD Jeff Leer, Giulia Oliverio, & Gary Holton project begun ca. 1997 comparative data at level of:

lexeme morpheme phoneme

hierarchical interactive, dynamic database

Page 19: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

E-MELD August 2002

AET-CLD Structure

Cogset

Lex

Morph

Phone

Page 20: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Database structure (portion)

Page 21: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Cogset table

Page 22: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Lex table

Page 23: Gary Holton ANLC E-MELD Workshop August 2002 Alaska Native Language Archive

Morph table (1)