61
Digitalized Dialect Studies: North-Western Romanian Sheila M. Embleton, Dorin Uritescu & Eric S. Wheeler York University, Toronto, Canada

Digitalized Dialect Studies: North-Western Romanian

  • Upload
    vaughn

  • View
    35

  • Download
    3

Embed Size (px)

DESCRIPTION

Digitalized Dialect Studies: North-Western Romanian. Sheila M. Embleton, Dorin Uritescu & Eric S. Wheeler York University, Toronto, Canada. Context. Noul Atlas lingvistic român. Crisana. Crisana region in north-west Romania Hard copy atlas by Stan and Uritescu (1996, 2003) - PowerPoint PPT Presentation

Citation preview

Page 1: Digitalized Dialect Studies: North-Western Romanian

Digitalized Dialect Studies: North-Western Romanian

Sheila M. Embleton, Dorin Uritescu & Eric S. Wheeler

York University, Toronto, Canada

Page 2: Digitalized Dialect Studies: North-Western Romanian

Context

Page 3: Digitalized Dialect Studies: North-Western Romanian

Noul Atlas lingvistic român. Crisana Crisana region in

north-west Romania

Hard copy atlas by Stan and Uritescu (1996, 2003)

Digitize to make it more accessible

Page 4: Digitalized Dialect Studies: North-Western Romanian

RODA: Romanian Online Dialect Atlas

Digitize and present hard copy atlas: Mostly graduate students

in Canada and Romania Enter data from maps into text files When complete, it will be posted to

the Internet for general use

Page 5: Digitalized Dialect Studies: North-Western Romanian

Objective Use Information Technology to

permit a broad range of scholars to access the data, select the data appropriately, and present the data clearly;

and so gain greater understanding of its significance.

Page 6: Digitalized Dialect Studies: North-Western Romanian

Other Digital Atlases

Page 7: Digitalized Dialect Studies: North-Western Romanian

Other Digital Atlases Salzburg

H.Goebl • phonetic dialect atlas of Dolomitic Ladinian

(since 1985) Edgar Haimerl

• ‘Visual DialectoMetry’ (VDM) (ca 2000) Netherlands

Heeringa et al.; de Vriend et al.• Dialectometric and cartographic software

Page 8: Digitalized Dialect Studies: North-Western Romanian

Other Dialect Atlases Japan

D. Long, others (http://nihongo.human.metro-u.ac.jp/~long/maps/perceptmaps.htm )• Japanese area maps

Page 9: Digitalized Dialect Studies: North-Western Romanian

Related endeavours Google Earth

• Available mapping software• Images world-wide

Dialect studies with databases• e.g. Iran: National survey for 2009

Visualization software• e.g T. Pi. Atlas of Dialect Topography• http://dialect.topography.chass.utoronto.ca/dt_atlas.php

Page 10: Digitalized Dialect Studies: North-Western Romanian

Overall challenges: Digitize data Accessible interface to data

Search Analyze

Presentation of data As data As maps

Page 11: Digitalized Dialect Studies: North-Western Romanian

RODA as linguistic technology

Page 12: Digitalized Dialect Studies: North-Western Romanian
Page 13: Digitalized Dialect Studies: North-Western Romanian

The technology allows one to:

View the data Search for data and count it Interpret the data or the counts Analyze the data (e.g. MDS) See the results as maps

Save the maps as .jpg pictures Save the results for later use

Hear samples of the data

Page 14: Digitalized Dialect Studies: North-Western Romanian

RODA: function Custom-defined maps

• You select the data• You see the result as a map

Programmable access to the whole set of digitized data• You ask about data spread over many maps• You can customize what you search for

(not just the editor’s choice)

Page 15: Digitalized Dialect Studies: North-Western Romanian

RODA: selection of data Context of search becomes important

• Word-final vs non-final vs either• Plain character vs accented character• Character vs (superposed) alternate

Choice of fields to search• E.g. With nouns: sg. vs pl. entries• Variations heard by field workers• Flags to mark special situations (e.g.

hesitation)

Page 16: Digitalized Dialect Studies: North-Western Romanian

Examples from RODA

Page 17: Digitalized Dialect Studies: North-Western Romanian

Crisana, Romania

Page 18: Digitalized Dialect Studies: North-Western Romanian

Crisana, Romania

(from RODA)

Page 19: Digitalized Dialect Studies: North-Western Romanian

Seeing Words Change

Word-final /u/in Latin and non-Latin words

Page 20: Digitalized Dialect Studies: North-Western Romanian

Word-final /u/ from Latin

Latin Romanian(standard and most

dialects)

Dialectal Variation

canto ‘I sing’ cânt cântu(vowel present)

cântu

(non-syllabic)

oculum ‘eye’ ochi ochiu ochiu

Page 21: Digitalized Dialect Studies: North-Western Romanian

Is word-final /u/ random? Look for a geographic pattern over

all potential occurrences The maps for single examples such

as /ochi/ and others, are in the hard-copy dialect Atlas,

But total data for all examples is spread widely over many maps.

Page 22: Digitalized Dialect Studies: North-Western Romanian

Word-final /u/

Data from:•407 maps•Field 1

Size of cross shows the number of occurrences

Horizontal= syllabic

Vertical = non-syllabic

Page 23: Digitalized Dialect Studies: North-Western Romanian

Word-final,syllabic /u/

Data from:•407 maps•Field 1•word-final only•(horizontal = vertical)

Locations 137, 141, 146 show most examples

Page 24: Digitalized Dialect Studies: North-Western Romanian

Word-final,syllabic /u/

Can review the data

Page 25: Digitalized Dialect Studies: North-Western Romanian

Word-final,syllabic /u/

Data from:•selected maps•Field 1•word-final only•removed non-vocalic /u/ , def. art., some clusters +/u/.•(horizontal = vertical)

Locations 137, 141, 146 show most examples

Page 26: Digitalized Dialect Studies: North-Western Romanian

/u/ Pattern There is a pattern:

Word final /u/ is retained in central, and north-eastern areas

It is syllabic mostly in parts of the central area

The locations with most frequent syllabic final /u/ do not form a continuous area

Page 27: Digitalized Dialect Studies: North-Western Romanian

Raised word-final /e/

Page 28: Digitalized Dialect Studies: North-Western Romanian

Raised, word-final /e/

Data from:•407 maps•Field 1

Horizontal= vertical

Raised /e/ is wide-spread

Page 29: Digitalized Dialect Studies: North-Western Romanian

Raised, word-final /e/ vs schwa

Data from:•407 maps•Field 1

Raised /e/ (horizontal)

Raised schwa (vertical)

Raised schwa is also wide-spread but does not always coincide with raised /e/(cf. 158, 159)

Page 30: Digitalized Dialect Studies: North-Western Romanian

High /e/ and schwa

Page 31: Digitalized Dialect Studies: North-Western Romanian

High /e/ and schwa

Page 32: Digitalized Dialect Studies: North-Western Romanian

Retained /u/versusRaised /e/

•Syllabic word-final /u/ (horizontal)

•Raised word-final /e/ (vertical)

•Zoom-in view of central area

137, 141, 146 have both

Page 33: Digitalized Dialect Studies: North-Western Romanian

Retained /u/versusRaised schwa

•Syllabic word-final /u/ (horizontal)

•Raised word-final schwa (vertical)

•Zoom-in view of central area

137, 146 (not 141) have both

Page 34: Digitalized Dialect Studies: North-Western Romanian

Conclusion The raising of final mid vowels and

the weakening of final high vowels are distinct natural lenition processes.

Page 35: Digitalized Dialect Studies: North-Western Romanian

Non-palatalized dentals before front vowels

Page 36: Digitalized Dialect Studies: North-Western Romanian

Non-palatalized dentals before front vowels

Crişana: dentals before front vowels are palatalized.

Are they restructured as palatals? If the process is no longer productive,

there may be non-palatalized dentals before front vowels.

If so, where, in what forms and what is the frequency?

Page 37: Digitalized Dialect Studies: North-Western Romanian

Non-palatalized dentals before front vowels

•Examples everywhere.

•(As is well-known, dentals are not palatalized in Oaş, except for 220.)

•Map shows where and how many examples.

Page 38: Digitalized Dialect Studies: North-Western Romanian

/st/ before front vowels

Page 39: Digitalized Dialect Studies: North-Western Romanian

/t/ but not /st/ before /e/ and /i/

•407 maps, field 1

•/te/ (horizontal)

•/ti/ (vertical)

•values all scaled x 3 to make more visible

Page 40: Digitalized Dialect Studies: North-Western Romanian

/t/ but not /st/ before /e/ and /i/

Shown as an interpretive map

•407 maps, field 1

•/te/ (red)

•/ti/ (black)

Map is automatically drawn from the previous searches

Page 41: Digitalized Dialect Studies: North-Western Romanian

/t/ before /e/ or /i/

•See the examples that were found and counted.

•See the source map number and location number of each.

•Can delete “exceptions” from the count.

Page 42: Digitalized Dialect Studies: North-Western Romanian

Non-palatalized dentals before front vowels

There are examples everywhere (not only in Oaş)

Here we establish a result with the location and frequency of examples.

Can view the examples that support the conclusion.

Page 43: Digitalized Dialect Studies: North-Western Romanian

With digital data and tools, we easily discover significant patterns

Here, we see the conservation of front vowels after velarizing consonants.

We see frequency

and areas phonological

context

/e, i/ after /ts, z, s/

Page 44: Digitalized Dialect Studies: North-Western Romanian

/e, i/ after /ts, z, s/

Page 45: Digitalized Dialect Studies: North-Western Romanian

MDS

Page 46: Digitalized Dialect Studies: North-Western Romanian

MDS process Multidimensional Scaling (MDS) uses

the “linguistic distance” between N+1 locations to place them in an N-dimensional space.

Then, the N-space is projected onto a 2-space (a map) such that the distances among the points are preserved as best as possible.

Page 47: Digitalized Dialect Studies: North-Western Romanian

MDS and dialects Embleton and

Wheeler have used an MDS process on English dialects Finnish dialects

Dialect roughly correlates with geography

Page 48: Digitalized Dialect Studies: North-Western Romanian

Dialect groupings Began with a hypothesis about

dialect groupings in Crisana Analyzed all data in 407 maps using

the MDS method Identity is exact match; any difference

is a difference of 1. Distance is sum of differences.

We see the groupings on a map.

Page 49: Digitalized Dialect Studies: North-Western Romanian

MDS mapAll groups

South-east and South-west are distinct.

The rest are less so. Suggests

the dialect unity of the region

--> refine groupings

Page 50: Digitalized Dialect Studies: North-Western Romanian

MDS mapRefined groupings

Still, considerable overlap or closeness

More groups that could be identified, e.g.:

Several divisions in West

Two areas in Oaş

Oaş is close to southern areas

Still, its distinctness is clear (cf. also Uritescu 1984a).

Page 51: Digitalized Dialect Studies: North-Western Romanian

MDS mapRefined groupings

Page 52: Digitalized Dialect Studies: North-Western Romanian

MDS mapRefined groupings

Page 53: Digitalized Dialect Studies: North-Western Romanian

MDS For large quantities of data, MDS

needs RODA’s digitized data. MDS provides another

understanding of the data. MDS is only one of many possible

quantitative tools (e.g. factor analysis, cluster analysis).

Page 54: Digitalized Dialect Studies: North-Western Romanian

Hear the Data

•Selected clips from source data in over 40 locations

•From map, pick location and play

•(Sound data is large; needs to be packaged separately for easy downloading)

Page 55: Digitalized Dialect Studies: North-Western Romanian

Bigger challenge

Page 56: Digitalized Dialect Studies: North-Western Romanian

Access to Data In the humanities,

Large amounts of data Diverse ways of selecting it

Information Technology Has the technology May not understand the needs

Need to learn how to apply IT to our discipline effectively

Page 57: Digitalized Dialect Studies: North-Western Romanian

Development Process Requirements gathering

Prototypes Cycles of propose-and-revise

User testing Test versions on web User feedback is important

Explore technology Changes fast Much to learn

Page 58: Digitalized Dialect Studies: North-Western Romanian

Bridging the Gap IT specialist: the challenge is to

make IT accessible to non-IT users Humanist: go after the technology

Plan for it. It needs careful thought Use it. It is powerful

Dialectologist and Romanist: RODA

Page 59: Digitalized Dialect Studies: North-Western Romanian

Future Directions Digitalize future volumes (3-5) Create digital interpretive maps

from hard-copy Atlas Apply MDS

Enhance the sound and multimedia aspects of the online atlas Play sound and see a transliterated text

Page 60: Digitalized Dialect Studies: North-Western Romanian

Summary Data will soon be available

You are invited to apply your techniques to the data

Digital data and IT methods permit: Widely accessible data Flexible searching and custom

presentation Repeatable processing

Page 61: Digitalized Dialect Studies: North-Western Romanian

Contacts Sheila [email protected] Dorin [email protected] Eric [email protected]

Test sites: ericwheeler.ca/test