1
Objective The goal of this study is to model geographic variation of UK pronunciation based on the pronunciation of the vowel /aɪ/. Thousands of tokens of the vowel in the word ‘five’ were mined from the Audio British National Corpus and compared across seventeen regions of the UK. This study was designed with an expectation of a spectrum of dialects existing across the country with monophthongal realizations (/i/ as in ‘feeve’) primarily found in the North, i.e., some Scottish varieties, and more diphthongization (/aɪ/ as in ‘five’) occurring in the South. Methodology Data Mining 8,567 tokens of ‘five’ were retrieved from the BNC. “Bad data” was excluded (whispery, overlapping, or exceptionally high-pitched speech), yielding 1,773 tokens to manually trim and analyze. Functional Data Analysis (see fig. 1) Orthogonal cubic polynomials were fitted to each F1 and F2 track. Tokens with slopes that lined up with expectations (falling F1 and rising F2 values) were included in the study, yielding 1,049 tokens total. Modal Analysis (see fig. 2) Speaker medians for each word were binned and then modally plotted in increments of 50 Hz for F1 Max and 100 Hz for F2 Min. Distinct areas of high vowel quality concentration suggest that our dataset of /aɪ/ F1 Max and F2 Min values is bimodal. Geographic Analysis (see figs. 3, 4) Distance from Orkney (northernmost location) was established as an independent variable. For each location, median geographic coordinates were loaded, and Distance from Orkney calculated. A correlation was found between Distance from Orkney and F2 Min. References 1. John Coleman, Ladan Baghai-Ravary, John Pybus, and Sergio Grau (2012) Audio BNC: the audio edition of the Spoken British National Corpus. Phonetics Laboratory, University of Oxford. http://www.phon.ox.ac.uk/AudioBNC 2. Ferragne, Emmanuel, and François Pellegrino. “Formant Frequencies of Vowels in 13 Accents of the British Isles.” Journal of the International Phonetic Association, vol. 40, no. 1, 2010, pp. 1–34. 3. Kretzschmar, William A. Language and Complex Systems. New York: Cambridge University Press, 2015. Print. 4. Morrison, Geoffrey S. Vowel Inherent Spectral Change. Springer, 2015. Print. 5. Scottish Corpus of Texts & Speech. 2007. Department of English Language, University of Glasgow, Scotland, UK. 6. Wells, J. C. Accents of English. Cambridge, Cambridgeshire. New York: Cambridge University Press, 1982. Print. Acknowledgements This project was made possible by the University of Georgia’s Center for Undergraduate Research Opportunities summer fellowship and the Oxford University Phonetics Laboratory. I would also like to thank mentors Prof. John Coleman and Dr. William A. Kretzschmar, along with GIS librarian Meagan Duever, who dedicated their time and expertise to this project. Modeling Geographic Variation in Pronunciation of United Kingdom English How are British accents distributed across the UK? Katie Henley, University of Georgia – [email protected] Prof. John Coleman, University of Oxford – [email protected] Fig. 2: The contour map of ‘five’ speakers’ median frequencies illustrates clusters of F1/F2 values and suggests a multimodal distribution of vowel qualities. Fig. 3: The F2 minimum variable is significantly correlated with Distance from Orkney: (N= 229, “p = .000,” 1-tailed). The North-to-South decline in F2 minimum indicates successive lowering of the vowel, from [i:] to [eI] to [aI], illustrating the historical "Great Vowel Shift" mapped out in UK geography. Fig. 4: This map illustrates the geographic distribution of 230 ‘five’ speaker’s average F2 Min values across seventeen representative locations in the UK. While these values are not the same at any location, the increase in proportion of lighter colored points as latitude decreases suggests that F2 Min values are continuous across the space and not sharply bounded as by isoglosses. Fig. 1: F1 and F2 frequencies of the vocalic portion (/aɪ/ and transition into the /v/) of a representative token of ‘five’. Solid line: observed data; dashed line: cubic polynomial of the form y = a 1 x 3 +a 2 x 2 +a 3 x+a 4 . Coefficients represent amount of ~-shaped wiggle, breadth of curvature, slope (steepness and direction), and average height, respectively. (Hz) Representative location F2 Minimum F1 Maximum

Modeling Geographic Variation in Pronunciation of United ... · to manually trim and analyze. ... Phonetics Laboratory ... Modeling Geographic Variation in Pronunciation of United

  • Upload
    buikiet

  • View
    215

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Modeling Geographic Variation in Pronunciation of United ... · to manually trim and analyze. ... Phonetics Laboratory ... Modeling Geographic Variation in Pronunciation of United

ObjectiveThegoalofthisstudyistomodelgeographicvariationofUKpronunciationbasedonthepronunciationofthevowel/aɪ/.Thousandsoftokensofthevowelintheword‘five’wereminedfromtheAudioBritishNationalCorpusandcomparedacrossseventeenregionsoftheUK.Thisstudywasdesignedwithanexpectationofaspectrumofdialectsexistingacrossthecountrywithmonophthongal realizations(/i/asin‘feeve’)primarilyfoundintheNorth,i.e.,someScottishvarieties,andmorediphthongization(/aɪ/asin‘five’)occurringintheSouth.

MethodologyDataMining• 8,567tokensof‘five’wereretrievedfromtheBNC.• “Baddata”wasexcluded(whispery,overlapping,or

exceptionallyhigh-pitchedspeech),yielding1,773tokenstomanuallytrimandanalyze.

FunctionalDataAnalysis(seefig.1)• OrthogonalcubicpolynomialswerefittedtoeachF1and

F2track.• Tokenswithslopesthatlinedupwithexpectations

(fallingF1andrisingF2values)wereincludedinthestudy,yielding1,049tokenstotal.

ModalAnalysis(seefig.2)• Speakermediansforeachwordwerebinnedandthen

modallyplottedinincrementsof50HzforF1Maxand100HzforF2Min.

• Distinctareasofhighvowelqualityconcentrationsuggestthatourdatasetof/aɪ/F1MaxandF2Minvaluesisbimodal.

GeographicAnalysis(seefigs.3,4)• DistancefromOrkney(northernmostlocation)wasestablishedasanindependentvariable.

• Foreachlocation,mediangeographiccoordinateswereloaded,andDistancefromOrkneycalculated.

• AcorrelationwasfoundbetweenDistancefromOrkneyandF2Min.

References1. JohnColeman,LadanBaghai-Ravary,

JohnPybus,andSergioGrau(2012)AudioBNC:theaudioeditionoftheSpokenBritishNationalCorpus.PhoneticsLaboratory,UniversityofOxford.http://www.phon.ox.ac.uk/AudioBNC

2. Ferragne,Emmanuel,andFrancoisPellegrino.“FormantFrequenciesofVowelsin13AccentsoftheBritishIsles.”JournaloftheInternationalPhoneticAssociation,vol.40,no.1,2010,pp.1–34.

3. Kretzschmar,WilliamA.LanguageandComplexSystems.NewYork:CambridgeUniversityPress,2015.Print.

4. Morrison,GeoffreyS.VowelInherentSpectralChange.Springer,2015.Print.

5. ScottishCorpusofTexts&Speech.2007.DepartmentofEnglishLanguage,UniversityofGlasgow,Scotland,UK.

6. Wells,J.C.AccentsofEnglish.Cambridge,Cambridgeshire.NewYork:CambridgeUniversityPress,1982.Print.

AcknowledgementsThisprojectwasmadepossiblebytheUniversityofGeorgia’sCenterforUndergraduateResearchOpportunitiessummerfellowshipandtheOxfordUniversityPhoneticsLaboratory.IwouldalsoliketothankmentorsProf.JohnColemanandDr.WilliamA.Kretzschmar,alongwithGISlibrarianMeaganDuever,whodedicatedtheirtimeandexpertisetothisproject.

ModelingGeographicVariationinPronunciationofUnitedKingdomEnglishHowareBritishaccentsdistributedacrosstheUK?

KatieHenley,UniversityofGeorgia– [email protected],UniversityofOxford– [email protected]

Fig.2:Thecontourmapof‘five’speakers’medianfrequenciesillustratesclustersofF1/F2valuesandsuggestsamultimodaldistributionofvowelqualities.

Fig.3:TheF2minimumvariableissignificantlycorrelatedwithDistancefromOrkney:(N=229,“p=.000,”1-tailed).TheNorth-to-SouthdeclineinF2minimumindicatessuccessiveloweringofthevowel,from[i:]to[eI]to[aI],illustratingthehistorical"GreatVowelShift"mappedoutinUKgeography.

Fig.4:Thismapillustratesthegeographicdistributionof230‘five’speaker’saverageF2MinvaluesacrossseventeenrepresentativelocationsintheUK.Whilethesevaluesarenotthesameatanylocation,theincreaseinproportionoflightercoloredpointsaslatitudedecreasessuggeststhatF2Minvaluesarecontinuousacrossthespaceandnotsharplyboundedasbyisoglosses.

Fig.1:F1andF2frequenciesofthevocalicportion(/aɪ/andtransitionintothe/v/)ofarepresentativetokenof‘five’.Solidline:observeddata;dashedline:cubicpolynomialoftheformy=a1x3 +a2x2 +a3x+a4 .Coefficientsrepresentamountof~-shapedwiggle,breadthofcurvature,slope(steepnessanddirection),andaverageheight,respectively.

(Hz)

Representativelocation

F2Minimum

F1Maximum