Phonetic convergence in an immersive game-based taskwraplab.co/publications/Biro-Toscano-Viswanathan-ASA-Fall2016.pdf · Phonetic convergence in an immersive game-based task INTRODUCTION

0

50

100

150

200

250

300

350

Voiced Voiceless

Aver

age

Vow

el L

engt

h (m

s)

Initial Stop Consonant

Vowel LengthHigh Engagement TaskLow Engagement Task

0100200300400500600700800900

1000

Voiced Voiceless

Aver

age

F1 (H

z)


F1 OnsetHigh Engagement Task

Low Engagement Task

0

50

100

150

200

250

300

350

Voiced Voiceless

Aver

age

f0 (H

z)


F0 OnsetHigh Engagement …Low Engagement …

High Engagement Low Engagement

●

●

●

●

●

●

●

●

●●

●

●

●

● ●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

80

100

120

20

25

30

35

VoicelessVoiced

0.25 0.5 0.75 1 0.25 0.5 0.75 1

Normalized Trial Number (Proportion of total trials)

Voic

e on

set t

ime

(ms)

Stimulus Voicing: ● Voiceless Voiced

Initial VOT: ● ●Long Short

Voice Onset Time

β = 3.55, SE = 1.86,χ2(1) = 3.37, p = 0.073

β = 1.62, SE = 0.85,χ2(1) = 3.31, p = 0.073


●

●

●

●

●

●

●●

●●

●

●

●

●

●

●

●●

●

●

●

●

●

●

●

●

●●

●

●

●

●

●●

●

●●

●

●

●

●

●

●●

150

200

250

150

200

250

300

VoicelessVoiced

0.25 0.5 0.75 1 0.25 0.5 0.75 1


Vow

el le

ngth

(ms)


Initial VL: ● ●Long Short

Vowel Length

β = -8.78, SE = 2.72,χ2(1) = 7.82, p = .0053

β = -6 .17, SE = 3.18,χ2(1) = 3.51, p = .063

β = -3 .72, SE = 1.76,χ2(1) = 4.43, p = .042


●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

150

175

200

225

150

175

200

225

VoicelessVoiced

0.25 0.5 0.75 1 0.25 0.5 0.75 1


f0 o

nset

(Hz)


Initial f0 onset: ● ●High Low

Phonetic convergence in an immersive game-based task

INTRODUCTION

METHOD

RESULTS DISCUSSION

ACKNOWLEDGEMENTS & REFERENCES

Tifani Biro ([email protected])

Department of Speech-Language-Hearing: Sciences and Disorders

University of Kansas

Joe Toscano ([email protected])

Department of PsychologyVillanova University

Navin Viswanathan([email protected])

Department of Speech-Language-Hearing: Sciences and Disorders

University of Kansas

Statistical AnalysesThree linear mixed effects models were used to examine:

The effect of trial number, task engagement, initial VOT, and their interactions on the produced VOTs for voiced or voiceless tokens separately. A trial number x task engagement x initial VOT interaction would indicate convergence

The effect of initial VOT, trial number, and their interactions on VOTs of voiced or voiceless tokens separately in each individual task. A initial VOT x trial number interaction would indicate convergenceThe effect of trial number on the VOTs of voiced or voiceless tokens for each of the participants in each task

Similar models examined these effects on VL, F1, and f0

Summary of ResultsElicited mean VOT were longer than what had been previously reported (voiced VOT: 25 ms; voiceless VOT: 101 ms; Lisker & Abramson, 1964). Lexical factors (Baese-Berk & Goldrick, 2009) and task difficulty (Schertz, 2013) could contribute to these differencesHowever, participants’ productions were not significantly different between the low and high engagement tasks

Convergence was observed along certain acoustic dimensions (VOT, VL, F1 onset), but not others (f0). Interestingly, the extent of convergence was affected by the level of task engagement. Overall, subjects in the high engagement task converged more, whereas subjects in the low engagement task were less likely to converge. This finding was especially the case for VL and F1 onset

These preliminary results suggest that engaging, naturalistic tasks may yield results that more accurately reflect real-world phonetic variation than traditional laboratory experiments

Future DirectionsFuture studies will use these communicative tasks to look at convergence among speakers of different native languages

Participants completed either a high- or low-engagement task30 word-initial voicing minimal pairs provided key information that interlocutors had to provide to each otherPhonetic convergence was tracked for 4 acoustic dimensions (VOT, VL, F1, f0) over the course of the one hour experiment

AcknowledgementsThank you to our coders: Anne Marie Crinnion, Sarah Welsh, Jacklyn Coelho, Nicole Johnson, John Michael Kay, Rakshana Selvarajan, and Christopher Burley; thanks to David Saltzman and Michael Phelan for help programming the Minecraft puzzles, and thanks to Emma Folk for assistance with running subjects. TMB was supported by a Villanova Graduate Student Fellowship.

ReferencesBabel, M. (2012). Evidence for phonetic and social selectivity in spontaneous phonetic

imitation. Journal of Phonetics, 40(1), 177-189.Baese-Berk, M., & Goldrick, M. (2009). Mechanisms of interaction in speech production.

Language and cognitive processes, 24(4), 527-554.Buxó-Lugo, A., Toscano, J. C., & Watson, D. G. (2016). Effects of participant

engagement on prosodic prominence. Discourse Processes, (just-accepted).Lisker, L., & Abramson, A. S. (1964). A cross-language study of voicing in initial stops:

Acoustical measurements. Word, 20, 384-422.Olmstead, A. J., Viswanathan, N., Aivar, M. P., & Manuel, S. (2013). Comparison of

native and non-native phone imitation by English and Spanish speakers. Frontiers in psychology, 4.

Pardo, J. S. (2010). Expressing oneself in conversational interaction. Expressing Oneself/Expressing One's self: Communication, Cognition, Language, and Identity, 183-196.

Schertz, J. (2013). Exaggeration of featural contrasts in clarifications of misheard speech in English. Journal of Phonetics, 41(3), 249-263.

Mapping Acoustic Signals to Phonetic Categories

Stimuli (subset)

Phonetic category membership is indicated by multiple dimensions in the acoustic signal. However, acoustic dimensions are variable and context-dependent, which can lead to ambiguities between two speech sounds that only differ in one acoustic attribute. One phenomenon that may affect this variability is phonetic convergence (the observation that speech patterns of interlocutors become more similar during a conversation)

Procedure

1. Production Measures

The extent talkers can converge on productions outside of their native language range may be limited (Olmstead et al., 2013). What dimensions talkers converge on also varies across studies (Babel, 2012; Pardo, 2010). However, since convergence is a phenomenon occurring conversationally within the natural world, typical laboratory tasks may fail at fully eliciting it

2. Convergence Measures


●

●

●

●

●

●

●

●

●

●

●

●

●

●

●

●●

●

●

●

●

●

●

●

●

●

●

●

●● ●

●

●

●

●

●

●

●

●

●

●

●

●

●

600

700

800

500

550

600

VoicelessVoiced

0.25 0.5 0.75 1 0.25 0.5 0.75 1


F1 o

nset

(Hz)


Initial F1 onset: ● ●High Low

β = -25.56, SE = 12.63,χ2(1) = 3.99, p = .05

β = 14.18, SE = 6.01,χ2(1) = 4.48, p = .03

Factors such as engagement have been found to have an effect on language production (Buxó-Lugo et al., 2016)How might task engagement affect speech production and phonetic convergence?

020406080

100120140160

b d g p t k

Aver

age

VOT

(ms)

Stop Consonant

VOT DistributionsLisker & Abramson (1964)

High Engagement Task

Low Engagement Task

Natural Setting• Conversational partner• Less control• Highly engaging

• Lack of interlocutor•More control• Typically boring

Laboratory Setting

Voice Onset Time (VOT)

Release Burst

f1 onset

f0 onset

Vowel Length

/b/

Time

Freq

Amp

Maze Participant 1 Participant 2

1 Listener Talker

2 Talker Listener

3 Listener Talker

4 Talker Listener

5 Listener Talker

6 Talker Listener

Voiced Voiceless

Velar got cot

goat coatghost coast

Alveolar dent tent

dip tipdart tart

Bilabial bat pat

bear pearbark park

gap fig

goat coat muck

luck sigh

high rig cap

say rare done pat yard

doe nun ten bet sad

pay heart frown glass try

goat muck fig high cap

Start

End

VOT

Conversation

/b/1 /b/2 /p/1 /p/2

VOT

/b/1 /b/2 /p/1 /p/2

F1 Onset Fundamental Frequency Onset

1

2

3

3

3

β = -8.67, SE = 3.97, χ2(1) = 4.73, p = .03

1β = -18.77, SE = 7.51,χ2(1) = 6.20, p = .01

2

Low Engagement Listener High Engagement Listener Low Engagement Talker High Engagement Talker

Documents

Phonetic convergence in an immersive game-based taskwraplab.co/publications/Biro-Toscano-Viswanathan-ASA-Fall2016.pdf · Phonetic convergence in an immersive game-based task INTRODUCTION