103
Sociopragmatic development in the preschool years: the links with prosodic and gestural abilities and Theory of Mind Mariia Pronina PhD Research Plan Supervised by: Pilar Prieto ICREA - Department of Translation and Language Sciences, Universitat Pompeu Fabra February 11th, 2019

Sociopragmatic development in the preschool years: the ...prosodia.upf.edu/home/arxiu/tesis/doctorat/Mariia Pronina PhD research plan.pdfconjuntament prosòdia, gestos i contingut

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Sociopragmatic development

in the preschool years:

the links with prosodic and gestural abilities

and Theory of Mind

Mariia Pronina

PhD Research Plan

Supervised by:

Pilar Prieto

ICREA - Department of Translation and Language Sciences,

Universitat Pompeu Fabra

February 11th, 2019

Mariia Pronina

PhD Research Plan

ACKNOWLEDGMENTS

First and foremost, I would like to give a special thanks to my supervisor Pilar Prieto for

all her time, support, guidance and the encouragement during the course of this work. Pilar

Prieto together with Iris Hübscher originated some of the projects presented in the current PhD

proposal. I would also like to thank Iris Hübscher for her tangible help with preparing of the

experiments: designing and filming some materials, and running the pilots with me. I owe my

gratitude to Joan Borràs-Comes for helping with statistics. Special thanks to Patrick Rohrer for

proofreading my work. And thanks to all members of the Prosodic Studies Group, Florence

Baills, Peng Li, Patrick Rohrer, Ïo Valls, Ingrid Vilà-Giménez, and Yuan Zhang for their

comments and support. I am also very thankful to the committee member for my PhD research

plan defense Wolfram Hinzen, Alfonso Igualada and Núria Esteve-Gibert for their valuable and

insightful comments and suggestions that I benefited enormously from.

I am grateful to the schools where the experiments were conducted (Escola Antoni Brusi

and Escola Bogatell in Barcelona and Escola Verd in Girona) as well as to the children who

participated in the study and their parents. I am also indebted to research assistants Eva Castillo,

Anna Massanas and Ainhoa David for their help with the administration of the tests. I would

also like to thank Ernesto Rodríguez for the illustrations of the APT and Alona Lukina and

Valentin Sotskov for providing me with the python script for group distribution. I would like

to particularly thank Anna Massanas for her help with the recording of the materials, for

carrying out the training sessions, and for her willingness to work with us on this project.

Mariia Pronina

PhD Research Plan

TABLE OF CONTENTS

ABSTRACT .............................................................................................................................. I

RESUM ................................................................................................................................... III

RESUMEN ............................................................................................................................... V

1. INTRODUCTION ................................................................................................................ 1

1.1. OBJECT OF ANALYSIS ...................................................................................................... 1

1.2. PRIOR WORK.................................................................................................................... 1

1.2.1. Multimodal foundations of sociopragmatic development in the first language 1

1.2.1.1. Early preverbal period .................................................................................... 1

1.2.1.2. Verbal period ................................................................................................... 8

1.2.2. ToM development and its links with other cognitive and linguistic abilities .. 21

1.2.2.1. ToM development .......................................................................................... 21

1.2.2.2. The links between ToM and language ......................................................... 23

1.2.2.3. The links between ToM and emotion understanding ................................. 29

1.2.2.4. Training studies to promote ToM ................................................................ 31

2. GOALS OF THE DISSERTATION ................................................................................. 34

3. THEORETICAL FRAMEWORK .................................................................................... 35

3.1. THE USAGE-BASED THEORY OF LANGUAGE ACQUISITION ........................................... 35

3.2. EMBODIED COGNITION .................................................................................................. 37

4. HYPOTHESES ................................................................................................................... 39

4.1. STUDY 1 ......................................................................................................................... 39

4.2. STUDY 2 ......................................................................................................................... 40

4.3. STUDY 3 ......................................................................................................................... 41

4.4. STUDY 4 ......................................................................................................................... 42

5. EXPERIMENTAL STUDIES ........................................................................................... 43

5.1. STUDY 1: SOCIOPRAGMATIC COMPETENCE IN PRESCHOOLERS AND ITS LINK WITH

OTHER ABILITIES .................................................................................................................. 43

5.1.1 Research question .................................................................................................. 43

5.1.2. Participants ........................................................................................................... 43

5.1.3. Materials ................................................................................................................ 44

5.1.4. Procedure .............................................................................................................. 46

Mariia Pronina

PhD Research Plan

5.1.5. Expected results .................................................................................................... 49

5.2. STUDY 2: A CORRELATIONAL EXAMINATION OF THE LINK BETWEEN CHILDREN’S

IMITATION SKILLS AND THEIR SOCIOPRAGMATIC ABILITIES ............................................. 50

5.2.1. Research question ................................................................................................. 50

5.2.2. Participants ........................................................................................................... 50

5.2.3. Materials ................................................................................................................ 50

5.2.4. Set up and procedure ........................................................................................... 51

5.2.5. Coding .................................................................................................................... 52

5.2.6. Summary of results ............................................................................................... 53

5.3. STUDY 3: DEVELOPMENTAL TRAJECTORIES OF PROSODY AND GESTURE IN THEIR

EXPRESSION OF SOCIOPRAGMATIC MEANINGS: A CROSS-SECTIONAL STUDY .................... 54

5.3.1 Research question .................................................................................................. 54

5.3.2. Participants ........................................................................................................... 54

5.3.3. Materials and procedure ...................................................................................... 54

5.3.4. Results .................................................................................................................... 54

5.4. STUDY 4: THE ROLE OF ENACTMENT IN THE DEVELOPMENT OF PRAGMATIC ABILITIES

AND TOM: A TRAINING STUDY ............................................................................................. 58

5.4.1. Research question ................................................................................................. 58

5.4.2. Participants ........................................................................................................... 58

5.4.3. Materials ................................................................................................................ 58

5.4.4. Experimental procedure ...................................................................................... 59

5.4.5. Expected results .................................................................................................... 63

6. WORK PLAN ..................................................................................................................... 64

7. SELECTED REFERENCES ............................................................................................. 66

8. FULL BIBLIOGRAPHY ................................................................................................... 69

Appendix A: The Audiovisual Pragmatic Test (APT) ........................................................ 88

Appendix B: Sample story of the training materials (Study 4) .......................................... 91

Appendix C: Script for group distribution (Study 4) .......................................................... 92

Appendix D: Annotation system of online recoding of training sessions (Study 4) .......... 93

Mariia Pronina

PhD Research Plan

i

ABSTRACT

By its very nature, human communication is multimodal and relies on a dynamic

information exchange involving not only speech content but also prosodic and gestural features.

While a substantial amount of research has shown that gesture and prosody play a bootstrapping

role at early stages of language acquisition (Goldin-Meadow, 1998, 2014; see de Carvalho,

Dautriche, Millotte, & Christophe, 2018 for a review), little is known about the role played by

gesture and prosody in children’s later linguistic and sociopragmatic development (see

Hübscher, 2018 for a review).

The main goal of the current PhD thesis is to explore preschoolers’ sociopragmatic

development and its relation to multimodal prosodic and gestural abilities, as well as to Theory

of Mind (ToM), the capacity to attribute mental states – e.g., beliefs, intentions, knowledge –to

others, and emotion understanding. To date, the complex relationship between these

components – namely sociopragmatic abilities, multimodal prosodic and gestural abilities,

ToM, and the ability to recognize emotions – remains unclear (see Bosco, Tirassa, & Gabbatore,

2018 for a review). Moreover, the present thesis will also assess whether sociopragmatic

abilities, ToM and emotion comprehension skills can be trained through an embodied

intervention involving multimodal enactment of language.

The present PhD thesis is comprised of four empirical studies with preschool children.

Study 1 will assess whether sociopragmatic abilities in a group of 3- to 4-year-old preschool

children are correlated with ToM, emotion understanding and language skills (semantics,

syntax, narrative abilities). Study 2 will assess whether sociopragmatic abilities in a group of 3

to 4-year-old preschool children are tied to multimodal imitation abilities (understood as the

ability to jointly imitate prosody, gesture and lexical content). Study 3 has the goal of jointly

exploring prosodic and gestural developmental trajectories in their expression of

sociopragmatic meanings in two age groups, namely 3- to 4-year-old children and 5- to 6-year-

old children. Finally, Study 4 will assess the effects of training 3- to 4- year-old children to use

prosodic and gestural patterns while enacting mental states and emotions on sociopragmatic

and ToM development.

All in all, this PhD thesis aims to explore sociopragmatic development in typically

developing preschool children and its links with ToM, multimodal production abilities and

other language skills. The results of the four studies will assess the relation between these

capacities and the role that gesture and prosody play in this picture. The main underlying

hypothesis of this thesis is that children’s multimodal abilities are strongly related to their

sociopragmatic competence, and that secondarily these abilities are related to overall linguistic

Mariia Pronina

PhD Research Plan

ii

development and mind-reading abilities. Moreover, given this relationship, we hypothesize that

strengthening perspective-taking abilities through embodied and language training

interventions will serve to boost both ToM as well as linguistic capacities that require

perspective-taking skills. This PhD thesis will not only lead to widening the body of literature

on sociopragmatic development, ToM, and multimodal but will also develop a tool to aid

preschool teachers in incorporating aspects of embodiment into their classroom.

Mariia Pronina

PhD Research Plan

iii

RESUM

Per la seva pròpia naturalesa, la comunicació humana és multimodal i es basa en un

intercanvi dinàmic d’informació que inclou no només la parla sinó també elements prosòdics i

gestuals. Mentre una gran quantitat de recerca ha demostrat que gest i prosòdia representen un

paper promotor en etapes primerenques d’adquisició del llenguatge (Goldin-Meadow, 1998,

2014; v. de Carvalho, Dautriche, Millotte, & Christophe, 2018 per una revisió), se sap poc sobre

el paper del gest i la prosòdia en el desenvolupament lingüístic i sociopragmàtic més tardà (v.

Hübscher, 2018 per una revisió).

L’objectiu principal d’aquesta tesi és investigar el desenvolupament sociopragmàtic dels

nens d’edat preescolar i la seva relació amb les habilitats prosòdiques i gestuals, com també

amb la Teoria de la Ment (ToM), entesa la capacitat d’atribuir estats mentals – e.g., creences,

intencions, coneixement – a d’altres persones, i amb la comprensió d’emocions. A hores d’ara,

encara s’està discutint a complexa relació entre aquests components, és a dir, l’habilitat

sociopragmàtica, les habilitats multimodals prosòdiques i gestuals, ToM, i l’habilitat de

reconèixer emocions no està clara (v. Bosco, Tirassa, & Gabbatore, 2018 per una revisió). A

més, aquesta tesi també analitzarà si les habilitats sociopragmàtiques, ToM i la capacitat de

comprensió d’emocions poden entrenar-se mitjançant una intervenció corporeïtzada que

potencia l’ús de la multimodalitat en l’aprenentatge del llenguatge.

Aquesta tesi inclou quatre estudis empírics amb nens d’edat preescolar. L’estudi 1

investigarà si les habilitats sociopragmàtiques dels nens de 3 i 4 anys es correlacionen amb la

ToM, amb la comprensió d’emocions i amb altres habilitats lingüístiques (semàntica, sintaxi,

habilitats narratives). L’estudi 2 examinarà si les habilitats sociopragmàtiques dels nens de 3 i

4 anys estan lligades amb l’habilitat d’imitació multimodal (entesa com l’habilitat d’imitar

conjuntament prosòdia, gestos i contingut lèxic). L’estudi 3 té com a objectiu explorar

trajectòries de desenvolupament conjunt de la prosòdia i el gest en la seva expressió

sociopragmàtica en dos grups d’edat, específicament, en nens de 3 a 4 anys i en nens de 5 a 6

anys. Finalment, l’estudi 4 avaluarà l’efecte de 8 sessions d’entrenament de representació

d’emocions i estats mentals usant patrons prosòdics i gestuals en la millora de les habilitats

sociopragmàtiques i de ToM en nens d’edat preescolar de 3 i 4 anys.

En resum, la finalitat d’aquesta tesi és investigar el desenvolupament sociopragmàtic en

nens d’edat preescolar i la seva relació amb ToM, amb les habilitats de producció multimodal

i d’altres habilitats lingüístiques. Els resultats dels quatre estudis avaluaran la relació entre

aquestes capacitats i el paper que juguen la prosòdia i el gest en aquest camp. La hipòtesi general

que guia la tesi és que les habilitats multimodals de nens estan estrictament relacionades amb

Mariia Pronina

PhD Research Plan

iv

la competència sociopragmàtica i, de forma secundaria, aquestes habilitats estan relacionades

amb el desenvolupament lingüístic general i l’habilitat de presa de perspectiva. Així mateix,

donada aquesta relació, proposem les habilitats de presa de perspectiva dels nens es poden

enfortir mitjançant una intervenció d’entrenament corporeïtzat i basat en la conversa. Aquesta

tesi no només conduirà a l’ampliació del nostre coneixement sobre el desenvolupament

sociopragmàtic i multimodal i el desenvolupament de la ToM, sinó també desenvoluparà eines

per ajudar als mestres de l'etapa infantil a incorporar aspectes de multimodalitat a l’aula.

Mariia Pronina

PhD Research Plan

v

RESUMEN

Per su propia naturaleza, la comunicación humana es multimodal y se basa en un

intercambio dinámico de información que incluye no solo el habla sino también elementos

prosódicos y gestuales. Mientras muchas investigaciones han demostrado que gesto y prosodia

juegan un papel precursor del lenguaje en etapas tempranas de la adquisición (Goldin-Meadow,

1998, 2014; v. de Carvalho, Dautriche, Millotte, & Christophe, 2018 por una revisión), se sabe

poco sobre el papel del gesto y la prosodia en el desarrollo lingüístico y sociopragmático más

tardío (v. Hübscher, 2018 per una revisión).

El objetivo principal de esta tesis es investigar el desarrollo sociopragmático de los niños

de edad preescolar y su relación con las habilidades prosódicas y gestuales, como también su

relación con la comprensión de emociones y la Teoría de la Mente (ToM), entendida comola

capacitad de atribuir estados mentales – e.g., creencias, intenciones, conocimiento – a otras

personas. A día de hoy, todavía se está discutiendo la compleja relación entre estos

componentes, es decir, la habilidad sociopragmática, las habilidades multimodales prosódicas

y gestuales, la ToM, y la habilidad de reconocer emociones (v. Bosco, Tirassa, & Gabbatore,

2018 per una revisión). Además, esta tesis también analizará si las habilidades

sociopragmáticas, ToM y la capacidad de comprensión de emociones pueden entrenarse

mediante una intervención corporeizada que potencia el uso de la multimodalidad en el

aprendizaje del lenguaje.

Esta tesis incluye cuatro estudios empíricos llevados a cabo con niños de edad preescolar.

El estudio 1 investigará si las habilidades sociopragmáticas de los niños de 3 y 4 años se

correlacionan con la ToM, la comprensión de emociones y con otras habilidades lingüísticas

(semántica, sintaxis y habilidades narrativas). El estudio 2 examinará si las habilidades

sociopragmáticas de los niños de 3 y 4 años se correlacionan con sus habilidades de imitación

multimodal (entendidos como la habilidad de imitar conjuntamente prosodia, gestos y

contenido léxico). El estudio 3 tiene como objetivo explorar las trayectorias de desarrollo

conjunto de la prosodia y el gesto en su expresión sociopragmática en dos grupos de edad,

específicamente, en niños de 3 a 4 años y en niños de 5 a 6 años. Finalmente, el estudio 4

evaluará el efecto de 8 sesiones de entrenamiento potenciador de la actuación multimodal de

las emociones y estados mentales por parte de los niños en edad preescolar en la mejora de sus

habilidades sociopragmáticas y de ToM.

En resumen, la finalidad de esta tesis es investigar el desarrollo sociopragmático en niños

de edad preescolar y su relación con la ToM, con las habilidades de producción multimodal y

con otras habilidades lingüísticas. Los resultados de los cuatro estudios evaluarán la relación

Mariia Pronina

PhD Research Plan

vi

entre estas capacidades y el papel que juegan la prosodia y el gesto en este campo. La hipótesis

general que guía la tesis es que las habilidades multimodales de niños están estrictamente

relacionadas con la competencia sociopragmática y, de forma secundaria, estas habilidades

están relacionadas con el desarrollo lingüístico general y la habilidad de toma de perspectiva.

Asimismo, dada esta relación, proponemos las habilidades de toma de perspectiva de los niños

se pueden fortalecer mediante una intervención de entrenamiento corporeizado y basado en la

conversación. Esta tesis no solo conducirá a la ampliación de nuestro conocimiento sobre el

desarrollo sociopragmático y multimodal y el desarrollo de la ToM, sino también desarrollará

herramientas para ayudar a los maestros de la etapa infantil a incorporar aspectos de

multimodalidad en el aula.

Mariia Pronina

PhD Research Plan

1

1. INTRODUCTION

1.1. Object of analysis

The research presented in this PhD thesis broadly examines the interface between the

sociopragmatic and linguistic faculties in preschoolers, as well as their capacity to attribute

mental states to others (e.g., beliefs, intentions, knowledge) – also called theory of mind (ToM),

and, importantly, the role played by gestural and prosodic development in this picture.

Following Leech (1983), Thomas (1981, 1983) and more recently Culpeper (2010) and many

pragmatic textbooks, the present PhD thesis will use the term sociopragmatics to reflect the fact

that the field of pragmatics not only covers the study of more general pragmatic meanings, but

that a central concern of the pragmatic field also involves the speakers’ social identities (see for

example the politeness section in 1.2.1.2.). As Andrés-Roqueta & Katsos (2017) claim,

sociopragmatic skills allow children to comprehend sentences appropriately to the

conversational context and enable them to produce sentences appropriate to communicative

situation. The use of the term sociopragmatics thus highlights the important role played by

social factors and their influence in pragmatic aspects of language.

A set of four independent empirical studies will be conducted to explore sociopragmatic

development in preschoolers and their links with language, ToM and multimodal production

and perception abilities. The first two studies will test the potential correlation between

childrens’ sociopragmatic abilities on one hand, and (a) a specific linguistic skills and ToM

capacity (Study 1), and (b) prosody and gesture imitation abilities (Study 2) on the other. The

third study will provide a detailed examination of the developmental path of prosodic and

gestural cues used by preschoolers to encode sociopragmatic meanings. The last study will use

an embodied intervention (i.e., the active involvement of the body, prosody and voice) to

investigate its beneficial effect on the development of perspective-taking skills, linguistic

capacities, and sociopragmatic abilities in preschoolers.

1.2. Prior work

1.2.1. Multimodal foundations of sociopragmatic development in the first language

1.2.1.1. Early preverbal period

By its very nature, human communication is multimodal and relies on a dynamic

information exchange involving not only speech content but also prosodic and gestural features.

It is therefore not surprising that children become capable communicators well before they start

to speak. Communication in the preverbal stage of life constitutes one of the foundations for

Mariia Pronina

PhD Research Plan

2

the later development of linguistic and pragmatic skills. As Tomasello, Carpenter, &

Liszkowski’s 2007 pointed out, “children’s early linguistic skills are built on this already

existing platform of prelinguistic communication”. This section will review the main stages of

early communicative development by highlighting the strong multimodal components which

compose these early stages (focusing on prosodic and gestural patterns).

During this first period of pragmatic acquisition, the infant communication system

strongly relies on nonverbal cues such as gaze, facial expression and prosody. Early patterns of

face perception and imitation, as well as understanding that faces can convey significant

communicative information, is fundamental for early communicative development. Newborns

prefer to look at faces (Johnson & Morton, 1991) and almost immediately after birth, infants

can imitate certain facial gestures, namely tongue protrusions and mouth opening gestures

(Meltzoff & Moore, 1989). Furthermore, by six weeks of age, infants are able to imitate facial

expressions 24 hours after they were originally presented (Meltzoff & Moore, 1994). In addition

to visual preference for faces, infants orient attention to eye-like stimuli and, crucially, display

a preference for eye contact (Batki, Baron-Cohen, Wheelwright, Connellan, & Ahluwalia,

2000). Moreover, they prefer direct gaze rather than averted gaze (Farroni, Csibra, Simion, &

Johnson, 2002). These early communicative behaviors underlie future linguistic and social

development. For instance, it has been demonstrated that infants’ gaze-following behavior

predicts their later vocabulary skills (Brooks & Meltzoff, 2005).

Further, infants show early use of multimodal cues in order to express their own emotions.

By 2 months of age, they start to control prosodic parameters of the speech to signal affective

meaning and can distinguish positive and negative emotion (e.g., use of variations in duration,

pitch range, and pitch peak; laughter for positive emotion and crying for negative emotion)

(Oller et al., 2013; Scheiner, Hammerschmidt, Jürgens, & Zwirner, 2002). At around 4 months,

infants can express such emotions as sadness or enjoyment through facial expression, the ability

to express different emotional states continues to develop, so for example, at 12 months,

children can signal more complex emotion such as surprise and fear (Sullivan & Lewis, 2003).

These outlined studies suggest an early mapping of audiovisual cues onto emotional states.

Towards the sixth to eighth week of life, babies often produce their first smiles and, with

them, the ability to exchange emotions and interact face to face with adults for longer periods

of time progressively comes. According to the usage-based approach to language acquisition

proposed by Tomasello (2005) (see section 3.1.), the social interaction experience is the basis

for linguistic development. The central tenet of this approach is that language structure is

learned through language use. In accordance with a usage-based approach, prelinguistic infants

Mariia Pronina

PhD Research Plan

3

develop skills which are fundamental for later language acquisition and do so through

communicative interactions (Tomasello et al., 2007).

Dyadic interaction refers to early interaction which involves infant and caretaker

communicating face-to-face in a bidirectional way and represents one of the foundations of

language development (Nwokah, 2014). During the period of dyadic interaction, infants show

preference for affect (Trevarthen, 1979) which also constitutes a crucial component of

subsequent language development since experience of affect enables children “to appreciate

that others are similar to, responsive to, and engaged with them” (Markova & Legerstee, 2008,

p. 27). Within this period, infants develop perceptive prosodic skills related to affect. By 5

months of age, infants are shown to rely on the audiovisual information in the perception of

emotional states. They can discriminate between affective vocal expressions (approval,

negative affect), show different reactions to different affective stimuli and are able to match

facial expressions with corresponding acoustic properties of the speech (Fernald, 1993;

Vaillant-Molina, Bahrick, & Flom, 2013). These abilities continue to develop later on. For

example, the study by Hoicka & Wang (2011) tested sensitivity to vocal cues in 15-months-old

infants and found that they can differentiate positive emotional vocal cues from humorous ones

and match them to intentional actions.

It is well established that at around nine months of age, a shift occurs from dyadic to

triadic communication (e.g., the so-called ‘attentional triad’, see Meltzoff, 2002; Tomasello,

1995; Tomasello, Carpenter, Call, Behne, & Moll, 2005). In triadic communications, apart from

infant and caregiver, some external object or event becomes involved in communication. The

coordination of the attention of two individuals to the same object is defined as joint attention

(Carpenter, 2012), since both individuals need to realize to some extent they are sharing

attention. Joint attention is crucial for communicative and language development. The first

signs of joint attention behaviors are observed in infants at around 9–12 months. As described

in the seminal work by Carpenter, Nagell, & Tomasello (1998), infants first show competence

in sharing attention to objects, even though they can point or follow adult’s gaze, the main

characteristic of these joint attention episodes is infant’s alternation of the gaze between object

and adult. Between 12 and 15 months of age, infants are able to follow attention to objects by

gaze and point following. They also begin to make use of information about adult gaze direction

to predict others’ future actions (Phillips, Wellman, & Spelke, 2002). Finally, infants become

able not only to follow adult’s gaze and gesture but also to direct the adult’s attention to objects

in which they themselves are interested by pointing at them. Over time, joint attention increases

in amount and changes its form and focus, and notably develops into the third year of life and

Mariia Pronina

PhD Research Plan

4

beyond as children become able to sustain complex conversations (Adamson & Dimitrova,

2014).

Parental scaffolding is particularly important during this period. Although early infant

vocalizations lack illocutionary force (Austin, 1962; Searle, 1969) being perlocutionary acts,

caregivers tend to attribute intentions to such utterances and seek out conversational structure

and turn-taking sequences (e.g., Snow, 1977). Caregivers lead the communication; however,

infant’s behavior and preferences show that they also are motivated to communicate. For

example, as noted above, infants have preferences for face-like and eye-like stimuli and show

imitative behavior. Infants also demonstrate a preference for contingency (Toda & Fogel, 1993)

and attempt to continue communication by smiling and vocalizing, something which is essential

for communicative development.

A wealth of studies has shown that children’s pointing gestures are central to the

expression of triadic communication and are strongly tied to later lexical development. For

example, Brooks & Meltzoff (2008) demonstrated that infants’ gaze following and pointing

skills at the age of 10 and 11 months predict their productive vocabulary at two years of age.

The study by Bavin et al. (2008) supported these findings, showing a significant correlation

between early communicative behaviors and vocabulary development in a large sample of

children. They analyzed gesture and object use at one year of age and found that this was a

predictor of vocabulary development at age two years. Caselli, Rinaldi, Stefanini, & Volterra

(2012) explored early actions and “gesture vocabulary”, i.e., gestures referring to lexical items,

in Italian infants (8–18 months) and provided evidence that early gesture correlates not only

with word production but also with word comprehension. Further, Rowe & Goldin-Meadow

(2009) observed infant’s interaction with their caregivers and found that children’s gesture

vocabulary at 18 months, specifically various meanings conveyed in gesture, predicts verbal

vocabulary size at 42 months. Moreover, Iverson & Goldin-Meadow (2005) conducted a

longitudinal stufy with children between the ages of 10 and 24 months and showed that lexical

items (e.g., ball, cup, flower) that a child produces in gesture appear earlier in the child’s verbal

lexicon.

One of the important skills that emerges between 7 and 9 months of age is intentionality

and intention reading. Infants start to demonstrate the ability to differentiate between means

and goals in their own and others’ productions. According to Tomasello (1995), this entails that

infants are able to distinguish between an action, the mean, and the intention underlying this

action, the goal. Previous research has shown that infants begin to comprehend actions as goal-

directed and can discern intentional and accidental actions (Hauf, 2007). New behaviors that

Mariia Pronina

PhD Research Plan

5

children demonstrate at the end of the first year of life are a signal of their emerging

understanding that other people are intentional agents. In addition to understanding intentions,

importantly, children’s communicative interaction is collaborative. Children are not only

capable of communicating but also of showing interest and enthusiasm to do so and of sharing

goals and intentions with others (Tomasello et al., 2005). This core component of human

communication, referred to as shared intentionality, is what sets human cognition apart.

Warneken, Chen, & Tomasello (2006) tested infants’ and chimpanzees’ engagement in a joint

activity with an adult. When the adult stopped participating in the activity, the chimpanzees did

not continue communication, while children attempted to reengage the adult at least once,

displaying awareness of mutual goals.

Pointing gestures are also an important token in the development of intentionality. As

indicated by Liszkowski (2005), infant pointing draws on the understanding of others as

attentional beings. Bates, Camaioni, & Volterra (1975) established two types of communicative

pointing gestures carrying different intentions: proto-imperatives and proto-declaratives. For

example, children can produce proto-imperative pointing gestures to request the adult to fetch

an object. They can produce proto-declarative pointing gestures (e.g., showing an object to the

adult) to engage the adult in communication and share the experience. Therefore, it is claimed

that two types of pointing do not imply the same cognitive abilities: “whereas imperative

pointing relies on a simple expectation that people will function as causal agents, declarative

pointing implies the capacity to influence the other person’s attentional state about aspects of

the environment and, at the same time, to perceive the other person as capable of understanding

one’s communicative intentions” (Camaioni, Perucchini, Bellagamba, & Colonnesi, 2004, p.

305). Indeed, in a later study, Camaioni et al. (2004) demonstrated that children who are better

in intention understanding produce more declarative pointing, while no relation was found

between imperative pointing and intention inferring.

Other authors have further analyzed the pragmatic intentions that an infant can express

with pointing gestures. For example, Tomasello, Carpenter, & Liszkowski (2007) proposed to

divide proto-declarative gesture into expressive gestures, that are used with purpose of sharing

an attitude towards an object, and informative gestures, that are used with purpose of providing

the lacking information to the adult. Similarly, Kovács, Tauzin, Téglás, Gergely, & Csibra

(2014) proposed that infants can point to an event to express two intentions: either to share their

appreciation of it with the adult or to exhibit the information about it from the adult (epistemic

request).

Mariia Pronina

PhD Research Plan

6

In answer to the question of how infants understand the intentions of others, it has been

suggested that they importantly rely on act-preceding information of joint attention scenes

(Tomasello et al., 2007). However, recent studies have also established the relevance of

multimodal cues in infant intention inferring (e.g., Behne, Carpenter, & Tomasello, 2005;

Camaioni et al., 2004). For example, Behne et al. (2005) tested whether 14- to 24-month-old

children were able to understand the intention of the adult (i.e., to inform about the location of

the toy) in the context of a hiding game when an adult was ostensively gazing or pointing to a

container with a hidden toy. In the non-communicative condition, the adult gestured toward the

box in a non-communicative manner. The findings indicated that children can distinguish

communicative gestures and facial expressions from non-communicative ones, and, moreover,

they could use these multimodal communicative cues to infer the location of the toy. In another

study, Camaioni et al. (2004) demonstrated that infants differentiate between imperative and

declarative intentions, as they reacted differently to imperative and declarative pointing. Other

studies have shown that infants rely strongly on prosodic cues when detecting intentions.

Carpenter, Akhtar, & Tomasello (1998) examined whether 14- to 18-month-old infants were

able to understand that the caregivers’ speech is intentional only on the basis of speech cues.

The experimenter performed two types of actions: intentional and accidental. The types of

actions differed only in the verbal message. For intentional actions, the expression was ‘There!’

(with appropriate prosody), for accidental actions the expression was ‘Whoops!’ (with

appropriate prosody). The authors found that infants imitated the intentional actions more than

the accidental ones, which led them to the conclusion that infants rely on the prosodic cues to

detect intentional actions. These findings were corroborated by Sakkalou & Gattis (2012). In

their first experiment, they used the action imitation paradigm developed by Carpenter et al.

(1998) in which intentionality is marked with lexical and prosodic cues. Similar to this study,

they found that infants imitated more intentional than accidental actions. In the second

experiment, they removed the lexical cues (the lexical information was presented in Greek).

They found that in the second experiment, older children imitated more intentional actions than

younger infants. The results of both studies showed that infants copied the intentional actions

more often, which suggests that children are able to detect intentionality solely on the basis of

prosodic cues accompanying the action. Similarly, a more recent study by Esteve-Gibert et al.

(2017) demonstrated that 12-month-old children use specific multimodal act-accompanying

cues, in particular prosody and hand shapes, in order to distinguish the expressive, imperative

and informative intents behind their caregiver’s pointing gestures.

Mariia Pronina

PhD Research Plan

7

As far as the relationship between prosody and language acquisition is concerned, a body

of studies have looked at the bootstrapping role of prosody for speech stream segmentation into

words (for a review, see Thorson, 2018) and syntactic parsing (de Carvalho, Dautriche, Millotte,

& Christophe, 2018). When interacting with infants, adults usually use Infant-Directed Speech

(henceforth, IDS) which is characterized by slower speech rate and exaggerated pitch variations

(e.g., Fernald & Simon, 1984). These prosodic particularities of IDS help infants to build the

phoneme repertoire (e.g., Werker et al., 2007) and better learn new words (e.g., Ma, Golinkoff,

Houston, & Hirsh-Pasek, 2011), among others. On the syntactic side, prosodic bootstrapping

theories (Gleitman, 1990; Morgan & Demuth, 1996) propose that infants acquire syntax by

processing and acquiring prosodic information. Further, it has been demonstrated that using co-

speech gestures in IDS can also facilitate verbal development (Goodwyn, Acredolo, & Brown,

2000).

With regard to prosody production to express intentionality, research on the early use of

prosodic cues to express pragmatic meanings has shown that in the second half of the first year

of life, children can use different prosodic patterns to signal their intentions (e.g., Esteve-Gibert

& Prieto, 2013; Papaeliou, Minadakis, & Cavouras, 2002; Papaeliou & Trevarthen, 2006, see

Esteve-Gibert & Prieto, 2018 for a complete review). Papaeliou et al. (2002) showed that 7- to

11-month-old infants use distinct prosodic cues (duration, F0) to produce emotional

vocalization and vocalizations with pragmatic intent. Another study by Papaeliou & Trevarthen

(2006) complemented these findings by identifying different prosodic patterns used by the

babbling infants to produce communicative versus investigative vocalizations. Further, Esteve-

Gibert & Prieto (2013) analyzed a longitudinal corpus of Catalan-babbling infants and found

that infants can express intentionality by using distinct prosodic patterns. Their findings showed

that pitch range and duration patterns of vocalizations signal their communicative and

pragmatic intentions. So, unlike investigative vocalizations, communicative vocalizations were

shorter and had a wider pitch range. In turn, the pitch and duration patterns of communicative

vocalizations depended on the intention (request or expression of discontent versus response or

statement).

Furthermore, the joint production of pointing gestures and accompanying vocalizations

was examined in several studies (Aureli et al., 2017; Grünloh & Liszkowski, 2015; Murillo &

Capilla, 2016; Murillo, Ortega, Otones, Rujas, & Casla, 2018). Aureli et al. (2017) investigated

how infants accompany their intentional gestures with prosody and demonstrated that 12-

month-old Italian-learning infants already display pointing-vocal coupling and that the match

between these two modalities gradually becomes more attuned. Grünloh & Liszkowski (2015)

Mariia Pronina

PhD Research Plan

8

examined point-accompanying characteristics (vocalizations, hand shape) elicited in

laboratory-controlled situations produced by Dutch 14-month-olds. They found that the

prosodic characteristics of infants’ vocalizations distinguish requestive acts from expressive

and informative ones, in the same way pointing hand shapes do. The authors also reported that

when requesting, pointing infants use distinct vocalizations depending on the distance of the

referent. Relatedly, Murillo & Capilla (2016) analyzed communicative abilities in infants in an

integrated fashion. Their results showed that with declarative pointing, Spanish infants

produced a flat intonation and with requestive pointing, they produced a rising intonation.

The coordination between the acoustic and visual language dimensions emerges by the

end of the babbling period and is particularly important in language development. Esteve-Gibert

& Prieto (2014) found that at this stage there is temporal coordination between gesture and

speech in an adult-like way: gesture onset occurs before speech onset. Interestingly, Igualada,

Bosch, & Prieto (2015) found that the ability to successfully use gesture–speech integration

(i.e., the ability to produce simultaneous gesture–speech combinations) is also related to later

lexical and grammatical development. Taken together, these findings show that children’s use

of prosody and pointing gestures, as well as the ability to combine them, predict later linguistic

outcomes.

All in all, the results reviewed in this section suggest that at the preverbal stages, children

heavily rely on multimodal aspects of communication reflecting pragmatic meanings such as

prosody, facial expression, eye gaze and manual gestures. Alongside with joint attention skills

and the ability for shared intentionality, gestural and prosodic cues play an important role in

children’s pragmatic development, and, moreover, constitute a bootstrapping step for acquiring

language. The ability to access and express pragmatic meanings through prosody and gesture

continues to develop over a long period. The next section will review the studies focusing on

the later stages sociopragmatic development of children and its multimodal foundations.

1.2.1.2. Verbal period

Even though less research has been carried out on the role of gesture and prosody in later

stages of sociopragmatic development, evidence points to the fact that they continue to boost

language processing and learning. For example, Igualada, Esteve-Gibert & Prieto (2017)

demonstrated that preschoolers benefit from beat gestures in a word learning task recalling the

focused word better when it is accompanied by a beat gesture (see Hübscher & Prieto, submitted

for review). Looking at syntactic acquisition, research has investigated the relation between

gesture-plus-word combination and two-word combinations. Studies by Iverson & Goldin-

Mariia Pronina

PhD Research Plan

9

Meadow (2005) and Özçalişkan & Goldin-Meadow (2005) found that the use of gesture-word

combinations conveying a word and a supplementary gesture in a proposition predicts the

transition to the two-word stage. That is, the child who first produces this particular type of

gesture-speech combination is also first to produce two-word combinations. In addition, Rowe

& Goldin-Meadow (2009) also showed that the number of gesture-speech combinations is a

predictor of later verbal sentence complexity. While most of the studies explore infant’s

pointing gestures, fewer studies have investigated other types of gesture. Importantly, there is

another frequent type of gesture that emerges in children between 1 and 2 years of age, e.g., the

so-called refusal gestures. Refusal gestures appear as the response to questions by the middle

of the second year before the emergence of the corresponding verbal expressions (Volterra &

Antinucci, 1979). By two years of age, children use conventional refusal gestures, such as head

shakes, and start to combine them with vocalizations, and then with words (Guidetti, 2000,

2005). In a longitudinal study, Guidetti (2002) analyzed gestures used by infants between 16

and 36 months and found that gestures of refusal and agreement were the most frequent

alongside pointing in children’s interactions with their mothers. For example, Kettner &

Carpendale (2013) provided evidence that between 13 and 15 months, children start to shake

the head for no, and later, between 16 and 18 months, they are already able to nod the head

for yes. Together with pointing gesture, refusal gesture has been shown to act as precursors of

language development (e.g., Beaupoil-Hourdel, Morgenstern, & Boutet, 2015; Guidetti, 2000,

2005; Morgenstern, Beaupoil-Hourdel, Blondel, & Dominique, 2016), specifically in early

development of negation. For instance, it has been shown that the first negative constructions

emerge with early gestures of rejection and avoidance (Beaupoil-Hourdel et al., 2015).

Similarly, Morgenstern et al. (2016) analyzed multimodal path into negation in both hearing

and deaf children, and although there were individual differences, they found that gesture is an

important component in the itineraries. Thus, children enter verbal modality of negation through

the use of gesture and afterwards children use negation gestures to reinforce or complement

spoken utterances.

In relation to iconic gestures, one study by Özçalişkan, Gentner, & Goldin-Meadow

(2012) found that, unlike the onset of pointing gestures, iconic gestures appear much later,

generally at around 25 months, when children have already started to produce their first words.

The findings indicate that children first acquire the verb system and then are able to use iconic

gesture to increase their repertoire of action meanings.

This section will review research that provides evidence that children continue relying

strongly on multimodal aspects of communication after they begin to produce first words, while

Mariia Pronina

PhD Research Plan

10

highlighting the importance of the precursory role of prosodic and gestural cues for later

sociopragmatic development. The subsections below review a variety of pragmatic areas that

develop during preschool years, namely speech act marking, focus and informational structure,

epistemic and emotional states, politeness, pragmatic inferences and irony, and discourse

abilities such as narratives and turn-taking (see also Hübscher & Prieto submitted for a review).

Speech acts

Speech act development can be considered one of the pragmatic milestones achieved in

infancy. While proto-declaratives serve as the antecedents of later statements and questions,

proto-imperative serve as antecedents of emerging orders and demands (Ninio, 2014). Ninio &

Wheeler (1984) developed a pragmatic coding taxonomy (i.e., the Inventory of Communicative

Acts, INCA) which has been widely applied by a number of researchers (see Cameron-

Faulkner, 2014 for a summary). Their findings established the gradual emergence of the speech

acts. Noteworthy, many scholars still point out that the classification of speech act categories

of young children’s utterances is often challenging.

As mentioned in the preceding section, studies on early pragmatic development have

brought evidence that infants are able to distinguish between the communicative intentions of

others (e.g., imperative, expressive, informative) by associating specific prosodic and gestural

cues with specific speech acts (e.g., Behne, Liszkowski, Carpenter, & Tomasello, 2012; Esteve-

Gibert et al., 2017; see Liszkowski, 2014 for a review). Moreover, infants can also express

proto-speech acts (e.g., proto-imperative and proto-declarative, see Bates et al., 1975; see also

Esteve-Gibert & Prieto, 2013; Tomasello et al., 2007). Proto-speech acts appear at around 10

months and they are first expressed through gesture and intentional vocalizations, suggesting

that gestural and prosodic cues play a key role in the development of speech acts.

Studies on intonational development confirm that intonation acts might be a precursor of

speech act marking expressed verbally. By the age of 2 years, infants produce two-word

combinations and have a full repertoire of pitch accents and boundary tones (Rusiewicz &

Esteve-Gibert, 2018) to express speech acts such as assertion, question or request. Prieto,

Estrella, Thorson, & Vanrell (2012) analyzed a large longitudinal corpus from children between

0;11 and 2;4 acquiring Catalan and Spanish, and assessed the pragmatic meaning of the

intonation patterns. They provided evidence that from the onset of the speech children use a

variety of phonologically distinct intonation contours for particular pragmatic meanings and

over time children master the prosodic properties of the nuclear configurations. It is important

to note that from the beginning the pragmatic meanings conveyed by intonation patterns are

Mariia Pronina

PhD Research Plan

11

also adult-like. For instance, Catalan children use target-like intonation patterns for vocatives

and interrogatives (rising contours) and produce complex intonation patterns in an adult-like

way to express request, insistence or discontent. Similarly, Frota, Matos, Cruz, & Vigário

(2016) found that Portuguese children at the age between 1 and 2 years appropriately produce

phonologically distinct intonation patterns which reflect basic speech acts such as statements,

vocatives and requests.

On the whole, gestural and prosodic cues to speech acts develop first, and then children

acquire language-specific intonational patterns used to express particular types of speech acts,

showing early development of the intonational grammar of the ambient language.

The development of speech act understanding continues in the preschool years. By the

age of 3 years children demonstrate understanding of the underlying features of speech acts

(Rakoczy & Tomasello, 2009). This ability has been proposed to be associated with Theory of

Mind abilities since the interpretation of some speech acts entails inferential processes (see

section 1.2.2. for more details). Concerning indirect speech acts, i.e., speech acts that represent

a mismatch between form and function, contradictory findings are found in the developmental

literature. A common view is that at the age of 3 years children can understand indirect speech

acts, especially indirect requests (e.g., Bucciarelli, Colle, & Bara, 2003; Shatz, 1978). In

general, the preschool period is characterized by the stabilization of the control of prosodic

features (Rusiewicz & Esteve-Gibert, 2018) and by the acquisition of more complex pragmatic

meanings, as the ones that will be reviewed in the following section.

Focus and information structure

The ability to understand specific pragmatic meanings such as the informational structure

of an utterance also has a long developmental trajectory. It has been argued that the

comprehension of focus prosody takes time to develop. More specifically, contrastive

interpretation of pitch prominence is observed in children only after the age of six (Ito, Bibyk,

Wagner, & Speer, 2014; Ito, Jincho, Minai, Yamane, & Mazuka, 2012; Speer & Ito, 2009; for

an updated review, see Ito, 2018). Children’s difficulty in interpreting contrastive prosody

cannot be explained by limited perceptive abilities given their sensitivity to prosodic features

from the very first stages of life (Christophe, Mehler, & Sebastián-Gallés, 2001; Decasper &

Spence, 1986; Nazzi, Bertoncini, & Mehler, 1998, among many others; see also section

1.2.1.1.). In a recent study, Kurumada & Clark (2017) argued that this difficulty may be

attributed to an underdeveloped ability to identify possible alternative expressions to create a

contrast. However, the authors found that when an alternative is contextually provided, 4-year-

Mariia Pronina

PhD Research Plan

12

old children are capable to understand intonational meaning intended and thus arrive at the

implicature. Based on their findings, it can be concluded that preschool children can identify

relevant information and appropriately interpret informational status of referents by means of

prosodic cues. Moreover, there is other evidence that focus may be acquired much earlier. For

example, it has been shown that younger children, specifically under the age of 2, rely on

prosodic cues to focus in word learning, thus, prosodic prominence expressing focus boost

language acquisition (e.g., Fernald & Mazzie, 1991; Thorson & Morgan, 2014).

On the production side, along the same lines, mixed results have been reported regarding

the age at which children start to use prosodic prominence to mark focus, ranging from 3 to 6

(see Chen, 2018 for a review). However, studies on early intonational development have

demonstrated that even younger children (under the age of 2) are able to use prosodic cues to

mark emphasis and novelty (e.g., Frota & Butler, 2018; Murillo & Capilla, 2016; for an

overview, see Hübscher & Prieto, submitted). Hübscher & Prieto (submitted) and Snow (2017)

proposed that intonation and gesture “act as sister systems in the signaling of information

focus”, suggesting that even young infants can express focus by pointing and pitch prominence

(see also results of Igualada et al., 2015). In later stages, between 4 and 5 years of age, gestural

patterns change. Children of this age begin to use manual beat gestures (i.e., rhythmic non-

representational movements of hand, arm or head associated with prosodic prominence) to

highlight information (Hübscher & Prieto, submitted). The use of prosodic prominence

accompanied by beat gestures is a step forward in the development of gesture-speech

integration to mark focus information. Contrastive focus marking takes longer to develop. A

study by Esteve-Gibert, Lœvenbruck, Dohen, & D’Imperio (2017) showed that before starting

to use prosodic means to mark contrastive focus, preeschol-age children are already able to use

gestural cues to express it (e.g., head nods).

In short, research suggests that although the development of focus is a long process, even

preverbal children can comprehend and express infromational structure to a certain extent. On

the perception side, they benefit from prosodic cues in lexical learning and in contrastive focus

comprehension. On the production side, they are able to combine prosodic and gestural cues to

mark focus since early stages of communicative development, and older children continue

developing the ability to integrate multimodal cues (early use of beat gestures) to express focus.

Epistemic states

Looking at the children’s acquisition of knowledge state (i.e., epistemic meanings),

previous studies indicate that by the age of 2, children can understand the epistemic state of

Mariia Pronina

PhD Research Plan

13

others (e.g., knowledgeable versus ignorant partners, see Liszkowski, Carpenter, & Tomasello,

2008) and by the age of 3 years, they are able to distinguish between reliable and unreliable

information (Koenig & Harris, 2005). However, only later, at around 4 years of age, children

can understand speaker knowledge state encoded by lexical markers, either mental verbs or

modal expressions (Moore, Bryant, & Furrow, 1989; Moore, Pure, & Furrow, 1990; see also

Hübscher & Prieto, submitted).

In this context, research has shown that multimodal cues act as epistemic precursors of

later lexical marking of epistemic meanings (Armstrong, Esteve-Gibert, Hübscher, Igualada, &

Prieto, in press; Bartz, 2017; Hübscher, Esteve-Gibert, Igualada, & Prieto, 2017; Kim, Paulus,

Sodian, & Proust, 2016). Some evidence has supported that even young children can signal

their own epistemic states through gestural cues. For instance, in a longitudinal study with

infants from 14 to 42 months, Bartz (2017) showed that at the age of 2, children start to signal

their ignorance with gestures (e.g., flip their palms to the side), moreover, verbal cues to

ignorance appear later. Furthermore, Kim et al. (2016) assessed the presence of uncertainty

gestures in preschool children (3–4-year-old) and found that while they were not able to express

their knowledge state verbally, they could do it through gesture (e.g., shrugging shoulders,

looking away, shaking the head). A recent study conducted by Hübscher et al. (2017) examined

the relative contributions of lexical and multimodal cues to preschoolers’ uncertainty

comprehension. Three- to five-year-old children participated in a knowledge state

comprehension task in which they were asked to decide about speaker belief state. Crucially,

the materials contained lexical, intonational and gestural markers. The results showed that

children detected uncertainty better when gestural cues were presented. Moreover, younger

children were found to be more sensitive to prosodic markers of speaker uncertainty than to

lexical markers. The authors suggested that prosodic and gestural cues play a bootstrapping role

in children early epistemic development. By the same token, Armstrong et al. (2018) found that

preschoolers better understood speaker’s incredulity, another type of belief state, when encoded

in both prosody and gesture, suggesting that the two cues develop in a parallel way.

A follow-up study by Hübscher, Vincze, & Prieto (under review) investigated preschool

children’s production of uncertain knowledge. Similar to the previous experiment, prosodic,

gestural and lexical markers were examined. In a cross-sectional production study with 3- and

5-year-olds, the results showed that the younger group of children (3-year-olds) first used

multimodal cues (i.e., prosodic and gestural cues) to signal their uncertainty, but not lexical

markers.

Mariia Pronina

PhD Research Plan

14

To sum up, recent research has provided evidence that epistemic state information is first

expressed through multimodal markers, and that preschoolers first rely on gestural and prosodic

cues in understanding the knowledge states of others. These findings suggest that epistemic

multimodal cues play a precursory role in pragmatic development (see also Armstrong &

Hübscher, 2018; Esteve-Gibert & Prieto, 2018).

Emotional states

While infants have been shown to interpret emotion online from facial expression (see

section 1.2.1.1. above), the ability to label emotions develops later. It has been shown that by

two years of age, some children are able to match some names of some emotions and the

appropriate facial expression (e.g., Izard, 1971). Older children can recognize emotions such as

happiness, sadness, fear, anger and disgust in facial expression (e.g., Harrigan, 1984, among

many others). Moreover, although prosodic abilities related to emotions develop very early in

infancy (see section 1.2.1.1. above), surprisingly, children start to use their speech perceptual

skills to understand speaker’s affective state much later (Nelson & Russell, 2011; Quam &

Swingley, 2012). Nelson & Russell (2011) found that preschoolers seem to label emotion on

the basis of facial and postural cues and the ability to understand prosodic emotional cues takes

longer to develop. By the same token, Quam & Swingley (2012) showed that while 4- and 5-

year-old children consistently used happy or sad prosody to interpret a situation or an emotion,

2- and 3-year-olds exploited facial and body-language cues. Furthermore, when prosodic cues

compete with lexical cues, young children seem to rely on lexical ones, in contrast to adults

(e.g., Friend & Bryant, 2000; Morton & Trehub, 2001). As has been proposed by Fernald

(1989), children pay more attention to what is said rather than how it is said. In a similar fashion,

Friend (2001) suggested that a transition from affective to linguistic meaning in the

comprehension of prosody occurs, therefore while infants rely on prosodic cues, older children

rely more on lexical content. However, it should be noted that children are unlikely to confront

situations with discrepant cues.

In relation to prosody, younger children have been shown to implicitly use acoustic cues

to infer the emotional state of others. In a series of studies, Berman and colleagues (Berman,

Chambers, & Graham, 2010, 2016; Berman, Graham, Callaway, & Chambers, 2013; Berman,

Graham, & Chambers, 2013) explored the children’s ability to relate emotional prosody to

emotion by using both explicit (pointing responses) and implicit online (eye-tracking)

methodologies. When investigating children’s sensitivity to acoustic cues (happy-, sad- or

neutral-sounding prosody) to detect affect in a context of referential ambiguity (e.g., broken

Mariia Pronina

PhD Research Plan

15

doll and intact doll), Berman et al. (2010) found that 4-year-olds’ pointing responses did not

show understanding of emotional prosody, but their gaze behavior did. Berman, Graham, &

Chambers (2013) reported that 5-year-old children succeeded in both explicit and implicit tasks.

Similarly, in Berman, Graham, Callaway, et al. (2013) 5-year-old children showed sensitivity

to acoustic cues bearing emotional meaning through both pointing and eye gaze, while 4-year-

olds resolved it only in their eye movements. The results of a more recent study (Berman et al.,

2016) showed that while only older children at the age of 5 can explicitly match acoustic and

visual cues, young children at the age 3 are able to do it implicitly in case of negative affect

(sad-sounding intonation and sad-looking face). These findings were echoed by Khu,

Chambers, & Graham (2018) who found that eye gaze measures of 4-year-old children indicate

that they use the interlocutor’s emotional prosody to infer the interlocutor’s emotional state.

Taken together, these studies indicate that children heavily rely on multimodal cues in

emotion recognition. While they exploit gestural cues very early on, the ability to link prosodic

cues to emotion starts to emerge in an implicit manner around age 3, and only later (around age

4 or 5) in a more explicit way. Finally, it is noteworthy to highlight that whereas a growing

body of studies focuses on recognition of emotions on the basis of facial expression and

prosodic cues, much less is known about how children use prosody to express their emotional

states.

Polite stances

It has been shown that listeners expect the speaker’s affect to be consistent with his or

her polite stance (Camras, Pristo, & Brown, 1985). However, research on children’s awareness

of politeness has primarily focused on the acquisition of grammatical constructions and lexical

cues and have neglected to address the issue from multimodal point of view. To our knowledge,

only a couple of studies have looked at the role of the prosodic cues in the understanding of

politeness in preschool-aged and school-aged children (Bates, 1976; Shochi, Erickson,

Sekiyama, Rilliard, & Aubergé, 2009). Bates (1976) reported that children first identify the use

of please (3-year-olds) as polite, then they can recognize the use of question intonation (4-year-

olds) as more polite, and later they perceive the conditional form as more polite (6-year-olds).

Yet, a recent study by Hübscher, Prieto, & Wagner (in press) has shown that 3-year-old children

can broadly rely on prosodic and visual cues in inferring politeness meaning. In this between-

subject study, 3-year-old children participated in a comprehension task and were asked to judge

the speaker’s polite stance. Across all conditions, the lexical cue to polite stance please was the

same and prosodic and facial features varied across conditions. The results of the experiment

Mariia Pronina

PhD Research Plan

16

revealed that preschool children can make use of intonation and/or facial expression exclusively

to access speaker’s polite meaning. These findings contradict prior research on children’s use

of prosodic cues when inferring politeness stance (Bates, 1976), but at the same time are in line

with other studies showing that preschoolers use multimodal cues to infer emotional and

pragmatic meanings (see Armstrong & Hübscher, 2018 for a review).

The ability to recognize politeness meanings is an important step in children’s

sociopragmatic development and emerges relatively early. Another no less important ability to

communicate politeness appears to develop more slowly, that is children’s own evaluation of

the social pragmatic conditions in the discourse context that should trigger polite behavior. To

be appropriately polite, children must be able to evaluate the social relationship between the

speaker and the hearer by assessing in conversation factors such as social distance and power

among interlocutors, as well as the cost of the action (Brown & Levinson, 1987). Children are

generally taught to use appropriate polite forms by adults who provide them a model (Küntay,

Nakamura, & Ateş-Şen, 2014). But the process of acquisition of politeness signaling takes place

over the first five years of life. The developmental literature states that the ability to use target

polite forms is acquired around 5 years of age or older (e.g., Baroni & Axia, 1989). According

to Bates (1976), until 4 years children primarily use direct questions and imperatives to request

something, then, from 5 to 6, they can use proper syntactic forms, but only later, at around age

7, children are able to use appropriate to the content requesting form.

Along the same lines, previous work on children’s acquisition of register conditions in

politeness marking has mainly focused on the lexical (and also morphosyntactic) strategies to

signal politeness meanings, though there are a few exceptions. For example, a recent cross-

sectional study (Hübscher, 2018) sheds light on the children’s ability to use multimodal cues

(prosody, gesture, body posture) to express politeness and their role in development. A group

of 3- to 5-year-olds was prompted to request high-cost and low-cost objects in situations

embedded in high and low social distance conditions. The results showed that preschoolers use

a wide range of prosodic mitigation strategies (e.g., rising intonation, more breathiness, slower

speech rates), gestural mitigation strategies (e.g., eyebrow raises, smiles, adaptors) and body

signals (e.g., raised shoulders, trunk lateral leanings) to request a high cost object and/or asking

an interlocutor with higher social distance. Further, in contrast with 3-year-olds, the older

children showed a more extended use of indirect polite constructions which indicates that the

ability to exploit lexical and morphosyntactic markers to communicate politeness develops in

a more slow way during the preschool period.

Mariia Pronina

PhD Research Plan

17

The studies reviewed in this subsection show that gesture and prosody continue to play

a precursory role in pragmatic development, specifically in the development of politeness,

beyond the infancy period.

Pragmatic inferences and irony

Inferring beyond the literal meaning of discourse has proved to be a difficult area for

young children’s pragmatic development. Children acquire gradual understanding of pragmatic

and nonliteral language and continue to improve it throughout childhood. For example, it has

been argued that preschoolers do not use contextual information to interpret utterances in the

same way that adults do (Snedeker & Trueswell, 2004). Given that, children’s ability to derive

scalar implicatures has been a lively area of research. In scalar implicatures, a weaker term of

a scale is used while a stronger alternative is excluded. An example would be the use of “some”

instead of “not all” in the sentence “some of the animals are sleeping” (example extracted from

Barner, Brooks, & Bale, 2011). Therefore, “some” implies “not all” (“not all the animals are

sleeping”) and the listener needs to generate an enriched meaning and conclude what the

speaker did not say but could have said. Research has shown that children younger than 5-and-

a-half years old fail to draw scalar implicatures at the same rate as adults (e.g., Barner et al.,

2011; Katsos & Bishop, 2011; Noveck, 2001, among others). However, some authors propose

that preschool children do indeed start to have the necessary pragmatic competence to derive

implicatures. They argue that difficulties can be explained by the lack of processing resources

(Pouscoulous, Noveck, Politzer, & Bastide, 2007) or vocabulary size (Barner et al., 2011).

Next to the aforementioned studies, getting a grasp on nonliteral meanings (humor,

irony) also requires the understanding of the intentions of others. Infants’ intention-reading

skills allow them to detect jokes at a very early stage. Children begin to appreciate humor as

young as 1 year and first stimuli that elicit infant’s laughter are nonverbal (auditory, tactile,

social, visual) (Sroufe & Wunsch, 1972; also see Hoicka & Wang, 2011 above). The study

conducted by Hoicka and Gattis (2008) demonstrated that 2- and 3-year-old children detect

jokes on the basis of laughter that may or may not accompany a joke action. Later in

development, children largely rely on visual and auditory cues in grasping humor. When humor

is verbal, children face difficulties in understanding certain type of jokes until 8 years (Shultz,

1974).

Shifting to irony, despite some divergences among findings, as a whole, it is agreed at

between 5 and 11 years children begin to understand speaker’s ironic intent (e.g., Demorest,

Meyer, Phelps, Gardner, & Winner, 1984; Filippova & Astington, 2010; Harris & Pexman,

2003). Recent studies have shown that audiovisual cues can facilitate children’s detection of

Mariia Pronina

PhD Research Plan

18

irony (González-Fuente, 2017; Laval & Bert-Erboul, 2005). Laval & Bert-Erboul (2005)

showed that 5-year-old children build their interpretation of sarcastic utterances on intonation.

Furthermore, González-Fuente (2017) investigated the role that prosodic and gestural cues play

in children’s early understanding of verbal irony. Five-, eight- and eleven-year-old children

participated in a comprehension task which involved the detection of irony on the basis of

audiovisual cues to emotion: strongly mismatching, weakly mismatching and matching

prosodic and gestural cues. The results showed that strongly mismatching multimodal cues lead

to better irony understanding in 5-year-olds.

In sum, some complex pragmatic skills such as ability to derive scalar implicatures, grasp

humor and comprehend irony are not acquired until after the preschool years. These pragmatic

contexts require the appreciation of the speaker’s intent and assessment of multiple cognitive

and emotional cues which implies an inference process. Children master their inferring ability

as they grow over and this is related to their developing cognitive skills (see section 1.2.2.2. on

ToM and pragmatics below).

Discourse abilities: turn-taking and narrative development

In order to use language efficiently, children must learn the pragmatic conventions that

are acquired through social interactions (Carmiol & Sparks, 2014). The knowledge of

communication rules is a part of pragmatic competence and includes the understanding of the

timing of social exchanges. During the preschool years, children develop turn-taking skills

learning how to get into and maintain a conversation, make repairs and evaluate what is relevant

(Casillas, 2014). At around three and four years of age, children can regularly use conventional

fillers in conversation, and by age four, children are able to time turn-taking in an adult-like

way (Casillas, 2014). Interestingly, it has been speculated that children rely more on prosodic

cues to identify upcoming turn-end boundaries as compared to adults (Casillas & Frank, 2012).

By age five, children can anticipate and repair communication breakdowns (Casillas, 2014). By

age six, children master other conversational abilities such as breaking into ongoing

conversation (Ervin-Tripp, 1979). On the whole, children rapidly acquire turn-taking skills over

the first years of life.

However, other pragmatic aspects take more to develop. Even though the ability to string

sentences together to convey extended discourse such as narratives begins to emerge during the

preschool years (e.g., Applebee, 1980), this period is considered transitional as children take

their first steps in narrative development. The early narratives produced by 3- to 4-year-olds are

characterized by the lack of important features such as having a goal-based structure. Children

Mariia Pronina

PhD Research Plan

19

do not produce ‘true narratives’ until the age of 5 or 6 (Applebee, 1978). Only at this age do

children start to use discourse markers. Syntactic complexity and thematic coherence of their

narratives increase, as well as the complexity of informational structure (Berman & Slobin,

1994; Diessel & Tomasello, 2005; McCabe & Peterson, 1991). Moreover, the narratives

become more adult-like in the prosodic domain (Kallay & Redford, 2016).

Research has shown that gesture plays an important role in narrative development. Thus,

several studies reported that gesture and speech continue to develop simultaneously (Colletta

et al., 2015; Graziano, 2014; Sekine & Kita, 2015). For example, Sekine & Kita (2015) analyzed

elicited narratives produced by 3-, 5- and 9-year-old children and found that gesture and speech

developed in parallel at the sentence and discourse levels. In this way, as narratives become

more complex, the gestures used by children also develop. In a similar vein, Graziano (2014)

indicated that the appearance of particular types of gestures (e.g., palm presentation gesture) is

tied to the developing ability to structure a narrative. Colletta et al. (2015) explored narrative

development in preschool-age (5-year-olds) and school-age (10-year-olds) children, and also

pointed out the co-development of speech and co-speech gestures. A longitudinal study by

Demir, Levine & Goldin-Meadow (2015) demonstrated that gesture use at the age of 5 predicts

later narrative structure in speech.

Furthermore, other studies suggest that gestures have a boosting role in narrative

development. Some studies have found such evidence with respect to discourse comprehension

(Clark, Hutcheson, & Buren, 1974; Goldin-Meadow & Wagner, 2005; Llanes-Coromina, Vilà-

Giménez, Kushch, Borràs-Comes, & Prieto, 2018; Macoun & Sweller, 2016; McNeil, Alibali,

& Evans, 2000; McNeill, 2000). For example, McNeil, Alibali, & Evans (2000) tested

preschooler’s speech comprehension, and established that when the spoken messages were

complex, reinforcing gestures gave external support for language comprehension since they

guide children’s attention toward the meaning of the message. Macoun & Sweller (2016)

reported benefits of iconic and deictic gesture in narrative comprehension. A more recent study

by Llanes-Coromina et al. (2018) found that beat gestures also help children to recall discourse

information and facilitate comprehension. In addition to fostering comprehension in children,

gestures can be used as a scaffold to produce more elaborate narrative discourse (Vilà-Giménez

& Prieto, 2018). For instance, in Vilà-Giménez & Prieto (2018), 5- to 6-year-old children were

shown to produce better narratives in terms of narrative structure and fluency after a training

session in which they were encouraged to perform beat gestures.

Mariia Pronina

PhD Research Plan

20

To summarize, the work reviewed in this section shows that gesture, prosody, and

pragmatic abilities develop in a parallel way, and gesture alongside with prosody continue to

play a key precursory role in sociopragmatic development beyond the early infancy period.

Although traditionally the studies on pragmatics focused more on lexical and morphosyntactic

cues, recent research has highlighted the importance of multimodal resources. Thus, children

are found to be sensitive to multimodal markers when inferring particular pragmatic meanings

(see sections on nonliteral language, epistemic meanings and polite stances), even if at first they

do so implicitly (see section on emotional states). Although the comprehension of some

pragmatic meanings (e.g., focus) may lag behind, there is recent evidence for earlier sensitivity

to prosodic information in interpreting them (see section on focus and informational structure).

On the production side, preschoolers have already acquired basic intonational inventory of the

ambient language and move to acquire specific prosodic features to express complex pragmatic

meanings (e.g., information structure). At the same time preschoolers continue to refine their

speech motor control (see section on speech acts). Likewise, preschoolers are shown to use

different gestural and prosodic strategies to express complex pragmatic meanings (see sections

on epistemic meanings and polite stances). Finally, although the studies on narrative

development usually focus on older children (starting from the age of 5), the co-development

of speech and co-speech gestures has been indicated also for preschool period (see section on

narrative development).

All in all, while the large and growing body of research has studied the co-development

of the gesture, prosody and pragmatics at the earliest stages (Goldin-Meadow, 1998, 2014),

much less is known about the role played by gesture and prosody in children’s later linguistic

and sociopragmatic development. Gesture is often disregarded in research on children’s later

language development, particularly after the emergence of speech when children acquire

complex lexical and grammatical features of language. However, gesture indeed continues to

form an integral part of the language-learning process, as can be clearly seen from the

developmental evidence on narrative abilities. Along with prosody, both are continuously

developing in form and function throughout childhood. To date, studies on the mapping

between prosody and meaning are limited, specifically in the preschool period, with

intonational development being the main focus of research.

Though the studies reviewed in this section highlight that gesture, prosody and pragmatics

go hand-in-hand in later development and insinuate the role of prosody and gesture as scaffolds,

studies on preschool-age children’s use and comprehension of multimodal cues in their

pragmatic development remain sparse in comparison to the research on the multimodal

Mariia Pronina

PhD Research Plan

21

foundations of language development in infants. Specifically, no research to date has assessed

the relationship between gestural and prosodic imitation abilities and pragmatic abilities in

preschool children (see proposed Study 2 below). Moreover, although specific aspects of

pragmatic development have been analyzed from multimodal perspective (see Hübscher, 2018),

no study has completely assessed children’s use of prosodic and gestural cues to express

sociopragmatic meanings (see proposed Study 3 below).

1.2.2. ToM development and its links with other cognitive and linguistic abilities

1.2.2.1. ToM development

Successful sociopragmatic development require children to obtain the understanding that

another person may have a different perspective and thus to be able to take others’ point of

view. The capacity to attribute mental states (e.g., beliefs, intentions, desires, knowledge) to

others has been widely studied in the developmental literature of what has been called Theory

of Mind (ToM) (a term coined by Premack & Woodruff, 1978; see also Perner (1991), Wellman

(2014) for a review). ToM, also known as ‘mind-reading’, ‘mentalizing’ or perspective-taking,

refers to a cognitive ability to impute one’s own or others’ mental states and to make predictions

about the behavior of other individuals (Premack & Woodruff, 1978, and many others).

Mental attributions could be verbal or non-verbal (Goldman, 2012). In verbal tasks,

children are required to make explicit judgments about others’ behavior. The child’s ability to

attribute false-belief mental states has been of particular interest. The so-called false-belief tests

(Wimmer & Perner, 1983) constitute the traditional way to measure verbal ToM in

developmental research. Let us take as an illustration the standard Sally and Anne paradigm

task (Baron-Cohen, Leslie, & Frith, 1985). In the Sally and Anne tasks, the child observes a

character (Sally) place an object in location A and leave. In the absence of the character, the

object is moved to a new location B by another character (Anne). The child is asked to predict

where Sally will first look for the object when she comes back. The child succeeds in the task

if she/he realizes that Sally will act according to her mistaken belief and therefore will first look

for the object where she left it before leaving (location A), rather than in its real place (location

B). Another standard explicit false-belief test widely used across studies is unexpected content

task such as the “Smarties task” (Astington & Gopnik, 1988; Perner, Leekam, & Wimmer,

1987) in which the child is shown a box of a well-known brand of candy, Smarties, and asked

what is inside to which she/he answers “Smarties”. Then the child opens the box and finds

pencils inside. Once the pencils are returned and the box is closed, the child is asked (a) what

was in the box, (b) what had she/he thought was in the box, and (c) what would her/his friend

Mariia Pronina

PhD Research Plan

22

think is in the box. In order to succeed, the child must be able to reconstruct their own previous

false belief and the false belief of others.

The tasks described are examples of first-order false-belief task. These tasks assess a

child’s understanding that other individuals’ beliefs differ from each other and that one acts in

accordance with one’s own mistaken belief. While these tasks involve the understanding of

what an individual thinks about real events (first-order beliefs), second-order false-belief tasks

involve the understanding of what an individual thinks about other individual’s thoughts (Perner

& Wimmer, 1985). So, second-order tasks require recursive understanding of other people’s

belief about another’s belief such as “John thinks Mary thinks...”. The second-order belief tasks

are usually used to measure ToM in older children.

A large body of research has demonstrated that critical development of ToM occurs

during the preschool period showing a fairly consistent pattern in the child’s development of

ToM (Perner, 1991; Perner & Roessler, 2012; Wellman, Cross, & Watson, 2001; Wimmer &

Perner, 1983). Thus, children younger than 4 years of age are more likely to fail on false belief

tasks. Later on in the preschool years (e.g., between 4 and 5 years of age) children become able

to ascribe false beliefs and anticipate the behavior of others.

Yet, a growing number of studies have argued that ToM abilities emerge much earlier,

even beginning at the first year of life (see for an overview Baillargeon, Scott, & Bian, 2016;

Baillargeon, Scott, & He, 2010; Scott & Baillargeon, 2017). These investigations provided

evidence that false belief understanding is presented already in infants and toddlers. In order to

explore early mind-reading abilities, usually nonverbal implicit tasks are used, which are less

cognitively demanding and more suitable for testing very young children. In contrast to

traditionally used verbal tests, nonverbal tasks do not require an explicit judgment by the child.

Thus these studies often use eye-tracking techniques to demonstrate that infants can correctly

reason about different false-belief scenarios and make predictions. For example, infant’s

looking time is measured in violation-of-expectation tasks showing that children look longer

when agents act inconsistently with their false beliefs (e.g., Kovács, Téglás, & Endress, 2010;

Oniski & Baillargeon, 2005; Yott & Poulin-Dubois, 2012). Another measure is that of

anticipatory looking, which shows that young children anticipate that an agent holding a false

belief will look for a desired object in the erroneous location (e.g., He, Bolz, & Baillargeon,

2012; Sodian et al., 2016; Surian & Geraci, 2012). Some studies with slightly older children

use the measure of preferential looking while children are listening to a false belief story and at

the same time looking at a picture book with matching and non-matching pictures (Scott, He,

Baillargeon, & Cummins, 2012). Another type of spontaneous behavioral response is that of

Mariia Pronina

PhD Research Plan

23

anticipatory pointing, which is produced by the infant to inform an agent who mistakenly looks

for the target about its real location (Knudsen & Liszkowski, 2012a, 2012b). In other words,

children expect an agent to commit a mistake and spontaneously intervene by pointing the true

location. Another paradigm applied in the research on precocious ToM capacities is the use of

elicited-intervention tasks in which the measures of child’s spontaneous helping behavior to a

false belief story character are used (Buttelmann, Carpenter, & Tomasello, 2009; Buttelmann,

Suhrke, & Buttelmann, 2015; Southgate, Chevallier, & Csibra, 2010). Nonverbal tasks also

vary in the false belief scenarios they use (e.g., false belief about location, presence, identity,

etc.) and in their linguistic demand: the tasks used with infants are generally nonverbal, while

the tasks with toddlers may have a linguistic component.

Nonetheless, while verbal false belief results can be easily replicated, a recent study on

nonverbal ToM has shown no convergent validity of false belief processing between different

types of tasks (Dörrenberg, Rakoczy, & Liszkowski, 2018). In this study, four implicit ToM

measures were analyzed including looking times, anticipatory looking, pupil dilation and

spontaneous communicative interaction. The results indicated no robust evidence for both

replicability and convergent validity of these tasks suggesting that the empirical foundation of

implicit ToM is still insufficient to ultimately prove it. Further, some doubts have been raised

as to whether the spontaneous looking and elicited interventions can be taken as strict evidence

of the ToM capacities (Heyes & Frith, 2014; Heyes, 2014). For instance, Heyes (2014) has

argued that these early behaviors can be attributed to perceptual novelty and other low-level

processes such as imaginal novelty, delay‐related memory limitations, or retroactive

interference; thus, doubt could be casted on the existence of implicit ToM ability in infants.

1.2.2.2. The links between ToM and language

Success on the explicit false belief tasks has been shown to be related to language

development. Research in both typically developing and atypical populations has extensively

investigated the nature of the relationship between children’s linguistic abilities and developing

ToM and has revealed clear links between the two (e.g., Astington & Jenkins, 1999; de Villiers

& de Villiers, 2000; de Villiers, 2005; Happé, 1995; Milligan et al., 2007; Tager-Flusberg &

Joseph, 2005). Significant positive correlations were found between ToM and general language

ability (e.g., Astington & Jenkins, 1999; Jenkins & Astington, 1996), semantics (e.g., Astington

& Baird, 2005b) and syntax (e.g., de Villiers & Pyers, 2002; Slade & Ruffman, 2005).

The direction of the correlation between false belief and language has been widely

debated (Astington & Baird, 2005; Astington & Jenkins, 1999; Milligan et al., 2007; Slade &

Mariia Pronina

PhD Research Plan

24

Ruffman, 2005). Many researchers argue that language ability is a predictor of false belief and

thus language plays a causal role in the developing of false belief understanding (Astington &

Baird, 2005). The empirical arguments for this position come from both longitudinal studies

(Astington & Jenkins, 1999; de Villiers & Pyers, 2002) and training studies (to be described in

section 1.2.2.4. below). The results reported in Slade & Ruffman (2005) confirm that linguistic

ability contributes to later ToM. However, they defend that the direction from language to false

belief (and not vice-versa) might be due to statistical reasons. As they pointed out, language

measures often include a wider range of scores compared to those assessing false belief abilities,

which leads the language score to predict false believe performance. The authors further state

that the relation between language and ToM is bidirectional since there is some evidence that

mental state understanding promotes semantic development (Sabbagh & Baldwin, 2001).

Further, Milligan et al. (2007) conducted a quantitative meta-analysis of 104 studies on

the relation between language ability and false-belief understanding in children under age 7.

The meta-analysis showed that in longitudinal studies, earlier language ability scores predicted

later false belief performance, and vice-versa, earlier false belief performance predicted later

language skills. Thus, these findings support the view of Slade & Ruffman (2005) that the

relation between language and ToM is bidirectional. However, when comparing effect sizes,

significant differences between them were found, suggesting significantly stronger effect from

language ability to false-belief performance than the opposite. Although Milligan et al. (2007)

combines results from a substantial amount of work, several limitations of this study need to be

considered. First, only studies conducted in English were analyzed. Second, the analysis was

restricted to general language ability, as well as semantic and syntactic measures, leaving

behind the pragmatic aspect of the language. The alleged reason for excluding pragmatics was

that the pragmatic ability is often assessed by conversational measures (i.e., child’s use of

language is dependent on an interlocutor) while in this study, only standardized language tests

and experimental language tasks were taken into account.

Numerous studies have looked at determining which specific language components

promote ToM. More specifically, previous research has argued that syntactic abilities are

particularly important for ToM (e.g., Astington & Jenkins, 1999; de Villiers, 1995, 2000; de

Villiers & de Villiers, 2000; de Villiers & Pyers, 2002; Smith, Apperly, & White, 2003).

Specifically, de Villiers (1995, 2000), de Villiers & de Villiers (2000) have argued that syntactic

development – in particular the acquisition of sentential complements – promotes false belief

understanding since it provides children with a necessary representational format for reasoning

false beliefs. In sentential complements, the clause that may be false can be embedded in the

Mariia Pronina

PhD Research Plan

25

main clause that may be true. Importantly, the truth-value of the embedded clause does not

depend on the entire sentence. Therefore, the very structure of complement clauses allows to

handle misrepresentation and serves as a bootstrap for false belief understanding. This

assumption was tested in a longitudinal study (de Villiers & Pyers, 2002). Indeed, the results

showed that the mastery of the sentential complements (namely, production of and memory for

complements) is a precursor of false belief understanding, and, as claimed by the authors, a

possible prerequisite for success in false belief. Interestingly, the meta-analysis conducted by

Milligan et al. (2007) revealed that the largest effect sizes were obtained between false belief

performance and the memory for complements test (de Villiers & Pyers, 2002). The role of

other types of clausal sentences in the development of false belief is less clear. However,

relative clause sentences, which require the handling of meta-representation, were also found

to correlate with false belief performance in 4-year-old children (Smith et al., 2003; although

see Hale & Tager-Flusberg, 2003 below).

Unlike the abovementioned studies that highlight the role of syntax, others have argued

about the importance of semantics (e.g., Moore et al., 1990; Olson, 1988; Ruffman, 2000;

Ruffman et al., 2003). For example, Olson (1988) emphasized the role of metacognitive terms

used to refer to mental states (see section 1.2.2.4. below). Similarly, Moore et al., (1990)

showed that false belief performance was related to children's understanding of the modal

expression of speaker certainty and uncertainty. Ruffman (2000) suggested that false belief is

related to general language ability which includes syntax and semantics as they provide children

with the needed terminology for reasoning about mental states. Ruffman et al. (2003) argued

that it is difficult to make a clear distinction between syntax and semantic measures since

semantic abilities are required to understand syntactic tests. In a correlational study with 3- and

5-year-old children, Ruffman et al. (2003) tried to isolate the potential effects of syntax and

semantics by administering a series of specific language measures. The results showed that

semantics but not syntax predicted unique variance in false belief understanding and that the

composite score of syntax and semantics accounted for more variance than syntax alone.

Nevertheless, the authors suggested that one language ability is a clue for another, as well as

the opposite, and argued that ToM is rather related to general language ability in general.

Albeit considerable attention has been devoted to investigate the nature of the relation

between language and ToM and the role played by different linguistic abilities in the

development of the latter, less is known about the relationship between ToM and children's

pragmatic abilities. The relationship between the two stems from the fact that pragmatic

interpretation requires attribute intentions to the speakers and implies reasoning about their

Mariia Pronina

PhD Research Plan

26

mental states (Sperber & Wilson, 2002). As a result, some scholars even suggest that pragmatics

and ToM are equivalent (see the ‘pragmatic theory of mind’ recently proposed by Frank, 2018).

Some other theoretical works defend a pragmatic account of ToM which in part relies on the

Gricean theory of pragmatics such as communicative intentions and implicatures (Grice,

1975). For example, Sperber & Wilson (2002) regard pragmatics as a sub-module of ToM.

However, several authors have suggested distinguishing between different types of pragmatic

inferences depending upon whether or not they include a ToM component. For instance,

recently, Andrés-Roqueta & Katsos (2017) proposed a distinction between “linguistic-

pragmatics” and “social-pragmatics”. While “linguistic-pragmatics” refers to the context that

being understood only requires structural language and pragmatic knowledge, “social-

pragmatics” also involves ToM competence. Though pragmatics and ToM are conceptually

related and both seem to require inferences, it is important to highlight that no one can be

reduced to the other (Bosco et al., 2018). Bosco et al. (2018) discussed in detail the overlap

between pragmatic ability and ToM and make the case that, as a faculty, pragmatics is distinct

from ToM and despite the close relation between them they are still separable constructs. The

current PhD thesis will adopt this latter view.

The developmental relation between ToM and pragmatics is still open to debate (e.g.,

Westra & Carruthers, 2017). Empirical studies that investigated how ToM is related to

pragmatics have mostly focused on certain pragmatic aspects of nonliteral language. More

precisely, a large part of work has concentrated on irony comprehension (e.g., Filippova &

Astington, 2008; Happé, 1993), as well as on understanding of metaphor (e.g., Lecce, Ronchi,

Del Sette, Bischetti, & Bambini, 2018) and jokes (e.g., Leekam, 1991). Several studies have

suggested that second-order beliefs in particular are correlated with the children’s

understanding of irony (Hancock, Dunham, & Purdy, 2000; Nilsen, Glenwright, & Huyder,

2011; Winner & Leekam, 1991). However, in a recent study by Bosco & Gabbatore (2017) no

effect of second-order ToM was found. Additionally, Angeleri & Airenti (2014) showed that,

even though irony understanding correlates with ToM, there is no direct relation between them

given that this correlation is due to the shared effect of language on both ToM and irony

comprehension. Further, children’s capacity to distinguish between jokes and lies has been

found to be correlated with their ToM competence. Leekam (1991) tested children’s

comprehension of intentional falsehood and jokes and found that children who were able to

distinguish between jokes and lies also understood speaker’s second-order belief. Concerning

metaphors, it has been suggested that first-order ToM is necessary for metaphor comprehension

Mariia Pronina

PhD Research Plan

27

(Happé, 1993). Hence metaphorical abilities and their link with ToM are usually tested in older

children (middle childhood).

As claimed by some authors, some studies aimed at investigating the relationship between

pragmatics and ToM have faced some methodological confusion related to the specific tasks

used to assess both abilities. Bosco et al. (2018) and Frank (2018) indicated that a set of previous

studies administered pragmatic tasks in order to assess ToM abilities such as the widely used

Strange Stories test (Happé, 1994). This test consists of stories comprising Pretence, Joke, Lie,

White Lie, Misunderstanding, Persuasion, Appearance/Reality, Figure of Speech, Irony,

Double Bluff, Contrary Emotions, and Forgetting. As Bosco et al. (2018) noted,

Misunderstanding, Persuasion, Figure of Speech and Irony may be considered to be pragmatic

tasks. Bosco et al. (2018) thus argue that using tasks that involve both pragmatic and ToM

competence is a methodological confusion and that these two abilities should be assessed with

distinct tests. The same holds for other tests developed to assess ToM (e.g., Champagne-Lavau

& Charest, 2015).

The relationship between prosodic and gestural abilities, on the one hand, and ToM, on

the other, has been investigated much more sparsely compared to the research on ToM and

other linguistic abilities. Some studies have recently pointed out the importance of prosodic,

and more importantly, visual cues in children’s comprehension of mental state concepts and its

relationship to ToM abilities (Armstrong, 2014; Armstrong et al., 2018). For example,

Armstrong (2014) found that 4- to 6-year-old children can use prosodic information when it is

the only cue available. In this study, children participated in a disbelief comprehension task in

which mental states were encoded trough intonation. Children of all tested ages were able to

perceive intonationally-encoded disbelief, though 6-year-olds were significantly more accurate.

Interestingly, while doing the task, some 6-year-old tended to produce facial gestures

corresponding to the perceived stimuli (e.g., movement of head backwards for incredulity). This

finding suggests that at this age there is an association between the three components: the

intonational pattern, the meaning and the facial gestures. Another study by Armstrong et al.

(2018) examined the comprehension of disbelief in Central Catalan in 3- to 5-year-old children

with a similar task but the information was presented in three conditions: audio, visual and its

combination. Results showed very poor performance of 3-year-old children when there was no

visual information. There was a great variability in the performance of the 4- year-olds but 5-

year-olds showed close to ceiling accuracy. The author suggested that younger children depend

on visual scaffolding to understand disbelief, by around 4 years children still continue to rely

on visual cues but move towards the acquisition of linguistic meaning, and by around 5 years

Mariia Pronina

PhD Research Plan

28

they have already acquired the linguistic meaning and do not need visual cues anymore.

Interestingly, the children also were administered a Sally and Anne task, and it was found that

the children who succeeded in the false belief task were those who were the most successful in

the comprehension task.

Yet, apart from Armstrong et al. (2018), to the author’s knowledge, no study has

investigated the correlation between ToM abilities and prosodic and gestural abilities. We

hypothesize that the two skills will be positively correlated. This state of affairs is partly due to

the fact that the situation with assessment tools of prosodic abilities differs from that of

standardized vocabulary and grammar tests. If one reviews the set of currently available

prosodic assessment tools and protocols in children, namely Prosody Profile (PROP) (Crystal,

1992), Prosody Voice Screening Profile (PVSP) (Shriberg, Kwiatkowski, Rasmussen, Lof, &

Miller, 1990), Diagnostic Analysis of Nonverbal Accuracy 2 (DANVA 2) (Nowicki & Duke,

1994), Profiling Elements of Prosody in Speech-Communication (PEPS-C) (Peppé & McCann,

2003), Perception of Prosody Assessment Tool (PPAT) (Klieve, 1998), and Minnesota Tests of

Affective Processing (MNTAP) (Lai, Hughes, & Shapiro, 1991), it quickly becomes apparent

that they are not optimal to comprehensively assess pragmatic prosodic skills in young typically

developing children (see Table 1 for a comparison of features).

First, all six tests primarily focus on children with atypical language development. While

the PROP and the PVSP were designed exclusively for clinical use and the PPAT and the

MNTAP were used for research purposes in diverse clinical populations, the DANVA 2 and

the PEPS-C (which were initially developed for both clinical and research purposes) have been

used to assess clinical groups as well as typically developing children.

Second, most of the prosodic tests focus only on receptive abilities. While the PROP and

the PVSP do evaluate expressive prosody in terms of its acoustic dimensions such as pitch,

tempo, stress, loudness, laryngeal quality and resonance, neither of these tests covers pragmatic

prosody in a comprehensive way. Perhaps the PEPS-C is the only one of these instruments that

takes into account the pragmatic function of prosody. Yet it only assesses a few communicative

aspects of prosody, namely, the ability to place contrastive stress and express affective stances

(only two, liking and disliking), as well as the production of neutral questions and statements.

Importantly, most of the tests were designed for children aged 5 or older, and only two of

them are appropriate for children aged 3 or 4 (the PVSP and DANVA 2). Finally, the

administration of most of these tests is fairly time consuming (the DANVA 2, PEPS-C and

Mariia Pronina

PhD Research Plan

29

PPAT take around one hour, and the MNTAP takes more than two hours) making it difficult to

apply them to young children.

To summarize, while prosodic assessment tests for children do exist, (a) they are primarily

designed for clinical use or for research in diverse clinical populations; (b) they focus

principally either on receptive prosodic skills, or on very basic expressive prosodic skills and

do not fully integrate the pragmatic functions of prosody; (c) they are not designed to assess

preschool-aged children; and (d) they are time-consuming.

Test Purpose Target child

population

Prosodic skills assessed

Target

age

range

Administration

time

Clinical

use

Research

use

Typical

Atypical

Expressive

Receptive

PROP + - - + + - – –

PVSP + - - + + - 3-81 –

DANVA 2 + + + + - + 3-99 1 hour

PEPS-C + + + + + + 5-14 1 hour

PPAT - + - + - + 7-12 1 hour

MNTAP - + - + - + 6-11 2-3 hours

TABLE 1: Comparative table showing the main features of the existing prosodic assessment tools for children

All in all, no standard prosodic test to date is optimized to assess in a comprehensive way

young children’s expressive prosodic abilities in relation to pragmatic contextual situations.

There is thus a need for a standard elicitation test which allows researchers to understand the

acquisition of pragmatic prosody during the preschool years. While some research has been

done on the early development of intonation (for a review see Frota & Butler, 2018), our

understanding of how intonation is acquired in the preschool years and beyond is still patchy.

The first aim of Study 3 is to create a tool to comprehensively assess young children’s

expressive prosodic and gestural abilities in relation to pragmatic contextual situations. This

tool will be used in Study 1 which has the goal of assessing the relationship between a variety

of linguistic skills – with special attention paid to pragmatic and prosodic skills – and ToM

abilities.

All in all, this section has shown that while a large body of research has focused on

determining the direction and nature of the relationship between ToM and such language

abilities as syntax and semantics, much remains to be investigated with respect to the role of

pragmatic and multimodal abilities.

1.2.2.3. The links between ToM and emotion understanding

Besides the link between linguistic abilities (semantics, syntax, pragmatics) and ToM,

there also has been a focus on exploring how ToM is related to other developmental domains.

One area that has received particular interest is the link between ToM and emotion

Mariia Pronina

PhD Research Plan

30

understanding. Emotion understanding refers to children’s knowledge of the nature of

emotions, as well as knowledge of what causes them and how they can be regulated (Pons,

Harris, & de Rosnay, 2004). Prior research has demonstrated that affective perspective taking

develops gradually beginning in the preschool years (Wellman, Phillips, & Rodriguez, 2000)

and that this is related to the growing understanding of beliefs and desires (Bartsch & Wellman,

1995). Both emotion understanding and ToM are crucial for social interaction and they are

conceptually connected (Dunn, 1995). Indeed, more recently, a body of empirical studies found

positive correlations between these two capacities in preschoolers and primary school children

(e.g., 3- to 5-year-olds (Harwood & Farrar, 2006); 4- to 6-year-olds (Ornaghi, Pepe, &

Grazzani, 2016) 4-and-a-half- to 6-and-a-half-year-olds (Weimer, Sallquist, & Bolnick, 2012);

5- to 7-year-olds (Bender, Pons, Harris, & de Rosnay, 2011); 3- to 8-year-olds (Grazzani,

Ornaghi, Conte, Pepe, & Caprin, 2018). Further, another study by (Kuhnert, Begeer, Fink, &

de Rosnay, 2017) with a longitudinal design reported a positive association between the

performance on emotion understanding by 5-year-old children and their later ToM abilities (7

years of age).

Further, performance on emotion tasks is also found to correlate with language tests

(Grazzani et al., 2018; Ruffman et al., 2003; Strand, Downs, & Barbosa-Leiker, 2016). For

example, contrary to some predictions by Astington & Jenkins (1999), Ruffman et al. (2003)

found a relation between syntax (understanding of word order and embedded clauses) and

emotion understanding performance in 3- to 5-year-old children. As noted by the authors, there

was no reason to expect this relation since, in contrast to false belief, emotion understanding

does not require meta-representational capacity. However, a recent longitudinal study by Strand

et al. (2016) confirms that there is an association between emotion understanding and linguistic

abilities. In this study, the Peabody Picture Vocabulary (PPVT), a measure of receptive

vocabulary, was administered. It was found that the performance on this task and on the emotion

expression recognition test are related in older preschoolers (49 to 67 months), but not in

younger (36 to 48 months). The authors suggested that initially emotion recognition skills are

insular but with the development of verbal abilities over the course of the preschool years they

become more influential. To put it another way, according to Strand et al. (2016), the nature of

the association between emotion-related processes and linguistic abilities changes over time:

whereas in the early stages this association is weak, later it becomes stronger and bidirectional

as both verbal skills and emotion understanding influence each other. Another study by

Grazzani et al. (2018) also found that language ability (PPVT) was significantly correlated with

Mariia Pronina

PhD Research Plan

31

both emotion understanding and ToM, and played a crucial role in mediating the relationship

between these two abilities and in explaining variance in ToM.

1.2.2.4. Training studies to promote ToM

Some studies have used a training methodology to pinpoint the kinds of linguistic

interaction that facilitate the development of ToM (Hale & Tager-Flusberg, 2003; Lohmann &

Tomasello, 2003; Lohmann, Tomasello, & Meyer, 2005). To this end, Lohmann & Tomasello

(2003) assigned 3-year-old children who failed a false belief test to four training conditions

involving adult–child interactions. In a first control training condition (no language training),

children were provided with a series of deceptive objects such as writing pen that resembled a

flower but not with accompanying linguistic commentary. In a second condition (discourse only

training), the children were shown the same training objects but the experimenter engaged the

children in perspective-shifting discourse using language but not mental state verbs nor

sentential complement constructions. In a third condition (sentential complement training only),

the deceptive aspect of the training objects was not highlighted but when talking about the

objects, the experimenter used mental state verbs and sentential complement constructions

which had been previously associated with false-belief understanding (see de Villiers, 2005).

The fourth condition (full training) included all mentioned factors: perspective-shifting

discourse (the second condition), and mental state verbs and sentential complements (the third

condition). The results showed that (a) deceptive experience alone is not sufficient; (b) both the

second and the third conditions facilitated children’s false belief understanding; (c) children

showed the largest improvement in the full training condition. It thus seems that language plays

a key role in the development of children’s false belief understanding (both discourse-shifting

and sentential complement syntax), and linguistic experience is facilitating and necessary

condition for the improvement of ToM. Another training study (Hale & Tager-Flusberg, 2003)

reported similar findings. In order to determine which structures play a causal role in the

development of ToM, Hale & Tager-Flusberg (2003) trained children on sentential

complements and on relative clauses (as a control condition). While the control group showed

no improvement in posttests, the group trained on sentential complements increased their

performance on false belief understanding. Moreover, training children in sentential

constructions has the same effect on false belief understanding as false belief training. The

outlined studies support the view of the de Villiers (2005) that sentential complements influence

false-belief understanding. However, the findings from Lohmann & Tomasello (2003) indicate

that syntax is not of sole importance for ToM development.

Mariia Pronina

PhD Research Plan

32

Meanwhile, other studies suggest that conversational interactions are fundamental for the

development of ToM and examine the beneficial effects of training with such interactions on

promoting of ToM (Appleton & Reddy, 1996; Janet Wilde Astington & Peskin, 2004; Guajardo

& Watson, 2002; Ornaghi, Brockmeier, & Grazzani, 2011; Slaughter & Gopnik, 1996). These

studies highlight the role of discourse and support the idea that that children’s acquisition of

ToM depends on exposure to conversations, as they provide insights into the beliefs and desires

of others (e.g., Bartsch & Wellman, 1995; Harris, 2005; Siegal, 1999; Tomasello, Striano, &

Rochat, 1999). As pointed out by Ornaghi, Brockmeier, & Grazzani (2011), “the relationship

between language and theory of mind is well embedded in one particular kind of language:

mental state talk” (p. 240). Mental state terms (also called inner state terms, metacognitive

language and mental lexicon) refer to mental states of belief, thoughts, intention, desire and

emotion. In the literature, the term “mental state” frequently refers exclusively to cognitive

terms (think, know) in contrast to desire (want, need) and emotional state terms (delighted,

angry). However, sometimes this term also references emotional states (e.g., LaBounty,

Wellman, Olson, Lagattuta, & Liu, 2008; Lemche, Kreppner, Joraschky, & Klann-Delius,

2007). Some researchers advocate for a more flexible use of the term. For example, Astington

& Peskin (2004) propose “mental terms” for all mental state terms and “metacognitive terms”

for belief states. This thesis will use “internal states” to describe both emotional and belief states

in, following the proposal by Armstrong & Hübscher (2018).

Children’s use of mental state terms such as know, think and believe is found to correlate

with their ToM understanding (e.g., Brown, Donelan-McCall, & Dunn, 1996; Dunn, Brown, &

Beardsall, 1991). Moreover, previous research has shown that mental state talk input facilitates

later understanding of mental state concepts in toddlers (e.g., Taumoepeau & Ruffman, 2006,

2008). For example, Taumoepeau & Ruffman (2006) found a longitudinal relation between

mother mental state language to 1-year-old children and children’s desire language and emotion

understanding at the age of 2. In a follow-up study, Taumoepeau & Ruffman (2008) expanded

their findings and reported that at 24 months of age, mothers’ talk about thoughts and

knowledge predicts children’s later mental state language at 33 months. The relation between

parental use of mental state language and later ToM was also found in preschool-age children

(LaBounty et al., 2008; Ruffman, Slade, & Crowe, 2002).

While the abovementioned studies showed that a child’s developing ToM is influenced

by parents’ conversational interaction, other studies used a training method to explore the role

of a child’s exposure to mental state language (Appleton & Reddy, 1996; Astington & Peskin,

2004; Guajardo & Watson, 2002; Ornaghi et al., 2011; Slaughter & Gopnik, 1996). The first

Mariia Pronina

PhD Research Plan

33

training studies (Appleton & Reddy, 1996; Slaughter & Gopnik, 1996) with 3-year‐old children

demonstrated that false belief understanding can be trained in focused conversations. These

studies used a task-specific approach, that is, the training was developed to teach children to

pass standard ToM tests and generally consisted in providing children with feedback according

to their performance on specific false-belief sequences. Examples include false belief about the

location of an object (Appleton & Reddy, 1996) or appearance-reality distinction (Slaughter &

Gopnik, 1996). More recent studies, however, involve an experimenter reading stories. For

example, Guajardo & Watson (2002) examined experimentally whether children’s mental state

input can facilitate ToM understanding. They found that 3- to 4-year-old children who

participated in training sessions (reading a children’s storybook and the subsequent discussion

of the mental state concepts) showed improved performance on false belief and deception tests

compared to children who did not participate in any training sessions. In a training study by

Astington & Peskin (2004), 4-year-old children were exposed to mental state language in story

texts. The same stories were read to children in the experimental and control conditions but

while for the training group the story was especially rich in metacognitive terms, for the control

group all metacognitive terms were removed though mentalistic concepts remained an integral

part of the stories. The children listened to the stories at home and at school during a 4-week

intervention period. The results of the posttests showed that children in the experimental group

produced significantly more metacognitive verbs in a storytelling task. Somewhat

unexpectedly, the training group did not improve their performance on the metacognitive

comprehension task; yet both groups showed improvement on the false belief explanation task,

the control group even significantly more than the experimental one. The authors suggested that

exposure to this kind of story, even if mentalistic concepts are represented implicitly, facilitates

ToM understanding. Hence children do not only passively perceive mentalistic concepts but

actively construct their own interpretations and thus acquire understanding of mentalistic

concepts.

In a more recent cross-sectional study, Ornaghi et al. (2011) explored the role of actively

using mental state lexical items in promoting ToM development and gaining advanced

understanding of mental state terms. Two age groups (3- and 4-year-old children) participated

in a 2-month intervention which involved a storyteller reading stories enriched with mental

terms. While in the control condition children were engaged in language games and

conversations, the experimental condition children were encouraged to use mental state terms.

There were no linguistic activities in the control condition. The findings can be summarised as

follows: (a) the training intervention had a significant effect on the metacognitive language

Mariia Pronina

PhD Research Plan

34

comprehension performance of both the younger (3-year-olds) and the older group (4-year-

olds); (b) a positive effect on the emotion understanding was found only in the younger

children; (c) a positive effect on the false belief understanding was found only in the older

group. Importantly, the pragmatic competence of the preschoolers was also assessed. In order

to do so, 12 stimuli from the Comprehensive Assessment of Spoken Language (Carrow-

Woolfolk, 1999), a test which evaluates the use of pragmatic language, was administered.

Although the intervention effect was not found for the pragmatic test, there was a significant

difference between pretest and posttest for pragmatic competence in the 3-year-old-children

experimental group and in both the 4-year-old-children experimental and control groups.

The training studies reviewed in this section have shown the beneficial effect of

conversational interventions on children’s cognitive development. However, little is known

about whether perceiving and enacting multimodal expressions of internal states and emotions

can contribute to enhancing perspective taking skills. The aim of Study 4 of this thesis will be

to analyze the effect of an embodied intervention based on (Ornaghi et al., 2011) on both

preschooler’s pragmatic competence and ToM.

2. GOALS OF THE DISSERTATION

The current PhD thesis has four main goals. The first goal is to examine the link between

the sociopragmatic skills of preschool children to their linguistic and ToM development. This

will enable us to have a clearer picture of the complex relationship between (a) sociopragmatic

abilities and (b) ToM skills, as well as (c) language skills and (d) emotion-detection abilities,

and will also contribute to the discussion about the overlap between these capacities. The

second goal is to explore the relationship between sociopragmatic abilities and ToM skills, on

the one hand, and multimodal imitation abilities on the other. Multimodal imitation ability is

understood as the ability to jointly imitate prosody, gesture and lexical content. The third goal

is to determine developmental trajectories of prosodic and gestural patterns that encode

sociopragmatic meanings in the preschool years. This will help assess how preschoolers master

their prosodic and gestural abilities and the role that prosody and gesture play in sociopragmatic

development. Finally, the fourth goal is to assess the role of enactment training with voice and

body on perspective-taking in children's sociopragmatic development.

All in all, the present PhD thesis aims to explore sociopragmatic development in

preschoolers and their links with language and ToM abilities and determining the role that

gesture and prosody play in this development.

Mariia Pronina

PhD Research Plan

35

To achieve the abovementioned goals, four empirical studies will be carried out. The

following research questions will be addressed in each study: (Study 1) Are sociopragmatic

abilities in children correlated with ToM and language skills (semantics, syntax, narrative

ability) in 3- to 4-year-old preschool children? (Study 2) Are sociopragmatic abilities in

children tied to multimodal imitation abilities (understood as the ability to jointly imitate

prosody, gesture and lexical content) in 3- to 4-year-olds? (Study 3) What are the

developmental trajectories of prosody and gesture in the expression of sociopragmatic

meanings in 3- to 4-year old and 5- to 6-year old children? (Study 4) Does an enactment training

intervention involving prosody and gesture have a beneficial effect in improving

sociopragmatic competence and ToM in 3- to 4-year-old preschoolers?

Our general hypotheses for each of the studies are the following: (1) Study 1:

Sociopragmatic competence in preschoolers will be positively correlated with ToM abilities, as

well as with other language skills and emotion detection skills; (2) Study 2: Children’s

sociopragmatic competence and ToM abilities will be linked to multimodal (gesture and

prosody) imitation abilities; (3) Study 3: Prosodic and gestural cues go hand-in-hand in

development and serve as precursors in the encoding of sociopragmatic meanings; and (4)

Study 4: “Embodied simulation” of children’s own and other’s mental and emotional states

will significantly affect ToM understanding and those linguistic capacities that require

perspective-taking skills such as sociopragmatic abilities.

3. THEORETICAL FRAMEWORK

3.1. The usage-based theory of language acquisition

This PhD thesis will follow a usage-based approach to language acquisition (see

Tomasello, 2000, 2003, 2009 for a review). Broadly, as the name indicates, this approach holds

that language is learned through language use. Drawing from extensive work on early

acquisition, this theory focuses on two sets of cognitive skills that one year-old children are

equipped with, namely being: intention-reading (functional dimension) and pattern-finding

(grammatical dimension) (Tomasello, 2003). Intention-reading includes the ability to share

attention with others, the ability to follow the attention and gestures of others, the ability to

direct the attention of other people to distal objects by using gestures (pointing, showing etc.),

and the ability to learn the actions of others by imitation (see also section 1.2.1.1.). These

preverbal abilities are indispensable for the acquisition of language, including the acquisition

of complex linguistic constructions. The other set of skills refer to pattern-finding, which is a

summary term for such processes as analogy, categorization and distributional analysis

Mariia Pronina

PhD Research Plan

36

(Tomasello, 2009). These abilities also start to emerge very early in infancy, some

prelinguistically. Importantly, both intention-reading and pattern-finding skills are domain-

general, in other words, they are not only applied in the linguistic domain (Tomasello, 2003).

Crucially, intention-reading also enables various cultural skills and practices, and pattern-

finding also allows for the categorization of various aspects of the world. While intention-

reading is the central construct in the socialpragmatic approach to language acquisition

(Carpenter & Tomasello, 2000), pattern-finding is the central assumption in the usage-based

approach to the acquisition of grammar.

The unique ability of intention-reading is crucial for Tomasello’s account. It allows the

child to understand what is intended with a speech act. In the comprehension of speech acts,

children largely rely on multimodal cues, and children express their communicative act trough

vocalization and gesture. As part of intentionality, gestural communication and pointing in

particular constitute a pillar of the usage-based approach. This thesis adopts this view and is

aimed at exploring later stages of development, when children become able to express more

specific and sophisticated intentions.

As noted in Tomasello (2009), the usage-based approach may be summarized as follows:

(1) meaning is use; (2) structure emerges from use. While the former involves a functional (or

semantic) dimension of communication, the latter involves a structural (or grammatical)

dimension. The usage-based theory suggests that the understanding of the nature of the

language and language acquisition can be gained only by looking to the process of

communication. The pragmatics of human communication is primary, and first communicative

interactions, as well as later conversations pave the way for language development. Although

language can be used in communication in a variety of ways, one common trait of language is

that it is acquired by understanding how other people use and interact with it.

All in all, the basic claim of the usage-based approach to language acquisition is

essentially that language structure emerges from language use and “universals of linguistic

structure derive from the fact that people everywhere have the same set of general cognitive

processes” (Tomasello, 2009, p. 85).

As a final remark, we wish to point out that, although for this PhD thesis the usage-based

approach will be followed, there are other possible explanations of the human linguistic

capacity. Within modern linguistic theories, a main contrasting approach to the language

acquisition is ‘nativist’ view which suggests that children are born with innate linguistic

knowledge rather than they acquire language through the social interaction experience (for a

review, see Cowie, 2017).

Mariia Pronina

PhD Research Plan

37

3.2. Embodied cognition

The theories of Grounded Cognition (Barsalou, 2008; for a review see Barsalou, 2010)

and of Embodied Cognition (Foglia & Wilson, 2013; Ionescu & Vasc, 2014; see Wilson &

Foglia, 2017 for a summary) hold that cognitive processes are highly dependent on sensory-

motor experience. Although the central tenet of both approaches states that the mind and body

influence each other, the former relates cognition to the environment, modal simulations and

bodily states, rejecting traditional views that knowledge is separate from brain’s modal systems

for perception, action and introspection.

Accounts of Grounded Cognition advocate that cognition is grounded in multiple ways,

including bodily states and, especially, simulations (or reenactment). For example, Tucker &

Ellis (1998) showed the perception of the handle of a cup activates a grasping simulation, in

this way, all multimodal experiences associated with the object become active. Much research,

particularly neuroimaging research, has confirmed the key role played by simulation in

cognitive processes (see Barsalou, 2008 for a review), providing evidence that the areas that

represent motor and perception properties of objects are activated when conceptual knowledge

about objects is represented. It has been shown that the simulation is ubiquitous in cognition,

including perception (e.g., perception of space), perception-action coordination (the example

described above), memory (e.g., memory effects), knowledge and conceptual processing (e.g.,

numerical cognition, language comprehension, and language learning). For example,

Pulvermüller, Hauk, Nikulin, & Ilmoniemi (2005) demonstrated a link between action and

language system in lexical processing. They found that when participants read the words for an

action (e.g., kick, pick), the motor system become active, action verbs thus produce simulation

in the corresponding areas of motor system. Their results suggest that language and motor

systems interact during the processing of lexical information. Similarly, Myung, Blumstein, &

Sedivy, (2006) demonstrated a priming effect of motor simulations on word recognition.

Specifically, they found that participants were faster to make a lexical decision for words that

were primed by a word with shared manipulation features (e.g., piano and typewriter) in

contrast to a word that was not related to the target word in terms of manipulation features (e.g.,

piano and blanket). This indicates that sensorimotor information is relevant to lexical semantics.

Furthermore, speakers retrieve words better if they produce gestures related to the meaning

(e.g., Krauss, 1998). Research on development also has shown that gesture can convey meaning

that cannot yet be expressed verbally (Goldin-Meadow, 2003). The abovementioned research

Mariia Pronina

PhD Research Plan

38

and many other studies (for a review see Barsalou, 2008) have demonstrated the importance of

simulation in language comprehension.

Similarly, Embodied Cognition theory claims that sensory-motor processes, our body’s

morphology, and internal states impact human cognition (Ionescu & Vasc, 2014). The

Embodiment Thesis claims that “many features of cognition are embodied in that they are

deeply dependent upon characteristics of the physical body of an agent, such that the agent's

beyond-the-brain body plays a significant causal role, or a physically constitutive role, in that

agent's cognitive processing.” (Wilson et al., 2017). Therefore, Grounded Cognition may be

regarded as an attempt to explain embodied cognition by the concept of grounding (Wilson et

al., 2017), i.e., human cognition is grounded in sensory-motor experiences, while the

Embodiment Thesis is more wide and consists in three assumptions: (1) body is a constraint on

cognition; (2) body is a distributor for cognitive processing; (3) body is a regulator of cognitive

activity. This implies that body does not merely transduce perception information to cognition,

but is its integral part.

Likewise, Embodied Cognition theory suggests that perception and action systems are

engaged to perform cognitive functions during language comprehension and language learning.

A bulk of neuroscience studies within the embodied cognition framework has provided

evidence for activation in brain areas that bear a systematic relation to the language content

during language comprehension (e.g., Tettamanti et al., 2005; for a review, see Glenberg, 2007).

For example, Tettamanti et al. (2005) found that listening to action-related sentences activates

the areas of the brain associated with perception and visual motion. Behavioral data has also

provided strong evidence for the Embodiment Thesis in various levels of language processing

(grammatical, semantical). For example, concerning the relationship between language and

action, it has been demonstrated that there is a strong link between concept knowledge and

situated action (Borghi, Glenberg, & Kaschak, 2004).

Another set of studies have focused on language learning. For example, Kiefer & Trumpp

(2012) showed that embodiment can facilitate reading and writing. Further, Maouene,

Sethuraman, Laakso, & Maouene (2011) provided evidence that embodiment can influence

children’s acquisition of verbs since bodily experiences can facilitates verb meaning

acquisition. For a review of developmental research on the role of gestures in language learning

see section 1.2.1.; see also Goldin-Meadow (2018) for a review.

Thus following these theories, cognition and the body are intimately related to each other,

bodily states activate cognitive processes, and the use of embodiment can promote language

comprehension and learning. All four studies of this thesis rely on this fundamental paradigm

Mariia Pronina

PhD Research Plan

39

but Study 4 directly tests the effect of an embodied intervention on cognitive (ToM) and

language abilities in preschool children. We hypothesize that the embodied training

intervention will specifically impact sociopragmatic ability since sociopragmatic learning is

embodied. It has been clearly shown for the early stages of communicative development since

children acquire language through situated experiences (Tomasello, 2009). For intention

reading, bodily states (gestures, facial expressions, eye gaze) are key components. As noted by

Bergen (2015), the usage-based approach proposed by Tomasello (2009) is the most complete

account to language acquisition from an embodied perspective.

4. HYPOTHESES

Based on the review of the literature discussed in the previous sections, we outline the list

of hypotheses to be tested in this PhD thesis. As explained earlier, the main underlying

hypothesis of this thesis is that sociopragmatic competence is a crucial component in children's

linguistic development which is related to mind-reading abilities and also to embodied language

(e.g., prosodic and gestural abilities) expressed through the body and the voice.

4.1. Study 1

Considerable attention has been drawn to investigate the link between ToM and language.

Syntactic and semantic abilities have become a special focus of interest (e.g., Astington &

Jenkins, 1999; de Villiers & de Villiers, 2000; de Villiers & Pyers, 2002; Milligan et al., 2007;

Slade & Ruffman, 2005; see section 1.2.2.2. for more detail). However, fewer studies have

dealt with pragmatic abilities, with the exception of studies which have primarily focused on

specific issues such as irony comprehension. Moreover, the relationship between

sociopragmatic abilities and ToM needs to be further investigated, given that (a) some scholars

suggest an overlap between these two capacities; and (b) some studies investigating the

relationship between pragmatics and ToM have faced a methodological confusion (for a review,

see Bosco et al., 2018). Following Bosco et al. (2018), the current PhD thesis will adopt the

view that as a language faculty, pragmatics is distinct from ToM and despite the close relation

between them they still are separable constructs.

The aim of Study 1 will be to examine the relationship between sociopragmatic abilities

in 3- to 4-year-old children and their mind-reading abilities (ToM) by applying a new

specifically designed Audiovisual Pragmatic Test (see proposed Study 3). Our hypothesis is

that these abilities will be positively correlated. This study will add to the literature by being

the first to directly investigate the correlation between general sociopragmatic competence and

Mariia Pronina

PhD Research Plan

40

ToM with 3- and 4-year-old children, and prosodic abilities and ToM. Moreover, the correlation

with the ability to detect emotions will be assessed. Furthermore, children's sociopragmatic and

ToM capacities will be investigated in regard to other language skills (semantics, syntax,

narrative abilities). We hypothesize positive correlations between them.

4.2. Study 2

Imitation is a complex skill that relies on different social, cognitive and motor abilities,

and plays a pivotal role in early language development. First, imitation is crucial for children’s

early language development, as it not only supports word learning but also serves as a

scaffolding mechanism for acquiring specific language features and grammatical structures (for

a review see Carpenter, 2014). Moreover, imitation has been shown to play a key role in social

communicative development (Carpenter, 2014). Numerous empirical and longitudinal studies

have examined various aspects of imitation and the mechanisms that underlie it and shown

imitative behaviors to be a naturally occurring interaction (Lieven & Stoll, 2013) throughout

infancy and early childhood (Carpenter, Nagell, et al., 1998; Carpenter, Tomasello, & Striano,

2005; Masur & Eichorst, 2002). For example, Carpenter & Tomasello, reported a relationship

between imitation and the early development of visual and oral aspects of language (Carpenter,

Nagell, et al., 1998; Carpenter et al., 2005). They found imitative learning to be predictive of

referential language, as well as of communicative gesture, in this case declarative pointing.

They also showed that the ability to imitate in role-reversal tasks correlated with the

comprehension and production of pronouns. In role- reversal tasks, a child observes an adult

performing an action directed at the child and then is asked to reverse roles by directing the

same action at the adult (in simple imitation tasks, both adult and child direct their actions at

the same external object, such as a ball). Likewise, in Masur & Eichorst (2002), infants’ early

spontaneous imitation of novel words was shown to be predictive of larger vocabularies later

in development. Research has pointed to the relevance of prosody imitation in prosodic

development since it requires both prosodic perception and production. Thus, imitation has been

shown to scaffold the production of prosodic contours (Gratier, 2014). Evidence from clinical

research confirms that, for example, Peppe, Cleland, Gibbon, O’Hare, & Castilla (2011) noted

that a difficulty in imitating prosody might be an obstruction for learning prosody. Thus, there

is evidence that imitation is linked to multimodal language abilities in particular and social

communication skills in general. Nevertheless, little is known about the association between

imitation abilities in children and their sociopragmatic skills.

Study 2 will analyze how multimodal language imitation abilities—understood as the

ability to jointly imitate prosody, gesture and lexical content—are related to sociopragmatic

Mariia Pronina

PhD Research Plan

41

competence and ToM abilities in preschool-aged children. This age group (from 3 to 4 years

old) is of particular interest given the evidence suggesting that developmental changes in the

fidelity of imitation are experienced during this period. Prior to age 3, though infants tend to

copy adult’ actions such as clapping, their performance is characterized by goal neglect and it

is therefore more accurate to label this behavior mimicry rather than imitation (Jones, 2007). In

contrast, older children show an understanding of the goal or intention of a specific action and

their imitative behavior is thus focused on outcomes. In fact, there is research showing a clear

progression from mimicry to imitation. For example, Dickerson, Gerhardstein, Zack & Barr

(2013) found that children between 1.5- and 2-years of age merely mimicked gestures, while

2.5- to 3.5-year-olds imitated them. The older children were also able to copy models more

closely, thus showing a greater degree of fidelity in their imitative behavior. The question of

imitative fidelity across ages even into adulthood was investigated in McGuigan, Makinson &

Whiten (2011), which concluded that “people may become more imitative as they mature”

(p. 1).

In line with previous findings, it is believed that both sociopragmatic competence and

ToM abilities will be positively correlated with imitation abilities.

This study will be carried out in collaboration with Dr. Iris Hübscher (URPP Language

and Space, University of Zurich, Switzerland) and Eva Castillo (an MA student in the

Department of Translation and Language Sciences, Universitat Pompeu Fabra).

4.3. Study 3

Research in the last few decades has shown that the prosodic features of language are

crucial in signaling sociopragmatic meanings in communication, such as speech act marking,

focus, or epistemic stance marking. In the developmental literature it has been shown that the

acquisition by children of increasingly complex prosodic skills goes hand-in-hand with their

sociopragmatic development (see Prieto & Esteve-Gibert, 2018, for a review). However,

relatively little is known about the developmental path followed by prosodic features in later

stages of development (Armstrong & Hübscher, 2018; Chen, 2018; Hübscher, 2018; Ito, 2018).

While tests that measure prosodic skills in children do exist at present, as yet there exists

no test that assesses children’s expressive prosodic skills within a variety of pragmatically

relevant discourse contexts (see section 1.2.2.2. for more details). Study 3 is an attempt to help

fill this gap. It has two main goals. The first is to present a new Audiovisual Pragmatic Test

(henceforth APT) designed for use with typically developing children starting from the age of

three. The APT uses a carefully controlled picture-supported set of Discourse Completion Task

Mariia Pronina

PhD Research Plan

42

(DCT) items which allow the user to assess prosody in relation to pragmatic social contexts.

The second is to explain the findings from an administration of this test to 3- to 4-year-old and

5- to 6-year-old Catalan-speaking children. Moreover, Study 3 will also investigate gestures

used by children while performing the test.

This study will add to the literature regarding intonational and gestural development of

preschoolers. Our general hypotheses are the following: (a) DCT elicitation methodology,

which has been successfully used for assessing adult intonational grammar, can be also

applicable to obtain expressive developmental data for 3- to 6-year-old children; (b) Prosodic

and gestural cues go hand-in-hand in development and serve as precursors in the encoding of

sociopragmatic meanings.

4.4. Study 4

Study 4 will investigate whether perceiving and enacting multimodal expressions of

mental states and emotions can contribute to enhancing perspective taking skills. This study is

carried out in collaboration with Dr. Judith Holler (Max Planck Institute for Psycholinguistics,

the Netherlands & Donders Institute for Brain, Cognition and Behaviour, Radboud University,

the Netherlands) and Dr. Iris Hübscher (URPP Language and Space, University of Zurich,

Switzerland).

Previous training studies have shown the beneficial effect of having language

conversations containing internal state terms on children’s cognitive development (e.g.,

Guajardo & Watson, 2002; Lohmann & Tomasello, 2003; Ornaghi et al., 2011). However, the

effect of embodied intervention has not been tested yet. We expect that children who reenact

multimodal language will show significant improvement. Further, it is believed that the training

aimed at promoting ToM and mental verb comprehension (the methodology and training

materials were adapted from the training study by Ornaghi et al. (2011)) will affect not only

ToM understanding, but also those linguistic capacities that require perspective-taking skills

such as sociopragmatic abilities.

The goal of Study 4 is to carry out a one-month between-subject intervention training

experiment with a subsequent comparison of pretest and posttest scores related to ToM and

mental state comprehension, as well as to emotional and sociopragmatic skills. Further, one

more objective of this study is to develop an educational website in order to provide preschool

educators with experimentally tested tools that can be used in classroom.

Mariia Pronina

PhD Research Plan

43

5. EXPERIMENTAL STUDIES

The following section will describe each of the studies in detail, covering the research

questions, methodology, and expected results for each study. All of the studies described have

already been partially carried out. As the data from Study 2 and 3 have been preliminary

analyzed, the corresponding sections will also include the summary of interim results.

5.1. Study 1: Sociopragmatic competence in preschoolers and its link with

other abilities

5.1.1 Research question

This study will explore the relationship between sociopragmatic competence in

preschool-age children and their ToM and emotion understanding, as well as linguistic abilities

(semantics, syntax, narrative abilities). Previous findings suggest a correlation between ToM

and semantic and syntactic abilities (e.g., Astington & Jenkins, 1999; de Villiers & de Villiers,

2000; de Villiers & Pyers, 2002; Milligan et al., 2007; Slade & Ruffman, 2005). The main aim

of Study 1 is to determine whether ToM also correlates with pragmatic abilities.

5.1.2. Participants

A total of 102 3- to- 4-year-old native Catalan-speaking children (45 male and 57 female;

mean age = 44.92 months, SD = 3.28 months; age range 39 to 51 months) participated in the

study. All participants were preschoolers at two Catalan public schools, located in the middle-

income district of Sant Martí within the metropolitan area of Barcelona, where the population

is largely Catalan-Spanish bilingual (96,3% understand Catalan, 73,6% speak Catalan) 1. The

main language of instruction in the target schools is Catalan (as opposed to Spanish). Prior to

the experiment, the children’s parents signed a participation consent form and completed an

occupational status questionnaire (mean ISEI score = 60.97, SD = 12.7 (Ganzeboom, De Graaf,

Treiman, & De Leeuw, 1992)), confirming middle class SES scores) as well as a language

questionnaire regarding the daily exposure of their child to Catalan (mean overall exposure time

= 56.8%, SD = 22.7). Data from a total of fifteen children had to be excluded from subsequent

analysis due to the fact that their mean exposure to Catalan was less than 20%. All children

were typically developing children and had no history of speech, language, or hearing

difficulties.

1 http://www.bcn.cat/estadistica/catala/dades/anuari/cap06/C0617010.htm

Mariia Pronina

PhD Research Plan

44

5.1.3. Materials

Screening Measure

A screening test was applied to all children (117) in order to exclude participants with a

lack of Catalan language proficiency from the experiment. The screening measure was the ELI,

L'avaluació del llenguatge infantil (Saborit Mallol, Julian Marza, & Navarro Lizandra, 2005),

an expressive one-word vocabulary test specially designed to measure vocabulary size of

Catalan speaking children aged 0 to 6 years old. It assesses lexical knowledge with a picture-

naming task. The stimuli consisted of 30 pictures of common objects such as a tree, a bulb, or

a coat. The participant was given credit for every correctly named item, and the total score

(from 0 to 30) was normalized to a 0–100 scale. A minimum grade of 20% was established for

participation in the study. Fifteen children were not included in the study due to not meeting

this threshold. None of the participants included in the study showed difficulties in speaking

Catalan or understanding instructions, which were given in Catalan.

Main Measures

Emotion Understanding

To assess children’s emotion knowledge, the Catalan adaptation of the Emotion Matching

Task originally designed for English-speaking preschool-aged children by (Izard, Haskins,

Schultz, Trentacosta, & King, 2003) was applied. This task measures the emotion

comprehension of 2- to 6-year-olds and focuses on the four basic emotions: happiness, sadness,

anger and fear/surprise. Only the two most discriminative parts of the Catalan EMT were

administered: expression-situation matching and expression labeling.

Mental verb comprehension

Children’s comprehension of metacognitive verbs was assessed with the Catalan version

of the Metacognitive Vocabulary Test (Astington & Pelletier, 1998) designed for 3- to 7-year-

old children. The test consists of a total of 12 short stories accompanied by images. Only the

first six stories were used due to the young age of the participants.

Theory of Mind

Children’s false-belief understanding was assessed with the two classical and most widely

used explicit false-belief tasks. The first one was the classical Sally and Anne task (Baron-

Cohen et al., 1985); an adapted version for Catalan was used (Armstrong et al., 2018). The

second false belief measure was the false-belief unexpected content “Smarties task” (Gopnik &

Mariia Pronina

PhD Research Plan

45

Astington, 1988). Both tasks were used in order to consider the ToM measure as a continuous

variable.

Pragmatics and Prosody

The APT is a test designed to assess pragmatic abilities in typically developing Catalan-

speaking children (Pronina, Hübscher, Vilà-Giménez, & Prieto, submitted). The general design

and elicitation procedure of the APT is based on a variety of currently used pragmatic tests for

children, e.g., the Test of Pragmatic Language (TOPL-2) (Phelps-Terasaki & Phelps-Gunn,

2007); the Clinical Evaluation of Language Fundamentals–5 (CELF-5) instrument (Wiig,

Semel, & Secord, 2013); and the Comprehensive Assessment of Spoken Language–2 (CASL–

2) tool (Carrow-Woolfolk, 2017). The elicitation procedure was based on the DCT method, in

which an everyday social context is described to which the participant is asked to respond as

naturally as possible.

A total of 47 items were designed for the APT tool that represented some context that

might plausibly occur in everyday life. All 47 items were accompanied by illustrations which

were specifically designed for the APT (see Figure 1 for an example).

FIGURE 1: Item number 22 of the APT showing text and illustration intended to elicit an expression of concern

for a friend.

For the purposes of the study, given that the test takers would be 3- to 4-year-old children,

only the first 35 items of the test were applied (see Appendix A). These 35 items can be

classified into five areas according to the pragmatic functions they are intended to elicit, namely

basic interaction skills (6 items), speech-act marking (6 items), affective stance marking (13

items), focus marking (3 items) and marking of epistemic bias (7 items). The items were

presented in a fixed order based on increasing pragmatic complexity and difficulty, with the

first eleven items of the test presumably non-challenging for 3-year-olds since they mostly

evaluate basic interaction skills.

“Your friend just tripped

and fell down. What would

you say?”.

Mariia Pronina

PhD Research Plan

46

Expressive vocabulary

The results from the ELI screening measure (i.e., the expressive one-word vocabulary

test, Saborit Mallol et al., 2005) were used to compute an expressive vocabulary score for each

child.

Expressive syntax

The expressive syntax test was designed to measure the oral expression of syntax in

Catalan-speaking children. The coverage of the test is based on previous sentence expression

tests developed for children such as the Comprehensive Assessment of Spoken Language-2

(CASL-2) (Carrow-Woolfolk, 2017).

Narrative competence

A Catalan adaptation of the Renfrew Bus Story test (Renfrew, 1997) was used to assess

children’s narrative skills. This test measures children’s ability to retell a continuous series of

events about a bus that escaped from its driver.

5.1.4. Procedure

The children were individually tested in a quiet classroom at the participating preschools.

All testing was performed in Catalan with each child by the author and three trained research

assistants. The tests were administered in a fixed order. The tests were divided into two blocks

in order make the assessment session shorter and facilitate children’s participation. The

Emotion Understanding, Narrative task, Mental verb comprehension and the Theory of Mind

tests were assessed in the first block, while expressive vocabulary, expressive syntax and the

APT were administered in the second. Each individual block lasted approximately 20-30

minutes and there were two sessions. For comparison purposes all test scores were normalized

to a 0–100 scale.

Emotion Understanding

The first part of the test (expression-situation matching) assesses emotion situation

knowledge. The child was shown four pictures of children expressing diverse emotions or

exhibiting a ‘neutral’ face and then was asked to indicate which one matches a described

situation, for instance, “Show me who has just been pushed away from the table”. The second

part (expression labeling) assesses expressive emotion knowledge; after being shown a picture,

the child was required to identify the emotion and say how the child in the picture feels.

Mariia Pronina

PhD Research Plan

47

Following standard procedure, children were awarded one point for each correct answer in the

first part of the test and were awarded one or two points, depending upon accuracy in labeling,

for each correct answer in the second part. The total scores range from 0 to 36.

Mental verb comprehension

The child was read a short illustrated story and then was asked to select which of two

offered metacognitive verbs correctly describes the main character’s state of mind, for example,

“Does John know it’s raining or does John remember it’s raining?”. Two additional training

items were administered before the test starts. One point was awarded for each correct verb

choice. The total scores range from 0 to 6.

Theory of Mind

The first false belief task was presented in video format and enacted by puppets (see

Figure 2). The child was asked a control question (“Where will the girl look for the ball?”) and

a ‘false belief’ test question (“Where is the ball, really?”). Each correct response received 1

point.

FIGURE 2: Set of pictures from false belief task.

In the second false belief task, instead of the Smarties container, the participant was

shown a Lacasitos tube, which is familiar to Catalan children. Then, three questions were asked

in a fixed order: a self-test question (“What did you think was in the box before you opened

it?”), an other-person-test question (“What would your friend think was inside the box before

it was opened?”) and the open-ended question (“Why will he/she think that?”). A score of 1

was given for each correct answer to the former two questions (self-test and other-person-test

questions). A composite ToM score ranging from 0 to 4 was calculated (a possible maximum

of 2 for the location-change task and a possible maximum of 2 for the unexpected content task).

Pragmatics

The children were tested individually in a quiet room at their respective preschools by the

examiner. The child faced the computer screen where the images for each item were presented.

At the beginning of the test, two additional familiarization trials were carried out. For each item,

Mariia Pronina

PhD Research Plan

48

in a lively fashion appropriate for speaking to a preschooler, the examiner described the social

situation represented in each item while the child looked at the illustration displayed on the

computer screen. The examiner then asked the child to respond appropriately as if he or she

was a character in the situation. It is anticipated that when the APT is given to very young

children, if the test taker seems to have difficulty understanding a situation or does not behave

as expected, the examiner should try contextualizing the situation by converting people who

are likely to be important to the child (such as a friend, parent or teacher) into the protagonist

of the situation. This technique was applied as needed during the administration of the tool.

Each child participant was exposed to 35 of the test items. Total duration of the full

procedure was between 15 and 20 minutes.

The scoring of each response was carried out online by the examiner at two levels, namely

at the levels of pragmatics and enactment.

The pragmatic appropriateness of responses was given a score from 0 to 2. While a score

of 0 was recorded when the child’s verbal response was either pragmatically inappropriate (for

example, saying “It wasn’t me” in response to the prompt shown in Figure 1), or completely

absent. A score of 1 was recorded if the child responded with a single word or a simple

construction that was nonetheless pragmatically appropriate to the situation (e.g., saying “Are

you okay?” in response to the prompt shown in Figure 1). Finally, if the answer was

pragmatically appropriate and a more complex set of constructions was used (e.g., saying “Are

you alright? Do you want me to go to the doctor with you?” in response to the prompt shown

in Figure 1), a score of 2 was recorded. The scores were then added, for a total ranging between

0 and 70 for pragmatic appropriateness (35 items × 2 points per item).

Enactment was only scored if the child’s verbal response to an item was pragmatically

appropriate, that is, if their pragmatic appropriateness score for that item was 1 or 2, not 0. If

the response was pragmatically appropriate in terms of utterance content, the examiner then

considered whether the enactment of the situation took place or not and recorded a score of 1

or 0, respectively. Thus, the response was scored as 1 if the child answered with the prosody

and gesture that would be appropriate if the situation was really happening at that moment and

used directed speech in first-person (e.g., to the question “What should you say?” the child

answered “Don’t cry!”). The response was given zero points if the child did not enact the

scenario, that is, if he or she did not take the perspective of the situation’s protagonist character

and used indirect speech (e.g., to the question “What should you say?” the child answered “That

he shouldn’t cry”). Also the enactment score was 0 if the response was pragmatically incorrect.

These scores were then added for a total ranging between 0 and 35 for enactment.

Mariia Pronina

PhD Research Plan

49

Expressive syntax

Expressive syntax was assessed with 16 test items accompanied by pictures and implying

the use of various syntactic structures. The child was presented with a picture on a computer

screen and asked a question about the depicted scene that elicits specific grammatical

constructions. The child must respond with one or more sentences. For example, the beginning

of a sentence was presented and the child had to complete it (e.g., Aquest nen està dret. Aquest

nen... [està assegut] (“This boy is standing. This boy... [is sitting]”). Prior to the beginning of

the test, 2 familiarization items were given. The task was stopped after 5 consecutive incorrect

responses. The child was awarded one point for each correct answer, the total scores range from

0 to 16.

Narrative competence

The child was told a story about a bus while looking at pictures on a computer screen

which depicts the scenes of the story. Then she/he was asked to retell it. The task was

discontinued either when the child either indicated that she/he had finished or when there was

a pause in speech for 10-15 seconds and she/he did not comment when asked whether she/he

wanted to add anything else. The narratives produced by children were coded in terms of

fluency (coding system adapted from Vilà-Giménez & Prieto, 2018) and narrative structure

(coding system adapted from Vilà-Giménez, Igualada, & Prieto, 2018). Thus, each child was

given a fluency score, ranging from 7 (minimum) to 1 (maximum), and narrative structure

score, ranging from 0 (minimum) to 6 (maximum).

5.1.5. Expected results

We expect that results on the APT test will positively correlate with ToM performance,

proving the relation between mind-reading capacities and pragmatics. We also expect a

correlation between sociopragmatic competence and emotion understanding. Finally, we

hypothesize that pragmatic ability will correlate with other linguistic abilities, namely,

expressive vocabulary, expressive syntax and narrative ability.

Mariia Pronina

PhD Research Plan

50

5.2. Study 2: A correlational examination of the link between children’s

imitation skills and their sociopragmatic abilities

5.2.1. Research question

This study will explore the relationship between imitation abilities, on the one hand, and

sociopragmatic competence and ToM abilities, on the other. Previous findings suggest strong

correlations between the ability to imitate and different language skills (vocabulary, particular

grammatical structures) (see section 4.2.). This study will determine whether or not the same

is true for sociopragmatic abilities. Moreover, this study will also investigate whether ToM

abilities correlate with imitation abilities.

5.2.2. Participants

The children for Study 2 were recruited at the same school as for Study 1 (see section

5.1.2. for the information about the schools). A total of 38 typically developing native Catalan-

speaking children (see section 5.1.3. for more detail about language screening measure) (20

male and 18 female, M age = 45.89 months, SD = 3.3 months; range 41–52 months) of middle-

class background from the Barcelona area participated in the experiment.

5.2.3. Materials

Gesture and prosody imitation task

In order to assess the children’s ability to imitate gesture and prosody, a special task was

created based on the Reciprocal Imitation Training (RIT) technique, which has been used in

several studies to improve the social communication skills of children and teenagers with

Autism Spectrum Disorder (ASD) (e.g., Ingersoll, 2008, 2012; Ingersoll & Lalonde, 2010). The

RIT is a naturalistic behavioral intervention designed to teach spontaneous imitation to young

children with ASD by means of play interactions with a partner.

The basic materials for the gesture and prosody imitation task used here were created on

the basis of examples coming from the RIT. A set of twelve video-recordings were joined

together to form a sequence of conversational prompts about or directed at a teddy bear called

Esmolet. The twelve videos each showed a Catalan-speaking actor addressing either the teddy

bear or the camera and producing an utterance in child-directed speech style. The twelve

utterances consisted of exclamatives (expressing affective intonational meanings and

greetings), questions (yes-no and wh-questions) or imperatives, mostly either directed at or

referring to the bear, and each accompanied by appropriate intonation and gestures. Examples

may be seen in Table 2. Gestures were either conventional, iconic or metaphoric.

Mariia Pronina

PhD Research Plan

51

Example

and gesture type

Gesture/ Utterance Example

and gesture type

Gesture/ Utterance

Conventional

Waving hand

Hola Esmolet!

‘Hello, Esmolet!’

Metaphoric

Open up palms and then

bring them together

Esmolet, abans d’anar a

dormir llegirem un llibre!

‘Esmolet, before going to

sleep we’ll read a book!’

Conventional

Finger to lips

Shhh, que l’Esmolet està

dormint

‘Shhh, Esmolet is sleeping!’

Metaphoric

Hands pressed together by

face, as if sleeping

Esmolet, ara anem a dormir!

‘Esmolet, let’s go to sleep!’

Conventional

Hands covering the eyes

Quina por que fa aquest

llangardaix!

‘This lizard is so scary!’

Iconic

Quick downward movement

with the right hand

Esmolet, anem al parc a

baixar pel tobogan?

‘Esmolet, shall we go to the

playground and slide down

the slide?’

TABLE 2: Examples of the video-recording prompts included in the gesture and prosody imitation task.

Audiovisual Pragmatic Test

In this study, the same Audiovisual Pragmatic Test was applied (see section 5.1.3 for

more details).

5.2.4. Set up and procedure

The two tests were carried out in a quiet room at the participating preschools and all the

sessions were videotaped. The children were assessed individually by the author and two

additional research assistants.

First, each child took the APT. The procedure was the same as described in section 5.1.4.

above.

Next, the child proceeded to the gesture and prosody imitation task. This involved first

watching a brief video with instructions and an introduction to Esmolet the teddy bear, which

was given to the child right before the first familiarization trial. The video playback was then

paused while the experimenter repeated the instructions in person, and this was followed by a

familiarization trial to make sure the child understood that they were supposed to imitate what

Mariia Pronina

PhD Research Plan

52

they saw modeled in each video clip. The actual imitation task was then begun, with the child

viewing a continuous sequence of twelve videos as described above, each video separated from

the next by a 7-second pause. After the child watched two repetitions of the same clip, the

experimenter paused the video, imitated the gestures, prosody and lexical content as in the

video, and then encouraged the child to do the same by saying “Ara tu!” (‘Now it’s your turn!’).

It was decided to have the adult intervene in this manner because, as has been noted previously

by several authors (e.g., Dickerson, Gerhardstein, Zack, & Barr, 2013; Flynn & Whiten, 2008;

Nielsen, Simcock, & Jenkins, 2008), children perform poorer on gestural imitation tasks when

actions are presented only in video format, a phenomenon that can be explained by the lack of

social contingency inherent in a video. Finally, the child proceeded to imitate the behaviors

depicted in the video clip, using Esmolet the bear him/herself in accordance with the model

when appropriate. Figure 3 shows still images of children performing the imitation task.

FIGURE 3: Still images from two children performing the imitation task.

5.2.5. Coding

Gesture and Prosody Imitation Task

As noted, in this task, the child was encouraged to imitate three separate elements, verbal

content, prosody and gesture. Since it was conceivable that the child would fail to reproduce

one or more of these elements, the three were evaluated separately. Thus, for each of the twelve

videos, a separate score from 0-2 was given for gesture, prosody and lexical content imitation

yielding a possible maximum of 6 points. A score of 0 points was given if the child either failed

to imitate the component altogether or did something completely at variance from the model.

A score of 1 was given if the child reproduced the modeled gesture, prosody or lexical content

only partially. For gesture or prosody, this meant that the gesture or prosody produced by the

child was similar but not identical to the model. In the case of lexical content, this meant that

the child produced only part of the target utterance. Finally, a score of 2 points was given when

the child accurately reproduced the gesture, prosody or lexical content exactly as displayed in

the video. The twelve scores obtained for each component were summed to yield gesture,

Mariia Pronina

PhD Research Plan

53

prosody lexical content imitation scores for each child, and then these three component scores

were summed to produce an overall imitation score per child.

Audiovisual Pragmatic Test

The coding system was the same as that described in section 5.1.4. In this study, only the

pragmatic abilities score was considered.

5.2.6. Summary of results

In order to examine the strength and direction of association between the pragmatic

abilities scores and the four imitation scores (prosody, gesture, lexical content, and general

imitation), four Pearson bivariate correlation analyses were performed. Table 3 shows the

results. In all cases, imitation and pragmatic skills scores were moderately and positively

correlated, with correlation coefficients ranging from .57 to .69, all of them highly significant,

at p < .001.

Imitation scores

Correlation with

pragmatic skills

scores

R p

Gesture imitation .570 .000

Prosody imitation .616 .000

Lexical content imitation .639 .000

Overall imitation score .694 .000

TABLE 3: Correlation scores (R correlation coefficients and p values) obtained between imitation and

pragmatic skills scores.

Regarding the first research question, the results revealed that children’s ability to imitate

gestures, prosody and lexical content positively correlate with their sociopragmatic

competence. The current study has thus extended the findings from previous studies which

showed strong correlations between the ability to imitate and different language skills

(vocabulary, particular grammatical structures), and has demonstrated a strong positive

association between imitating and pragmatic ability.

From a practical point of view, the findings of this study also have implications for the

development of interventions aimed at supporting and improving sociocommunicative skills in

children, normally developing or otherwise. Given the correlation shown here, it may well be

that active imitation training will lead to enhanced pragmatic skills. Future experimental

research involving training paradigms might be able to answer this question.

Mariia Pronina

PhD Research Plan

54

Further analysis will address the second research question about the link between

children’s imitation abilities and ToM abilities.

5.3. Study 3: Developmental trajectories of prosody and gesture in their

expression of sociopragmatic meanings: a cross-sectional study

5.3.1 Research question

This study has the goal of exploring children's joint prosodic and gestural developmental

trajectories to express sociopragmatic meanings. A tool that has been created to assess

pragmatic abilities in preschoolers, will be also used to comprehensively assess young

children’s expressive prosodic and gestural abilities in relation to pragmatic contextual

situations. This cross-sectional study with two separate groups of children (one group of 3- to

4-year-old children on the one hand and another group of 5- to 6-year-old children on the other)

has two main goals. First, to test whether DCT elicitation methodology, which has been

successfully used for assessing adult intonational grammar, can be also applicable to obtain

expressive developmental data for 3- to 6-year-old children. The second goal is to explore the

prosodic and gestural cues encoding sociopragmatic meanings in these two groups of preschool

children (3- to 4-year-olds and 5- to 6-year-olds).

5.3.2. Participants

This cross-sectional study will involve 3- to 4-year-old children on the one hand and 5-

to 6-year-old children on the other. Data collection is being carried out in two phases. First, 3-

to 4-year-old children from the Barcelona area of Catalonia were already administered the

pragmatic and prosodic test. The 102 native Catalan-speaking children from Study 1 also took

part in Study 3. In the future, the same pragmatic and prosodic test will be administered to one

hundred 5- to 6-year-old children.

5.3.3. Materials and procedure

In order to explore how preschool-age children express sociopragmatic meanings through

intonation and/or gesture, the APT was applied. The stimuli, procedure and coding system of

the APT are described in detail in sections 5.1.3. and 5.1.4.)

5.3.4. Results

The following section presents the findings from a first administration of this test to 102

3- to 4-year-old Catalan-speaking children. First, the feasibility of the APT for preschool

Mariia Pronina

PhD Research Plan

55

children (3- to 4-year-olds) and its prosodic appropriateness were analyzed. In the future, a

more in-depth analysis of the prosodic and gestural cues (see examples in Figure 4) used by

children to express various sociopragmatic meanings will be conducted.

FIGURE 4: Gestural cues used by children while performing the APT.

Feasibility of the APT for 3- to- 4-year-old children

Average pragmatic appropriateness scores by item indicated that the elicitation method

used by the test was feasible for use with 3- to 4-year old children. A large majority of the

children engaged in the activity to one degree or another, with only 4% of the group failing to

answer any item. Figure 5 shows the number of appropriate responses (i.e., scoring 1 or 2) on

the x-axis and the number of children (N) on the y-axis, in other words, how many children

were able to respond appropriately to a certain number of items. The results show that 51% of

the children (52 children) were able to respond appropriately to more than one-third of the test,

of which 15% responded appropriately to between half and two-thirds of the test and 9%

responded appropriately to more than the two-thirds of the items. The maximum number of

appropriate responses given by any one child was 28 out of 35.

FIGURE 5: Frequency distribution of pragmatically and prosodically appropriate responses by 3- and 4- year-

olds on the APT.

Mariia Pronina

PhD Research Plan

56

These results confirm the general appropriateness of the test for preschool children in

terms of the elicitation technique applied.

Prosodic appropriateness as assessed by the APT

Successful performance on the task require certain perspective-taking abilities that allow

children to be able to take the perspective of a scenario character and enact the situation. In this

way, the proper testing can be performed if the child understands the instructions and the

procedure of the test. It is important to highlight that the absence of answer is not always due

to the incapability to produce appropriate prosody but rather indicates that the child faces

difficulties with the enacting of a specific situation. Therefore, if the child consistently does not

respond to the test items and does not enact the situations, it is rather can be explained by the

lack of the ability to take perspective but it cannot be ascertained by this assay whether the child

is able or unable to produce prosody. The feasibility results show that in this age children start

to engage to the activity and some children are capable to enact the situations and therefore give

an appropriate answer in terms of pragmatics and prosody to the large number of items.

Therefore, we decided to focus on the results of a subgroup of 36 children who responded

appropriately in terms of prosody to at least 11 items, in order to be able to assess the prosodic

profile of children at this age.

Prosodically appropriate responses by these 36 children were separated into pragmatic

areas. Figure 6 shows a summary of the prosodically appropriate responses by these 36 children

separated into pragmatic areas. The results show that 65% of them produced appropriate

responses to scenarios that focused on basic interaction skills (e.g., greetings, farewells,

expressions of gratitude). As for basic speech acts (statements, questions, imperatives,

vocatives), 49% of this subgroup successfully produced the target prosodic outcome in these

contexts. Forty-one percent produced sentences to express different affective and emotional

states such as insistence, discontent, guilt or sympathy (e.g., scolding request, regret,

congratulatory sentence), and 37% managed to correctly express emphasis or focus. Finally,

more complex sentence types encoding epistemic biases like uncertainty and obviousness in

statements or confirmation in questions were the most difficult area for expressive prosody at

this age. Only 17% of the children were able to successfully produce the right prosody with

these items. This suggests that, in general, these 3- to 4-year-old children had trouble

understanding the situations conveying pragmatic meanings related to beliefs and epistemic

status.

Mariia Pronina

PhD Research Plan

57

FIGURE 6: Percentage of prosodically appropriate responses by children, separated by pragmatic areas.

The novelty of the APT tool lies in two main features, namely, (1) it provides

comprehensive coverage of socially appropriate pragmatic situations, which allows for the

assessment of pragmatic prosody; and (2) it uses a carefully controlled DCT elicitation method

which is enhanced by the use of illustrations. With regard to the prosodic skills of 3- to 4-year-

olds, our initial results revealed that, as expected, children at this age cope best with items

involving basic interaction skills, followed by basic speech act prosody, as well as prosody that

marks affective stance, information and contrastive focus, and least well with biased-sentences.

These results are in line with recent research on the developmental path of pragmatic prosody

production in preschool children (Armstrong & Hübscher, 2018). Though these results may be

regarded as an initial indication of the pragmatic prosodic skills available to 3- to 4-year-olds,

a more in-depth analysis of the development of pragmatic prosody in preschool-aged will be

carried out in the future.

Finally, our preliminary results suggest that, overall, the 35-item APT was usable with 3-

to 4-year old children, allowing the test administrator to obtain semi-spontaneous child speech

in a relatively short period of time. Though this preliminary version of the instrument was

written in Catalan, it can easily be adapted to other languages. This suggests that the APT has

the potential to be of great utility in future research across languages on the parallel

development of pragmatic and prosodic skills, particularly in very young children. Future

analyses will explore the obtained results in more detail from the point of view of prosody and

gesture. Concerning the older target group of this cross-sectional study, we expect 5- to 6-year-

old children to obtain higher scores and be able to express more pragmatic meanings such as

epistemically biased sentences.

Mariia Pronina

PhD Research Plan

58

5.4. Study 4: The role of enactment in the development of pragmatic

abilities and ToM: a training study

5.4.1. Research question

While previous training studies have shown the beneficial effect of language

interventions on children’s cognitive development (e.g., Guajardo & Watson, 2002; Lohmann

& Tomasello, 2003; Ornaghi et al., 2011), little is known about whether perceiving and enacting

multimodal expressions of internal states and emotions can contribute to enhancing perspective

taking skills. The main research question of Study 4 will be to test whether embodied training

of children's own and other's mental and emotional states can contribute to improving their ToM

abilities and sociopragmatic competence.

5.4.2. Participants

The 102 children from Study 1 also were recruited to participate in Study 4. A total of 83 3-

to 4-year-old children participated in the experiment (37 male and 46 female, M age = 44.75

months, SD = 3.27 months; ranging from 39 to 51 months at the time of pretest). For various

reasons, nineteen children had to be excluded. Two children missed school and dropped out of

the experiment and seventeen children did not want to collaborate during either the pretests or

posttest showing restlessness or refusing to pay attention and to participate. Parents were

informed about the experiment’s goal and signed a written consent.

5.4.3. Materials

Four pre- and posttest measures were used from various tasks including (1) Emotion

Understanding (Izard et al., 2003); (2) the Mental Verb Comprehension Test (Astington &

Pelletier, 1998); (3) false belief tests: a Catalan adaptation of the Sally and Anne task (Baron-

Cohen et al., 1985) and the “Smarties task” (Gopnik & Astington, 1988); (4) the APT, or the

pragmatic and prosodic assessment (see sections 5.1.3. and 5.1.4. for the detailed description

of the tasks).

Materials for the training sessions

The materials for the training sessions were taken from the book “The Adventures of Jack

and Theo” (Ornaghi, Orlandi, & Perego, 2007) that had been specifically created for the study

by Ornaghi et al. (2011). The book comprises 16 stories that narrate the adventures of two

characters, the dolphin Jack and the shark Theo. The former 8 stories, arranged in order of

increasing difficulty, were translated and adapted for Catalan (a sample is provided in Appendix

Mariia Pronina

PhD Research Plan

59

B). Then high-quality recordings of the stories read aloud in child-directed speech were made,

and videos with accompanying images from the original book were designed.

The eight target stories are enriched with internal lexicon terms, specifically expressing

mental state and emotional stances. A total of 12 metacognitive terms were used, namely a total

of 8 mental state terms and 4 emotional stance terms (see Table 4). In each of the 8 stories, two

internal states were highlighted: one mental state and one emotional state or two mental states.

Therefore across the 8 stories, 16 internal states were trained including 8 mental states that were

presented once and 4 emotional states that were presented twice, in two different stories.

Mental state terms Emotional stance terms

wanting

remembering

knowing

thinking

believing

deciding

doubting

wondering

getting delighted

getting upset

getting angry

getting scared/surprised

TABLE 4: Target mental lexicon terms selected for the training sessions.

5.4.4. Experimental procedure

Before and after the training intervention, each child was tested individually in a quiet

room by the author and three trained research assistants (see section 5.1.4. for the detailed

description of the procedure of each test).

Before training, the participants were assigned to three different condition groups, which

will be described in greater detail in the following subsection. In order to guarantee a

homogeneous distribution of children across the three conditions, a special algorithm was

written in Python. The algorithm was based on matching the participant’s scores from the four

pretest measures (emotion understanding, mental verb comprehension, ToM, and the APT).

The working principle of the algorithm is explained in Appendix C. Once the children were

distributed into groups according to the results of the algorithm, a separate one-way ANOVA

analysis (IBM SPSS Statistics v. 24) was run for each pretest measure to check that the three

groups were not statistically different. Results confirm that the different condition groups do

not significantly differ in any of the pretests (all p’s > .05). These results and descriptive

statistics are provided in Table 5.

Mariia Pronina

PhD Research Plan

60

Measures M SD p

Emotion Understanding Test

multimodal language condition

language condition

control condition

47.3

48.2

47.5

17.6

18.1

16.7

.98

Mental verb comprehension

multimodal language condition

language condition

control condition

48.7

53.7

47.8

16.3

17.5

21.8

.46

Theory of Mind

multimodal language condition

language condition

control condition

40.4

41.7

35.8

24.6

23.0

23.4

.62

The APT (Pragmatic score)

multimodal language condition

language condition

control condition

19.5

20.9

22.8

14.3

13.8

16.9

.72

The APT (Enactment score)

multimodal language condition

language condition

control condition

23.8

25.9

28.7

17.3

15.6

21.4

.60

TABLE 5: Descriptive statistics for pretest measures.

Training sessions

The intervention took place over a four-week period and was aimed at improving

conceptual understanding of mental states and thinking about alternative perspectives. During

this phase, children participated in 8 sessions lasting between 20 and 25 minutes. Small groups

of about 12 children were trained twice per week on nonconsecutive days. All training sessions

took place in a quiet classroom at the children’s school and were videotaped. A summary of the

experimental procedure can be seen in Figure 7.

FIGURE 7: Experimental procedure of Study 4.

Mariia Pronina

PhD Research Plan

61

The training contained three between-subject conditions: control condition, language

condition and multimodal language condition. In all conditions, at the beginning of each

training session, the children watched the story enriched with internal state lexicon (the whole

list of 12 trained terms is provided above in Table 4). The conditions differed in the activities

that were carried out after watching the story. Children in the control group carried out a non-

conversational activity related to the story, like drawing pictures and solving puzzles depicting

sea inhabitants. Children in the language condition were trained through a mini-conversation

which focused on target internal states and encouraged children to reflect about the internal

states of themselves and others. Children in the multimodal language condition were trained

through the mini-conversation but, importantly, children were also encouraged to enact the

internal states of the story characters.

Across all conditions, each training session followed a standard procedure and lasted the

same amount of time in all groups. In the language and multimodal conditions, a training script

was designed for each story (an example is given Table 6, see Appendix B for story sample).

All scripts followed the same scheme. First, a brief introduction of the story scene was presented

by the experimenter and then six questions were asked to the children in a fixed order. Four

question were the same across conditions (questions 1, 3, 4, 6 in black in Table 6), two question

differed across conditions (questions 2 and 4, in grey for language only condition (say), in green

for multimodal condition (say and do)).

Language only condition Multimodal condition

1. Do you remember that in the beginning of the story

Theo asked Sara Sea-Turtle: “What happened? Why

are you crying?” ... Today we are going to play using

the words “getting upset”.

Do you remember why Sara was upset and why did

she cry?

1. Do you remember that in the beginning of the story

Theo asked Sara Sea-Turtle: “What happened? Why

are you crying?” ... Today we are going to play using

the words “getting upset”.

Do you remember why Sara was upset and why did

she cry?

2.

What did Sara say when Theo found her stuck in a

hole?

2.

What did Sara do when Theo found her stuck in a

hole? Can you tell me what she was saying at the

same time?

3. What makes you upset? 3. What makes you upset?

4. What makes your friends upset? 4. What makes your friends upset?

5.

What would your friend say if his favorite toy broke?

5.

What would your friend say if his favorite toy broke?

What would he do in this situation? Can you show

me?

Mariia Pronina

PhD Research Plan

62

6. Well, children, you’ve been very good. We’ve

played using the word...? (Children finish the

sentence)

6. Well, children, you’ve been very good. We’ve

played using the word...? (Children finish the

sentence)

TABLE 6: Sample script of training session. Story: “Sara Sea-Turtle” (see Appendix B), target term: getting upset.

Concerning the content of the questions, first the children were asked a question about

the recalled episode and were introduced the target term to be focused on during the training

session. Afterwards the children were invited to work out the situation and the target word: in

the language condition, it was carried out through a conversational procedure, by contrast, in

the multimodal language children were trained not only through a conversational procedure but

the children were also guided to enact the internal states. Figure 8 shows photographs of the

experimenter conducting a training session. Then children were asked to reflect on their own

emotional internal states. Subsequently they were asked to reflect on internal states of others.

Then children were provided with one more contextualized example and were asked to discuss

it (language only condition) or to discuss and enact it (multimodal condition).

Lastly, the experimenter concluded the first part of the session and commended the

children. Then the experimenter established a link between the fragment of the story that had

just been focused on and a new story’s episode that contained the second target term of the

training session. The second target internal state’s questions followed the same scheme. Finally,

children were invited to return to the class.

Throughout the training session, the experimenter encouraged participants to use the

target term as much as possible, she also motivated children to participate actively in the

conversation and ensured that all of them were involved in the activity.

FIGURE 8: Training session photographs.

Notes. Images on the left: language condition; images on the right: multimodal language condition.

In order to validate the training materials and scripts, prior to the experiment, a pilot study

was conducted. Comprehensibility of the stories and children’s interest in the training were

Mariia Pronina

PhD Research Plan

63

tested with the 19 preschool children (3-year-old) from Girona who did not take part in the

study.

Each training session was conducted by two experimenters. Across all sessions, the same

experimenters worked with the children. The first experimenter performed the training activity.

The second controlled for the time of the training sessions and documented all events that took

place during the training session (e.g., the experimenter asks a question, children respond a

question). To do so, an annotation system of the online recoding of training sessions was

developed for the study. It included information about the experimenter’s actions (e.g., to repeat

a question, to reformulate a question, to give positive/negative feedback, to provide an example

etc.) and children’s actions (e.g., to comment something, to refuse to answer, to repeat the

experimenter’s multimodal actions). There were also special symbols for coding the type of

answer, feedback, and examples which could be verbal or multimodal (the whole annotation

system is provided in the Appendix D). In this way, for each group and each session,

information about the number of children’s answers, experimenter’s examples, and information

about the number of experimenter errors was collected. This data will be used to evaluate the

degree of active participation by children. After the completion of the session, each participant

was given an activity participation score by the two experimenters. The child’s participation,

concentration and adequacy of responses were given a separate mark. Then for each child an

average score on involvement in the training was calculated.

5.4.5. Expected results

We expect that children in both the language and multimodal conditions will significantly

improve their ToM, emotional, and perspective-taking performance in the posttests as

compared to children in the control group. Importantly, we hypothesise that training

intervention will affect not only ToM understanding but also those linguistic capacities that

require perspective-taking skills such as pragmatic and prosodic abilities.

Further, this study not only leads to widening the body of literature on sociopragmatic

development and ToM, but it also has the intention to develop an educational tool to aid

preschool teachers in incorporating multimodal aspects specifically designed to boost their

ToM skills into their classroom. An educational website with training materials and instructions

has been designed. The web page can be found at:

https://entrenemlesemocions.wordpress.com

Mariia Pronina

PhD Research Plan

64

6. WORK PLAN

Previous work (September 2017 – January 2019)

• Establish contact with schools for Studies 1, 2, 3 and 4

• Organization of experimental design for Study 1, 2, 3 and 4

• Stimuli preparation for Study 3

• Development of training sessions for Study 4 in collaboration with Dr. Judith Holler

(Max Planck Institute for Psycholinguistics, the Netherlands & Donders Institute for

Brain, Cognition and Behaviour, Radboud University, the Netherlands)

• Completion of stimuli materials for Study 2 in collaboration with Eva Castillo and Iris

Hübscher

o Filming, re-editing

• Carrying out of pilots for Study 1, 3 and 4 at Escola Verd, Girona

• Data collection for Study 1, 2, 3 and 4 at Escola Bogatell and Escola Antoni Brusi,

Barcelona

• Data preparation and preliminary data processing for Study 2 and 3

• Preliminary statistical analysis for Study 4

• Developing an educational web site for Study 4

• Attendance to the conference Speech Prosody 2018: Poznan, Poland, June 13-16, 2018.

• Oral presentation at Xe Workshop Sobre la Prosòdia del Català 2018: Barcelona,

Catalunya, June 28, 2018. Title: Avaluació d'habilitats pragmàtiques i prosòdiques dels

nens en edat preescolar. Co-authors: Pilar Prieto, Iris Hübscher & Ingrid Vilà-Giménez.

• Oral presentation at I Congreso Internacional sobre Didáctica de la Lengua Infantil:

Bilbao, Spain, November 14, 2018. Title: El efecto del entrenamiento multimodal en

potenciar habilidades de toma de perspectiva y competencia pragmática. Co-authors:

Pilar Prieto, Judith Holler, & Iris Hübscher.

• Writing and submission of the article based on Study 3 to the for the 19th International

Congress of Phonetic Sciences. Title: A new tool to assess pragmatic prosody in

children: evidence from 3- to 4-year-olds. Co-authors: Pilar Prieto, Iris Hübscher, &

Ingrid Vilà-Giménez.

• Writing and submission of the article based on the preliminary results of Study 2 to the

19th International Congress of Phonetic Sciences. Title: Children’s imitation skills are

Mariia Pronina

PhD Research Plan

65

positively correlated with sociopragmatic abilities. Co-authors: Pilar Prieto, Iris

Hübscher & Eva Castillo.

• Writing and submission of PhD research plan.

February 2019 – May 2019

• Defense of the PhD Research Plan (February 11th, 2019)

• Continue data processing for Study 2 and 3

• Data analysis for Study 1, 2 and 3

• Begin writing for Study 4

June 2019 – August 2019

• Attendance to the Workshop Sobre la Prosòdia del Català in Barcelona (Catalunya).

June, 2019.

• Attendance to the Phonetics and Phonology in Europe conference (PapE) in Lecce,

(Italy). 17-19 June, 2019.

• Attendance to the Workshop on Infant Language Development (WILD) in Potsdam,

(Germany). 13-15 June, 2019.

• Attendance to the International Congress of Phonetic Sciences (ICPhS) in Melbourne

(Australia). 4-10 August, 2019.

September 2019 – December 2019

• Continue data analysis for Study 1, 2 and 3

• Writing for Study 1

• Writing for Study 2

• Establish contact with schools for the second part of Study 3

• Data collection for the second part of Study 3

January 2020 – July 2020

• Data analysis for the second part of Study 3

• Attendance to the International Conference on Speech Prosody 2020 - Location and

Dates TBA

September 2020 – December 2020

Mariia Pronina

PhD Research Plan

66

• Writing for Study 3

• Begin writing for PhD thesis

January 2021 – April 2021

• PhD Thesis writing

May 2021 – July 2021

• PhD Thesis Defense

7. SELECTED REFERENCES

This book provides a comprehensive review of a broad range of topics in pragmatic

development research. Each chapter presents current theories and key empirical finding in the

respective domains. The topics include prelinguistic foundations of communication, the

acquisition of conventional language (language specific words and grammatical constructions),

non-literal language use (humor, metaphor and irony comprehension), and organising and

marking information. It also covers some overarching topics such as the assessment of

children’s pragmatic abilities. The volume is particularly important for the present thesis since

it laid the groundwork for all four studies that address main goals related to the role of

sociopragmatic abilities in children’s development.

This recent book edited by Prieto and Esteve-Gibert offers a complete state of the art on

prosodic development. The prosodic topics dealt with are very diverse in nature and are divided

in four sections, namely, early sensitivity to prosody, learning to produce prosody, moving to

meaning and interaction: prosody and pragmatic development, prosody in bilingualism and in

specific populations. The chapter “Set in time: Temporal coordination of prosody and gesture

Prieto, P., Esteve-Gibert, N. (Eds.). (2018). The Development of Prosody in First Language

Acquisition. Amsterdam: John Benjamins.

Matthews, D. (2014). Pragmatic Development in First Language Acquisition. Amsterdam:

John Benjamins.

Mariia Pronina

PhD Research Plan

67

in the development of spoken language production” by Rusiewicz and Esteve-Gibert discusses

the relationship between prosody and gesture which is particularly important for Study 3 of this

thesis. The third section on the interplay between prosody and pragmatics is also closely related

to the Study 3 which will add to these findings by applying a new tool to assess pragmatic

prosody in children. The chapter “Children’s development of internal state prosody” by

Armstrong and Hübscher presents an insightful literature survey on how children including

preschoolers use prosody to mark and comprehend affect and mental states of others which is

particularly important for Study 1 which aims to test the correlation between prosodic skills

and ToM abilities.

In this review article, Hübscher and Prieto show that gesture and prosody act as an

integrated system and play a scaffolding role in pragmatic acquisition through the analysis of

empirical results coming from developmental research in these domains. Importantly, the

evidence discussed in this paper demonstrates that children rely on gestural and prosodic cues

at different developmental stages. More specifically, the article discussed children’s

communicative behaviors between the ages of 1 and 5. This paper is of great interest for the

current PhD thesis, as it reports a summary of research on early pragmatic development

including preschool-aged children. It focuses on speech marking, information focus, epistemic

stance and politeness, pragmatic areas that will be explored in Study 3 from a multimodal

perspective. The authors advocate for the need of a multimodal approach in different domains

such as language intervention, which is crucial for Study 4.

The study by Bosco et al. (2018) aims to contribute to a further understanding of the

complex interplay between pragmatics and Theory of Mind abilities. It provides an exhaustive

overview on the correlation between these two capacities by taking into account both a

developmental and a clinical perspective. This article defends the position that pragmatics and

ToM constitute independent capacities. The authors indicate that pragmatics is often identified

Hübscher, I., Prieto, P. Gestural and prosodic development act as sister systems and jointly

pave the way for children’s sociopragmatic development. Submitted to Frontiers in

Psychology.

Bosco, F.M., Tirassa, M., Gabbatore, I. (2018). Why pragmatics and theory of mind do not

(completely) overlap. Front. Psychol. 9:1453.

Mariia Pronina

PhD Research Plan

68

with ToM in empirical research due to the fact that the tasks used as a measure of ToM involve

both pragmatic and ToM competences, which leads to methodological problems in exploring

the relationship between the two. This article is particularly important for Study 1, as its aim is

to investigate how sociopragmatic abilities in preschool children correlate with their ToM

abilities.

Ornaghi et al. (2011) is another important study that for the current PhD thesis. In line

with other training studies (e.g., Lohmann & Tomasello, 2003), this study demonstrated that a

training involving linguistic interaction leads to improved performance on ToM tasks and

facilitates conceptual understanding of mental terms in 3- to 4-year-old children. The

methodology, of this study, as well as part of the training materials, were adapted for Study 4.

The main difference is that Study 4 will also test the effects of a multimodal enacting training.

In this way, while Ornaghi et al. (2011) had two conditions, namely, control and language

conditions, Study 4 has three conditions, namely, control, language and multimodal language.

Also, in the original study only the use of mental state language was trained, whereas Study 4

also involves emotional stance training.

Ornaghi V., Brockmeier J., Grazzani I. (2011). The role of language games in children’s

understanding of mental states: a training study. J. Cogn. Dev. 12, pp. 239–259.

Mariia Pronina

PhD Research Plan

69

8. FULL BIBLIOGRAPHY

Adamson, L. B., & Dimitrova, N. (2014). Joint attention and language development. In P.

Brooks & V. Kempe (Eds.), Encyclopedia of language development (pp. 299–304). Los

Angeles, CA: Sage Publications.

Andrés-Roqueta, C., & Katsos, N. (2017). The contribution of grammar, vocabulary and Theory

of Mind in pragmatic language competence in children with Autistic Spectrum Disorders.

Frontiers in Psychology, 8, 996.

Angeleri, R., & Airenti, G. (2014). The development of joke and irony understanding: A study

with 3- to 6-year-old children. Canadian Journal of Experimental Psychology/Revue

Canadienne de Psychologie Expérimentale, 68(2), 133–146.

Applebee, A. N. (1978). The child’s concept of story: ages two to seventeen. Chicago, IL:

Chicago Press.

Applebee, A. N. (1980). Children’s narratives: new directions. The Reading Teacher, 34, 137–

142.

Appleton, M., & Reddy, V. (1996). Teaching three year-olds to pass false belief tests: a

conversational approach. Social Development, 5(3), 275–291.

Armstrong, M. (2014). Child comprehension of intonationally-encoded disbelief. In W. Orman

& J. Valleau (Eds.), Proceedings of the 38th Annual Boston University Conference on

Language Development (Vol. 2, pp. 25–38). Somerville, MA: Cascadilla Press.

Armstrong, M., Esteve Gibert, N., Hübscher, I., Igualada, A., & Prieto, P. (2018).

Developmental and cognitive aspects of children’s disbelief comprehension through

intonation and facial gesture. First Language, 38(6), 596–616.

Armstrong, M., & Hübscher, I. (2018). Children’s development of internal state prosody. In P.

Prieto & N. Esteve-Gibert (Eds.), The Development of Prosody in First Language

Acquisition (pp. 271–293). Amstersam: John Benjamins.

Astington, J. W., & Baird, J. A. (2005a). Introduction: Why language matters. In J. W.

Astington & J. A. Baird (Eds.), Why Language Matters for Theory of Mind (pp. 3 – 25).

New York: Oxford University Press.

Astington, J. W., & Baird, J. A. (2005b). Representational development and false-belief

understanding. In J. W. Astington & J. A. Baird (Eds.), Why language matters for theory

of mind (pp. 163–185). New York: Oxford University Press.

Astington, J. W., & Gopnik, A. (1988). Knowing you’ve changed your mind: Children’s

understanding of representational change. In J. W. Astington, P. L. Harris, & D. R. Olson

(Eds.), Developing theories of mind (pp. 193–206). New York, NY, US: Cambridge

University Press.

Astington, J. W., & Jenkins, J. M. (1999). A longitudinal study of the relation between language

and theory-of-mind development. Developmental Psychology, 35(5), 1311–1320.

Astington, J. W., & Pelletier, J. (1998). Metacognitive vocabulary test. Institute of Child Study,

University of Toronto, Unpublished.

Astington, J. W., & Peskin, J. (2004). Meaning and use: Children’s acquisition of the mental

lexicon. The Development of the Mediated Mind: Sociocultural Context and Cognitive

Development, 59–78.

Mariia Pronina

PhD Research Plan

70

Aureli, T., Spinelli, M., Fasolo, M., Garito, M. C., Perucchini, P., & D’Odorico, L. (2017). The

pointing–vocal coupling progression in the first half of the second year of life. Infancy,

22(6), 801–818.

Austin, J. L. (1962). How to do things with words. Cambridge, Mass.: Harvard University Press.

Baillargeon, R., Scott, R. M., & Bian, L. (2016). Psychological reasoning in infancy. Annual

Review of Psychology, 67(1), 159–186.

Baillargeon, R., Scott, R. M., & He, Z. (2010). False-belief understanding in infants. Trends in

Cognitive Sciences, 14(3), 110–118.

Barner, D., Brooks, N., & Bale, A. (2011). Accessing the unsaid: The role of scalar alternatives

in children’s pragmatic inference. Cognition, 118(1), 84–93.

Baron-Cohen, S., Leslie, A. M., & Frith, U. (1985). Does the autistic child have a “theory of

mind”? Cognition, 21(1), 37–46.

Baroni, M. R., & Axia, G. (1989). Children’s meta-pragmatic abilities and the identification of

polite and impolite requests. First Language, 9(27), 285–297.

Barsalou, L. W. (2008). Grounded cognition. Annual Review of Psychology, 59(1), 617–645.

Barsalou, L. W. (2010). Grounded cognition: past, present, and future. Topics in Cognitive

Science, 2(4), 716–724.

Bartsch, K., & Wellman, H. M. (1995). Children talk about the mind. New York: Oxford

University Press.

Bartz, D. T. (2017). Young children’s meta-ignorance (PhD Thesis). Harvard University.

Bates, E. (1976). Language and cognition: The acquisition of pragmatics. New York: Academic

Press.

Bates, E., Camaioni, L., & Volterra, V. (1975). The acquisition of performatives prior to speech.

Merrill-Palmer Quarterly of Behavior and Development, 21(3), 205–226.

Batki, A., Baron-Cohen, S., Wheelwright, S., Connellan, J., & Ahluwalia, J. (2000). Is there an

innate gaze module? Evidence from human neonates. Infant Behavior and Development,

23(2), 223–229.

Bavin, E. L., Prior, M., Reilly, S., Bretherton, L., Williams, J., Eadie, P., … Ukoumunne, O. C.

(2008). The Early Language in Victoria Study: Predicting vocabulary at age one and two

years from gesture and object use. Journal of Child Language, 35(3), 687–701.

Beaupoil-Hourdel, P., Morgenstern, A., & Boutet, D. (2015). A child’s multimodal negations

from 1 to 4: The interplay between modalities. In P. Larrivée & C. Lee (Eds.), Negation

and Polarity: Experimental Perspectives. Language, Cognition, and Mind. (Vol. 1, pp.

95–123). Cham: Springer.

Behne, T., Carpenter, M., & Tomasello, M. (2005). One-year-olds comprehend the

communicative intentions behind gestures in a hiding game. Developmental Science, 8(6),

492–499.

Behne, T., Liszkowski, U., Carpenter, M., & Tomasello, M. (2012). Twelve-month-olds’

comprehension and production of pointing. British Journal of Developmental Psychology,

30(3), 359–375.

Mariia Pronina

PhD Research Plan

71

Bender, P. K., Pons, F., Harris, P. L., & de Rosnay, M. (2011). Do young children

misunderstand their own emotions? European Journal of Developmental Psychology,

8(3), 331–348.

Bergen, B. (2015). Embodiment. In E. Dabrowska & D. Divjak (Eds.), Handbook of Cognitive

Linguistics (pp. 10–30). Walter de Gruyter GmbH & Co KG.

Berman, J. M. J., Chambers, C. G., & Graham, S. A. (2010). Preschoolers’ appreciation of

speaker vocal affect as a cue to referential intent. Journal of Experimental Child

Psychology, 107(2), 87–99.

Berman, J. M. J., Chambers, C. G., & Graham, S. A. (2016). Preschoolers’ real-time

coordination of vocal and facial emotional information. Journal of Experimental Child

Psychology, 142, 391–399.

Berman, J. M. J., Graham, S. A., Callaway, D., & Chambers, C. G. (2013). Preschoolers use

emotion in speech to learn new words. Child Development, 84(5), 1791–1805.

Berman, J. M. J., Graham, S. A., & Chambers, C. G. (2013). Contextual influences on children’s

use of vocal affect cues during referential interpretation. Quarterly Journal of

Experimental Psychology, 66(4), 705–726.

Berman, R., & Slobin, D. I. (1994). Relating events in narrative: A crosslinguistic

developmental study. Hillsdale, NJ, US: Lawrence Erlbaum Associates, Inc.

Borghi, A. M., Glenberg, A. M., & Kaschak, M. P. (2004). Putting words in perspective.

Memory and Cognition, 32(6), 863–873.

Bosco, F. M., & Gabbatore, I. (2017). Sincere, deceitful, and ironic communicative acts and the

role of the theory of mind in childhood. Frontiers in Psychology, 8, 21.

Bosco, F. M., Tirassa, M., & Gabbatore, I. (2018). Why pragmatics and Theory of Mind do not

(completely) overlap. Frontiers in Psychology, 9, 1453.

Brooks, R., & Meltzoff, A. N. (2005). The development of gaze following and its relation to

language. Developmental Science, 8(6), 535–543.

Brooks, R., & Meltzoff, A. N. (2008). Infant gaze following and pointing predict accelerated

vocabulary growth through two years of age: A longitudinal, growth curve modeling study.

Journal of Child Language, 35(1), 207–220.

Brown, J., Donelan-McCall, N., & Dunn, J. (1996). Why talk about mental states? The

significance of children’s conversations with friends, siblings, and mothers. Child

Development, 67(3), 836–849.

Brown, P., & Levinson, P. (1987). Politeness: Some universals in language usage. Cambridge,

UK: Cambridge university press.

Bucciarelli, M., Colle, L., & Bara, B. G. (2003). How children comprehend speech acts and

communicative gestures. Journal of Pragmatics, 35(2), 207–241.

Buttelmann, D., Carpenter, M., & Tomasello, M. (2009). Eighteen-month-old infants show

false belief understanding in an active helping paradigm. Cognition, 112(2), 337–342.

Buttelmann, F., Suhrke, J., & Buttelmann, D. (2015). What you get is what you believe:

Eighteen-month-olds demonstrate belief understanding in an unexpected-identity task.

Journal of Experimental Child Psychology, 131, 94–103.

Camaioni, L., Perucchini, P., Bellagamba, F., & Colonnesi, C. (2004). The role of declarative

pointing in developing a theory of mind. Infancy, 5(3), 291–308.

Mariia Pronina

PhD Research Plan

72

Cameron-Faulkner, T. (2014). The development of speech acts. In D. Matthews (Ed.),

Pragmatic development in first language acquisition (pp. 37–52). Amsterdam: John

Benjamins.

Camras, L. A., Pristo, T. M., & Brown, M. J. K. (1985). Directive choice by children and adults

- affect, situation, and linguistic politeness. Merrill-Palmer Quarterly-Journal of

Developmental Psychology, 31(1), 19–31.

Carmiol, A. M., & Sparks, A. (2014). Narrative development across cultural contexts. In D.

Matthews (Ed.), Pragmatic development in first language acquisition (pp. 279–296).

Amsterdam: John Benjamins.

Carpenter, M. (2012). Joint attention in humans and animals. In N. Seel (Ed.), Encyclopedia of

the Sciences of Learning (pp. 1663–1664). Berlin: Springer.

Carpenter, M. (2014). Imitation in communicative development. In P. Brooks & V. Kempe

(Eds.), Encyclopedia of Language Development (pp. 275–277). Los Angeles, CA: Sage

Publications.

Carpenter, M., Akhtar, N., & Tomasello, M. (1998). Fourteen-through 18-month-old infants

differentially imitate intentional and accidental actions. Infant Behavior and Development,

21(2), 315–330.

Carpenter, M., Nagell, K., & Tomasello, M. (1998). Social cognition, joint attention, and

communicative competence from 9 to 15 months of age. Monographs of the Society for

Research in Child Development, 63(4), i–vi, 1-143.

Carpenter, M., & Tomasello, M. (2000). Joint attention, cultural learning, and language

acquisition: Implications for children with autism. In M. Wetherby & B. M. Prizant (Eds.),

Autism spectrum disorders: A transactional perspective (pp. 31–54). Baltimore, MD, US:

Paul H Brookes Publishing.

Carpenter, M., Tomasello, M., & Striano, T. (2005). Role reversal imitation and language in

typically developing infants and children with autism. Infancy, 8(3), 253–278.

Carrow-Woolfolk, E. (1999). CASL: Comprehensive assessment of spoken language. Circle

Pines, MN: American Guidance Services.

Carrow-Woolfolk, E. (2017). Comprehensive assessment of spoken language, Second Edition

(CASL-2). Torrance, CA: Western Psychological Services.

Caselli, M. C., Rinaldi, P., Stefanini, S., & Volterra, V. (2012). Early action and gesture

“vocabulary” and its relation with word comprehension and production. Child

Development, 83(2), 526–542.

Casillas, M. (2014). Turn-taking. In D. Matthews (Ed.), Pragmatic development in first

language acquisition (pp. 53–70). Amsterdam: John Benjamins.

Casillas, M., & Frank, M. C. (2012). Cues to turn boundary prediction in adults and

preschoolers. In S. Brown-Schmidt, J. Ginzburg, & S. Larsson (Eds.), Proceedings of

SemDial 2012 (SeineDial): The 16th Workshop on the Semantics and Pragmatics of

Dialogue (pp. 61–69). Paris: Université Paris-Diderot.

Champagne-Lavau, M., & Charest, A. (2015). Theory of mind and context processing in

schizophrenia: The role of social knowledge. Frontiers in Psychiatry, 6, 98.

Chen, A. (2018). Get the focus right across languages: acquisition of prosodic focus-marking

in production. In P. Prieto & N. Esteve-Gibert (Eds.), The Development of Prosody in First

Language Acquisition (pp. 295–316). Amsterdam: John Benjamins.

Mariia Pronina

PhD Research Plan

73

Christophe, A., Mehler, J., & Sebastián-Gallés, N. (2001). Perception of prosodic boundary

correlates by newborn infants. Infancy, 2(3), 385–394.

Clark, R., Hutcheson, S., & Buren, P. Van. (1974). Comprehension and production in language

acquisition. Journal of Linguistics, 10(1), 39–54.

Colletta, J. M., Guidetti, M., Capirci, O., Cristilli, C., Demir, O. E., Kunene-Nicolas, R. N., &

Levine, S. C. (2015). Effects of age and language on co-speech gesture production: An

investigation of French, American, and Italian children’s narratives. Journal of Child

Language, 42(1), 122–145.

Cowie, F. (2017). Innateness and Language. In E. N. Zalta (Ed.), The Stanford Encyclopedia of

Philosophy.

Crystal, D. (1992). Profiling linguistic disability. London, UK: Edward Arnold.

Culpeper, J. (2010). Historical Sociopragmatics. In A. H. Jucker & I. Taavitsainen (Eds.),

Historical Pragmatics (pp. 69–96). Berlin: Mouton de Gruyter.

de Carvalho, A., Dautriche, I., Millotte, S., & Christophe, A. (2018). Early perception of phrasal

prosody and its role in syntactic and lexical acquisition. In P. Prieto & N. Esteve-Gibert

(Eds.), The Development of Prosody in First Language Acquisition (pp. 17–35).

Amsterdam: John Benjamins.

de Villiers, J. (1995). Steps in the mastery of sentence complements. Paper Presented at the

Biennial Meeting of the Society for Research in Child Development. Indianapolis, IN.

de Villiers, J. (2000). Language and theory of mind: What are the developmental relationships?

In S. Baron-Cohen, H. Tager-Flusberg, & D. J. Cohen (Eds.), Understanding other minds:

Perspectives from developmental cognitive neuroscience (pp. 83–123). New York, NY,

US: Oxford University Press.

de Villiers, J. (2005). Can language acquisition give children a point of view. In J. W. Astington

& J. A. Baird (Eds.), Why language matters for theory of mind (pp. 186–219). New York:

Oxford University Press.

de Villiers, J., & de Villiers, P. (2000). Linguistic determinism and the understanding of false

beliefs. In P. Mitchell & K. J. Riggs (Eds.), Children’s reasoning and the mind (pp. 191–

228). Hove, UK: Psychology Press.

de Villiers, J., & Pyers, J. E. (2002). Complements to cognition: a longitudinal study of the

relationship between complex syntax and false-belief-understanding. Cognitive

Development, 17(1), 1037–1060.

de Villiers, P. (2005). The role of language in theory-of-mind development: what deaf children

tell us. In J. W. Astington & J. A. Baird (Eds.), Why Language Matters for Theory of Mind

(pp. 266–297). New York: Oxford University Press.

Decasper, J., & Spence, M. (1986). Prenatal maternal speech influences newborns’ perception

of speech sounds. Infant Behavior and Development, 9(2), 133–150.

Demir, Ö. E., Levine, S. C., & Goldin-Meadow, S. (2015). A tale of two hands: Children’s early

gesture use in narrative production predicts later narrative structure in speech. Journal of

Child Language, 42(3), 662–681.

Demorest, A., Meyer, C., Phelps, E., Gardner, H., & Winner, E. (1984). Words speak louder

than actions: Understanding deliberately false remarks. Child Development, 55(4), 1527–

1534.

Mariia Pronina

PhD Research Plan

74

Dickerson, K., Gerhardstein, P., Zack, E., & Barr, R. (2013). Age-related changes in learning

across early childhood: A new imitation task. Developmental Psychobiology, 55(7), 719–

732.

Diessel, H., & Tomasello, M. (2005). A new look at the acquisition of relative clauses.

Language, 81(4), 882–906.

Dörrenberg, S., Rakoczy, H., & Liszkowski, U. (2018). How (not) to measure infant Theory of

Mind: Testing the replicability and validity of four non-verbal measures. Cognitive

Development, 46, 12–30.

Dunn, J. (1995). Children as psychologists: The later correlates of individual differences in

understanding of emotions and other minds. Cognition & Emotion, 9(2–3), 187–201.

Dunn, J., Brown, J., & Beardsall, L. (1991). Family talk about feeling states and children’s later

understanding of others’ emotions. Developmental Psychology, 27(3), 448–455.

Ervin-Tripp, S. (1979). Children’s verbal turn-taking. In E. Ochs & B. Schieffelin (Eds.),

Developmental Pragmatics (pp. 391–414). New York, NY: Academic Press.

Esteve-Gibert, N., Lœvenbruck, H., Dohen, M., & D’Imperio, M. P. (2017). The use of prosody

and gestures for the production of contrastive focus in French-speaking 4 and 5 year olds.

Proceedings of the Workshop on Abstraction, Diversity and Speech Dynamics. Munich.

Esteve-Gibert, N., & Prieto, P. (2013). Prosody signals the emergence of intentional

communication in the first year of life: evidence from Catalan-babbling infants. Journal

of Child Language, 40(05), 919–944.

Esteve-Gibert, N., & Prieto, P. (2014). Infants temporally coordinate gesture-speech

combinations before they produce their first words. Speech Communication, 57, 301–316.

Esteve-Gibert, N., & Prieto, P. (2018). Early development of the prosody-meaning interface. In

P. Prieto & N. Esteve-Gibert (Eds.), The Development of Prosody in First Language

Acquisition (pp. 227–246). Amsterdam: John Benjamins.

Esteve-Gibert, N., Prieto, P., & Liszkowski, U. (2017). Twelve-month-olds understand social

intentions based on prosody and gesture shape. Infancy, 22(1), 108–129.

Farroni, T., Csibra, G., Simion, F., & Johnson, M. H. (2002). Eye contact detection in humans

from birth. Proceedings of the National Academy of Sciences of the United States of

America, 99(14), 9602–9605.

Fernald, A. (1989). Intonation and communicative intent in mothers’ speech to infants: Is the

melody the message? Child Development, 60(6), 1497–1510.

Fernald, A. (1993). Approval and disapproval: Infant responsiveness to vocal affect in familiar

and unfamiliar languages. Child Development, 64(3), 657–674.

Fernald, A., & Mazzie, C. (1991). Prosody and focus in speech to infants and adults.

Developmental Psychology, 27(2), 209–221.

Fernald, A., & Simon, T. (1984). Expanded intonation contours in mothers’ speech to

newborns. Developmental Psychology, 20(1), 104–113.

Filippova, E., & Astington, J. W. (2008). Further development in social reasoning revealed in

discourse irony understanding. Child Development, 79(1), 126–138.

Filippova, E., & Astington, J. W. (2010). Children’s understanding of social-cognitive and

social-communicative aspects of discourse irony. Child Development, 81(3), 913–928.

Flynn, E., & Whiten, A. (2008). Cultural transmission of tool use in young children: A diffusion

chain study. Social Development, 17(3), 699–718.

Mariia Pronina

PhD Research Plan

75

Frank, C. K. (2018). Reviving pragmatic theory of theory of mind. AIMS Neuroscience, 5(2),

116–131.

Friend, M. (2001). The transition from affective to linguistic meaning. First Language, 21(63),

219–243.

Friend, M., & Bryant, J. B. (2000). A developmental lexical bias in the interpretation of

discrepant messages. Merrill-Palmer Quarterly (Wayne State University. Press), 46(2),

342–369.

Frota, S., & Butler, J. (2018). Early development of intonation. In P. Prieto & N. Esteve-Gibert

(Eds.), The Development of Prosody in First Language Acquisition (pp. 145–164).

Amsterdam: John Benjamins.

Frota, S., Matos, N., Cruz, M., & Vigário, M. (2016). Early prosodic development: Emerging

intonation and phrasing in European Portuguese. In M. Armstrong, M. Vanrell, & N. C.

Henriksen (Eds.), Issues in Hispanic and lusophone linguistics: Interdisciplinary

approaches to intonational grammar in Ibero-Romance (pp. 295–324). Philadelphia, PA:

Benjamins.

Ganzeboom, H. B. G., De Graaf, P. M., Treiman, D. J., & De Leeuw, J. (1992). A Standard

International Socio-Economic Index of Occupational Status. Social Science Research, 21,

1–56.

Gleitman, L. (1990). The structural sources of verb meanings. Language Acquisition, 1(1), 3–

55.

Glenberg, A. M. (2007). Language and action: Creating sensible combinations of ideas. In G.

Gaskell (Ed.), Oxford handbook of psycholinguistics (pp. 361–370). Oxford, UK: Oxford

University Press.

Goldin-Meadow, S. (1998). The Development of Gesture and Speech as an Integrated System.

New Directions for Child and Adolescent Development, 1998(79), 29–42.

Goldin-Meadow, S. (2003). Hearing gesture: How our hands help us think. Cambridge, MA:

Harvard University Press.

Goldin-Meadow, S. (2014). Widening the lens: What the manual modality reveals about

language, learning and cognition. Philosophical Transactions of the Royal Society B:

Biological Sciences, 369(1651), 20130295.

Goldin-Meadow, S. (2018). Taking a Hands-on Approach to Learning. Policy Insights from the

Behavioral and Brain Sciences, 5(2), 163–170.

Goldin-Meadow, S., & Wagner, S. M. (2005). How our hands help us learn. Trends in Cognitive

Sciences, 9(5), 230–241.

Goldman, A. I. (2012). Theory of Mind. In E. Margolis, R. Samuels, & S. P. Sitch (Eds.), The

Oxford Handbook of Philosophy of Cognitive Science. Oxford: Oxford University Press.

González-Fuente, S. (2017). Audiovisual prosody and verbal irony (PhD Thesis). Universitat

Pompeu Fabra, Barcelona.

Goodwyn, S. W., Acredolo, L. P., & Brown, C. A. (2000). Impact of Symbolic Gesturing on

Early Language Development. Journal of Nonverbal Behavior, 24(2), 81–103.

Gopnik, A., & Astington, J. W. (1988). Children’s understanding of representational change

and its relation to the understanding of false belief and the appearance-reality distinction.

Child Development, 59(1), 26–37.

Mariia Pronina

PhD Research Plan

76

Gratier, M. (2014). Vocal imitation. In P. Brooks & V. Kempe (Eds.), Encyclopedia of

Language Development (pp. 662–663). Los Angeles, CA: Sage Publications.

Graziano, M. (2014). The development of two pragmatic gestures of the so-called Open Hand

Supine family in Italian children. In M. Seyfeddinipur & M. Gullberg (Eds.), From

Gesture in Conversation to Visible Action as Utterance: Essays in Honor of Adam Kendon

(pp. 311–330). Amsterdam: John Benjamins.

Grazzani, I. G., Ornaghi, V., Conte, E., Pepe, A., & Caprin, C. (2018). The relation between

emotion understanding and theory of mind in children aged 3 to 8: The key role of

language. Frontiers in Psychology, 9, 724.

Grice, H. P. (1975). Logic and conversation. In P. Cole & J. L. Morgan (Eds.), Syntax and

Semantics, Vol. 3, Speech Acts (pp. 41–58). New York: Academic Press.

Grünloh, T., & Liszkowski, U. (2015). Prelinguistic vocalizations distinguish pointing acts.

Journal of Child Language, 42(06), 1312–1336.

Guajardo, N. R., & Watson, A. C. (2002). Narrative discourse and Theory of Mind

development. The Journal of Genetic Psychology, 163(3), 305–325.

Guidetti, M. (2000). Pragmatic study of agreement and refusal messages in young French

children. Journal of Pragmatics, 32(5), 569–582.

Guidetti, M. (2002). The emergence of pragmatics: Forms and functions of conventional

gestures in young French children. First Language, 22(3), 265–285.

Guidetti, M. (2005). Yes or no? How young French children combine gestures and speech to

agree and refuse. Journal of Child Language, 32(4), 911–924.

Hale, C. M., & Tager-Flusberg, H. (2003). The influence of language on theory of mind: A

training study. Developmental Science, 6(3), 346–359.

Hancock, J. T., Dunham, P. J., & Purdy, K. (2000). Children’s comprehension of critical and

complimentary forms of verbal irony. Journal of Cognition and Development, 1(2), 227–

248.

Happé, F. (1993). Communicative competence and theory of mind in autism: A test of relevance

theory. Cognition, 48(2), 101–119.

Happé, F. (1994). An advanced test of theory of mind: Understanding of story characters’

thoughts and feelings by able autistic, mentally handicapped, and normal children and

adults. Journal of Autism and Developmental Disorders, 24(2), 129–154.

Happé, F. (1995). The role of age and verbal ability in the theory of mind task performance of

subjects with autism. Child Development, 66(3), 843–855.

Harrigan, J. A. (1984). The effects of task order on children’s identification of facial

expressions. Motivation and Emotion, 8(2), 157–169.

Harris, M., & Pexman, P. M. (2003). Children’s perceptions of the social functions of verbal

irony. Discourse Processes, 36(3), 147–165.

Harris, P. L. (2005). Conversation, pretense, and theory of Mind. In J. W. Astington & J. A.

Baird (Eds.), Why Language Matters for Theory of Mind (pp. 70–83). New York: Oxford

University Press.

Harwood, M. D., & Farrar, M. J. (2006). Conflicting emotions: The connection between

affective perspective taking and theory of mind. British Journal of Developmental

Psychology, 24(2), 401–418.

Mariia Pronina

PhD Research Plan

77

Hauf, P. (2007). Infants’ perception and production of intentional actions. Progress in Brain

Research, 164, 285–301.

He, Z., Bolz, M., & Baillargeon, R. (2012). 2.5-year-olds succeed at a verbal anticipatory-

looking false-belief task. British Journal of Developmental Psychology, 30(1), 14–29.

Heyes, C. (2014). False belief in infancy: a fresh look. Developmental Science, 17(5), 647–659.

Heyes, C., & Frith, C. D. (2014). The cultural evolution of mind reading. Science, 344(6190),

1243091.

Hoicka, E., & Gattis, M. (2008). Do the wrong thing: How toddlers tell a joke from a mistake.

Cognitive Development, 23(1), 180–190.

Hoicka, E., & Wang, S. (2011). Fifteen-month-old infants match vocal cues to intentional

actions. Journal of Cognition and Development, 12(3), 299–314.

Hübscher, I. (2018). Preschoolers ’ pragmatic development : How prosody and gesture lend a

helping hand (PhD thesis). Universitat Pompeu Fabra.

Hübscher, I., Esteve-Gibert, N., Igualada, A., & Prieto, P. (2017). Intonation and gesture as

bootstrapping devices in speaker uncertainty. First Language, 37(1), 24–41.

Hübscher, I., & Prieto, P. (submitted). Gestural and prosodic development act as sister systems

and jointly pave the way for children’s sociopragmatic development. Frontiers in

Psychology.

Hübscher, I., Prieto, P., & Wagner, L. (accepted). Three-year-olds infer polite stance from

intonation and facial cues. Journal of Politeness Research Language Behaviour and

Culture.

Hübscher, I., Vincze, L., & Prieto, P. (under review). Children’s signaling of their uncertain

knowledge state: prosody, face and body cues come first. Journal of Language Learning

and Development.

Igualada, A., Bosch, L., & Prieto, P. (2015). Language development at 18 months is related to

multimodal communicative strategies at 12 months. Infant Behavior and Development, 39,

42–52.

Igualada, A., Esteve-Gibert, N., & Prieto, P. (2017). Beat gestures improve word recall in 3- to

5-year-old children. Journal of Experimental Child Psychology, 156, 99–112.

Ingersoll, B. (2008). The social role of imitation in autism: Implications for the treatment of

imitation deficits. Infants and Young Children, 21(2), 107–119.

Ingersoll, B. (2012). Brief report: Effect of a focused imitation intervention on social

functioning in children with autism. Journal of Autism and Developmental Disorders,

42(8), 1768–1773.

Ingersoll, B., & Lalonde, K. (2010). The impact of object and gesture imitation training on

language use in children with Autism Spectrum Disorder. Journal of Speech Language

and Hearing Research, 53(4), 1040–1051.

Ionescu, T., & Vasc, D. (2014). Embodied cognition: challenges for psychology and education.

Procedia - Social and Behavioral Sciences, 128, 275–280.

Ito, K. (2018). Gradual development of focus prosody and affect prosody comprehension. In P.

Prieto & N. Esteve-Gibert (Eds.), The Development of Prosody in First Language

Acquisition (Vol. 23, pp. 247–270). Amsterdam: John Benjamins.

Mariia Pronina

PhD Research Plan

78

Ito, K., Bibyk, S. A., Wagner, L., & Speer, S. R. (2014). Interpretation of contrastive pitch

accent in six- to eleven-year-old English-speaking children (and adults). Journal of Child

Language, 41(1), 82–108.

Ito, K., Jincho, N., Minai, U., Yamane, N., & Mazuka, R. (2012). Intonation facilitates contrast

resolution: Evidence from Japanese adults and 6-year olds. Journal of Memory and

Language, 66(1), 265–284.

Iverson, J. M., & Goldin-Meadow, S. (2005). Gesture Paves the Way for Language

Development. Psychological Science, 16, 368–371.

Izard, C. E. (1971). The face of emotion (New York:). Appelton Century Crofts.

Izard, C. E., Haskins, F. W., Schultz, D., Trentacosta, C. J., & King, K. A. (2003). Emotion

matching task. Newark, DE: University of Delaware.

Jenkins, J. M., & Astington, J. W. (1996). Cognitive factors and family structure associated

with theory of mind development in young children. Developmental Psychology, 32(1),

70–78.

Johnson, M. H., & Morton, J. (1991). Biology and cognitive development: The case of face

recognition. Oxford, UK: Blackwell Scientific Publications.

Jones, S. S. (2007). Imitation in infancy the development of mimicry. Psychological Science,

18(7), 593–599.

Kallay, J., & Redford, M. A. (2016). A longitudinal study of children’s intonation in narrative

speech. In Proceedings of the 17th Annual Conference of the International Speech

Communication Association (pp. 1079–1083). San Francisco, CA.

Katsos, N., & Bishop, D. V. M. (2011). Pragmatic tolerance: Implications for the acquisition of

informativeness and implicature. Cognition, 120(1), 67–81.

Kettner, V. A., & Carpendale, J. I. M. (2013). Developing gestures for no and yes : Head

shaking and nodding in infancy. Gesture, 13(2), 193–209.

Khu, M., Chambers, C. G., & Graham, S. A. (2018). When you’re happy and I know it: four-

year-olds’ emotional perspective taking during online language comprehension. Child

Development, 89(6), 2264–2281.

Kiefer, M., & Trumpp, N. M. (2012). Embodiment theory and education: The foundations of

cognition in perception and action. Trends in Neuroscience and Education, 1(1), 15–20.

Kim, S., Paulus, M., Sodian, B., & Proust, J. (2016). Young children’s sensitivity to their own

ignorance in informing others. PLoS ONE, 11(3), e0152595.

Klieve, S. (1998). Perception of prosodic features by children with cochlear implants: is it

sufficient for understanding meaning differences in language? Unpublished master’s

thesis. University of Melbourne.

Knudsen, B., & Liszkowski, U. (2012a). 18-month-olds predict specific action mistakes

through attribution of false belief, not ignorance, and intervene accordingly. Infancy,

17(6), 672–691.

Knudsen, B., & Liszkowski, U. (2012b). Eighteen- and 24-month-old infants correct others in

anticipation of action mistakes. Developmental Science, 15(1), 113–122.

Koenig, M. A., & Harris, P. L. (2005). Preschoolers mistrust ignorant and inaccurate speakers.

Child Development, 76(6), 1261–1277.

Kovács, Á. M., Tauzin, T., Téglás, E., Gergely, G., & Csibra, G. (2014). Pointing as epistemic

request: 12-month-olds point to receive new information. Infancy, 19(6), 543–557.

Mariia Pronina

PhD Research Plan

79

Kovács, Á. M., Téglás, E., & Endress, A. D. (2010). The social sense: Susceptibility to others’

beliefs in human infants and adults. Science, 330(6012), 1830–1834.

Krauss, R. M. (1998). Why do we gesture when we speak? Current Directions in Psychological

Science, 7(2), 54–54.

Kuhnert, R.-L., Begeer, S., Fink, E., & de Rosnay, M. (2017). Gender-differentiated effects of

theory of mind, emotion understanding, and social preference on prosocial behavior

development: A longitudinal study. Journal of Experimental Child Psychology, 154, 13–

27.

Küntay, A. C., Nakamura, K., & Ateş-Şen, B. (2014). Crosslinguistic and crosscultural

approaches to pragmatic development. Pragmatic Development in First Language

Acquisition, 317–341.

Kurumada, C., & Clark, E. V. (2017). Pragmatic inferences in context: Learning to interpret

contrastive prosody. Journal of Child Language, 44(4), 850–880.

LaBounty, J., Wellman, H. M., Olson, S., Lagattuta, K., & Liu, D. (2008). Mothers’ and fathers’

use of internal state talk with their young children. Social Development, 17(4), 757–775.

Lai, Z., Hughes, S., & Shapiro, E. (1991). Manual for the Minnesota Tests of Affective

Processing (MNTAP). Minneapolis, MN: University of Minnesota.

Laval, V., & Bert-Erboul, A. (2005). French-speaking children’s understanding of sarcasm: The

role of intonation and context. Journal of Speech Language and Hearing Research, 48(3),

610–620.

Lecce, S., Ronchi, L., Del Sette, P., Bischetti, L., & Bambini, V. (2018). Interpreting physical

and mental metaphors: Is Theory of Mind associated with pragmatics in middle childhood?

Journal of Child Language, Nov 16, 1–15.

Leech, G. N. (1983). Principles of Pragmatics. London: Longman.

Leekam, S. R. (1991). Jokes and lies: Children’s understanding of intentional falsehood. In A.

Whiten (Ed.), Natural theories of mind: Evolution, development and simulation of

everyday mindreading (pp. 159–174). Cambridge, MA, US: Basil Blackwell.

Lemche, E., Kreppner, J. M., Joraschky, P., & Klann-Delius, G. (2007). Attachment

organization and the early development of internal state language: A longitudinal

perspective. International Journal of Behavioral Development, 31(3), 252–262.

Lieven, E., & Stoll, S. (2013). Early communicative development in two cultures: A

comparison of the communicative environments of children from two cultures. Human

Development, 56(3), 178–206.

Liszkowski, U. (2005). Human twelve-month-olds point cooperatively to share interest with

and helpfully provide information for a communicative partner. Gesture, 5(1–2), 135–154.

Liszkowski, U. (2014). Two sources of meaning in infant communication: preceding action

contexts and act-accompanying characteristics. Philosophical Transactions of the Royal

Society of London. Series B, Biological Sciences, 369(1651), 20130294.

Liszkowski, U., Carpenter, M., & Tomasello, M. (2008). Twelve-month-olds communicate

helpfully and appropriately for knowledgeable and ignorant partners. Cognition, 108(3),

732–739.

Llanes-Coromina, J., Vilà-Giménez, I., Kushch, O., Borràs-Comes, J., & Prieto, P. (2018). Beat

gestures help preschoolers recall and comprehend discourse information. Journal of

Experimental Child Psychology, 172, 168–188.

Mariia Pronina

PhD Research Plan

80

Lohmann, H., & Tomasello, M. (2003). The role of language in the development of false belief

understanding: A training study. Child Development, 74(4), 1130–1144.

Lohmann, H., Tomasello, M., & Meyer, S. (2005). Linguistic communication and social

understanding. In J. W. Astington & J. A. Baird (Eds.), Why Language Matters for Theory

of Mind (pp. 245–265). New York: Oxford University Press.

Ma, W., Golinkoff, R. M., Houston, D. M., & Hirsh-Pasek, K. (2011). Word learning in infant-

and adult-directed speech. Language Learning and Development, 7(3), 185–201.

Macoun, A., & Sweller, N. (2016). Listening and watching: The effects of observing gesture

on preschoolers’ narrative comprehension. Cognitive Development, 40, 68–81.

Maouene, J., Sethuraman, N., Laakso, A., & Maouene, M. (2011). The body region correlates

of concret and abstract verbs in early child language. Cognition, Brain, Behavior: An

Interdisciplinary Journal, 15(Special Issue: Embodiment and Development), 449–481.

Markova, G., & Legerstee, M. (2008). How infants come to learn about the minds of others.

Zero to Three, 28(5), 26–31.

Masur, E. F., & Eichorst, D. L. (2002). Infants’ spontaneous imitation of novel versus familiar

words: Relations to observational and maternal report measures of their lexicons. Merrill-

Palmer Quarterly, 48(4), 405–426.

McCabe, A., & Peterson, C. (1991). Developing narrative structure. Hillsdale, NJ, US:

Lawrence Erlbaum Associates, Inc.

McGuigan, N., Makinson, J., & Whiten, A. (2011). From over-imitation to super-copying:

Adults imitate causally irrelevant aspects of tool use with higher fidelity than young

children. British Journal of Psychology, 102(1), 1–18.

McNeil, N. M., Alibali, M. W., & Evans, J. L. (2000). The role of gesture in children’s

comprehension of spoken language: Now they need it, now they don’t. Journal of

Nonverbal Behavior, 24(2), 131–150.

McNeill, D. (2000). Language and gesture. Cambridge University Press.

Meltzoff, A. N. (2002). Imitation as a mechanism of social cognition: Origins of empathy,

theory of mind, and the representation of action. In U. Goswami (Ed.), Blackwell handbook

of childhood cognitive development (pp. 6–25). Malden: Blackwell Publishing.

Meltzoff, A. N., & Moore, M. K. (1989). Imitation in newborn infants: Exploring the range of

gestures imitated and the underlying mechanisms. Developmental Psychology, 25(6), 954–

962.

Meltzoff, A. N., & Moore, M. K. (1994). Imitation, memory, and the representation of persons.

Infant Behavior and Development, 17(1), 83–99.

Milligan, K., Astington, J. W., & Dack, L. A. (2007). Language and Theory of Mind: Meta-

analysis of the relation between language ability and false-belief understanding. Child

Development, 78(2), 622–646.

Moore, C., Bryant, D., & Furrow, D. (1989). Mental terms and the development of certainty.

Furrow Source: Child Development, 60(1), 167–171.

Moore, C., Pure, K., & Furrow, D. (1990). Children’s understanding of the modal expression

of speakercertainty and uncertainty and its relation to the development of a

representational Theory of Mind. Child Development, 61(3), 722–730.

Mariia Pronina

PhD Research Plan

81

Morgan, J., & Demuth, K. (1996). Signal to syntax: An overview. In J. Morgan & K. Demuth

(Eds.), Signal to syntax: Bootstrapping from speech to grammar in early acquisition (pp.

1–22). Mahwah, N.J.: Lawrence Erlbaum Associates.

Morgenstern, A., Beaupoil-Hourdel, P., Blondel, M., & Dominique, B. (2016). A Multimodal

Approach to the Development of Negation in Signed and Spoken Languages: Four Case

Studies. In L. Ortega, A. E. Tyler, H. I. Park, & M. Uno (Eds.), The usage-based study of

language learning and multilingualism (pp. 15–36). Washington: Georgetown University

Press.

Morton, J. B., & Trehub, S. E. (2001). Children’s understanding of emotion in speech. Child

Development, 72(3), 834–843.

Murillo, E., & Capilla, A. (2016). Properties of vocalization- and gesture-combinations in the

transition to first words. Journal of Child Language, 43(4), 890–913.

Murillo, E., Ortega, C., Otones, A., Rujas, I., & Casla, M. (2018). Changes in the Synchrony of

Multimodal Communication in Early Language Development. Journal of Speech

Language and Hearing Research, 61(9), 2235–2245.

Myung, J. Y., Blumstein, S. E., & Sedivy, J. C. (2006). Playing on the typewriter, typing on the

piano: Manipulation knowledge of objects. Cognition, 98(3), 223–243.

Nazzi, T., Bertoncini, J., & Mehler, J. (1998). Language discrimination by newborns: toward

an understanding of the role of rhythm. Journal of Experimental Psychology: Human

Perception and Performance, 24(3), 756–766.

Nelson, N. L., & Russell, J. A. (2011). Preschoolers’ use of dynamic facial, bodily, and vocal

cues to emotion. Journal of Experimental Child Psychology, 110(1), 52–61.

Nielsen, M., Simcock, G., & Jenkins, L. (2008). The effect of social engagement on 24-month-

olds’ imitation from live and televised models. Developmental Science, 11(5), 722–731.

Nilsen, E. S., Glenwright, M., & Huyder, V. (2011). Children and adults understand that verbal

irony interpretation depends on listener knowledge. Journal of Cognition and

Development, 12(3), 374–409.

Ninio, A. (2014). Pragmatic Development. In P. Brooks & V. Kempe (Eds.), Encyclopedia of

Language Development (pp. 472–479). Los Angeles, CA: Sage Publications.

Ninio, A., & Wheeler, P. (1984). A manual for classifying verbal communicative acts in

mother- infant interaction. Working Papers in Developmental Psychology, No 1,

Jerusalem: The Martin and Vivian Levin Center, Hebrew University. Reprinted as

Transcript Analysis, 3, 1–82.

Noveck, I. A. (2001). When children are more logical than adults: Experimental investigations

of scalar implicature. Cognition, 78(2), 165–188.

Nowicki, S., & Duke, M. P. (1994). Individual differences in the nonverbal communication of

affect: The diagnostic analysis of nonverbal accuracy scale. Journal of Nonverbal

Behavior, 18(1), 9–35.

Nwokah, E. (2014). Dyadic interaction and early communicative development. In P. Brooks &

V. Kempe (Eds.), Encyclopedia of Language Development (pp. 168–170). Los Angeles,

CA: Sage Publications.

Oller, D. K., Buder, E. H., Ramsdell, H. L., Warlaumont, A. S., Chorna, L., & Bakeman, R.

(2013). Functional flexibility of infant vocalization and the emergence of language.

Proceedings of the National Academy of Sciences, 110(16), 6318–6323.

Mariia Pronina

PhD Research Plan

82

Olson, D. R. (1988). On the origins of beliefs and other intentional states in children.

Developing Theories of Mind, 6, 414–426.

Oniski, K. K., & Baillargeon, R. (2005). Do 15-month-old infants understand false beliefs?

Science, 308(5719), 255–258.

Ornaghi, V., Brockmeier, J., & Grazzani, I. G. (2011). The role of language games in children’s

understanding of mental states: A training study. Journal of Cognition and Development,

12(2), 239–259.

Ornaghi, V., Orlandi, I., & Perego, E. (2007). The adventures of Jack and Theo. Department of

Human Sciences, University of Milano Bicocca, Italy.

Ornaghi, V., Pepe, A., & Grazzani, I. G. (2016). False-belief understanding and language ability

mediate the relationship between emotion comprehension and prosocial orientation in

preschoolers. Frontiers in Psychology, 7, 1534.

Özçalişkan, Ş., Gentner, D., & Goldin-Meadow, S. (2012). Do iconic gestures pave the way for

children’s early verbs? Applied Psycholinguistics, 35(6), 1143–1162.

Özçalişkan, Ş., & Goldin-Meadow, S. (2005). Gesture is at the cutting edge of early language

development. Cognition, 96(3), B101–B113.

Papaeliou, C. F., Minadakis, G., & Cavouras, D. (2002). Acoustic patterns of infant

vocalizations expressing emotions and communicative functions. Journal of Speech

Language and Hearing Research, 45(2), 311.

Papaeliou, C. F., & Trevarthen, C. (2006, February 22). Prelinguistic pitch patterns expressing

“communication” and “apprehension.” Journal of Child Language. Cambridge University

Press.

Peppe, S., Cleland, J., Gibbon, F., O’Hare, A., & Castilla, P. M. (2011). Expressive prosody in

children with autism spectrum conditions. Journal of Neurolinguistics, 24(1), 41–53.

Peppé, S., & McCann, J. (2003). Assessing intonation and prosody in children with atypical

language development: the PEPS-C test and the revised version. Clinical Linguistics &

Phonetics, 17(4–5), 345–354.

Perner, J. (1991). Understanding the representational mind. Cambridge, MA: The MIT Press.

Perner, J., Leekam, S. R., & Wimmer, H. (1987). Three-year-olds’ difficulty with false belief:

The case for a conceptual deficit. British Journal of Developmental Psychology, 5(2), 125–

137.

Perner, J., & Roessler, J. (2012). From infants’ to children’s appreciation of belief. Trends in

Cognitive Sciences, 16(10), 519–525.

Perner, J., & Wimmer, H. (1985). “John thinks that Mary thinks that...” attribution of second-

order beliefs by 5- to 10-year-old children. Journal of Experimental Child Psychology,

39(3), 437–471.

Phelps-Terasaki, D., & Phelps-Gunn, T. (2007). The Test of Pragmatic Language (2nd ed.).

Austin, TX: Pro-Ed.

Phillips, A. T., Wellman, H. M., & Spelke, E. S. (2002). Infants’ ability to connect gaze and

emotional expression to intentional action. Cognition, 85(1), 53–78.

Pons, F., Harris, P. L., & de Rosnay, M. (2004). Emotion comprehension between 3 and 11

years: Developmental periods and hierarchical organization. European Journal of

Developmental Psychology, 1(2), 127–152.

Mariia Pronina

PhD Research Plan

83

Pouscoulous, N., Noveck, I. A., Politzer, G., & Bastide, A. (2007). A developmental

investigation of processing costs in implicature production. Language Acquisition, 14(4),

347–375.

Premack, D., & Woodruff, G. (1978). Does the chimpanzee have a theory of mind? Behavioral

and Brain Sciences, 1(4), 515–526.

Prieto, P., & Esteve-Gibert, N. (2018). The development of prosody in first language

acquisition. Amsterdam: John Benjamins.

Prieto, P., Estrella, A., Thorson, J., & Vanrell, M. D. M. (2012). Is prosodic development

correlated with grammatical and lexical development? Evidence from emerging intonation

in Catalan and Spanish. Journal of Child Language, 39(1), 1–37.

Pronina, M., Hübscher, I., Vilà-Giménez, I., & Prieto, P. (submitted). A new tool to assess

pragmatic prosody in children: Evidence from 3- to 4-year-olds.

Pulvermüller, F., Hauk, O., Nikulin, V. V., & Ilmoniemi, R. J. (2005). Functional links between

motor and language systems. European Journal of Neuroscience, 21(3), 793–797.

Quam, C., & Swingley, D. (2012). Development in children’s interpretation of pitch cues to

emotions. Child Development, 83(1), 236–250.

Rakoczy, H., & Tomasello, M. (2009). Done wrong or said wrong? Young children understand

the normative directions of fit of different speech acts. Cognition, 113(2), 205–212.

Renfrew, C. (1997). Bus story test—A test of narrative speech. Bicester: Winslow (4th ed.).

Bicester, England: Winslow Press.

Rowe, M. L., & Goldin-Meadow, S. (2009). Early gesture selectively predicts later language

learning. Developmental Science, 12(1), 182–187.

Ruffman, T. (2000). Nonverbal theory of mind: Is it important, is it implicit, is it simulation, is

it relevant to autism. In J. W. Astington (Ed.), Minds in the making: Essays in honor of

David R. Olson (pp. 250–266). Malden, MA: Blackwell.

Ruffman, T., Slade, L., & Crowe, E. (2002). The relation between children’s and mothers’

mental state language and theory-of-mind understanding. Child Development, 73(3), 734–

751.

Ruffman, T., Slade, L., Rowlandson, K., Rumsey, C., & Garnham, A. (2003). How language

relates to belief, desire, and emotion understanding. Cognitive Development, 18(2), 139–

158.

Rusiewicz, H. L., & Esteve-Gibert, N. (2018). Set in time: Prosody-Gesture coordination. In P.

Prieto & N. Esteve-Gibert (Eds.), The Development of Prosody in First Language

Acquisition (pp. 103–124). Amsterdam: John Benjamins.

Sabbagh, M. A., & Baldwin, D. A. (2001). Learning words from knowledgeable versus ignorant

speakers: Links between preschoolers’ Theory of Mind and semantic development. Child

Development, 72(4), 1054–1070.

Saborit Mallol, C., Julian Marza, J. P., & Navarro Lizandra, J. L. (2005). ELI : l’avaluacio del

llenguatge infantil. Castello de la Plana: Publicacions de la Universitat Jaume I.

Sakkalou, E., & Gattis, M. (2012). Infants infer intentions from prosody. Cognitive

Development, 27(1), 1–16.

Scheiner, E., Hammerschmidt, K., Jürgens, U., & Zwirner, P. (2002). Acoustic analyses of

developmental changes and emotional expression in the preverbal vocalizations of infants.

Journal of Voice, 16(4), 509–529.

Mariia Pronina

PhD Research Plan

84

Scott, R. M., & Baillargeon, R. (2017). Early false-belief understanding. Trends in Cognitive

Sciences, 21(4), 237–249.

Scott, R. M., He, Z., Baillargeon, R., & Cummins, D. (2012). False-belief understanding in 2.5-

year-olds: evidence from two novel verbal spontaneous-response tasks. Developmental

Science, 15(2), 181–193.

Searle, J. R. (1969). Speech acts: An essay in the philosophy of language. Cambridge:

Cambridge university press.

Sekine, K., & Kita, S. (2015). The parallel development of the form and meaning of two-handed

gestures and linguistic information packaging within a clause in narrative. Open

Linguistics, 1, 490–502.

Shatz, M. (1978). Children’s comprehension of their mothers’ question-directives. Journal of

Child Language, 5(1), 39–46.

Shochi, T., Erickson, D., Sekiyama, K., Rilliard, A., & Aubergé, V. (2009). Japanese children’s

acquisition of prosodic politeness expressions. In Proceedings of the Annual Conference

of the International Speech Communication Association (pp. 1743–1746). Brighton, Uk.

Shriberg, L. D., Kwiatkowski, J., Rasmussen, C., Lof, G. L., & Miller, J. F. (1990). The

prosody-voice screening profile. Tucson, AZ: Communication Skill Builders.

Shultz, T. R. (1974). Development of the appreciation of riddles. Child Development, 45(1),

100–105.

Siegal, M. (1999). Language and thought: The fundamental significance of conversational

awareness for cognitive development. Developmental Science, 2(1), 1–34.

Slade, L., & Ruffman, T. (2005). How language does (and does not) relate to theory of mind:

A longitudinal study of syntax, semantics, working memory and false belief. British

Journal of Developmental Psychology, 23(1), 117–141.

Slaughter, V. P., & Gopnik, A. (1996). Conceptual coherence in the child’s Theory of Mind:

Training children to understand belief. Child Development, 67(6), 2967–2988.

Smith, M., Apperly, I., & White, V. (2003). False belief reasoning and the acquisition of relative

clause sentences. Child Development, 74(6), 1709–1719.

Snedeker, J., & Trueswell, J. C. (2004). The developing constraints on parsing decisions: The

role of lexical-biases and referential scenes in child and adult sentence processing.

Cognitive Psychology, 49(3), 238–299.

Snow, C. E. (1977). The development of conversation between mothers and babies. Journal of

Child Language, 4(01), 1–22.

Snow, D. (2017). Gesture and intonation are “sister systems” of infant communication:

Evidence from regression patterns of language development. Language Sciences, 59, 180–

191.

Sodian, B., Licata, M., Kristen-Antonow, S., Paulus, M., Killen, M., & Woodward, A. (2016).

Understanding of goals, beliefs, and desires predicts morally relevant Theory of Mind: A

longitudinal investigation. Child Development, 87(4), 1221–1232.

Southgate, V., Chevallier, C., & Csibra, G. (2010). Seventeen-month-olds appeal to false beliefs

to interpret others’ referential communication. Developmental Science, 13(6), 907–912.

Speer, S. R., & Ito, K. (2009). Prosody in first language acquisition - Acquiring intonation as a

tool to organize information in conversation. Linguistics and Language Compass, 3(1),

90–110.

Mariia Pronina

PhD Research Plan

85

Sperber, D., & Wilson, D. (2002). Pragmatics, modularity and mind-reading. Mind and

Language, 17(1), 3–33.

Sroufe, L. A., & Wunsch, J. P. (1972). The Development of Laughter in the First Year of Life.

Child Development, 43(4), 1326–1344.

Strand, P. S., Downs, A., & Barbosa-Leiker, C. (2016). Does facial expression recognition

provide a toehold for the development of emotion understanding? Developmental

Psychology, 52(8), 1182–1191.

Sullivan, M. W., & Lewis, M. (2003). Emotional expressions of young infants and children: A

practitioner’s primer. Infants & Young Children, 16(2), 120–142.

Surian, L., & Geraci, A. (2012). Where will the triangle look for it? Attributing false beliefs to

a geometric shape at 17 months. British Journal of Developmental Psychology, 30(1), 30–

44.

Tager-Flusberg, H., & Joseph, R. M. (2012). How language facilitates the acquisition of false-

belief understanding in children with autism. In J. W. Astington & J. A. Baird (Eds.), Why

Language Matters for Theory of Mind (pp. 298–318). New York: Oxford University Press.

Taumoepeau, M., & Ruffman, T. (2006). Mother and infant talk about mental states relates to

desire language and emotion understanding. Child Development, 77(2), 465–481.

Taumoepeau, M., & Ruffman, T. (2008). Stepping stones to others’ minds: Maternal talk relates

to child mental state language and emotion understanding at 15, 24, and 33 months. Child

Development, 79(2), 284–302.

Tettamanti, M., Buccino, G., Saccuman, M. C., Gallese, V., Danna, M., Scifo, P., … Perani, D.

(2005). Listening to action-related sentences activates fronto-parietal motor circuits.

Journal of Cognitive Neuroscience, 17(2), 273–281.

Thomas, J. A. (1981). Pragmatic failure (M.A. dissertation). University of Lancaster.

Thomas, J. A. (1983). Cross-cultural pragmatic failure. Applied Linguistics, 4(2), 91–112.

Thorson, J. (2018). The role of prosody in early word learning: Behavioral evidence. In P. Prieto

& N. Esteve-Gibert (Eds.), The Development of Prosody in First Language Acquisition

(pp. 59–77). Amsterdam: John Benjamins.

Thorson, J., & Morgan, G. (2014). Directing toddler attention: Intonation and information

structure. In W. Orman & M. J. Valleau (Eds.), Proceedings of 38th Annual Boston

University Conference on Language Develooment. Somerville, MA: Cascadilla Press.

Toda, S., & Fogel, A. (1993). Infant response to the still-face situation at 3 and 6 months.

Developmental Psychology, 29(3), 532–538.

Tomasello, M. (1995). Joint attention as social cognition. In C. Moore & P. J. Dunham (Eds.),

Joint attention: Its origins and role in development (pp. 103–130). Hillsdale, NJ, US:

Lawrence Erlbaum Associates.

Tomasello, M. (2000). Do young children have adult syntactic competence? Cognition, 74(3),

209–253.

Tomasello, M. (2003). Constructing a language: A usage-based theory of language acquisition.

Cambridge, MA: Harvard University Press.

Tomasello, M. (2009). The usage-based theory of language acquisition. In E. L. Bavin (Ed.),

The Cambridge handbook of child language (pp. 69–87). Cambridge: Cambridge

University Press.

Mariia Pronina

PhD Research Plan

86

Tomasello, M., Carpenter, M., Call, J., Behne, T., & Moll, H. (2005). Understanding and

sharing intentions: The origins of cultural cognition. Behavioral and Brain Sciences,

28(5), 675–735.

Tomasello, M., Carpenter, M., & Liszkowski, U. (2007). A new look at infant pointing. Child

Development, 78(3), 705–722.

Tomasello, M., Striano, T., & Rochat, P. (1999). Do young children use objects as symbols?

British Journal of Developmental Psychology, 17(4), 563–584.

Trevarthen, C. (1979). Communication and cooperation in early infancy: A description of

primary intersubjectivity. Before Speech: The Beginning of Interpersonal Communication,

1, 530–571.

Tucker, M., & Ellis, R. (1998). On the relations between seen objects and components of

potential actions. Journal of Experimental Psychology: Human Perception and

Performance, 24(3), 830–846.

Vaillant-Molina, M., Bahrick, L. E., & Flom, R. (2013). Young infants match facial and vocal

emotional expressions of other infants. Infancy, 18, E97–E111.

Vilà-Giménez, I., Igualada, A., & Prieto, P. (2018). Observing storytellers who use rhythmic

beat gestures improves children’s narrative discourse performance. Developmental

Psychology, Advance online publication: http://dx.doi.org/10.1.

Vilà-Giménez, I., & Prieto, P. (2018). Encouraging children to produce rhythmic beat gestures

leads to better narrative discourse performances. In Proceedings from the 9th International

Conference on Speech Prosody. Poznań, Poland.

Volterra, V., & Antinucci, F. (1979). Negation in Child Language: A Pragmatic Study. In E.

Ochs & B. Schieffelin (Eds.), Developmental Pragmatics (pp. 281–303). New York:

Academic Press.

Warneken, F., Chen, F., & Tomasello, M. (2006). Cooperative activities in young children and

chimpanzees. Child Development, 77(3), 640–663.

Weimer, A. A., Sallquist, J., & Bolnick, R. R. (2012). Young children’s emotion comprehension

and Theory of Mind understanding. Early Education & Development, 23(3), 280–301.

Wellman, H. M. (2014). Making minds: How theory of mind develops. Oxford University Press.

Wellman, H. M., Cross, D., & Watson, J. (2001). Meta-analysis of theory-of-mind

development: the truth about false belief. Child Development, 72(3), 655–684.

Wellman, H. M., Phillips, A. T., & Rodriguez, T. (2000). Young children’s understanding of

perception, desire, and emotion. Child Development, 71(4), 895–912.

Werker, J. F., Pons, F., Dietrich, C., Kajikawa, S., Fais, L., & Amano, S. (2007). Infant-directed

speech supports phonetic category learning in English and Japanese. Cognition, 103(1),

147–162.

Westra, E., & Carruthers, P. (2017). Pragmatic development explains the Theory-of-Mind

Scale.

Wiig, E. H., Semel, E., & Secord, W. A. (2013). Clinical Evaluation of Language

Fundamentals–Fifth Edition (CELF-5). Bloomington, MN: NCS Pearson.

Wilson, R. A., Foglia, L., & Wilson, R. A. (2017). Embodied cognition. In E. N. Zalta (Ed.),

The Stanford Encyclopedia of Philosophy (pp. 319–325).

Mariia Pronina

PhD Research Plan

87

Wimmer, H., & Perner, J. (1983). Beliefs about beliefs: Representation and constraining

function of wrong beliefs in young children’s understanding of deception. Cognition,

13(1), 103–128.

Winner, E., & Leekam, S. (1991). Distinguishing irony from deception: Understanding the

speaker’s second-order intention. Britid Journal of Developmental l’ycbology, 9(2), 257–

270.

Yott, J., & Poulin-Dubois, D. (2012). Breaking the rules: Do infants have a true understanding

of false belief? British Journal of Developmental Psychology, 30(1), 156–171.

Mariia Pronina

PhD Research Plan

88

APPENDIX A: The Audiovisual Pragmatic Test (APT)

Familiarization item 1

This girl has a sister. And

what about you? Do you

have any sisters or

brothers?

Familiarization item 2

Imagine that your

grandmother surprisingly

visits you at home. She

knocks on the door and

you open it. What would

you say?

Test item 1

Imagine that you make a

new friend, you greet

each other shaking each

other’s hands. You want

to say to him “Hello!”

and tell him your name.

How would you say it?

Test item 2

Imagine that today you

have gone to the cinema

with your family, and

your little brother has

started talking loudly in

the middle of the movie in

a disrespectful manner.

What would you say to

make him be quiet?

Test item 3

Imagine that you mother

leaves for work. What

would you say as she is

walking out the door?

Test item 4

Imagine that your aunt is

cutting a cake. You are

very hungry and want to

ask her for a piece of

cake. What would you

say?

Test item 5

Imagine that you are

eating a piece of cake and

when you finish, your

aunt asks you “Do you

want more?”. What

would you say?

Test item 6

Imagine that you come

home and when you enter

the door you smell a pie,

it smells delicious. You

see your mother in the

kitchen and you know

that she just made your

favorite pie. What would

you say?

Test item 7

Imagine that your friend

has a muffin. You want a

little bit of it. What would

you say to your friend?

Test item 8

Imagine that your friend

gives you a half of his

muffin. What would you

say?

Test item 9

Imagine that you enter the

classroom in the

morning. What would

you say to your teacher?

Test item 10

Imagine that your friends

are playing with a ball.

You want to play with

them. What would you

say to your friends?

Mariia Pronina

PhD Research Plan

89

Test item 11

Imagine that you want to

watch TV and you know

that normally your

parents do not allow you

to. How would you ask

permission from your

parents?

Test item 12

Imagine that you are

having lunch with your

friend’s parents. Your

friend’s mother offers

desert to you but you are

already full. What would

you say?

Test item 13

Imagine that today you

need to carry a lot of

things to school. What

would you say to your

brother if you want him to

help you?

Test item 14

Imagine that you and

your mother are having

breakfast together and

you spill milk all over the

table. What would you

say to your mother?

Test item 15

Imagine that one day your

mother comes with a very

big bag. You are very

interested in the bag.

What would you say to

your mother?

Test item 16

Imagine that your friend

participated in a poetry

competition and just

found out that he has lost.

What would you say to

him?

Test item 17

You go to a shop to buy a

bottle of water. When you

are there, a man in the

shop asks you what you

want to buy. What would

you say/answer?

Test item 18

You have come to visit

Joan with one of your

friends. You entered the

room and you

immediately saw that his

favorite toy was near the

door. A moment later

your friend tells you that

he has lost his toy and

asks you whether you

have seen it. What would

you say?

Test item 19

Your teacher has got her

hair cut and you think that

it suits her very well.

How would you say it?

Test item 20

Imagine that you are in

your grandmother’s

house and she is a bit

deaf. You just told her

that you want a snack

because you are very

hungry but she has not

heard your well and she

asks you “Do you want to

go for a walk?”. How

would you tell her that

that’s not what you want,

you want a snack instead?

Mariia Pronina

PhD Research Plan

90

Test item 21

You have arrived at

school and your friend

Pau has not come. Maria

asks you if you know

why. Yesterday you saw

that Pau hardly kept his

eyes open and you think

that he might me ill. What

would you say to Maria?

Test item 22

Your friend just tripped

and fell down. What

would you say?

Test item 23

Imagine that there is a

new girl in your class.

You like music class very

much. One day you speak

with her and you want to

know if she is also taking

music class. How would

you ask her?

Test item 24

Imagine that you do not

like bananas. You are

having desert after lunch

and you mother gives you

one, sure that you like

them. She is very sure

about it. You want to tell

her that you do not like bananas. What would

say?

Test item 25

You and your friend Pau

come to school and you

think that you must hand

in homework that the

teacher asked you to do

but you are not sure.

What would you say to

the teacher?

Test item 26

Imagine that you have

two friends, Paula and

Marina, that did not know

each other. One day you

decide to present them to

each other. What would

you say?

Test item 27

Imagine that you enter

your friend Maria’s

house, but when you are

inside you cannot see her.

You think that she must

be in her room. Call her.

A few seconds go by and

nobody comes out. You

think that she might be

upstairs and you call her

once again.

Test item 28

Imagine that your friend

wants to invite you to

come over after school

but you also want to

invite him/her to come

over to your house. You

want to convince him/her

to come over to your

place. How would you

ask? What would you

say?

Test item 29

Imagine that your teacher

has asked you to paint

some pictures of the

fairytale “Little Red

Riding Hood” but it turns

out that she has made a

mistake and she is

showing you the fairytale

Test item 30

Imagine that your friend

Pere has participated in a

dress competition and just

found out that he has

won. What would you

say?

Mariia Pronina

PhD Research Plan

91

“Puss in Boots”. What

would you say to her?

Test item 31

Imagine that you are in a

park with your family and

your parents ask you to

watch after your little

sister. She runs and goes

far from the park. You are

worried because many

cars pass by there. What

would you say to her?

Test item 32

Imagine that somebody

tells you that your friend,

Pau, has won a swimming

competition. You cannot

believe it because you

know that Pau cannot

swim. Ask again who is

the person that won the

swimming competition.

Test item 33

Imagine that you are with

your friend Núria and it

has been a long time since

you have had lunch

together. Suddenly, you hear that she has a

growling stomach. You

want to know if she is

hungry. Ask her.

Test item 34

Imagine that you are at

some friend’s party and

some of the other guests’

brothers are also there.

You are watching a movie but the little

brothers are making too

much noise and you

cannot hear anything in

the movie. Ask them to be

quiet.

Test item 35

Imagine that your friend

tells you that she went to

a restaurant for lunch and

tells you, surely, that she

got cat for lunch instead

of rabbit. You cannot

believe her and you are

very surprised. Ask her

again what she got for

lunch.

APPENDIX B: Sample story of the training materials (Study 4)

Story: “Sara Sea-Turtle” (target terms: believing, getting upset).

One day, Theo Shark was swimming along the bottom of the sea.

Suddenly, he heard a voice calling for help and he thought: “Someone is in

trouble or in danger”. He looked around and decided to try to find that person

and help them.

Mariia Pronina

PhD Research Plan

92

He tried to figure out where the voice was coming from by swimming about

in all directions. When he came near some rocks he could hear the voice more

strongly. He went right up to the rocks and saw an opening between two of

them. He looked in and saw Sara Sea-Turtle who was crying. Theo asked her:

“What happened? Why are you crying?”.

Sara, upset and frightened, said: “I was gathering food for my little babies and

I saw some delicious shellfish in this hole. I thought that my little ones would

love them. I noticed that the hole was a bit small, but I decided to get in

anyhow and now I’m stuck inside here and can’t get out. Please help me!”.

Theo replied: “Of course, don’t worry, stay calm. I’ll go and call my friend

Jack Dolphin; he’s sure to know how to get you out of there”.

So Theo swum as fast as he could towards Jack’s house. When he got there,

he explained the situation to his friend, and Jack immediately had an idea: he

decided that they would take a very strong rope with them to pull the sea-turtle

out of the hole.

Together, Theo and Jack went back to Sara Sea-Turtle and when she saw

them she felt a lot calmer because she know that two friends would do their

best to get her out of the hole.

Theo Shark said: “We’re back, my friend Jack has brought a rope; now

you’ve got to hold it tightly between your teeth and we’ll try to pull you

out”. Sara, feeling much better now, said: “OK”. She thought to herself: “I’ve got

to try as hard as I can. I really hope it works!”.

Theo and Jack began to pull as strongly as they could, but no matter how hard

they tried, they could not get Sara out. “This isn’t working”, said Jack. “She’s

too stuck. We won’t be able to get her out on our own.”

So they decided to go and ask some other fish to help them.

“The more of us there are, the harder we will be able to pull!”, thought Jack.

After a while, Charlie Balloon Fish, Amadeus Hammerhead, Sally Eel and

even Diego Seahorse came to help too.

When Sara saw so many other fish coming to help her she felt better straight

away and tried even harder to free herself.

They all pulled on the rope as hard as they could, and after a number of

attempts, the sea-turtle finally popped out of the hole she had been stuck in.

Delighted to be free at last, she thanked Theo, Jack and all the other fish,

before rushing home to feed her babies.

From that day onwards, Sara never went through narrow openings again in

case she got stuck once more.

APPENDIX C: Script for group distribution (Study 4)

An algorithm was programmed to distribute children across the three conditions. This

algorithm measures the degree of similarity between different test scores and assigns the

participants to experimental conditions while taking into account children's class distribution

and preserving general similarity of the distributions by conditions.

Mariia Pronina

PhD Research Plan

93

The participants are allocated to a certain condition on the the basis on four test scores:

Emotion Understanding, Mental Verb Comprehension, ToM and the Pragmatic Test. The

division of participants into groups is undertaken in three steps. First, the distance between two

participants' scores is calculated. In order to do so, the cosine between four-dimentional vectors

(corresponding to the four tests) is measured. Then, the distance matrix of all class participants

is obtained. Thus, the closer the vectors are, the smaller the distantance is. Afterwards, a

minimum of all distances and its location (a vector) are calculated, which results in close vector

pairs. Finally, the algorithm stops when one of the groups contains half of the participants. The

remaining children are assigned to the other group.

APPENDIX D: Annotation system of online recoding of training sessions (Study 4)

Children's actions:

R1 – response of one child;

R2 – response of the second child;

R3 – response of the third child;

R4, R5...

R – response of children (all together);

RNeg – children don't want to answer a question;

R1Neg – a particular child doesn't want to answer a question;

R2Neg – the second child doesn't want to answer a question;

R3Neg, R4Neg...

Rm – multimodal response of children (enactment);

R1m, R2m, R3m....

Rv – verbal response to multimodal question;

R1v, R2v...

C1, C2, C3 – children's comments (a child decides to say something without been asked a question);

M – children repeat the experimenter multimodal actions.

Experimenter’s actions:

Q – experimenter asks a question to everyone;

2Q- experimenter repeats a question to everyone;

Q+ – experimenter reformulates a question to everyone;

Q1 – experimenter asks a question referring to a particular child;

Q2 – experimenter asks a question referring to another child (the second child);

Q3, Q4, Q5...

2Q1, 2Q2 – experimenter repeats a question and asks to answer a particular child;

FPos – positive feedback;

Fneg – negative feedback;

TRep – experimenter encourages children to repeat a term;

A – experimenter answers a question;

Ex – experimenter provides a verbal example;

Ex+ – experimenter provides an additional verbal example;

Exm – experimenter provides a multimodal example;

Ex+m – experimenter provides an additional multimodal example;

ME – experimenter does a multimodal action but not an example;

– experimenter invites children to stand up to enact;

– experimenter invites children to sit down to continue with the training;

End – experimenter concludes the training session and invites children to return to the class.