272
© 2011 Kathryn Cain Dickerson ALL RIGHTS RESERVED

© 2011 Kathryn Cain Dickerson ALL RIGHTS RESERVED

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

© 2011

Kathryn Cain Dickerson

ALL RIGHTS RESERVED

Multiple Memory Systems Involved in Human Probabilistic Learning

By: Kathryn Cain Dickerson

A Dissertation submitted to the Graduate School – Newark

Rutgers, The State University of New Jersey

in partial fulfillment of the requirements for the degree of

Doctor of Philosophy

Graduate Program in Behavioral and Neural Science

Written under the direction of

Professor Mauricio R. Delgado

and approved by:

Dr. Lila Davachi

Dr. Mauricio R. Delgado

Dr. Bart Krekelberg

Dr. Catherine Myers

Dr. Elizabeth Tricomi

Newark, New Jersey

October 2011

ABSTRACT OF THE DISSERTATION

Multiple Memory Systems Involved in Human Probabilistic Learning

By: Kathryn Cain Dickerson

Dissertation Director: Professor Mauricio R. Delgado

This dissertation investigated the nature of interactions between multiple

memory systems (MMS) in the human brain during probabilistic learning. How

MMS interact in the brain is highly debated in the literature, with evidence

supporting competition, cooperation, and parallel engagement observed across

several species and various experimental paradigms (e.g., Poldrack et al., 2001;

Poldrack and Rodriguez, 2004; Voermans et al., 2004; White and McDonald,

2002). In this dissertation, three functional neuroimaging experiments

investigated the relationship between the medial temporal lobes (MTL),

implicated in declarative learning, and the basal ganglia (BG), involved in

nondeclarative learning, during probabilistic learning. Specifically, tasks which

varied with respect to learning type (feedback, akin to nondeclarative learning

and observation akin to declarative learning) and cue difficulty (e.g., easy, hard)

were employed. Based on the evolutionary theory of MMS, neuroanatomical

connections between these regions, as well as connectivity with midbrain

dopaminergic centers involved in reward, memory, and reinforcement learning, it

ii

was hypothesized that rather than one specific relationship (e.g., competitive or

cooperative), MMS operate in parallel during probabilistic learning – at times

exhibiting parallel engagement, and at times interacting directly via cooperation

or competition depending on the context of the learning situation. Interactions

between memory systems were measured by the relative engagement of each

system during learning (as measured by BOLD responses), simple correlation

analyses, as well as functional and effective connectivity measures. Lastly, to

examine putative dopaminergic influences during learning, reinforcement

learning models were employed. Across all experiments, it was observed that a)

patterns of activation in MMS during learning varied depending on the learning

context; b) functional and effective connectivity existed across regions (MTL,

BG); and c) a dopaminergic learning signal correlated with activity in both the BG

and MTL. To conclude, the results of this dissertation support the hypothesis

that multiple memory systems operate in parallel during probabilistic learning in

humans. Understanding how such systems communicate during learning may

inform treatment of several neuropsychiatric disorders which affect learning and

memory in humans (e.g., Parkinson’s disease, schizophrenia, and MTL

amnesia).

iii

Dedication

To my mother, Christine Josephine Dickerson, for her constant support and love

To my father, Kenneth Gerard Dickerson, for his quiet and steady guidance

I would not be here without the two of you – I am eternally grateful for all you

have given me

&

To my grandparents, Dorothea Marie and Raymond James Cain – thank you for

teaching me the power of knowledge, hard work, and determination

You are deeply loved and missed

iv

Acknowledgments

There are many people who I would like to thank for their support and assistance

on my journey towards achieving my graduate degree. First I would like to thank

my advisor, Dr. Mauricio Delgado. He has been an excellent mentor and I am so

happy to have had the opportunity and pleasure of working with him. He has

taught me a great deal and I am very appreciative of his kindness and patience

over the years. I would also like to thank all the members of my dissertation

committee, both past and present, Dr. Elizabeth Tricomi, Dr. Bart Krekelberg, Dr.

Mark Gluck, Dr. Catherine Myers, and Dr. Lila Davachi. They have all been

extremely helpful and thoughtful in guiding me through my dissertation

experience. I am also very grateful for the advice and mentoring I received at the

University of Rochester, especially from Dr. Mark Mapstone, which helped me

decide to attend graduate school. My time at Rutgers would not have been the

same without my dear friends and colleagues in the Delgado lab. I am especially

thankful to Dominic Fareri, Laura Martin, Mike Niznikiewicz, Tony Porcelli, and

Swati Battacharya-Sharma with whom I began my journey as well as Kamila Sip,

Meredith Johnson, Andrea Lewis, Lauren Leotti and Anastasia Rigney, with

whom I am ending it. It has been such a joy and a pleasure knowing you all, and

I value your friendships enormously. Mike, Dominic, Tony and Laura especially

taught me so much and were wonderfully patient teachers. It was also my

pleasure to work with Victoria Lee, Martina Henderson, and Stephanie Herrera,

who were undergraduate students in the lab. Vicki especially has been such a

wonderful support, helping me with many aspects of my projects over the years.

v

I am very grateful to the Neuroscience Department, especially Dr. Ian Creese

and Dr. Steve Levison, who were the directors of the program for the majority of

my time at Rutgers, as well as the all of the faculty who taught me a great deal

about neuroscience. I am so happy I was accepted here and have very much

enjoyed being a part of the department as well as being affiliated with the

Psychology department. I am very grateful to all of my fellow neuroscience

students, especially Amber Ziegler, Kate Seip, Arielle Schmidt, Anushree Karnik,

Jessica Wright, Alex Kohl, Josh Callahan, Till Hartmann, Bengi and Tim Unal,

Julia Basso, and Liad and Alfonso Renart, for listening and providing excellent

comments on practice talks and for their general support and encouragement. I

would also like to thank the staff of the Neuroscience Department, Ann Kutyla,

Wayne Brown, and Connie Sadaka as well as Sandra Smith of the Psychology

Department for their assistance throughout the years. The staff at the UMDNJ

Advanced Imaging Center: Pat Singh, Bobby Singh, Evelyn Oliviera, and Melissa

Ortiz were most helpful during my time of data collection and I am very

appreciative of their assistance. I am also grateful to all of the students who

volunteered their time participating in my studies. I am so lucky to have the

support of my close friends Molly Reed, Lidza Kalifa, Megan Sweeny, Avinash

Reddy, Usman Khan, and my entire extended family, who have supported me

every step of the way. I am also so lucky to have the best parents and siblings –

Mom, Dad, Nate, Lilly, and Kate I truly could not have done this without you, your

love has given me so much encouragement when I needed it most. Lastly I

vi

would like to thank my partner Alon Amir for his love and support. I am so happy

that we had this journey together; it would not have been the same without you.

vii

Table of Contents

Abstract of the Dissertation ii

Dedication iv

Acknowledgments v

Table of Contents viii

List of Figures xii

List of Tables xiv

List of Appendices xvi

Chapter 1: Introduction 1

1.1 General Introduction 1

1.2 The Neuroanatomy of Multiple Memory Systems 4

1.2.1 Medial temporal lobes and declarative memory 5

1.2.2 Basal ganglia and nondeclarative memory 8

1.2.3 Anatomical circuitry of the MTL and BG 10

1.2.4 Neuroanatomical connections between memory systems 12

1.3 Interactions between Multiple Memory Systems – Functional Significance 16

1.3.1 Evidence supporting competitive interactions 17

1.3.2 Evidence supporting cooperative interactions 19

1.3.3 Evidence supporting parallel engagement 21

1.4 Description of Dissertation Experiments 25

1.5 Significance 29

viii

Chapter 2: General Materials, Methods, & Data Analyses 30

2.1 General Materials & Methods 30

2.1.1 Experimental designs 30

2.1.2 Participants 31

2.1.3 fMRI acquisition 32

2.2 General Data Analyses 33

2.2.1 General linear model 33

2.2.2 Granger causality analyses 35

2.2.3 Prediction error analyses 36

2.2.4 Cluster lever statistical threshold estimator 40

2.2.5 Sequential Bonferroni correction 41

2.2.6 Note regarding test phase neuroimaging results 42

Chapter 3: Experiment 1 43

3.1 Introduction: Background & Rationale 43

3.2 Materials & Methods 47

3.2.1 Experimental design 47

3.2.2 Data analysis 51

3.3 Results 54

3.3.1 Behavioral results 54

3.3.2 Neuroimaging results 57

3.4. Discussion 62

ix

Chapter 4: Experiment 2 69

4.1 Introduction: Background & Rationale 69

4.2 Materials & Methods 77

4.2.1 Experimental design 77

4.2.2 Data analysis 80

4.3 Results 86

4.3.1 Behavioral results 86

4.3.2 Neuroimaging results 95

4.3.3 Exploratory analyses 107

4.4 Discussion 116

Chapter 5: Experiment 3 131

5.1 Introduction: Background & Rationale 131

5.2 Materials & Methods 142

5.2.1 Experimental design 142

5.2.2 Data analysis 148

5.3 Results 153

5.3.1 Behavioral results 153

5.3.2 Neuroimaging results 158

5.4 Discussion 166

x

Chapter 6: General Discussion 176

6.1 Purpose & Summary of Experiments 176

6.2 Limitations 184

6.3 Conclusions & Future Directions 187

References 189

Figures 208

Tables 223

Appendices 236

Vita 255

xi

List of Figures

Figure 3.1 Schematic of the Task used in Experiment 1 208

Figure 3.2 Experiment 1 Behavioral Results 209

Figure 3.3 Experiment 1 Neuroimaging Results: Main Effect of Cue Difficulty 210

in the Striatum and Hippocampus

Figure 3.4 Experiment 1 Neuroimaging Results: Granger Causality Analysis 211

Functional Connectivity in the Caudate Nucleus using the Hippocampus

as the Seed Region

Figure 3.5 Experiment 1 Neuroimaging Results: Prediction Error Analysis in 212

the Hippocampus and Striatum

Figure 4.1 Schematic of the Task used in Experiment 2 213

Figure 4.2 Experiment 2 Behavioral Results 214

Figure 4.3 Experiment 2 Neuroimaging Results: Learning Phase Main Effect 215

of Cue Difficulty in the Striatum and MTL – Feedback Group

Figure 4.4 Experiment 2 Neuroimaging Results: Learning Phase Main Effect 216

of Cue Difficulty in the Striatum and MTL – Observation Group

Figure 4.5 Experiment 2 Neuroimaging Results: Update Phase Main Effect 217

of Cue Difficulty in the Striatum and MTL – Feedback and Observation

Groups

Figure 4.6 Experiment 2 Neuroimaging Results: Granger Causality Analysis 218

Effective Connectivity using the Midbrain as the Seed Region

Figure 4.7 Experiment 2 Neuroimaging Results: Prediction Error Analysis 219

Figure 5.1 Schematic of the Task used in Experiment 3 220

xii

Figure 5.2 Experiment 3 Behavioral Results 221

Figure 5.3 Experiment 3 Neuroimaging Results: ROI Analysis during the 222

Stimulus Presentation Period

xiii

List of Tables

Table 1.1 Evidence Supporting Interactions between Multiple 25

Memory Systems

Table 3.1 Experiment 1 Learning Phase ANOVA: Brain Regions Showing 223

a Main Effect of Cue Difficulty

Table 3.2 Experiment 1 Learning Phase ANOVA: Brain Regions Showing 224

a Main Effect of Learning Type

Table 3.3 Experiment 1 Test Phase Contrast of Previously Studied 225

vs. Novel

Table 4.1 Experiment 2 Learning Phase ANOVA: Brain Regions Showing 226

a Main Effect of Cue Difficulty – Feedback Group

Table 4.2 Experiment 2 Learning Phase ANOVA: Brain Regions Showing 227

a Main Effect of Cue Difficulty – Observation Group

Table 4.3 Experiment 2 Test Phase 1: Contrast of Previously Studied vs. 228

Novel – Feedback Group

Table 4.4 Experiment 2 Test Phase 1: Contrast of Previously Studied vs. 229

Novel – Observation Group

Table 4.5 Experiment 2 Update Phase ANOVA: Brain Regions Showing 230

a Main Effect of Cue Difficulty – Feedback & Observation Groups

Combined

Table 4.6 Experiment 2 Update Phase ANOVA: Brain Regions Showing 231

a Main Effect of Learning Type – Feedback & Observation Groups

Combined

xiv

Table 4.7 Experiment 2 Update Phase ANOVA: Brain Regions Showing 232

a Main Effect of Group

Table 4.8 Experiment 2 Test Phase 2: Contrast of Previously Studied vs. 233

Novel – Feedback Group

Table 4.9 Experiment 2 Test Phase 2: Contrast of Previously Studied vs. 234

Novel – Observation Group

Table 5.1 Experiment 3 Test Phase: Contrast of Previously Studied 235

vs. Novel

xv

List of Appendices

Appendix 1 – Neuroimaging Test Phase Results for Experiments 1-3 236

Appendix 2 – Experiment 3 Correlations Observed across the 238

Independent Region of Interest Analysis

Appendix 3 – Behavioral Questionnaires Administered in Experiments 1-3 241

xvi

1

Chapter One: Introduction

1.1 General Introduction

Accumulating evidence from both animal and human-based research

supports the existence of multiple memory systems that rely on distinct neural

substrates in the human brain (Sherry and Schacter, 1987; Squire, 1992b; Squire

and Zola, 1996). An important division is between a declarative memory system,

which relies on the integrity of the medial temporal lobes (MTL), and a

nondeclarative memory system, which engages many brain regions including the

basal ganglia system (BG; Sherry and Schacter, 1987; Squire, 1992b). How

these two regions interact during learning, however, remains unclear and is a

topic of debate in the literature. While some evidence points towards competitive

interactions, such that when one system is engaged it suppresses or blocks the

other (Poldrack et al., 2001; Poldrack and Packard, 2003; Foerde et al., 2006;

Lee et al., 2008), other research supports cooperative interactions (Dagher et al.,

2001; Voermans et al., 2004; Cincotta and Seger, 2007; Sadeh et al., 2011).

Moreover, a third theory posits that the MTL and BG operate in parallel and may

interact in either a competitive or a cooperative manner depending on the

learning scenario (White and McDonald, 2002; Albouy et al., 2008; Atallah et al.,

2008). Considering the conflicting evidence regarding the nature of the

interactions between these two major learning and memory systems,

understanding how they interact remains a question of great interest. The goal of

the experiments presented in this dissertation was to investigate the interactions

between multiple memory systems, specifically the basal ganglia and the medial

2

temporal lobe, during probabilistic learning using functional magnetic resonance

imaging (fMRI) of healthy human participants.

One way of approaching this debate is to consider the evolutionary theory

supporting multiple memory systems as well as neuroanatomical and functional

connectivity between the MTL and BG. It has been suggested that multiple

learning and memory systems evolved to play distinct roles in the brain (Sherry

and Schacter, 1987). Sherry and Schacter propose that natural selection may

have driven the development of multiple memory systems in the brain, when

functional incompatibility, or the inability of one memory system to solve a

problem in a particular learning environment, lead to the development of multiple

systems with distinct functions. Furthermore, it has been theorized that these

memory systems may operate in parallel, encoding unique information (White

and McDonald, 2002). Anatomical considerations state that these structures

communicate directly via a unidirectional anatomical projection from the

hippocampus, part of the MTL, to the nucleus accumbens (Kelley and Domesick,

1982) as well as the medial, ventral, rostral, and caudal aspects of the caudate

nucleus/putamen, all subcomponents of the BG (Krayniak et al., 1981; Sorensen

and Witter, 1983; Swanson and Kohler, 1986; Groenewegen et al., 1987). The

MTL and BG also communicate indirectly via interconnectivity with dopaminergic

centers in the midbrain (Swanson, 1982; Bolam et al., 2000; Poldrack and

Rodriguez, 2004). Dopamine is a neurotransmitter that has been implicated in

playing a fundamental role in learning and memory processes (Wise, 2004).

Dopaminergic neural activation is observed in the initial acquisition of reward

3

contingencies (Schultz, 1997, 2002) and is associated with changes in blood

oxygen level dependent (BOLD) signals in the human striatum during processes

such as receipt of rewards and punishments, monitoring goal outcomes, and

prediction error encoding (for review see O'Doherty et al., 2004; Balleine et al.,

2007; Delgado, 2007). Moreover, dopamine neuronal activity is linked with long-

term memory formation (Frey et al., 1990; Frey et al., 1991; Li et al., 2003),

known to recruit the hippocampus (for review see Bliss and Collingridge, 1993)

and recently midbrain reward-related activation has been linked to the

enhancement of long-term memory in the hippocampus (Wittmann et al., 2005).

Furthermore, functional coupling of the midbrain and hippocampus enhances

memory formation (Adcock et al., 2006; Shohamy et al., 2008). Based on

evidence in the literature, it is hypothesized that this functional neuroconnectivity

with dopamine plays an integral role in facilitating interactions between multiple

memory systems.

Therefore, drawing from the evolutionary theory of multiple memory

systems and functional neuroconnectivity with dopamine, it is theorized that the

MTL and BG operate in parallel during probabilistic learning with the ability to

complement each other, which may manifest itself as both cooperation and

competition. For the purpose of this dissertation, the following definitions of

parallel, cooperative, competitive interactions are employed, modified for

interpreting neuroimaging data based on White and McDonald’s theory (2002) as

well as interpretations of current neuroimaging studies in the field (Poldrack et

al., 2001; Voermans et al., 2004; Foerde et al., 2006; Doeller et al., 2008; Sadeh

4

et al., 2011). Parallel engagement is defined as the engagement of multiple

memory regions simultaneously during learning, while displaying unique patterns

of BOLD activation, suggesting that they may be encoding distinct information in

parallel. Cooperative interactions are defined as the exhibition of similar patterns

of BOLD signal in both the MTL and BG during learning as well as positive

correlations across distinct areas. Lastly, competitive interactions are defined as

the exhibition of opposing patterns of BOLD signal in both the MTL and BG

during learning and negative correlations across distinct regions.

1.2 The Neuroanatomy of Multiple Memory Systems

Over the past two decades the predominant theory regarding human

memory suggests that memory is not a unitary process, but rather consists of

multiple systems that rely on distinct neural substrates (Sherry and Schacter,

1987; Squire, 1992b; Squire and Zola, 1996). One system is referred to as

declarative memory, which encompasses episodic (memory for distinct episodes)

and semantic memory (facts and events), both of which consist of flexible

associations that may be rapidly acquired (Squire et al., 2004). Recalling one’s

college graduation day (episode) and knowing that George Washington was the

first President of the United States (fact) are examples of declarative memory. A

second system is referred to as nondeclarative memory, and is a broader

collection of many types of memory, which are more inflexible in nature and take

longer to acquire (Reber et al., 1996; Squire et al., 2004). Habit and procedural

memory, emotional memory, as well as priming and perceptual memory are all

5

types of nondeclarative memory (Squire et al., 2004). Remembering how to ride

a bicycle is an example of a nondeclarative memory.

1.2.1 Medial temporal lobes and declarative memory: Declarative and

nondeclarative memory are thought to differ not only in the types of memory they

encompass and their respective properties, but also in the neural substrates they

engage. Numerous studies, spanning rodent, primate, and human-based

research, document that declarative memory relies on the integrity of the medial

temporal lobes (see Squire, 1992a for review). A significant portion of rodent-

based research focuses on spatial maze learning tasks (Packard et al., 1989;

Packard and White, 1989; Packard and White, 1991; Packard and McGaugh,

1992, 1994). In these maze learning tasks, a rodent is trained to navigate a

maze to retrieve a food reward or escape water. One example is the radial maze

task, which has two versions termed the win-shift and the win-stay radial maze

(Packard et al., 1989). The win-shift radial maze task requires animals to visit

each arm of the maze only once to retrieve a food reward. It is theorized to

represent declarative learning because the animal must store rapidly acquired

information based on a single experience, e.g., it must remember which arms of

the maze it has and has not visited in order to obtain all of the food rewards while

not making any errors (re-entering an arm where the food has already been

obtained; White and McDonald, 2002). Lesions to the fimbria-fornix region, the

fiber bundle which arises from the subiculum and projects to the hypothalamus,

impair performance on the win-shift version of the radial maze task (Packard et

al., 1989) and a water maze paradigm (Packard and McGaugh, 1992). Because

6

lesions to the fimbria-fornix impair performance on the win-shift radial maze task,

it is thought that the MTL plays an important role in episodic-like learning in

rodents. Additionally, lesions to the CA1 and CA3 subfields of the hippocampus

produce spatial and associative memory deficits in rodents (Stubley-Weatherly et

al., 1996).

In primate-based research, evidence suggesting that the MTLs are

important for declarative learning comes from a host of discrimination related

tasks (such as pattern discrimination, delayed-nonmatching-to-sample, delayed

retention of object discrimination, and concurrent discrimination). These studies

have shown that lesions to the medial temporal lobes impair discrimination tasks

(Zola-Morgan and Squire, 1984, 1985; Zola-Morgan and Squire, 1986), but leave

motor and cognitive skill learning intact (Zola-Morgan and Squire, 1984).

Furthermore, studies using human MTL amnesic patients also support the theory

that the medial temporal lobes are necessary for declarative memory, as patients

who are missing part or all of their MTLs suffer profound declarative memory

deficits (Scoville and Milner, 1957; Penfield and Milner, 1958; Corkin et al., 1984;

Rempel-Clower et al., 1996). Lastly, fMRI and positron emission tomography

(PET) studies corroborate the theory that the MTLs are involved in encoding and

recalling declarative memories (for review see Schacter and Wagner, 1999;

Eldridge et al., 2000; Davachi and Wagner, 2002; Zeineh et al., 2003; Law et al.,

2005).

Specifically, in learning paradigms the hippocampus is believed to play a

role in encoding episodes (see Gluck et al., 2003 for review), stimulus-stimulus

7

representations (Gluck and Myers, 1993; Bunsey and Eichenbaum, 1995;

Eichenbaum and Bunsey, 1995), as well as flexibly adapting information to use in

novel contexts (Myers et al., 2003; Shohamy and Wagner, 2008). In category

learning, it has been suggested by Gluck and Myers (Gluck and Myers, 1993;

Gluck et al., 2003) that the hippocampus functions to compress overlapping

stimulus information and differentiate between distinct stimulus information.

These compressed and differentiated representations in the hippocampus are

formed over time through exposure to many trials during training, and are

subsequently communicated to cortical regions. It has also been suggested that

the connections of the MTL with the cortex are designed for rapid learning of

individual events (O'Reilly and Munakata, 2000). This type of learning is

particularly important for learning arbitrary category associations, for example in

unstructured categorization tasks which do not have an exemplar, do not involve

information integration, do not use prototype distortion, nor require learning of a

specific rule (Seger and Miller, 2010). Such unstructured categorization tasks

have been previously shown to engage the MTL (Poldrack et al., 1999; Poldrack

et al., 2001; Seger and Cincotta, 2005; Foerde et al., 2006). In non-human

primates, neurons in the hippocampus and temporal cortex have been shown to

respond to category-specific information (Hampson et al., 2004), suggesting that

category-specific information may be encoded within this region. In accord with

this theory, the hippocampus is thought to be involved in encoding episodes,

which may occur in as little as one learning instance. It is believed that the

hippocampus encodes a conjunction of many features which comprise the

8

episode, rather than encoding the individual features distinctly (Gluck et al.,

2003).

1.2.2 Basal ganglia and nondeclarative memory: In contrast,

nondeclarative memory is thought to recruit a host of distinct brain regions

including the basal ganglia (procedural or habit memory), the amygdala

(emotional memory), and the neocortex (priming and perceptual memory) (Squire

et al., 2004). For the purposes of this dissertation, the focus was on the role of

the basal ganglia in human probabilistic learning. Research using both animal

models and human studies demonstrates that the basal ganglia are involved in

several aspects of nondeclarative learning and memory. In animal models,

rodent maze studies reveal that lesions of the caudate nucleus, part of the

striatum, impair performance on a win-stay version of the radial maze task. This

version of the task requires animals to approach only certain cued arms of the

maze to retrieve a food reward, which is thought to be akin to stimulus-response

or habit learning (Packard et al., 1989; Packard and McGaugh, 1992).

Accumulating evidence suggests that the basal ganglia are also involved in many

aspects of cognitive learning (see Middleton and Strick, 2000; and Packard and

Knowlton, 2002 for review). In human based studies, recent fMRI research has

implicated this area in several other learning related functions including: habit

learning (see Yin and Knowlton, 2006 for review), skill learning (Atallah et al.,

2007), instrumental conditioning (O'Doherty et al., 2004), as well as reward

learning and decision-making (Tricomi et al., 2004; Balleine et al., 2007;

Delgado, 2007). It has become well-documented in the literature that the basal

9

ganglia are not solely a structure important for motor learning, but that they are

also critical for many types of nondeclarative memory and cognitive functioning.

The basal ganglia system has also been implicated in playing a

fundamental role in category learning in humans (Knowlton et al., 1994; Knowlton

et al., 1996; Poldrack et al., 1999; Poldrack et al., 2001; Shohamy et al., 2004b;

Shohamy et al., 2004a; Seger and Cincotta, 2005; Cincotta and Seger, 2007). It

has been suggested that the basal ganglia may be involved in encoding goal

outcomes (Tricomi and Fiez, 2008; Carter et al., 2009), error correction learning

(O'Doherty et al., 2004; O'Doherty, 2004; Abler et al., 2006; Pessiglione et al.,

2006), stimulus-response associations (Rolls, 1994; Packard and Knowlton,

2002; Ashby and Spiering, 2004), as well as mediating action selection (Seger,

2008). Work by Seger has suggested the involvement of multiple corticostriatal

loops, the visual (caudate nucleus body/tail), motor (putamen), executive (head

of caudate nucleus), and motivational (ventral striatum) loop, which play distinct

roles in category learning (2008). The author has proposed that the visual

corticostriatal loop may be involved in action selection, while the motor loop may

be involved in action selection as well as automatizing sequences. The

executive and motivational corticostriatal loops are engaged in feedback

processing, with the executive loop encompassing the head of the caudate

nucleus which may be involved in updating behavior via error correction.

Consistent with the actor-critic model, it has been theorized that the dorsal

striatum (head of caudate nucleus) acts as the actor and may be involved in

deciding which action to take, while the ventral striatum acts as the critic tracking

10

whether or not an excepted reward has been received (Joel et al., 2002;

O'Doherty et al., 2004).

1.2.3 Anatomical circuitry of the MTL and BG: It is important to consider

the anatomical circuitry of the MTL and BG individually before considering how

they may interact. Both the MTL and BG are large structures, consisting of

multiple subregions. The MTL is comprised of the parahippocampal cortex

(PHC), perirhinal cortex (PRC), entorhinal cortex (EC), and the hippocampal

complex (HC) (Squire et al., 2004). The parahippocampal and perirhinal cortices

receive information from unimodal and polymodal association areas and send

this information to the EC. The entorhinal cortex has multiple sublayers and

serves as the input unit to the hippocampus. The hippocampus itself also

consists of multiple layers, including the dentate gyrus, and the Cornu Ammonis

(CA) fields, referred to as CA1, CA2, CA3 and CA4 (Andersen, 1971; Amaral,

1993). The predominant loop arises as input from layer two of the EC, which

sends projections to the mossy fibers of the dentate gyrus and layers CA2 and

CA3, referred to as the perforant path (there is also a projection from the EC

layer 3 to CA1 and the subiculum). The mossy fibers of the dentate gyrus then

project to the pyramidal cells of the CA3 field, which project via the Schaffer

collaterals to the pyramidal cells in CA1 in a unilateral circuit. CA1 then projects

to the subiculum and deep layers of the EC. This flow of information from the

perforant path → dentate gyrus → CA3 → CA1 is referred to as the trisynaptic

circuit. The CA1 field is a sparse network, which functions well in pattern

separation, while CA3 has many interconnections, and functions well in pattern

11

completion. The output unit of the hippocampal unit is the subiculum and fimbria-

fornix fiber bundle.

Turning to basal ganglia circuitry, the basal ganglia system is comprised of

the striatum, the globus pallidus, the subthalamic nucleus, and the substantia

nigra (Packard and Knowlton, 2002; Yin and Knowlton, 2006). The striatum

consists of the caudate nucleus, putamen, and nucleus accumbens. The

caudate nucleus and putamen are often referred to collectively as the dorsal

striatum, and are implicated in learning tasks; while the nucleus accumbens is

referred to as the ventral striatum, and is more heavily involved in reward and

conditioning tasks (O'Doherty et al., 2004; Balleine et al., 2007; Delgado, 2007).

The BG also contains the globus pallidus, which is divided into an internal and

external component, as well as the subthalamic nucleus. In addition, the

substantia nigra (SN), a midbrain structure, is involved in basal ganglia circuitry.

The SN is divided into two main components, the SN reticulata (SNr) and the SN

compacta (SNc), the latter of which contains dopamine neurons that project to

the striatum. As a note, dopaminergic neurons in the ventral tegmental area

(VTA) also project to the striatum (Haber, 2003). The BG is generally

categorized into containing two main circuits, referred to as the direct pathway

and the indirect pathway. The direct pathway is as follows:

Cortex (glutamatergic) → striatum (GABAergic) → SNr-GPi (GABAergic) →

thalamus (glutamatergic) → cortex, forming a functional loop. Strong excitatory

input from the cortex results in strong inhibitory input from the striatum to the

SNr-GPi complex, which then weakly inhibits the thalamus, resulting in a stronger

12

excitatory projection to the cortex. The end result of this loop therefore is

excitation of the cortex.

The indirect circuit is as such:

Cortex (glutamatergic) → striatum (GABAergic) → GPe (GABAergic) →

subthalamic nucleus (glutamatergic) → SNr-GPi (GABAergic) → thalamus

(glutamatergic) → cortex. Excitatory input from the cortex results in strong

inhibition of the GPe from the striatum. The GPe then weakly inhibits the STN,

which then strongly excites the SNr-GPi complex, inhibiting the thalamus, which

can then only weakly excite the cortex. The end result of this loop therefore is

weak excitation of the cortex. Consequently, the direct and indirect pathways

have antagonistic functions in the brain.

1.2.4 Neuroanatomical connections between memory systems: In order to

better understand how two major learning and memory systems operate in the

human brain, much recent research has focused on examining the interactions

between the medial temporal lobes and the basal ganglia. Anatomically, these

regions are connected by a unilateral projection from the hippocampus to the

nucleus accumbens (Kelley and Domesick, 1982) and the medial, ventral, rostral,

and caudal aspects of the caudate nucleus/putamen (Krayniak et al., 1981;

Sorensen and Witter, 1983; Swanson and Kohler, 1986; Groenewegen et al.,

1987). Lisman and Grace (2005) formulated a theory of how the striatum and

hippocampus may interact via a loop with the ventral tegmental area. This model

is as follows:

13

Layer II of the EC → dentate gyrus and CA3 (glutamatergic) → CA1 (which also

receives input from EC layer III; glutamatergic) → subiculum (glutamatergic) →

nucleus accumbens (GABAergic) → ventral pallidum (GABAergic) → VTA

(dopaminergic) → CA1, layer III of the EC, and the nucleus accumbens.

Lisman and Grace propose that this model is involved in novelty detection

and encoding of information into long term memory. Newly detected information

in the hippocampus is sent as a novelty signal via the subiculum, nucleus

accumbens and ventral pallidum to the VTA in the midbrain. Subsequent DA

release from the VTA projects back up to the hippocampus, where it is involved

in facilitating long-term potentiation. Of critical importance for the purposes of

this dissertation, is the fact that the VTA also projects to the nucleus accumbens,

thereby effectively communicating with both the hippocampus and the ventral

striatum.

In addition to the Lisman and Grace model, Devan and White (1999) have

presented four anatomical links between the hippocampal complex and the

dorsomedial striatum:

1) CA1 and subiculum via the fimbria/fornix → ventral striatum and

caudate nucleus/putamen (Groenewegen et al., 1987; Brog et al.,

1993)

2) Perirhinal cortex → dorsal CA1, EC, and NAcc/medial caudate

nucleus/putamen (Krayniak et al., 1981; Sorensen and Witter, 1983;

Swanson and Kohler, 1986; McGeorge and Faull, 1989; Burwell et al.,

1995; Liu and Bilkey, 1996)

14

3) Hippocampal output → posterior cingulate cortex → medial caudate

nucleus/putamen (Domesick, 1969; McGeorge and Faull, 1989)

4) Hippocampal output → medial prefrontal cortex → ventral and

dorsomedial striatum (Swanson, 1981; Gerfen, 1984; Swanson and

Kohler, 1986; Jay et al., 1989)

Therefore it is possible that the hippocampus and striatum may also

communicate via a third region, such as the prefrontal cortex (Poldrack and

Rodriguez, 2004) or even the cingulate cortex (Devan and White, 1999).

Evidence from animal literature indicates that lesions to the cingulate cortex

(Sutherland et al., 1988) and medial prefrontal cortex (Sutherland et al., 1982;

Kolb et al., 1994) impair performance on water maze tasks in rodents, suggesting

that the connections between these regions is functionally important. Recent

research in particular has emphasized that connections with the prefrontal cortex

may be facilitating interactions between the MTL and the BG. In one study using

dynamic causal modeling, the dorsolateral prefrontal cortex (DLPFC) was found

to be the sole structure involved in receiving reward-related information directly,

which subsequently influenced activity in the midbrain (VTA) and ventral striatum

(NAcc) (Ballard et al., 2011). The authors suggest that the DLPFC integrates

information about reward and then sends this information to the ventral striatum

and midbrain dopaminergic regions, resulting in the initiation of motivated

behavior. As the midbrain projects to both the MTL and ventral striatum, it is

possible that these areas may interact via the PFC or the midbrain. The work of

15

Ballard and colleagues is consistent with a previous model posited by Poldrack

and Rodriguez (2004) which implicated the prefrontal cortex in mediating

interactions between the BG and MTL. In this theory, Poldrack and Rodriguez

posit that the MTL and BG most likely do not interact in a direct manner, but may

interact via the prefrontal cortex or via the action of neuromodulators. Both the

BG and MTL form anatomical loops with frontal cortical areas (Middleton and

Strick, 2000; Thierry et al., 2000; Seger, 2006). In terms of potential

neuromodulators, the authors nominate a couple of candidates, one of which is

dopamine. In addition to playing a role in reward learning and memory

functioning, dopamine also has an important influence on successful

classification learning (Beninger et al., 2003). Beninger and colleagues

conducted an experiment where schizophrenic patients were administered either

typical or atypical antipsychotic medications and asked to perform a probabilistic

classification learning task. Those on typical antipsychotic medication, which is

believed to block D2 receptors in the dorsal striatum, were impaired at

classification, whereas those on atypical medication, which does not block

dopamine receptors in the dorsal striatum, and healthy control participants,

performed normally. This study highlights the role dopamine plays in normal

acquisition of category learning information.

In accord with this general theory, a distinct experiment examining the

neural correlates of instructed knowledge observed that regions of the

ventromedial prefrontal cortex (vmPFC), ventral striatum, and MTL were involved

in reward-related learning and were functionally negatively correlated with a

16

region of the DLPFC during the instructed learning session when positive

outcomes were received (Li et al., 2011). The authors concluded that DLPFC

may be modulating action-outcome responses in reward-related structures

dependent on the presence and usefulness of explicit knowledge (instruction)

compared with only action-outcome information (no instruction). Lastly,

computational models exploring interactions among distinct neural substrates

mediating learning have suggested that the PFC may work together with the

hippocampus during rule-governance. In particular, Doll and colleagues (2009)

posit two hypotheses regarding how the MTL and BG interaction. The first

hypothesis suggests that the PFC and hippocampus (HC) may bias what the

striatum learns (bias model), while a second hypothesis posits that the BG learns

contingencies independently from the PFC/HC, but later the PFC/HC overrides

these learned contingencies (override model). Both of these models have been

shown to be valid (Doll et al., 2009); therefore further research is required to

explore how these systems interact. Thus, converging evidence suggests that

one manner by which the MTL and BG may interact is via the PFC.

1.3 Interactions between Multiple Memory Systems – Functional Significance

Summarizing the recent work of many researchers suggests three main

conflicting hypotheses regarding the nature of interactions between the basal

ganglia and the medial temporal lobe: 1) memory systems compete during

learning, 2) memory systems cooperate during learning, or 3) memory systems

17

operate in parallel during learning. Evidence in favor of each of these

hypotheses is described below:

1.3.1 Evidence supporting competitive interactions: Initial evidence

suggested that these memory systems interact in a competitive manner, such

that when one region is engaged, it suppresses or decreases activity in the other

region via either direct or indirect connections (Poldrack et al., 2001; Packard

and Knowlton, 2002; Foerde et al., 2006; Lee et al., 2008). Such studies have

ranged across species, from rodent-based lesion studies to human-based fMRI

experiments involving probabilistic category learning. Evidence of competitive

interactions in animal-based work has been observed during lesion experiments

when knocking out or pharmacologically modulating one region, for example the

caudate nucleus of the striatum results in an enhancement in performance on a

MTL dependent task and vice versa (Mitchell and Hall, 1988; Packard et al.,

1989; McDonald and White, 1993; Packard, 1999; Lee et al., 2008). Most

recently, Lee and colleagues showed a double dissociation between

hippocampal and striatal-based learning paradigms in mice. Disrupting striatal

functioning, via either lesions or inhibiting a transcription factor impaired cued-

learning (dependent on the striatum) and enhanced spatial-learning

(hippocampus-dependent). Likewise, lesions to the hippocampus impaired

spatial-learning while enhancing cued-learning. Based on these findings, the

authors concluded that the memory systems interact in a competitive manner.

In human fMRI studies, competition has been shown using probabilistic

category learning tasks (Poldrack et al., 2001; Foerde et al., 2006; Seger and

18

Cincotta, 2006). For example, Poldrack and colleagues (2001) employed a

between-subjects, blocked-fMRI design and a popular category learning task, the

weather prediction task. In the weather prediction task, participants are required

to learn to predict the outcome of an event (rainy or sunny weather) based on the

presentation of 1-4 cue cards (modified from the original medical symptom

classification task of Gluck and Bower, 1988). Each card is associated with a

certain probability of predicting either rain or sun (e.g., 20, 40, 60 or 80% rain).

This version of the task contained two different variations. The first was a

feedback task, designed to be analogous to nondeclarative learning, as the

probabilistic nature of the cards and trial and error approach to learning would

presumably make it difficult for participants to outwardly memorize the

associations (Knowlton et al., 1996). The second version was an observational

task, similar to declarative learning, as the cue card and outcome were presented

simultaneously, presumably making it easier for participants to directly memorize

the associations. One group of participants completed the feedback task and a

separate group completed the observational task. The authors found the

feedback version produced activation in the basal ganglia and deactivation in the

MTL. The MTL exhibited greater BOLD responses to the observational

compared with the feedback version of the task. Furthermore, when the authors

conducted a functional connectivity analysis, they found a negative correlation

between the MTL and a region in the right caudate nucleus across participants.

Additionally, in a separate event-related fMRI experiment using only the feedback

version of the task, the authors reported that the MTL was initially activated and

19

the caudate nucleus deactivated during the task. Over time, the MTL became

deactivated and the caudate nucleus became activated, suggesting a temporal

element to the engagement of the regions. Based on this evidence, the authors

concluded that the memory systems interact in a competitive manner during

learning.

A recent study replicated Poldrack and colleagues’ 2001 result by

reporting negative correlations between the hippocampus and specifically the

head of the caudate nucleus during a rule-learning task (Seger and Cincotta,

2006). The authors concluded that the antagonistic interaction between these

two regions is probably not direct, but rather is mediated by frontal cortical areas.

1.3.2 Evidence supporting cooperative interactions: A second hypothesis

regarding how these regions interact during learning suggests they cooperate,

such that they work together to facilitate learning (Dagher et al., 2001; Voermans

et al., 2004; Cincotta and Seger, 2007; Sadeh et al., 2011). Human-based fMRI

evidence supporting this theory was observed in a relatively recent experiment

which examined specifically how responses in the striatum are modified when

learning via feedback versus learning via observation (Cincotta and Seger,

2007). In this block-design study, the authors used an information integration

task and reported that BOLD responses in the caudate nucleus and the

hippocampus were analogous in nature. Contrary to Poldrack and colleagues’

2001 results where MTL and BG activation negatively correlated and the regions

were active at alternating time points, this observation suggests that the two

regions are at times active simultaneously and show similar patterns of activity

20

during learning. Similar patterns of activity within distinct memory regions may

be interpreted as cooperative engagement of these areas during learning.

Additionally, a recent study demonstrated cooperation between the hippocampus

and putamen during a declarative memory encoding task (Sadeh et al., 2011).

Activation within the hippocampus and putamen was associated with successful

memory encoding and critically, activity within these two regions was correlated

in participants for later remembered, but not forgotten words. Lastly, the strength

of the correlation between the putamen and hippocampus predicted successful

memory, such that the stronger the correlation across regions, the more words

participants subsequently remembered.

This observation is further supported by fMRI studies which posit that the

MTL and BG may cooperate in order to maintain normal functioning in

compromised brain states (Dagher et al., 2001; Voermans et al., 2004). Dagher

and colleagues (2001) investigated planning in Parkinson’s disease patients and

healthy control patients using the Tower of London task in a PET study. Results

indicated that Parkinson’s patients performed as well as healthy control

participants, but engaged a different network of brain regions during the task.

Healthy control participants primarily engaged areas of the PFC and the caudate

nucleus, while PD patients engaged similar prefrontal areas, but no caudate

nucleus activation was observed. Critically, activation within the hippocampus

was observed in PD patients, but was diminished in healthy control participants.

These results suggest that within PD patients, where striatal functioning is

compromised, hippocampal activation increased in order to maintain overall

21

successful planning in an executive function task. A similar result was observed

in a different disease population using a separate learning paradigm. In a study

by Voermans and colleagues (2004), Huntington’s disease (HD) patients were

required to perform a route navigation task. Results indicated that increasing

MTL BOLD responses helped compensate for decreased striatal functioning in

order to maintain successful route recognition in HD patients. In healthy control

participants, no such increase in MTL activity was observed. In fact, healthy

control participants engaged greater caudate nucleus activation, while HD

patients elicited greater hippocampus activation during route recognition.

Furthermore, healthy control participants exhibited greater functional connectivity

between the hippocampus and caudate nucleus than HD patients. The authors

suggest that in HD patients, this connectivity may change such that the MTL may

adapt to have a more independent and compensatory role due to compromised

striatal function. Such cooperation between learning systems suggests they may

work together to facilitate a normally functioning brain state. Converging

evidence across multiple disease states (PD, HD) employing distinct

experimental tasks (Tower of London, route navigation) provides strong evidence

that the medial temporal lobe and striatum do cooperate during some learning

scenarios, specifically when such cooperation maintains overall successful

learning and performance levels.

1.3.3 Evidence supporting parallel engagement: Lastly, a third hypothesis

describing how the MTL and BG interact posits that the memory systems operate

in parallel, complementing each other’s function in the brain, and therefore have

22

the ability to either cooperate or compete during learning (McDonald and White,

1994; White and McDonald, 2002; Albouy et al., 2008; Doeller et al., 2008;

Tricomi and Fiez, 2008). White and McDonald (2002) outline their theory of three

parallel memory systems in the rodent brain: the hippocampus, the dorsal

striatum, and the amygdala. They suggest that in a normal brain, these regions

process and store information simultaneously and in parallel. The authors also

state that the memory systems at times cooperate and at times compete during

learning depending on the specific learning situation. Evidence supporting this

theory in animal studies is seen in radial maze experiments which demonstrate

that similar behaviors, such as certain types of place learning, may be acquired

via either the dorsal striatum, particularly the medial dorsal section, or the

hippocampus (McDonald and White, 1995; Devan and White, 1999).

Furthermore, recent fMRI studies involving various, distinct paradigms

support parallel engagement of the medial temporal lobe and striatum. Doeller

and colleagues (2008) employed an environmental learning paradigm, adapted

from traditional animal maze tasks, to investigate interactions between the

hippocampus and striatum. Participants were required to learn the location of

several objects in a virtual environment, using a distinct environmental landmark

or a boundary for orientation purposes. Results indicated that the posterior

hippocampus was engaged in learning and remembering locations via

environmental boundaries, whereas the caudate nucleus was engaged when

learning and remembering locations via single, intramaze landmarks. These

results support the recruitment of the dorsal striatum during trial and error,

23

associative learning via distinct landmarks, and the engagement of the

hippocampus during boundary-based, incidental learning. Furthermore, the

authors employed a dynamic causal modeling analysis to investigate interactions

between these regions during learning. The results from this connectivity

analysis supported independent and parallel engagement of the hippocampus

and striatum, rather than direct interactions between the regions of interest. The

hippocampus and striatum therefore supported distinct types of learning and

were engaged independently and in parallel during learning of the environment in

this paradigm.

A second, unrelated study examined engagement of the striatum and

hippocampus during a declarative word pair learning task (Tricomi and Fiez,

2008). Participants were required to learn arbitrary word pairs via feedback, with

the same pairs of words being presented over three rounds. Importantly,

feedback was administered in each round, but on the first round, as participants

were randomly guessing the answer, feedback simply indicated a correct guess

and not a correct memory, as occurred in rounds 2 and 3. The authors reported

that both the caudate nucleus and the hippocampus were engaged during

learning, but exhibited distinct patterns of activity. The caudate nucleus

displayed an interaction of round by time, showing greater activation in the last 2

rounds, during the feedback presentation period. Furthermore, the caudate

nucleus differentiated between correct and incorrect feedback in the later rounds,

when feedback was meaningful. The hippocampus was also engaged, exhibiting

a main effect of time, initially decreasing after trial onset and subsequently

24

increasing after feedback presentation. These results are inconsistent with

purely competitive interactions between the BG and MTL as the two regions were

simultaneously active and did not negatively correlate. Furthermore the results

support parallel engagement of the MTL and BG as these areas were engaged

during the learning process, playing distinct roles during the acquisition of

declarative information. The authors concluded that the caudate nucleus may be

representing goal achievement during declarative learning.

Lastly, parallel engagement between the BG and MTL has been observed

in a fMRI motor sequence learning and memory task (Albouy et al., 2008). Both

the hippocampus and regions of the striatum, specifically the caudate nucleus

and putamen, were involved in initial motor sequence learning. Interestingly,

hippocampal activation during training predicted behavioral performance

improvement observed the next day. Additionally, a functional connectivity

analysis indicated competitive interactions between the hippocampus and the

putamen during learning. However, this interaction became cooperative 24

hours following sequence training in participants who were fast learners. This

result elegantly illustrates the parallel nature of the interactions between the MTL

and BG, showing that the two regions may both compete and cooperate during

learning and memory processes.

To summarize therefore, there are three conflicting hypotheses regarding

the nature of interactions between multiple memory systems. Evidence

supporting each theory is summarized in the table below:

25

Table 1.1 Evidence Supporting Interactions between Multiple Memory SystemsCompetitive Interactions Cooperative Interactions Parallel Engagement Mitchell and Hall, 1988 Dagher et al., 2001 McDonald and White, 1994Packard et al., 1989 Voermans et al., 2004 Devan and White, 1999 Packard 1999 Cincotta and Seger, 2007 Atallah et al., 2008 McDonald and White, 1993 Sadeh et al., 2011 Albouy et al., 2008 Poldrack et al., 2001 Tricomi and Fiez, 2008 Foerde et al., 2006 Doeller et al., 2008 Seger and Cincotta, 2006 Lee et al., 2008 For review see Poldrack and Packard 2003

1.4 Description of Dissertation Experiments

Thus it is clear there is an ongoing debate regarding the nature of

interactions between the medial temporal lobes and basal ganglia. The purpose

of this dissertation therefore was to investigate how these two memory systems

interact during learning. Based on the evolutionary theory of multiple memory

systems as well as anatomical and functional connectivity with dopaminergic

midbrain centers (SN/VTA), it was hypothesized that these distinct neural regions

will act in parallel during probabilistic learning, exhibiting signs of parallel

engagement, as well as competitive and cooperative interactions. Furthermore,

based on the compilation of evidence documenting dopamine’s involvement in

learning and memory, it was hypothesized that dopamine may play a role in

facilitating interactions between the MTL and BG during reward-related learning.

The goal of the first experiment was to investigate the relative

engagement and interactions between the MTL and the BG during probabilistic

learning using a within-subjects, event-related fMRI design and a novel, simple

probabilistic learning task. The purpose of this experiment was to explore in a

26

basic manner how each system, the BG and MTL, is engaged individually and

how these distinct systems interact during multiple types of probabilistic learning

(feedback and observation). It was hypothesized that the MTL and BG will

operate in parallel during learning, exhibiting parallel engagement as well as

potentially both cooperative and competitive interactions, which were measured

by relative activation within each region as well as functional and effective

connectivity between these regions during learning. It was further hypothesized

that such parallel engagement may be mediated by midbrain dopaminergic

connectivity, as indirectly assessed by the application of a reinforcement learning

model postulated to reflect dopaminergic activity during trial and error learning

(Schultz, 1997; O'Doherty et al., 2002). The results of this experiment provided a

fundamental understanding of the involvement of the MTL and BG during

different types of probabilistic learning.

The goal of the second experiment was to examine interactions between

the MTL and BG during re-learning or updating of probabilistic information

caused by a reversal in learning contingencies. The purpose of this experiment

was to examine how the MTL and BG interact during learning, using a novel

manner of addressing this question, via the employment of multiple types of

learning and a reversal in probabilistic cue contingencies. It was hypothesized

that immediately following reversal (cue contingencies switch) both the MTL and

BG will act in parallel to process new contingencies. Specifically, it was

predicted that initial inconsistencies between the old and new information will

diminish as the old contingencies are updated, regardless of whether updating

27

occurs within the same learning type (e.g., feedback-feedback or observation-

observation) or across learning types (e.g., feedback-observation or observation-

feedback). It was theorized that both competitive and cooperate interactions may

be observed during the updating process in accord with the parallel theory of

engagement among multiple memory systems. These interactions were

measured by the relative engagement of each system during initial learning and

reversal as well as functional and effective connectivity across the brain during

reversal. Lastly, in order to examine putative dopaminergic involvement in

mediating these interactions, a reinforcement learning model was applied. The

results of this experiment provided evidence of how multiple memory systems

interact when learning types evolve and learning contingencies reverse.

The goal of the third experiment was to probe a specific learning condition

in which the MTL and BG may exhibit cooperative interactions. The purpose of

this experiment was to test a direct hypothesis that was developed by observing

the engagement of the MTL and BG in experiments 1 and 2. Specifically, a

declarative memory recognition task was employed in order to engage the medial

temporal lobe (akin to a “behavioral knockout” of one region of interest, the MTL).

Concurrent with this interfering memory task, participants performed a

probabilistic feedback learning task, which contained both an easy/consistent as

well as a hard/inconsistent learning condition. Additionally, control conditions

were employed which did not contain the interfering declarative memory task,

and thereby did not aim to engage the MTL. The effect of the interfering memory

task was observed on behavior as well as brain regions engaged during

28

probabilistic feedback learning. It was hypothesized that multiple memory

systems will cooperate when information is inconsistent and therefore

challenging, specifically when both systems are available to be involved in

feedback probabilistic learning (control conditions), while memory systems will

operate more independently when information is more consistent and less

challenging, in support of a parallel processing model of learning. The relative

engagement of distinct memory regions was examined across learning sessions

(control sessions and declarative memory interference session) and interactions

across regions were examined by the application of simple correlation analyses.

The results of this experiment provided evidence of the engagement and

interactions between multiple memory systems in “normal” (single-task) versus

dual-tasking learning scenarios.

The experiments presented in this document were event-related and

mostly within-subjects fMRI designs which employed a novel probabilistic

learning task adapted from Delgado et al., 2005. Event-related fMRI designs

have certain advantages over block designs previously used (e.g., Poldrack et

al., 2001; Cincotta and Seger, 2007) in that they allow the experimenter to track

the BOLD signal changes to specific events. Additionally, when it is feasible,

within-subject designs are also preferable as they eliminate possible sources of

variance (across-subject comparisons). However, sometimes between-subjects

comparisons are necessary (experiment 2 for example) based on the theoretical

question and experimental design. While much important work has been

conducted using the popular weather prediction task (WPT) a novel probabilistic

29

categorization task was chosen for these experiments for the following two

reasons. First, the tasks designed in this thesis are simpler than the weather

prediction task, in that they contained only 1 learnable cue per stimulus, as

opposed to multiple cue cards. Second, the task is based on a card guessing

game, which has been previously shown to robustly engage the striatum in a trial

and error learning variant – one of the key regions of interest for these

experiments (Delgado et al., 2005). Other than these two distinctions, the tasks

employed in this thesis and the WPT are similar and therefore results from the

experiments presented here may be somewhat comparable to studies that

employ the weather prediction task.

1.5 Significance

The results from the experiments outlined in this dissertation aimed to

provide a richer understanding of how the MTL and BG interact during human

probabilistic learning. By better understanding how these systems are engaged

during learning and memory, and how they interact, a more complete picture of

how healthy humans solve probabilistic learning tasks was obtained.

Understanding interactions among human memory systems is critical, as these

neural areas are implicated in many neurodegenerative and neuropsychiatric

diseases. Therefore, these results may eventually aide research efforts directed

towards understanding and improving treatment for Parkinson’s disease,

Huntington’s disease, Alzheimer’s disease, MTL amnesia, and schizophrenia.

Additionally results from these experiments may shed some light on individual

30

differences in learning, such as individuals who learn better via observation via

those who learn better via trial and error experiences.

Chapter Two: General Materials, Methods, & Data Analyses

2.1 General Materials & Methods

This section provides a brief overview of the common elements in

experiments 1-3. Details containing information specific to individual

experiments are presented in the materials and methods section of the

appropriate chapter.

2.1.1 Experimental designs: Each task was programmed using E-Prime

Version 2.0 (PST, Pittsburgh). Participants inside the MRI machine saw the task

via use of a back projection system using a mirror affixed to the head coil. A MRI

compatible button box was used to record behavioral responses during the

experiments. For all experiments, participants were given detailed instructions

and completed a short practice version outside of the MRI prior to beginning the

actual task (different stimuli were used during practice sessions). In experiment

2, which was a between-subjects design, additional instructions were

administered while the participants remained inside the MRI machine after the

first test session was completed in order to train them on the type of learning they

did not experience during the learning phase (e.g. trained on feedback learning if

the participant was in the observation learning group). Monetary compensation

consisted of a minimum of $25 per hour upon arrival at the imaging center for

completion of the study, with additional incentives based on participants’

31

performance during the task for experiments 1 and 2. Specifically, participants

were awarded additional money based on a paper test administered after

completion of the MRI portion of the experiment (see Appendix 3 for the

questionnaires administered in experiments 1-3). In experiment 1, total

compensation ranged between $50 and $65; in experiment 2, total compensation

ranged between $50 and $56. In experiment 3, participants were told they would

be paid additional money based on how well they learned the cues and how well

they discriminated between old and new natural scenes. However, all

participants who completed the study were paid $60 and were told they

performed well in the task, irrespective of how well they actually performed in the

experiment. Participants were paid the same amount in experiment 3 solely to

simplify the payment process.

2.1.2 Participants: All participants were right handed adults over the age

of 18 years. Every participant was screened for head injury and a history of

psychiatric or neurological impairments as well as contraindications to MRI

(metal implants, claustrophobia). All females were administered a pregnancy

test, which was verified to be negative, prior to beginning the MRI session.

Informed consent was obtained from each participant before beginning the

experiment. All studies were approved by the Institutional Review Boards of

Rutgers University and the University of Medicine and Dentistry of New Jersey

(UMDNJ). Experiment 1 contained seventeen participants (nine female, mean

(M) age 24 years, standard deviation (SD) 4.1); experiment 2 had fifty-two

participants (26 female, M age 22 years, SD 4.7); and experiment 3 had twenty-

32

eight participants (15 female, M age 22 years, SD 5.0). Final analysis in

experiment 1 included 16 participants (9 female), as one person was excluded

due to a scanner malfunction in the middle of the MRI session. In experiment 2,

final analysis consisted of forty-one participants (20 female), as 6 people missed

an excessive amount of trials (greater than 2 SDs from the M; some due to falling

asleep), 3 people were excluded due to excessive motion (movement beyond 5

millimeters or systematic spikes), 1 person was excluded due to a scanner

malfunction, and 1 person was excluded due to failure to learn the task (chance

accuracy, 50%, in the learning phase). Experiment 2 was a between-subjects

design consisting of a feedback group and an observation group: final data

analysis included 21 participants in the feedback group and 20 in the observation

group. The complete data set for experiment 3 consisted of twenty-four

participants (12 female), as two participants were excluded due to malfunctions

with the MRI overheating, one person did not return for the second day (fMRI

session), and one person was excluded due to failure to attend during the day 1

encoding session.

2.1.3 fMRI acquisition: All MRI experiments were conducted at the

UMDNJ Advanced Imaging Center. The MRI machine at this facility is a 3-Tesla

Siemens Allegra, which was used to collect the structural (T1-weighted

MPRAGE: 256 x 256 matrix; FOV = 256 mm; 176 1-mm sagittal slices) and

functional images (single-shot echo EPI sequence; TR = 2000 ms, TE = 25ms;

FOV = 192 cm; flip angle = 80°; matrix = 64 x 64; slice thickness = 3 mm). Forty

contiguous oblique-axial slices (3 x 3 x 3 mm voxels) were acquired parallel to

33

the anterior commissure – posterior commissure line. Functional data

preprocessing and analysis was completed using Brain Voyager QX software

(Versions 1.10; 2.0, and 2.2 Brain Innovation, Maastricht, The Netherlands).

Preprocessing consisted of three dimensional motion correction (six-parameters),

spatial smoothing (4 mm, FWHM), voxel-wise linear detrending, high-pass

filtering of frequencies (3 cycles per time course), and normalizing the data to

Talairach sterotaxic space (Talairach and Tournoux, 1988). In addition, the

mean intensity of the BOLD signal was examined in experiments 2 and 3, but

was not included in the preprocessing of the data. A canonical two gamma

hemodynamic response function was used in all experiments to convolve the

events of interest.

2.2 General Data Analyses

This section provides descriptions and explanations of the common

analyses conducted in experiments 1-3. Study specific information for each

experiment is provided in the data analysis section of the appropriate chapter.

Following fMRI preprocessing, functional data analysis was completed using a

whole brain analysis technique for all experiments, as well as region of interest

(ROI) analyses for experiment 3.

2.2.1 General linear model: In all phases of each experiment, random-

effects general linear models (GLM) were performed. Predictors of interest

(which varied based on the experiment and are described in the relevant

chapters) were included in the model as well six motion parameters which were

34

included as regressors of no interest in order to account for potential motion

related issues. In experiments 1 and 2, data analysis was conducted across the

4 second time period where the stimulus, response, and feedback occurred. The

analysis was performed in this time period in order to most appropriately

compare the feedback and observation trials. Relevant information for the

feedback trials occurred in the last 2 seconds, where feedback was provided,

whereas relevant information for the observation trials was present in the first 2

seconds, where participants observed the cue and association (informative arrow

indicating the cue value). Therefore, to most accurately compare these distinct

trial types, the stimulus/response and feedback periods were combined for the

analyses performed in experiments 1 and 2. In experiment 3, analyses were

conducted over several time points. The first analysis examined BOLD activation

during the stimulus presentation period, where participants viewed the natural

scenes/probabilistic cues and made a choice regarding the probabilistic cue

value. The second analysis examined the BOLD signal during the feedback

period, where participants received feedback contingent on their response in the

cue period, which indicated the outcome of their probabilistic cue value choice

(correct, incorrect, missed trial). The third and final analysis was a region of

interest analysis examining BOLD signals during the stimulus presentation period

in 7 a priori ROIs: the bilateral midbrain (centered on the ventral tegmental area),

bilateral ventral striatum, bilateral hippocampus, and the left caudate nucleus.

Details about the ROIs used are provided in Chapter 5.

35

2.2.2 Granger causality analyses: In addition to the basic analyses

performed, a granger causality analysis (GCA) was conducted in order to

examine functional and effective connectivity in the brain during various phases

of the experiments (details provided in the methods sections of experiments 1

and 2). This analysis is especially informative as the experiments conducted in

this dissertation were designed to examine interactions between the MTL and

BG, which are difficult to probe using traditional neuroimaging analyses alone.

Granger causality is advantageous as it examines functional and effective

connectivity among multiple brain regions during a designated time period. GCA

operates by looking for relationships between time course data from a seed

region (x) with the rest of the brain (y). Using vector autoregression, Geweke

(1982) proposed a measure of linear dependence, Fx,y between two hypothetical

times series of data, x[n] and y[n]. Fx,y is the sum of three components:

Fx,y = Fx→y + Fy→x + Fx⋅y

Fx→y measures the directed influence from x to y; probing if past values of x

improve the current predicted value of y.

Fy→x measures the directed influence from y to x; probing if past values of y

improve the current predicted value of x.

Fx⋅y measures the undirected instantaneous influence of x and y. This measure

adds in the current value of x or y to the model already containing the past values

of x and y.

Therefore, GCA examines both undirected instantaneous influence (Fx⋅y) and

directed (Fx→y and/or Fy→x) influences between a designated seed region (x) and

36

the rest of the brain (see Goebel et al., 2003; Roebroeck et al., 2005 for more

details). For the analyses performed in the experiments in this dissertation,

connectivity was examined across the entire run (first to last volume/TR) and an

autoregressive model order of 1 TR (2 seconds) was used, consistent with the

BrainVoyager plugin and with previous studies (Goebel et al., 2003; Roebroeck

et al., 2005). Therefore Granger causality between distinct brain regions was

conducted by examining 2 seconds into past time course data. Of special

interest in this set of experiments was examining connectivity between the MTL

and BG, with potential involvement of the midbrain.

2.2.3 Prediction error analyses: Reinforcement learning models were

applied to learning trials in experiments 1 and 2. The purpose of applying the

reinforcement learning models was to gain a better understanding of not only

what regions were engaged during learning, but also how mechanistically

participants were solving the tasks. Specifically, it was hypothesized that

dopaminergic influences may play a role in facilitating interactions between the

MTL and BG. In addition to examining BOLD activation in the dopaminergic

midbrain, which is an anatomically small region whose exact location is difficult to

determine using standard neuroimaging techniques, it was theorized that the

application of a reinforcement learning model would aid in examining putative

dopaminergic influences. Dopamine neurons are known to be involved in

feedback-based learning and prediction error encoding and project to both the

striatum and hippocampus (Scatton et al., 1980; Lynd-Balta and Haber, 1994;

Schultz et al., 1997; Pagnoni et al., 2002; McClure et al., 2003; Abler et al.,

37

2006). Therefore, one way of examining putative dopaminergic influences during

learning is to apply a prediction error learning model to the behavioral accuracy

data, with the aim of examining which brain areas track prediction error signals

during learning. If the BG and/or MTL vary parametrically with a PE signal, it is

possible that these regions are receiving similar information via midbrain

dopaminergic neurons.

Regressors for the prediction error models used in experiments 1 and 2

were calculated based on a Q-learning model (Watkins, 1989). The Q learning

model was chosen to model behavioral choices as the literature supports this

type of learning as an effective means of predicting instrumental choice in

probabilistic reward learning tasks (Samejima et al., 2005; Pessiglione et al.,

2006; Burke et al., 2010). A Q learning model operates by estimating the

expected value of choosing action A and action B based on the individual

sequence of choices and outcomes obtained by participants. Q models use

prediction error signals to update action values according to the Rescorla-

Wagner learning rule. In the context of experiments 1 and 2, this choice is

making a button press to indicate whether a probabilistic cue has either a higher

or lower than 5 value (e.g., choosing a high value). The expected value of

making a cue-action choice is termed a Q value and represents the expected

reward obtained by taking a certain action, in this case receiving feedback when

making an individual choice. In order to model behavior, the following equations

were used:

38

1. Qj, i+1 ← Qj, i + λδ

2. δ = r – Qj, i

The model contains the following parameters: 1) initial Q values (Qj), 2)

learning rate, λ, and 3) the softmax function temperature, m (akin to slope). A

maximum likelihood estimation algorithm was used to estimate optimal values for

these three parameters. The cue-action values (Qs) were then updated

according the Rescorla-Wagner learning rule. Specifically, following each trial,

the value of choosing higher than 5 and lower than 5 was updated according to

Equation 1: Qj, i+1 ← Qj, i + λδ. Equation 2, which calculates the prediction error,

δ = r – Qj, i is simply the difference between the expected (Qj,i) and the actual

outcome (r) of a trial. The probability of choosing each action was estimated by

using the softmax rule: for example if choosing high, the softmax function is:

P(Q↑,i) = exp(Q↑i /m)/(exp(Q↑i /m)+exp(Q↓i /m). The variable m is a temperature

parameter which measures variability, such that for low temperatures, the

probability of choosing the action which has the highest expected reward

approaches 1 (steep slope, exploitative behavior), where as for high

temperatures, all actions have almost the same probability (moderate slope,

explorative behavior). In the model, prediction error is represented by δ; r equals

the reward amount, which in the feedback trials was 1 for correct trials or 0 for

incorrect trials; i = trial number and j represents the type of action: participants’

choice of high (↑) or low (↓). A single set of free parameters was used for all

39

participants when generating the prediction error regressors which were later

used in the neuroimaging analyses in experiments 1 and 2.

For experiment 2, the prediction error model described above was

modified for the observation trials, based on the model presented by Burke and

colleagues (2010). Specifically, an action prediction error signal was generated

for the observation trials. In order to do so, it was conceived that in the

observation learning condition, participants could simply view the cue/value

associations passively (although they were instructed not to), and therefore

theoretically think of each cue’s value as being assigned or chosen by the

computer. Therefore, for the participant, the observation trials could be thought

of as similar to observing another agent, in this case the computer, play to learn

the cue/value associations, while the participant observed and subsequently

learned the cue/value associations as well. An action prediction error is defined

as δaction = action(H,L) - p(H, L) where action ∈ (H, L; H: higher than five cue

value; L: lower than five cue value). The probability values, p(H, L) of observing

each cue-action pair were calculated by the Q(↑) and Q(↓) values used in the

softmax function. For example, for trial i, if the model predicted that the

probability of choosing H (higher than 5 cue value) is 0.7 and the participant

observed that choosing H is the correct choice, according the model: δaction =

actioni(H,L) - p(H, L), δaction = 1-0.7 = 0.3. Hence a low surprise or saliency signal

is generated. If the participant observed that in fact L (lower than 5 cue value)

was the correct choice, then δaction = 1-0.3 = 0.7, therefore a greater

surprise/salience signal. Due to the way δaction is constructed, an action

40

prediction error value is always positive (as action always sums to 1), and can be

used as an indicator for how “salient” the computer’s choice was to the

participant for that particular trial.

As a note, a prediction error analysis was not conducted in experiment 3.

The predominant reason for this was that the third experiment was much more

complex in design than experiments 1 and 2 and as a result would be difficult to

model. In experiment 3, during the stimulus presentation period, there was also

a natural scene present in all learning sessions and presumably participants

were trying to 1) ignore this information, 2) make a perceptual decision, or 3)

remember the scene, based on the session type. Therefore, this was not a pure

feedback learning environment as there were multiple cognitive demands

present. Even though prediction errors were most likely being generated in

response to the cue outcomes, since this was a much more complex learning

environment, it was decided not to conduct a prediction error analysis in this

experiment.

2.2.4 Cluster level statistical threshold estimator: The cluster level

statistical threshold estimator is a plugin included in the BrainVoyager analysis

package which provides a correction for multiple comparisons (Forman et al.,

1995; Goebel et al., 2006). This plugin operates by employing Monte Carlo

simulations in order to determine the likelihood of observing clusters of various

sizes in a given map at a set threshold. In order to use the plugin, the statistical

parametric map of interest is first thresholded at the desired level (e.g., p<0.005,

uncorrected). Proceeding the initial thresholding, the plugin estimates the spatial

41

smoothness of the map and performs a whole brain correction using Monte Carlo

simulations which estimate the rate of false positives in the present map at the

cluster level. In all analyses reported, 1,000 iterations (the recommended

number of the plugin) were chosen. After the simulations, the selected map

automatically applies the minimum cluster size threshold which produces the

desired cluster-level false-positive alpha rate (5% was chosen in all analyses).

All remaining active clusters in the corrected map are used in creating a table

that summarizes the number of clusters above the chosen threshold (5%) for

each cluster size. When the analysis is complete, each cluster size is assigned

an alpha value determined by the frequency of its occurrence in the map. This

analysis therefore corrects for multiple cluster tests in the map.

2.2.5 Sequential Bonferroni correction: In all experiments, post-hoc

analyses conducted on the behavior and neuroimaging data consisting of more

than two t-tests within a family of comparisons were corrected for multiple

comparisons with the sequential Bonferroni technique (Holm, 1979; Rice, 1989).

This analysis was developed in 1979 by Holm as a less strict correction for

multiple comparisons, compared to the traditional Bonferroni correction. In this

analysis, the desired alpha level (e.g., 0.05) is divided by the number of t-tests to

be performed, k (e.g., 3). The table-wide alpha level then becomes that value

(e.g., 0.05/3 = 0.0167). The p values of all three t-tests are then ranked in order

from most to least significant (P1 to Pk, or 0.004, 0.02, 0.04 for example). If the

most significant p value (0.004) is less than alpha/k (e.g., 0.0167), then it can be

considered significant at the “table-wide” alpha level and the experimenter may

42

proceed in examining the next test. If the most significant p value is greater than

alpha/k, then all resulting p values are termed insignificant at the “table-wide”

level. In the given example, 0.004 is less than 0.0167, so the experimenter may

examine the next test. The alpha level is then divided by k-1, in this example (3-

1=2). The new alpha level is now 0.05/2 or 0.025 and the next most significant p

value is 0.02, so it is also considered significant at the table-wide level. The

experimenter may proceed on to the last test, using an alpha level of 0.05/(3-2,

or 1). Therefore in this example, all tests are significant at the table-wide level.

2.2.6 Note regarding test phase neuroimaging results: All of the test

phase neuroimaging results are presented in Appendix 1. The test phases were

not primary phases of interest in experiments 1-3 but rather were performed as

secondary analyses in order to ensure that participants successfully acquired cue

contingencies in all learning sessions. These phases were also performed to

examine the engagement of memory regions when recalling previously learned

cue values in an exploratory fashion. Because the experimental sessions of

interest were long (learning and update phases), the test sessions were short;

especially the tests administered in experiments 1 and 2, which contained a

limited number of trials (30 trials in the experiment 1 test – 3 presentations of

each cue; 32 trials in the experiment 2 tests– 4 presentations of each cue). For

these reasons, the results were not of primary interest and are presented in

Appendix 1.  

43

Chapter Three: Experiment 1

3.1 Introduction: Background & Rationale

It is theorized that memory is not a unitary process, but rather consists of

multiple systems which rely on distinct neural substrates (Sherry and Schacter,

1987; Squire, 1992; Squire and Zola, 1996). A significant body of research,

spanning across species, supports the existence of multiple memory systems in

the mammalian brain. A major division is between the declarative memory

system, dependent on the medial temporal lobes, and a nondeclarative memory

system, which engages several brain structures, one of which is the basal

ganglia system. While decades of research have investigated the individual roles

of the aforementioned areas during multiple types of learning, a key remaining

question of interest is examining how the medial temporal lobes and basal

ganglia interact during learning and memory formation. This question has

aroused great debate in the literature, with some evidence supporting

competitive interactions (Poldrack et al., 2001; Poldrack and Packard, 2003;

Foerde et al., 2006; Lee et al., 2008), other research supporting cooperative

interactions (Dagher et al., 2001; Voermans et al., 2004; Cincotta and Seger,

2007; Sadeh et al., 2011) and a third body of evidence supporting parallel

engagement of the medial temporal lobes and basal ganglia system (White and

McDonald, 2002; Albouy et al., 2008; Atallah et al., 2008). For the purposes of

this thesis, McDonald and White’s definition of parallel memory systems is

employed (2002), which suggests that these distinct systems may be online

simultaneously and involved in distinct aspects of learning. The authors posit

44

that information passes independently through each system, such that each

system receives the same information, but specializes in representing different

aspects of the information. Furthermore, it is theorized that these systems may

either interact directly in a cooperative or a competitive manner or may

simultaneously influence behavior in parallel.

While examining the nature of the interactions between the medial

temporal lobes and the basal ganglia, it is important to consider how these

distinct regions are functionally and anatomically connected. Research suggests

that there are direct anatomical projections from the hippocampal formation to

the nucleus accumbens and the caudate nucleus/putamen in the rodent (Jung et

al., 2003). Furthermore, some researchers have proposed models describing

how these regions may interact during learning. One key model of interest,

developed by Lisman and Grace (2005) proposes that the MTL and BG may

interact via anatomical connectivity with the dopaminergic midbrain. In the

model, the hippocampus functions to detect the entrance of novel information

and sends this novelty signal to the ventral tegmental area in the midbrain via

projections through the subiculum, nucleus accumbens (ventral striatum), and

ventral pallidum. The dopaminergic neurons within the VTA fire in response to

this novelty signal, subsequently releasing dopamine into the hippocampus

where it enhances long term potentiation. Importantly, the VTA also sends a

projection back to the nucleus accumbens. Therefore, this model suggests a

functional loop between the hippocampus, nucleus accumbens, and midbrain,

which specializes in novelty detection and the entrance of information into long-

45

term memory. Dopamine is a logical candidate for facilitating interactions

between these regions as it is believed to play a critical role in learning and

memory processes (Wise, 2004; Shohamy and Adcock, 2010). Dopamine is

implicated in reward-related learning, which is typically associated with striatal

function (for review see Schultz, 2002) as well as facilitating long-term memory

formation in humans, which is more traditionally associated with medial temporal

lobe activation (Wittmann et al., 2005; Adcock et al., 2006; Shohamy and

Wagner, 2008; Shohamy and Adcock, 2010). Therefore, anatomical as well as

functional evidence supports a role for dopamine in facilitating interactions

between the basal ganglia and medial temporal lobe.

Given the debate regarding the nature of interactions between multiple

memory systems, further research is required to examine multiple memory

systems and how they interact during learning and memory formation. The

purpose of the first experiment of this dissertation was to investigate in a broad

manner, the relative engagement and interactions between the MTL and the BG

during a simple learning paradigm using a within-subjects fMRI design.

Probabilistic learning was chosen because a major portion of the literature has

employed probabilistic learning tasks to examine both the BG’s and the MTL’s

involvement in learning. It has been suggested to be an effective manner of

engaging both nondeclarative/implicit and declarative/explicit learning

mechanisms (Knowlton et al., 1996; Shohamy et al., 2004b), although this theory

is debated (Lagnado et al., 2006). A novel paradigm was developed for this set

of experiments based on a card guessing task (Delgado et al., 2005), which was

46

previously shown to engage the striatum. It was decided to modify this task in

particular due to the fact that it strongly engaged one of the key systems of

interest (the basal ganglia). Modifications were performed to the task design in

order to try to engage the MTL as well; specifically an observation version of the

task was developed, modified from the task design used by Poldrack and

colleagues (2001). In the experiment, participants were required to learn the

value of easy and hard probabilistic cues either through feedback (trial and error)

or via observation (paired-association). Based on the synthesis of current

research in the field, three main points were considered while formulating a

hypothesis regarding the nature of interactions between the MTL and BG: 1) both

cooperative and competitive interactions between these regions have been

reported in the literature, 2) evolutionary considerations suggest these two

systems play fundamentally different roles in the brain, and 3) White and

McDonald’s theory of parallel engagement among memory systems in the brain

posits that the MTL and BG may operate in parallel or interact directly in a

cooperative or competitive manner. Based on the above evidence from the

literature, it was hypothesized that the MTL and BG will operate in a parallel

manner, potentially exhibiting both cooperative and competitive interactions.

This hypothesis was tested by examining the relative activation within both the

MTL and the BG during feedback and observation learning as well as functional

and effective connectivity between these regions during learning. It was further

hypothesized that such parallel engagement may be mediated by midbrain

dopaminergic connectivity, based on dopamine’s involvement in reward-related

47

learning and long-term memory formation as well as the known neuroanatomical

projections from the midbrain to both the striatum and hippocampus (Shohamy

and Adcock, 2010). Dopamine’s influence in mediating interactions between the

medial temporal lobe and the basal ganglia was indirectly assessed by the

application of a reinforcement learning model postulated to reflect dopaminergic

activity during feedback learning.

3.2 Materials & Methods

3.2.1 Experimental design

The probabilistic learning task used in this study was a modification of a

previously used feedback-based card learning paradigm (Delgado et al., 2005).

In this adapted version of the task, participants were asked to perform a

categorization task that involved determining the numerical value of several

probabilistic visual cues (e.g., circle is higher than the number 5; Figure 3.1).

There were two distinct phases of the task, a learning phase, where participants

acquired the values of the visual cues, followed by a test phase, where

participants were asked to remember the cue associations they learned in the

previous phase. Participants were instructed that each visual cue had been

assigned a numerical value on the scale from 1-9 and could be thought of as

having a value of either higher (6-9) or lower (1-4) than the number 5. They were

instructed that they did not need to learn the exact value of each cue (e.g. circle

is 7), but rather group the shapes into those with a value of higher than five and

those with a value of lower than five.

48

Two independent variables were manipulated in the task. The first was

the type of learning, or how the participants acquired the cue associations. In the

learning phase, there were two distinct ways in which participants learned about

the values of 8 cues. One manner was via feedback (4 cues), where participants

guessed and received feedback regarding the cue’s value, akin to trial-and-error

or nondeclarative learning (Figure 3.1A). The second manner was via

observation (4 cues) where participants observed the cue paired with its value,

more similar to declarative learning. During feedback trials, participants saw a

visual cue and made a button press indicating their belief regarding the value of

the cue (button 1 = higher than 5; button 2 = lower than 5). Visual cue

presentation and response (2 seconds total) were followed by feedback

contingent on the participants’ response (check mark for correct trials, X mark for

incorrect trials, and # symbol for missed trials; 2 seconds). Feedback was

followed by a jittered inter-trial-interval (ITI; 2-14 seconds) to allow for the

hemodynamic response to return to baseline. During observation trials,

participants observed the cue paired with an informative arrow indicating the

value of the cue (upward-facing arrow denotes a higher than 5 value; downward-

facing arrow denotes a lower than 5 value) and made a response indicating their

belief regarding the value of the cue (button 1 = higher than 5; button 2 = lower

than 5; Figure 3.1B). The visual cue and response period (2 seconds) was

followed by a message informing the participants if their response had or had not

been recorded (2 seconds). As long as the participants made a button press

within the 2 second visual cue/response period, they received a message that

49

their answer was recorded, if they failed to respond within 2 seconds for any

reason, they received a message that their answer was not recorded (missed

trial). This message was followed by a jittered ITI (2-14 seconds) before

presentation of the next visual cue. The button response in the observation trials

served as a motor control for the button response made during the feedback

trials, equated the duration of each trial type (4 seconds total), and was an

attempt to equate the difficulty of the trial types. During the learning phase,

participants completed 4 blocks, alternating between feedback and observation

blocks (2 of each type). Each block contained 40 trials, for a total of 160 trials in

the learning phase. Feedback and observation blocks differed in terms of the

cues presented as well as cue color (e.g., feedback cues were pink, observation

cues were blue). Block order and cue color were counterbalanced, and within

each block, cue trial order was randomized.

The second independent variable manipulated in the paradigm was the

probability of the cues (predictive outcome of the numerical value for each cue),

referred to as cue difficulty. Four cues (2 feedback and 2 observation) were 85%

predictive of the outcome (higher or lower than 5) and are referred to as “easy

cues” and the remaining four cues (2 feedback and 2 observation) were 65%

predictive of the outcome and are referred to as “hard cues”. Cue difficulty was

included as a variable as both the BG and MTL are known to be modulated by

the difficulty of material to-be-learned. During the learning phase, participants

were specifically told to optimize their responses to the cues. They were told that

as the value of the cues was probabilistic, for the observation trials, the direction

50

of the arrows would change. For example, if a cue is 85% higher than five it is

thereby 15% lower than five. Participants were instructed to pay careful attention

to the direction of the arrows and to try to determine what the value of each cue

was most of the time. On all trials, they were instructed to press the button

corresponding to the value that cue was associated with for the majority of the

trials in the task. Therefore, on the inconsistent trials, (cue is lower than five)

participants could choose to push the button indicating “higher”, if that is what

they believed the value of the cue to be most of the time. As a result of these

instructions, participants were actively making a choice regarding each cue’s

value and were not passively pushing the button that always matched the

direction of the arrow. Because participants were instructed to optimize their

responding, behavioral accuracy was scored according to participants’ actual

choices. That is if they always pushed the button indicating a higher than five

response, for a cue which was 85% or 65% higher than 5, they would receive a

score of 100% correct; if they followed the arrows exactly they would probability

match and be 85% or 65% correct respectively. As a point of clarification,

participants were not told the exact probabilities (e.g., 85%) but were simply

introduced to the concept of probabilistic information. The test phase was

presented immediately following completion of the learning phase, while the

participants remained inside the scanner (Figure 3.1C). The test phase

contained all eight visual cues presented in the learning phase, in addition to two

novel cues presented for the first time in the test phase. The novel cues were

presented to provide a control for comparing previously learned information with

51

novel information. There were 30 trials total in the test phase, three

presentations of each of the ten cues. Trials consisted of a self-timed cue

presentation/response period followed by a jittered ITI (6-14 seconds) before

onset of the next cue. Critically, feedback (from the feedback trials) and

informative arrows (from the observation trials) were not presented in this phase.

Participants saw only the cue by itself (e.g., circle) and made a button press to

indicate its value, based on what they learned in the learning phase. A second

test phase was administered a couple of days following completion of the MRI

study in a behavioral-only test session completed in the laboratory (M = 2.63

days, SD = 1.63 days).

3.2.2 Data analysis

GLM: A random-effects GLM analysis was conducted in the learning

phase to examine BOLD activity during the 4-second cue presentation +

feedback period, using the following five predictors: learning type (feedback and

observation), cue difficulty (easy and hard), and missed trials. From this GLM

statistical parametric maps (SPM) were generated and thresholded at p<0.005

with a cluster threshold correction of 5 contiguous voxels (135mm3 in 1x1x1mm

units). A repeated-measures ANOVA was then conducted within BrainVoyager

using learning type (feedback and observation) and cue difficulty (easy and hard)

as within-subjects factors. The primary SPM of interest investigated a main

effect of cue difficulty. This analysis was of interest as it provided a non-biased

examination of the relative engagement and pattern of activity within the BG and

the MTL during learning. Functional ROIs were defined based on the resulting

52

map and the BOLD signal (characterized by beta weights or mean parameter

estimates) was extracted in order to examine potential similarities and

differences between feedback and observation learning in post-hoc analyses. In

addition, potential effects of learning changes over time were examined. In order

to model time, the learning phase was divided according to run/block. The BOLD

signal was examined in the functional regions of interest (e.g., caudate nucleus

and hippocampus) for the first learning run/block of each learning type (e.g.,

feedback trials 1-40; observation trials 1-40) and similarly for the second learning

run/block of each learning type (e.g., feedback trials 41-80; observation trials 41-

80). Subsequently, the mean BOLD signal from the first or “early” learning run

was then compared with the mean BOLD signal from the second or “late”

learning run for both learning types in both regions. The second SPM of interest

directly investigated a main effect of learning type. The third SPM of interest

examined potential interactions of cue difficulty and learning type.

During the test phase, BOLD activity was examined during the stimulus

onset and participant response period. A random effects GLM was conducted

with observation (easy and hard), feedback (easy and hard), and novel cues as

predictors, in addition to 6 motion parameters included as regressors of no

interest. As the GLM is unbalanced, a 2x2 (learning session x cue difficulty)

within-subjects ANOVA could not be performed in this phase. Therefore, a

contrast of previously studied (observation and feedback cues collapsed across

difficulty) vs. non studied (novel) cues was performed, thresholded at p <0.005.

The cluster level statistical threshold estimator plugin was used on the resulting

53

SPM. Parameter estimates were extracted from the resulting region and

examined with post-hoc analyses. Any reported regions not withstanding

correction are labeled clearly and should be interpreted with caution (Poldrack et

al., 2008).

GCA: A Granger causality analysis was performed specifically to probe

potential interactions between the hippocampus and striatum during learning.

The functionally defined hippocampus ROI from the main effect of cue difficulty

ANOVA in the learning phase was used as the principal seed region for this

analysis. This ROI was chosen as the primary seed region given the direct

anatomical projections from the hippocampus to the basal ganglia (Kelley and

Domesick, 1982). Functional and effective connectivity maps were generated

separately for the feedback and observation runs. At the individual subject level,

the two feedback blocks were combined to form one feedback map and the two

observation blocks were combined to form one observation map. Time course

values entered into the analysis contained the entire run, from the first to the last

time point (240 volumes per run). At the multi-subject level, each participant’s

feedback map was combined to make one group feedback map and each

participant’s observation map was combined to make one group observation

map. Maps were given a threshold of p<0.0001 with a cluster threshold

correction of 7 continuous voxels for the observation maps and 8 continuous

voxels for the feedback maps (cluster-level false-positive rate of 5%). Standard

second level statistics were performed on the group maps; specifically a t-test

was calculated. Reported results do not contain effective connectivity (Fy→x

54

and/or Fx→y), as no effective connectivity was observed between the regions of

interest (MTL and BG). Therefore the results reported represent functional

connectivity (instantaneous influence with no directionality information; Fx⋅y).

Lastly, a second Granger causality analysis was performed in the same manner

as a control analysis, using the caudate nucleus region obtained from the

learning phase ANOVA main effect of cue difficulty analysis. The resulting maps

were thresholded at p<0.0005 with a cluster threshold correction of 12

continuous voxels (cluster-level false-positive rate of 5%) unless otherwise noted.

PE: The prediction error values generated from the Q learning model

presented in the general methods chapter were then used as a regressor in the

neuroimaging GLM analysis. Eight additional predictors were included in the

GLM: trial event and missed responses, as well as six motion parameters.

Within the GLM, the PE values were coded during the two second feedback

presentation time period for only the feedback learning trials. A SPM which

probed regions of the brain that varied parametrically with the PE values during

feedback learning was generated, thresholded at p<0.005 with a voxel contiguity

of 5 continuous voxels (cluster-level false-positive rate of 5%).

3.3 Results

3.3.1 Behavioral results

Learning phase: Accuracy

The primary analysis probed accuracy differences between the learning

types and levels of cue difficulty. A 2 (learning type: feedback vs. observation) x

55

2 (cue difficulty: easy vs. hard) repeated measures ANOVA was performed. No

main effect of learning type (F(1,15) = 1.47; p > 0.05), a main effect of cue

difficulty, (F(1,15) = 16.04; p< 0.01), and no significant interaction (F(1,15) = 0.85; p

> 0.05) were observed (Figure 3.2A). In order to examine the main effect of cue

difficulty within and across conditions, post-hoc t-tests were performed.

Participants’ accuracy was significantly better on easy compared to hard cues for

both the feedback (t(15) = 2.45; p < 0.05) and observation (t(15) = 2.54; p < 0.025)

trials. There was marginally better performance for easy cues during observation

compared with feedback learning (t(15) = 2.24; p = 0.04; trend after sequential

Bonferroni correction), with no performance differences for hard cues between

learning types (t(15) = 0.23; p > 0.05).

A secondary analysis examined changes in learning accuracy over time in

feedback and observation trials by comparing performance early (first block)

versus late (second block) during the learning phase. A 2 (learning type:

feedback vs. observation) x 2 (cue difficulty: easy vs. hard) x 2 (time: early vs.

late) repeated measures ANOVA indicated a no main effect of learning type

(F(1,15) = 1.38; p > 0.05), a main effect of cue difficulty (F(1,15) = 16.09; p < 0.01), a

main effect of time (F(1,15) = 35.07; p < 0.01), and no significant interactions. The

significant main effect of time was driven by better performance for the hard but

not the easy trials over time.

Test phase: Accuracy

The first analyses examined differences between learning sessions and

levels of cue difficulty in each individual test session (immediate and follow-up).

56

During the individual test phases, no differences between feedback and

observation learning or between easy and hard cues were observed. In the

immediate test phase, a 2 (learning type: observation vs. feedback) x 2 (cue

difficulty: easy vs. hard) repeated measures ANOVA (excluding the novel

information and examining only the previously studied material) revealed no

significant main effect of learning type (F(1,15) = 0.06; p > 0.05), no significant

main effect of cue difficulty (F(1,15) = 0.23; p > 0.05), and no significant interaction

(F(1,15) = 1.92; p > 0.05). These results suggest that participants successfully

learned contingencies independent of learning type and/or level of cue difficulty.

The same 2 x 2 repeated measures ANOVA was performed for the follow up test

phase. No significant main effect of learning type (F(1,15) = 0.51; p > 0.05), no

significant main effect of cue difficulty (F(1,15) = 0.37; p > 0.05), and no significant

interaction (F(1,15) = 0.00, p > 0.05) were observed.

The second analysis probed for differences in accuracy between learning

material over time. In order to test this, one analysis examined differences

across the two test sessions. A 3 (learning material: observation, feedback,

novel) x 2 (time: immediate vs. follow up test session) repeated measures

ANOVA was performed. A main effect of learning material (F(1.90,28.43) = 14.04; p

< 0.01), no main effect of time (F(1,15) = 0.11; p > 0.05), and a marginally

significant interaction (F(1.36,20.36) = 3.33; p = 0.07) were observed (all factors were

Greenhouse-Geisser corrected; Figure 3.2B). The main effect of learning

material indicated greater accuracy for studied (feedback and observation)

compared with non-studied (novel) material. Post-hoc t-tests revealed that

57

participants’ performance in the feedback version of the task marginally declined

in the follow-up test session (t(15) = 2.13; p = 0.05; trend after sequential

Bonferroni correction), but no differences were observed for observation

accuracy over time (t(15) = 1.04; p > 0.05). As anticipated, participants’ accuracy

in the novel condition did not change across time (t(15) = 1.01; p > 0.05).

3.3.2 Neuroimaging results

Learning phase

Main Effect of Cue Difficulty: The primary analysis of interest in the

learning phase was to examine a main effect of cue difficulty, as that was the

significant result in the behavioral data and would provide for a non-biased

examination of the MTL and BG’s activation during learning. A 2 (learning type:

feedback vs. observation) x 2 (cue difficulty: easy vs. hard) within-subjects

ANOVA was performed within BrainVoyager and a main effect of cue difficulty

was examined (Table 3.1). A region of the left caudate nucleus was identified as

processing a main effect of cue difficulty (Figure 3.3), along with an area in the

left hippocampus that did not survive cluster threshold correction. Mean

parameter estimates from these two functionally defined ROIs were then

extracted for further analyses. In the left caudate nucleus (x, y, z = -15, 20, 7;

Figure 3.3A and Figure 3.3B), the pattern of BOLD responses was similar for

both feedback and observation learning, with no differences observed between

type of learning when collapsed across cue difficulty (t(15) = 1.00; p > 0.05). Post-

hoc t-tests indicated a greater BOLD response for easy compared with hard cues

in both types of learning [feedback: (t(15) = 5.02; p < 0.025); observation: (t(15) =

58

3.99; p < 0.05]. In the left hippocampus (x, y, z = -36, -28, -8; Figure 3.3C and

Figure 3.3D) a marginally significant effect was observed when comparing mean

parameter estimates from the feedback and observation sessions (t(15) = 2.01; p

= 0.06), with a marginally greater BOLD response for the observation trials. This

difference was driven mostly by performance for the hard trials (t(15) = 2.40; p =

0.03; trend after sequential Bonferroni correction). Post-hoc t-tests indicated a

greater BOLD response for easy compared with hard cues for the feedback

learning (t(15) = 3.73; p < 0.025) and a marginally greater response for easy than

hard cues in the observation trials: (t(15) = 1.79; p = 0.09).

As a secondary analysis, changes in the BOLD response during feedback

and observation sessions over time were examined by comparing the BOLD

signal early (first block) versus late (second block) during the learning phase. A

2 (learning type: feedback vs. observation) x 2 (cue difficulty: easy vs. hard) x 2

(time: early vs. late) repeated measures ANOVA conducted on the mean BOLD

responses extracted from the caudate nucleus revealed no main effect of

learning type (F(1,15) = 0.82; p > 0.05), a main effect of cue difficulty (F(1,15) =

95.47; p < 0.01), no main effect of time (F(1,15) = 1.97; p > 0.05), and no

significant interactions. A nearly significant increase in mean parameter

estimates over time was observed for the observation hard cues in the caudate

nucleus (t(15) = 2.25; p = 0.04; trend after sequential Bonferroni correction). The

same 2 x 2 x 2 repeated measures ANOVA was conducted on the BOLD

responses extracted from the hippocampus ROI. A nearly significant main effect

of learning type (F(1,15) = 3.36; p = 0.09), a main effect of cue difficulty (F(1,15) =

59

20.69; p < 0.01), a main effect of time (F(1,15) = 5.35; p < 0.05), and a nearly

significant interaction between learning type and time (F(1,15) = 3.76; p < 0.07)

were observed. Post-hoc comparisons indicated a nearly significant increase in

the hippocampus for the feedback hard cues as learning progressed (t(15) = 2.38;

p = 0.03; trend after sequential Bonferroni correction).

Main Effect of Learning Type: The second analysis examined a potential

main effect of learning type, generated from the same learning type x cue

difficulty ANOVA described previously (Table 3.2). Several regions of the basal

ganglia were engaged including the right ventral portion of the head of the

caudate nucleus (x, y, z = 6, 3, 4) and the left ventral caudate nucleus extending

into the globus pallidus (x, y, z = -12, 2, 4). No voxels within the medial temporal

lobe were observed at a threshold of p < 0.005. These results indicate that some

regions of the BG are involved in primarily processing feedback information more

so than observation information, supporting a parallel model of engagement.

Interaction of Cue Difficulty and Learning Type: The last main analysis

from the learning phase ANOVA examined a potential interaction between

learning type and cue difficulty. Results revealed activation in an area of the left

medial prefrontal cortex. Post-hoc t-tests conducted on the mean parameter

estimates extracted from this region indicated greater activity for observation

easy than hard trials (t(15) = 4.63; p < 0.025); but no difficulty differences for the

feedback trials (t(15) = 1.26; p > 0.05). The interaction was driven by a greater

BOLD response to observation easy compared with feedback easy cues (t(15) =

2.72; p < 0.025) and a marginally greater response to feedback than observation

60

hard cues (t(15) = 1.94; p = 0.07). These results indicate that this region of the

prefrontal cortex is involved in processing feedback and observation information,

modified by cue difficulty.

Learning phase: Correlations between striatum and medial temporal lobe

As a first pass at exploring the relationship between the caudate nucleus

and hippocampus’s BOLD responses during learning, a series of Pearson’s

correlations were performed. A significant positive correlation during the

observation learning session was observed between mean parameter estimates

extracted from the hippocampus and the caudate nucleus identified in the main

effect of cue difficulty analysis (r = 0.498, p = 0.05). There was no relationship

between the two regions BOLD responses overall in the feedback session.

However, exploratory analyses revealed a marginally significant positive

correlation between the caudate nucleus and hippocampus during the last block

of feedback learning, specifically for easy cue trials when participants’

expectations were violated by the receipt of incorrect feedback (r = 0.530, p =

0.08).

Granger Causality Analysis

To more thoroughly assess the level of connectivity and potential

interactions between the BG and MTL during learning, a Granger causality

analysis was performed. The left hippocampus from the main effect of cue

difficulty learning phase analysis was used as the primary seed region. The

resulting Granger causality maps highlight instantaneous correlations between

the hippocampus (seed region) and regions of the striatum during both

61

observation and feedback probabilistic learning sessions. In particular,

instantaneous influence between the hippocampus and two regions of the right

caudate nucleus were observed during feedback learning (x, y, z = 14, 18, 13

and x, y, z = 14, 12, 19; not shown), as well as nearly the identical ROIs in the

caudate nucleus (x, y, z = 14, 18, 13 and x, y, z = 14, 10, 18; not shown; Figure

3.4A) and one region of the right ventral putamen in observation learning (x, y, z

= 22, 3, -4; not shown) (Figure 3.4B). A second Granger causality analysis using

the caudate nucleus as the seed region was performed as a control analysis

because the hippocampus seed was taken from an uncorrected region in the

learning session. The results yielded instantaneous influence between the

caudate nucleus and bilateral hippocampal regions in observation learning

sessions (x, y, z = -31, -38, -3 and x, y, z = 32, -29, -12; not shown) corrected to

a cluster level false positive rate of 5%, as well as an area near the right

hippocampus in feedback learning sessions (x, y, z = 29, -8, -15; uncorrected for

multiple comparisons).

Prediction Error Analysis

The last analysis conducted examined a putative influence of midbrain

dopaminergic neurons in learning, which are known to project to both the

striatum and the hippocampus (Scatton et al., 1980; Lynd-Balta and Haber,

1994) and are involved in learning and memory processes (Wise, 2004; Lisman

and Grace, 2005; Shohamy and Adcock, 2010). The prediction error learning

signal is believed to be a correlate of physiological firing of dopamine neurons

during reward-related learning (Schultz, 1997; Schultz et al., 1997; Schultz,

62

2002). Based on the observation of activation of both the striatum and

hippocampus during the learning phase - processing differences in cue difficulty,

a reinforcement learning algorithm was applied to the behavioral data. The

results yielded an area of the left putamen (x, y, z = -30, 2, 4; Figure 3.5A) and

the right hippocampus (x, y, z = 27, -28, -14; Figure 3.5B) whose activation

parametrically varied with the PE signals generated from the model. Thus, the

use of a reinforcement learning model during feedback learning engaged the

striatum, a typically reported region in prediction error encoding (Pagnoni et al.,

2002; McClure et al., 2003; Abler et al., 2006) as well as the hippocampus, a

novel finding in an area which has more often been associated with mismatch

signals (Ploghaus et al., 2000; Chen et al., 2011).

3.4 Discussion

The purpose of this experiment was to investigate the relative

engagement and interactions of multiple memory systems during probabilistic

learning. The nature of how multiple memory systems interact is debated, with

several studies suggesting that the BG and MTL compete during learning

(Poldrack et al., 2001; Foerde et al., 2006; Lee et al., 2008). The data collected

in this experiment, however, are inconsistent with the hypothesis of purely

competitive interactions, and support an alternative hypothesis that these

memory systems operate in parallel during learning (White and McDonald, 2002;

Albouy et al., 2008; Atallah et al., 2008). Using an event-related probabilistic

task and within-subjects measures of learning allowed for a simple and controlled

63

investigation of the engagement and interactions between multiple memory

systems. Behavioral results indicated that as learning progressed, accuracy was

modulated by cue difficulty (easy and hard) across learning types (feedback or

observation). Regions within the BG and MTL, specifically the caudate nucleus

of the striatum and the hippocampus, tracked these behavioral changes as they

were modulated by cue difficulty, displaying similar patterns of activity in

response to both feedback and observation information during learning.

Furthermore, BOLD signals within these regions were positively correlated during

learning, as assessed by simple correlations and a more sophisticated functional

connectivity analysis using both the hippocampus and the caudate nucleus as

reference regions. The functional connectivity observed between these areas

may potentially be explained by modulation of midbrain dopaminergic neurons

during reward-related learning (Lisman and Grace, 2005; Shohamy et al., 2008;

Shohamy and Adcock, 2010) as loci within both the striatum and MTL were found

to vary parametrically with a prediction error signal. The results of the prediction

error analysis further corroborate the hypothesis that these distinct memory

systems may operate in parallel while processing probabilistic information as

both the striatum and hippocampus were actively engaged in learning scenarios

where violations of expectations or mismatches occurred, which may function as

useful updating signals during the learning process.

The results from the present experiment complement the visuomotor and

simple association learning literature, which supports the theory that the basal

ganglia and medial temporal lobes may be engaged simultaneously in learning

64

arbitrary visuomotor associations (Toni et al., 2001; Amso et al., 2005; Law et al.,

2005; Haruno and Kawato, 2006). The current results are consistent in particular

with a recent study by Mattfeld and Stark (2010) in which the interaction of the

BG and MTL was investigated using an arbitrary visuomotor association task.

Several regions of the BG and MTL exhibited increases in BOLD signal as the

strength of memory increased during a trial and error task, suggesting that

regions of the BG and MTL are engaged when learning arbitrary associations.

The observation in the present experiment of the caudate nucleus and

hippocampus demonstrating larger BOLD responses to easy compared to hard

cues complements Mattfeld and Stark’s result. The authors also employed a

functional connectivity analysis, results of which indicated functional connectivity

between both the ventral (nucleus accumbens) and dorsal (caudate nucleus)

striatum with the hippocampus during learning, corroborating the interactive

nature of the hippocampus and striatum during learning and memory processes.

One key distinction between the task employed in the current experiment and

more traditional visuomotor tasks is the inclusion and examination of different

learning types (observation in addition to feedback).

The current experiment’s results enhance recent neuroimaging findings

demonstrating cooperative interactions between the BG and MTL during

category learning (Voermans et al., 2004; Cincotta and Seger, 2007). In the

present experiment, an event-related design allowed for the decoupling of factors

such as cue difficulty and examining changes in learning over time to lend further

support to the hypothesis that parallel processing in the BG and MTL contributes

65

to overall learning during probabilistic paradigms. A significant positive

correlation between the BG (caudate nucleus) and MTL (hippocampus) during

the observation learning session and a marginally significant positive correlation

during the feedback learning session were observed in the present experiment.

Whereas negative correlations have been interpreted as competition between

memory systems (Poldrack et al., 2001), positive correlations may suggest

cooperative, perhaps synergistic interactions, leading to the interpretation that

the systems may communicate with each other in specific contexts to facilitate

probabilistic learning. This interactive nature between the MTL and BG is

illustrated by Granger causality results in the present study which indicated that

loci within the striatum and the hippocampus were correlated at simultaneous

time points during the learning phase when using both the caudate nucleus and

hippocampus as reference regions.

Based on the Lisman and Grace model (2005), it may have been

predicted that ventromedial regions of the striatum would be engaged during the

present paradigm, correlating with hippocampus activation, as opposed to more

dorsal regions. Consequently, it may be surprising that dorsal regions of the

striatum were observed during learning. However, the literature supports a role

for the dorsal striatum during instrumental tasks (O'Doherty et al., 2004),

particularly when learning is action contingent (Tricomi et al., 2004). Moreover,

the dorsal striatum, especially the caudate nucleus, has been shown to be

involved in cognitive tasks involving feedback (Poldrack et al., 2001; Seger and

Cincotta, 2005; Seger, 2008; Tricomi and Fiez, 2008). Considering this

66

evidence, it is logical that the strongest loci of activation within the striatum were

observed in the dorsal rather than ventral striatum in the current cognitive

learning paradigm. Importantly, there is also a direct projection from the

hippocampus to the caudate nucleus in the rodent, albeit a ventral region of the

caudate nucleus (Jung et al., 2003). Lastly, it has been posited that the ventral

striatum may communicate with the dorsal striatum via multiple spiral loops

connecting the striatum with dopaminergic midbrain centers (Heimer et al., 1997;

Haber, 2003). Therefore, it is possible that interactions between the

hippocampus, midbrain DA areas, and more dorsal regions of the striatum exist,

via a ventromedial to dorsolateral movement of information through the

aforementioned spiral loops.

In addition to the similarities in the activation observed in the hippocampus

and regions of the basal ganglia during the task, two differences emerged. First,

regions within the basal ganglia were modulated by a main effect of learning

type, while no voxels were identified in the MTL showing such differentiation.

Several neuroimaging papers have shown that reward processing and feedback

recruit regions of the ventromedial striatum (for review see Delgado, 2007); as

such, it is not surprising that this region was recruited more strongly during the

feedback learning trials. It may have been expected that the MTL would be

selectively modulated by observation trials given previous findings (Poldrack et

al., 2001); however, this result was not observed in the current experiment.

While a null result in neuroimaging studies does not indicate any particular

finding, and the context and experimental design of the present paradigm differ

67

from previous probabilistic learning studies, it is possible that the MTL is recruited

during both feedback and observation learning in the present task – as

suggested by the main effect of difficulty analysis. A second difference that was

observed during learning between the hippocampus and caudate nucleus was

that the hippocampus showed a main effect of time (early x late learning) during

the learning session, whereas responses within the caudate nucleus were not

significantly modulated across time. This effect was driven by changes in activity

during feedback learning (primarily for the hard cues), which increased during

later compared to early stages of learning in the hippocampus. This result may

suggest that the hippocampus is recruited more heavily for difficult feedback

trials later in the learning process.

The observation of both the hippocampus and striatum processing

differences in cue difficulty, but only the ventral striatum exhibiting stronger

engagement during feedback learning is indicative of parallel engagement of

these regions during probabilistic learning. These results suggest that the

striatum plays a dynamic role during probabilistic learning, processing both

learning type as well as cue difficulty. Furthermore, the results suggest that the

striatum may be processing similar or distinct information as the hippocampus –

suggesting they may operate independently and in parallel. Neither purely

cooperative interactions (similar pattern of activity in the BG and MTL in all

learning scenarios) nor purely competitive interactions (opposite patterns of

activity during learning or negative correlations) were observed in the present

paradigm.

68

As a caveat for the present study, it should be noted that the hippocampus

locus obtained from the main effect of cue difficulty analysis was reported at a

threshold of p<0.005, uncorrected for multiple comparisons. Additionally, two

areas in the test phase analysis, a region adjacent to the hippocampus and the

caudate nucleus, were also reported at p<0.005 uncorrected. Although this

information may be useful for studies in the future, regions reported at an

uncorrected threshold should be regarded with caution due to the increased

likelihood of observing a Type I error (Poldrack et al., 2008).

There are several open questions remaining based on the results of

experiment 1. The primary question of interest is addressing in a more direct

manner how the BG and MTL interact during probabilistic learning. One manner

of addressing this question is by employing multiple types of learning as well as

by examining how systems interact during the reversal of probabilistic

contingencies. The present study examined the relative engagement of distinct

regions during different types of learning and probed interactions between these

regions using correlations, functional connectivity analyses, and reinforcement

learning models. However, because all participants completed both learning

types in the same learning session, it is possible that the current results are

biased by the presentation of both learning types (feedback and observation)

within the same learning phase. Therefore, in order to address this concern, it is

necessary to examine the engagement of these discrete regions in observation

and feedback learning in a distinct manner. Experiment 2 attempts to address

these unresolved issues by 1) employing a between-subjects design – such that

69

participants initially learn via only one type of learning and 2) by reversing

contingencies in the middle of the learning session to probe in a novel manner

how the memory systems interact during probabilistic learning.

Chapter Four: Experiment 2

4.1 Introduction: Background & Rationale

The nature of interactions between multiple memory systems is currently

debated in the literature. Evidence supporting cooperative (Dagher et al., 2001;

Voermans et al., 2004; Cincotta and Seger, 2007; Sadeh et al., 2011),

competitive (Poldrack et al., 2001; Foerde et al., 2006; Seger and Cincotta, 2006;

Lee et al., 2008), and parallel (McDonald and White, 1994; White and McDonald,

2002; Albouy et al., 2008; Doeller et al., 2008; Tricomi and Fiez, 2008)

interactions has been documented across species using several different types

of learning paradigms. In experiments specifically involving human participants

probabilistic category learning (Knowlton et al., 1994; Knowlton et al., 1996;

Poldrack et al., 1999; Poldrack et al., 2001; Hopkins et al., 2004; Shohamy et al.,

2004b; Shohamy et al., 2004a; Shohamy et al., 2008), information integration

category learning (Ashby et al., 2002; Cincotta and Seger, 2007), visuomotor

learning (Seger and Cincotta, 2005; Mattfeld and Stark, 2010), spatial navigation

(Voermans et al., 2004), and declarative memory tasks (Tricomi and Fiez, 2008;

Sadeh et al., 2011) have all been employed when investigating this question.

Some studies have examined interactions between memory systems by

employing distinct types of learning, in particular feedback and observation

70

learning (Poldrack et al., 2001; Ashby et al., 2002; Shohamy et al., 2004b;

Cincotta and Seger, 2007; Schmitt-Eliassen et al., 2007; Djonlagic et al., 2009; Li

et al., 2011). Feedback learning is traditionally associated with engagement of

the basal ganglia, in particular the striatum (see Delgado, 2007 for review),

whereas human observation learning (or instructed learning) has been

associated with medial temporal lobe (Poldrack et al., 2001; Li et al., 2011),

striatal (Cincotta and Seger, 2007; Li et al., 2011), as well as dorsolateral and

ventromedial prefrontal cortex activation (Burke et al., 2010; Li et al., 2011).

Results from fMRI experiments that specifically employed feedback and

observation learning have supported both competitive (Poldrack et al., 2001) and

cooperative interactions (Cincotta and Seger, 2007) between the medial temporal

lobes and basal ganglia. Given that the nature of these interactions is debated,

further research is required to probe how multiple memory systems interact

during probabilistic learning. The goal of this experiment was to address this

question by employing multiple types of learning and examining interactions

across two stages of the learning process: initial acquisition and later updating

following a reversal in cue contingencies (e.g., cue value reversal).

Several studies have demonstrated a role for the basal ganglia during

reversal learning (Annett et al., 1989; Schoenbaum and Setlow, 2003; Setlow et

al., 2003; Frank and Claus, 2006). One study of interest examined the neural

correlates of probabilistic reversal learning and demonstrated activation of

frontostriatal regions during the key time point of behavioral reversal (Cools et al.,

2002). In this probabilistic reversal learning task, participants initially acquired

71

cue contingencies via trial and error. After a predetermined number of correct

responses, cue contingencies reversed, unbeknownst to the participants. The

participants were then required to update the new cue associations and change

their responding in order to receive correct feedback. Cools and colleagues

demonstrated that a frontostriatal network was engaged when processing the

critical, final error preceding participants’ reversing their behavioral choices in

order to receive correct feedback. This study illustrates the involvement of the

frontostriatal network during the updating of probabilistic cue contingencies. This

result is relevant as the task employed in the present experiment contained one

reversal of probabilistic cue contingencies during learning. Furthermore, the

striatum is one of the key areas of interest in the studies presented in this

dissertation, and it is therefore pertinent to know that it may be mediating the

reversal of probabilistic cue contingencies.

In addition, the medial temporal lobes have also been implicated in playing

a fundamental role in reversal learning. Both research using non-human animals

and humans has documented the involvement of the hippocampus during

reversal learning. Lesions to the hippocampus in animals impair reversal

learning (Zola and Mahut, 1973; Berger, 1983; Fagan and Olton, 1986; Marston

et al., 1993). In human studies, patients with bilateral damage to the

hippocampus are impaired at reversing previously acquired stimulus-outcome

associations (Myers, 2000; Carrillo et al., 2001; Myers et al., 2006) and tend to

perseverate by choosing previously a correct association. A study recently

confirmed a role for the basal ganglia and the medial temporal lobes during

72

probabilistic reversal learning using a feedback-based task with Parkinson’s

patients and MTL amnesic patients (Shohamy et al., 2009). Shohamy and

colleagues modified the weather prediction task, making a simpler task in which

each cue predicted the correct outcome 80% of the time and each cue pattern

(every trial consisted of a pattern of 3 cue cards on the screen) predicted the

outcome 100% of the time. Results indicated that while both PD and MTL

amnesic patients were able to initially acquire cue contingencies, both patient

groups demonstrated impairments with reversal. PD patients did successfully

reverse cue contingencies, but did so by choosing to respond to a new cue

among the pattern, thereby learning a new cue-outcome association, rather than

updating the old cue-outcome association and thus “opting out” of the reversal.

MTL amnesic patients failed to successfully reverse cue contingencies

completely and demonstrated perseverative behavior to previously correct

associations. The authors concluded that the basal ganglia and medial temporal

lobes therefore both contribute to probabilistic reversal learning, but in unique

ways. The basal ganglia may be required for updating old cue-outcome

associations, whereas the medial temporal lobes may play a broader role in

flexibly modifying cue-outcome associations. These data are in accord with the

parallel model of engagement between multiple memory systems.

Of additional interest, Swainson and colleagues (2000) demonstrated that

probabilistic reversal learning was impaired in mild Parkinson’s patients (PD),

who were receiving dopaminergic medication as a part of disease treatment.

The authors concluded that their results were in agreement with the dopamine

73

“overdosing hypothesis” (Gotham et al., 1988; Frank et al., 2004; Frank, 2005;

Cools, 2006). The overdosing hypothesis suggests that dopaminergic medicine,

while augmenting depleted dopamine levels in the dorsal striatum in the early

stages of PD, overdoses the still intact ventral striatum’s dopamine levels, one of

the consequences of which is washing out critical prediction error signals thus

preventing them from influencing behavior. This elegant study therefore

demonstrated a critical role for dopamine during the acquisition of probabilistic

information. This result is relevant to the present experiment, and to this

dissertation as a whole, as it is hypothesized that dopamine may be facilitating

interactions among multiple memory systems during probabilistic learning.

The task in experiment 2 combines the use of multiple types of learning

(feedback and observation) with probabilistic reversal learning in order to probe

how multiple memory systems interact during a dynamic learning environment. It

may be informative to examine the engagement of memory systems when

learning types evolve and when learning contingencies change, as this may

provide a novel way of examining how distinct neural substrates interact during

the learning process. The purpose of experiment 2 therefore was to examine in

a novel manner, how multiple memory systems interact, by probing these

interactions when learning types evolve. In particular, the experimental task

investigated changes in the engagement and interactions between multiple

memory systems when probabilistic cue contingencies were initially acquired via

one learning type and subsequently updated via the same as well as a distinct

learning type. Specifically, the experiment was a between-subjects, event-

74

related fMRI design involving two groups of participants. While undergoing

functional neuroimaging, one group initially acquired all probabilistic cue

contingencies via feedback learning, and subsequently updated the cue

contingencies via the same learning type, feedback, as well as a new, different

learning type, observation (group 1/feedback group). A second group of

participants initially acquired cue contingencies via observation learning and

subsequently updated contingencies via both the same, observation, and a new,

different learning type, feedback (group 2/observation group). This design

therefore allowed for probing a key question of interest, that being the

investigation of the relative engagement and interactions between memory

systems when updating probabilistic cue information via the same learning type

as initial cue acquisition (feedback-feedback/group1 and observation-

observation/group2) and a different learning type as initial cue acquisition

(feedback to observation/group1 and observation to feedback/group2).

Addressing the question in this manner allowed for the examination of neural

substrates in initial feedback learning versus observation learning, as well as the

investigation of what systems were engaged and potential interactions between

these systems when the learning type changed (that being either feedback-

observation or observation-feedback) versus when the learning type remained

the same (feedback-feedback or observation-observation).

Additionally, by employing this task design, an examination of pure

feedback learning (group 1) as well as pure observation learning (group 2) was

initially examined. This was necessary as the results from experiment 1

75

indicated involvement of both the striatum and hippocampus during feedback and

observation learning. It is possible that this result was observed, however, not

due to the fact that the striatum and hippocampus were both involved in both

learning types, but rather that each region was engaged in one learning type

(e.g., striatum during feedback and hippocampus during observation) and

activation was simply contaminated in the areas due to the presence of both

learning types in the same learning session. Therefore, a between-subjects

design was employed in experiment 2 in order to address this confounding issue

in experiment 1. While between-subjects designs do contain increased

variability, as comparisons are made across subjects, at times such designs are

deemed necessary. Following the initial acquisition period, a test was

administered in both groups to ensure participants had fully acquired the cue

contingencies. After the test, the update phase was administered, in which

participants in both groups 1 and 2 learned about cue contingencies via both

learning types (feedback and observation). Critically, cue contingencies were

reversed in this session, unbeknownst to the participants, requiring them to

update previously correct answers in order to receive correct feedback and

perform well in the task. Lastly, a second test was administered in order to test

whether participants had correctly updated the new cue contingencies.

The following behavioral hypotheses were formulated: in the learning

phase, it was hypothesized that participants would initially acquire cue

contingencies well, irrespective of how contingencies were initially acquired

(feedback/group1 vs. observation/group2 learning). In the update phase, it was

76

hypothesized that participants’ accuracy would initially decrease following

probabilistic cue reversal for both learning types (feedback and observation trials)

in both groups (groups 1 and 2), and subsequently improve over time as

participants updated old cue contingencies. Lastly, two additional hypotheses

were made regarding behavioral accuracy in the update phase: 1) participants

would be more efficient at updating cue associations within the same learning

type (observation-observation or feedback-feedback) or 2) participants would be

more efficient at updating cue associations by employing a new learning type

(observation-feedback or feedback-observation).

The following neuroimaging hypotheses were formulated: based on the

results of experiment 1, it was hypothesized that the basal ganglia and medial

temporal lobes would be engaged during the initial acquisition (learning phase) of

both feedback (group 1) and observation (group 2) information. During the

update (reversal) phase, it was hypothesized that both memory systems would

be engaged and involved in updating cue contingencies in parallel, potentially

exhibiting both cooperative as well as competitive interactions during the

updating process. Interactions between distinct memory systems during

updating were measured by the relative engagement of each region during the

update phase, the application of simple correlation analyses, as well as the

application of more sophisticated functional and effective connectivity analyses.

Lastly, reinforcement learning models were applied to the learning and update

phases in order to examine the role of putative dopaminergic influences during

the initial acquisition of feedback (group 1) and observation (group 2) information,

77

as well as during the updating of cue contingencies via both feedback and

observation learning in both groups.

4.2 Materials & Methods

4.2.1 Experimental design

This experiment was a modification of the task used in experiment 1

(Figure 4.1). In this between-subjects version of the probabilistic learning task,

participants were asked to learn the numerical value of six different visual cues,

which ranged on a numeric scale from 1-9 and could be categorized as either

higher or lower than 5. There were five phases of the task: learning phase

(Figure 4.1A), test phase 1 (Figure 4.1B), update phase (Figure 4.1C), test phase

2 (Figure 4.1D), and a follow-up test (not shown) administered on a subsequent

day in the laboratory. The same two independent variables were manipulated as

those in experiment 1, with some modifications: learning type (feedback and

observation) and cue difficulty (easy and hard, with the addition of a new type

which was deterministic). In the learning phase, participants acquired the value

of six cues in only one manner, via either feedback or observation. Those who

completed the feedback learning session are referred to as group1/feedback

group and those who completed the observation learning session are referred to

as group2/observation group. In the update phase, all participants learned the

new cue values via both feedback and observation trials (half of the cues via

feedback; half of the cues via observation). In terms of cue difficulty, there were

three types of cues: 100% predictive of the outcome (2 cues referred to as

78

deterministic), 87.5% predictive of the outcome (2 cues referred to as easy), and

62.5% predictive of the outcome (2 cues referred to as hard).

The learning phase consisted of 2 blocks of 48 trials each, all of which

were either feedback or observation (feedback shown in Figure 4.1A). Feedback

and observation trial structure and timing in this version of the task were identical

to that used in experiment 1 (2 second stimulus/response phase, 2 second

feedback period, 2-14 second jittered ITI). All cues in this task were blue in color.

As in experiment 1, participants were instructed to optimize their responding, in

both the learning and update phases. Following completion of the learning

phase, participants advanced to the first test session, which contained the 6 cues

learned in the learning phase, as well as two novel cues (Figure 4.1B). There

were 32 trials total in the test phase, four presentations of each of the eight cues,

presented in random order. The cues from the learning phase were blue, and

the novel cues were white in color. Trial structure and timing in this test phase

were identical to that used in the test phase from experiment 1 (self-timed

stimulus/response phase, no feedback or observation information provided, 6-14

second jittered ITI).

Following completion of the first test phase, participants were instructed

on the type of learning they had not experienced in the learning phase (feedback

learning for the observation group; observation learning for the feedback group).

After completion of a short practice session while inside the MRI, they began the

update phase. Participants were informed that they were going to continue

learning about the value of the cues, now via both types of learning. Participants

79

were also instructed that some things may have changed since the first part of

the experiment (the learning phase), but they were not given any specific

information other than the vague indication that something may be different. The

structure of the update phase was very similar to that of the learning phase and

consisted of 2 blocks of 48 trials each (Figure 4.1C). However, there were two

critical manipulations in the update phase. The first was that half of the trials

were feedback (3 cues: 1 deterministic, 1 easy, and 1 hard) and the remaining

half were observation (3 cues: 1 deterministic, 1 easy, and 1 hard). The second

key manipulation was that all of the cue contingencies switched, (e.g., circle was

higher than 5 in the learning phase, but lower than 5 in the update phase);

however the probabilities of the cues remained the same (e.g., circle was 100%

higher than five in the learning phase and 100% lower than five in the update

phase). Trial structure and timing were identical to that used in the learning

phase (2 second stimulus/response phase, 2 second feedback period, 2-14

second jittered ITI). Trial type (feedback or observation) and trial order were

randomized; therefore feedback and observation trials were intermixed during

this phase. Participants were not given any indication of the nature of the

upcoming trial type; they simply determined the trial type based on the cue

presentation/response period. If it was a feedback trial, they saw only the cue, if

it was an observation trial, they saw a cue paired with an arrow. Participants

were instructed that the arrow was an informative clue indicating the value of the

cue (higher or lower than 5). Following completion of the update phase, a

second test phase was administered that was identical to the first test phase (six

80

blue cues shown in the update phase and the same 2 white novel cues that were

shown in the first test phase; Figure 4.1D). All four phases were completed while

the participants remained in the MRI. Lastly, a follow-up behavioral-only test

session was administered a couple of days following completion of the MRI

portion of the experiment in the laboratory (M = 2.24 days, SD = 1.82 days).

4.2.2 Data analysis

Behavioral Data Analysis:

In order to examine the behavioral accuracy data, a series of ANOVAs

were performed. The primary question of interest was to examine accuracy in

each phase of the experimental task (learning phase, test 1, update phase, test

2, follow-up test) as it varied according to learning type (feedback and

observation for the learning and update phases, with the addition of novel for the

test phases) and cue difficulty (deterministic vs. easy vs. hard). It was also a

question of interest to examine behavioral performance across groups, to

determine if accuracy was significantly greater for one group than the other. In

order to do so, group was used as between-subjects factor in all of the analyses

presented in the behavioral results section and learning type and cue difficulty

were used as within-subjects factors. Details regarding specific ANOVAs

performed in each session are presented in the appropriate behavioral results

sections. As a note, one subject’s data from the first test session was lost;

therefore the feedback group test 1 data set consists of 20 subjects.

Neuroimaging Data Analysis:

81

GLM: In the learning phase participants who completed the feedback and

observation learning sessions were analyzed separately in order to examine the

engagement of the basal ganglia and medial temporal lobes during feedback

learning (group 1) distinct from observation learning (group 2). However, the

same analyses were performed on each group’s data; specifically two random-

effects GLMs were conducted to examine BOLD activity during the 4-second cue

presentation + feedback period using the following predictors: learning type

(feedback for the feedback group and observation for the observation group) and

cue difficulty (deterministic, easy, and hard). Missed trials, six motion

parameters, and the mean intensity BOLD signal were included as regressors of

no interest. Mean intensity was included in this experiment only as a regressor

of no interest as an examination of the mean intensity of all participants indicated

fairly substantial deviations from the mean (M = 5 SDs away from the average

intensity signal). Two one-way ANOVAs using cue difficulty as a within-subjects

factor were conducted within BrainVoyager. Resulting statistical parametric

maps were set at a threshold of p<0.005 and a cluster threshold level of 6

contiguous voxels or 136mm3 (corrected to a cluster-level false-positive rate of

5%). Functionally defined regions were identified from the resulting maps (group

1/feedback and group 2/observation) and the BOLD signal was extracted for

further post-hoc analyses.

In the update phase, the comparison of greatest interest was to examine

which regions were engaged when updating information via the same (feedback-

feedback for group 1 and observation-observation for group 2) or a different

82

(feedback-observation for group 1 and observation-feedback for group 2) type of

learning. In order to examine this, all 41 participants were combined into one

analysis. One random-effects GLM was performed to examine BOLD activity

during the 4-second cue presentation + feedback period using the following

predictors: learning type (same and different) and cue difficulty (deterministic,

easy, and hard). In addition, missed trials, six motion parameters, and the mean

intensity signal were included as regressors of no interest. A 2 (learning type:

same vs. different) x 3 (cue difficulty: deterministic vs. easy vs. hard cues) x 2

(group: feedback vs. observation) repeated-measures ANOVA was performed

within BrainVoyager, using learning type and cue difficulty as within-subjects

factors and group as a between-subjects factor. Based on this ANOVA a main

effect of learning type, main effect cue difficulty, main effect of group, as well as

interactions were examined within BrainVoyager. SPMs were generated and

corrected to a cluster-level false-positive rate of 5% which was calculated

individually for each main effect and interaction: 1) main effect of learning type:

p<0.005, cluster threshold = 6 or 137mm3; 2) main effect of cue difficulty:

p<0.005, cluster threshold = 5 or 129mm3; 3) main effect of group: p<0.005,

cluster threshold = 6 or 136mm3; 4) interaction of learning type x group:

p<0.00001, and an assigned cluster threshold of 10 (the cluster threshold plugin

was unable to run as the p value was so significant; therefore a cluster threshold

of 10 was chosen by the experimenter). Functional ROIs were identified and the

mean parameter estimates were extracted for further post-hoc comparisons.

83

For the test phases, BOLD activity was examined across the entire trial

(stimulus onset and participant response). Four random-effects GLMs were

performed, one per test session per group. All GLMs included learning type

(which varied based on the analysis and is described below) and cue difficulty

(deterministic, easy, and hard) as factors: 1) feedback group test 1 - learning

material (feedback and novel) and cue difficulty; 2) observation group test 1 -

learning material (observation and novel) and cue difficulty; 3) feedback group

test 2 - learning material (feedback, observation, and novel) and cue difficulty; 4)

observation group test 2 - learning material (feedback, observation, and novel)

and cue difficulty. SPMs were generated and in every test session a contrast of

studied versus non-studied (novel) cues was performed. Mean parameter

estimates were extracted from resulting functional ROIs and used for further

post-hoc analyses.

GCA: A granger causality analysis was performed in the update phase

using the midbrain as the seed region. The midbrain was chosen as the principal

seed region in order to examine potential functional and effective connectivity

between dopaminergic regions and the BG and MTL (Shohamy and Adcock,

2010). Previously published midbrain regions (Adcock et al., 2006),

encompassing the bilateral ventral tegmental area were used as the seed

regions (x, y, z = -4, -15, -9 and x, y, z = 5, -14, -8). Functional and effective

connectivity maps were generated at the individual subject level for both runs of

the update phase. Time course values over the entire run, from the first to last

time point (290 volumes per run) were entered into the analysis. At the multi-

84

subject level, each participant’s map was combined to make one group map

(which contained both feedback and observation trial information, as the trial

types were intermixed in this phase). For the feedback group, a technical error

prevented two subjects’ data from being included; therefore, the feedback group

consisted of 19 subjects in the GCA analysis. For this group, functional

connectivity maps were given a threshold of p<0.0001 with a cluster threshold

correction of 3 voxels (cluster-level false-positive rate of 5%) and observation

group maps were thresholded at p<0.0005 and a cluster threshold correction of 3

voxels (cluster-level false-positive rate of 5%). Effective connectivity maps were

thresholded at p<0.05 for both groups and corrected to a cluster-level false-

positive rate of 5% (FB group: right VTA cluster threshold = 37 or 998mm3 and

left VTA cluster threshold = 38 or 1005mm3; for the OB group: right VTA cluster

threshold = 39 or 998mm3 and left VTA cluster threshold = 37 or 965mm3).

Standard second level statistics were performed on the group maps; specifically

a t-test was calculated.

PE: Four GLMs were created for the prediction error analyses: 1)

feedback group learning phase; 2) observation group learning phase; 3)

feedback group update phase; 4) observation group update phase. The

prediction error values generated from the Q learning model presented in the

general methods chapter were used as a regressor for the feedback trials in GLM

1 (feedback learning phase). The action prediction error values generated for the

observation trials were used as a regressor for the observation trials in GLM 2

(observation learning phase). In GLM 3 and 4, feedback prediction errors and

85

action prediction errors were generated for the feedback update phase trials and

the observation update phase trials respectively. In GLMs 3 and 4, one

regressor contained the feedback prediction error signal values (feedback trials)

and a separate regressor in the same model contained the action prediction error

signal values (observation trials). GLM 3 contained both of these regressors for

group1/feedback group, while GLM 4 contained both of these regressors for

group2/observation group. In addition, in all four of the GLMs, eight additional

predictors were included: trial event and missed responses, as well as six motion

parameters. Within GLMs 1, 3, and 4, for the feedback trials, the feedback

prediction error values were coded during the two second feedback presentation

period (indicating a correct, incorrect, or missed trial). In GLMs, 2, 3, and 4, for

the observation trials, the action prediction error values were coded during the

two second cue presentation period (where the cue and informative arrow,

indicating the cue’s value, were presented simultaneously). Four SPMs which

probed regions of the brain that varied parametrically with either feedback

prediction error signals or action prediction error signals (depending on the GLM)

were generated, set to a threshold of p<0.005 with an appropriate voxel

contiguity that corrected to a cluster-level false-positive rate of 5% (FB group:

learning phase cluster threshold = 6 or 138 mm3; update phase cluster threshold

= 5 or 136 mm3 for both the feedback and observation trial models; OB group:

learning phase cluster threshold = 5 or 135 mm3; update phase cluster threshold

= 5 or 131 mm3 for both the feedback and observation trial models).

86

4.3 Results

4.3.1 Behavioral results

Learning phase: Accuracy

The analysis of interest in the learning phase was to investigate how

participants’ accuracy varied according to cue difficulty (deterministic vs. easy vs.

hard) across groups (group1/feedback vs. group2/observation). In order to

address this question a 3 (cue difficulty: deterministic vs. easy vs. hard) x 2

(group: feedback vs. observation) repeated measures ANOVA was conducted

using cue difficulty as a within-subjects measure and group as a between-

subjects measure. A main effect of cue difficulty (F(1.730,67.49) = 77.13; p <

0.0001), a main effect of group (F(1,39) = 18.29; p < 0.0001), and a marginally

significant interaction of cue difficulty and group (F(1.730,67.49) = 77.13; p = 0.085)

were observed. Post-hoc t-tests were conducted to probe the main effect of cue

difficulty and main effect of group results. As expected, behavioral accuracy

scaled according to difficulty: deterministic > easy > hard (paired-samples t-tests,

collapsed across groups: deterministic > easy t(40) = 4.26; p < 0.001; easy > hard

t(40) = 7.59; p < 0.001). An independent samples, post-hoc t-test was performed

to examine the main effect of group and indicated that participants in the

observation group (mean accuracy = 85%) performed significantly better than

participants in the feedback group (mean accuracy = 75%; t(27.12) = 4.36; p <

0.001). In order to examine the marginally significant interaction of cue difficulty

by group, independent samples t-tests were performed and indicated that

participants in the observation group performed significantly better than

87

participants in the feedback group for both deterministic and easy cues (t(28.61) =

4.89; p < 0.001 and t(26.87) = 4.20; p < 0.001, respectively), with no performance

difference exhibited for the hard cues (t(39) = 0.94; p > 0.05; Figure 4.2A and B).

This result suggests that the two learning types are perhaps best equated in the

hardest cue condition (62.5% probability).

Test 1: Accuracy

In order to confirm that participants acquired all cue contingencies

successfully in the learning phase, a test phase was administered which

contained the probabilistic cues that were acquired in the learning phase, now

presented without any cue value information (e.g., no feedback in the trials for

group1/feedback group; no observational cue information in the trials for

group2/observation group). Additionally, novel information was presented in the

test phase as a control in order to compare accuracy for previously learned with

novel information presented for the first time in the test session. Within each

group, a paired-samples t-test was performed to compare accuracy for previously

learned information with novel information (collapsed across levels of cue

difficulty). In the feedback group a t-test of feedback versus novel information

was performed and for the observation group, a t-test of observation versus

novel information was conducted. Participants in both groups performed

significantly better on previously learned information compared to novel material

(FB group: feedback > novel t(19) = 3.94; p < 0.001; OB group: observation >

novel t(19) = 4.95; p < 0.001).

88

In addition to examining differences within each group comparing

previously studied with novel information, potential differences in the test phase

among cue difficulty were also investigated. This question was examined across

groups to determine if participants in one group were more accurate than the

other group in the test phase. A 3 (cue difficulty: deterministic vs. easy vs. hard)

x 2 (group: feedback vs. observation) repeated measures ANOVA was

conducted using cue difficulty as a within-subjects measure and group as a

between-subjects measure. A main effect of cue difficulty (F(1.609,61.15) = 14.60; p

< 0.001), no main effect of group (F(1,38) = 1.65; p > 0.05), and no significant

interaction between cue difficulty and group (F(1.609,61.15) = 1.40; p > 0.05) were

observed. Post-hoc paired-samples t-tests were performed to explore the main

effect of cue difficulty result, collapsed across group. The results indicated that

accuracy nearly scaled according to difficulty with marginally better performance

for deterministic compared with easy cues (t(39) = 1.80; p = 0.08), and greater

accuracy for both deterministic and easy cues compared with hard cues (t(39) =

5.16; p < 0.001 and t(39) = 3.20; p < 0.005 respectively). To summarize, in the

first test phase, participants’ accuracy nearly scaled according to cue difficulty,

with no accuracy differences observed across the feedback and observation

groups, indicating that both groups acquired initial cue contingencies well.

Update phase: Accuracy

Two analyses were performed examining participants’ behavioral

accuracy during the update (reversal) phase. The first probed potential

differences across different types of learning (feedback vs. observation trials) as

89

well as the various levels of cue difficulty (deterministic vs. easy vs. hard) across

groups (group1/feedback vs. group2/observation). Therefore, a 2 (learning type:

feedback vs. observation) x 3 (cue difficulty: deterministic vs. easy vs. hard) x 2

(group: feedback vs. observation) repeated-measures ANOVA, using learning

type and cue difficulty as within-subjects factors and group as a between-

subjects factor was conducted. A main effect of learning type (F(1,39) = 5.19; p <

0.05), a main effect of cue difficulty (F(2,78) = 31.81; p < 0.001), no main effect of

group (F(1,39) = 0.226; p > 0.05), and no significant interactions (all p values >

0.05) were observed. A post-hoc paired-samples t-test was performed to

examine the main effect of learning type (collapsed across group), with results

indicating greater accuracy for observation compared with feedback cues (t(40) =

2.31; p < 0.05). The main effect of cue difficulty was caused by significantly

greater accuracy for deterministic and easy compared with hard cues (paired-

samples t-tests: t(40) = 6.93; p < 0.001 and t(40) = 5.73; p < 0.001 respectively),

with no significant difference observed between deterministic and easy cue

performance (t(40) = 1.35; p > 0.05). To summarize, participants were more

accurate for observation than feedback cues in the update phase, as occurred in

the learning phase, with accuracy modulated by difficulty such that participants

performed better on the “easier cues” (deterministic and easy cues) compared

with the hard cues.

Because there was a reversal in cue contingencies in the beginning of the

update phase, a second analysis examined changes in accuracy at multiple time

points throughout this phase. It was expected that participants’ performance

90

would reflect this reversal by exhibiting initial poor accuracy, followed by

increases in performance as participants updated previously learned

contingencies over time. In order to do this, trials were binned into 12-trial

increments and participants’ accuracy for those 12 trials was averaged. Mean

accuracy was examined at trials: 1-12 (time point 1), 13-24 (time point 2), 25-36

(time point 3), 37-48 (time point 4) for block 1 and the same time points: 1-12, 13-

24, 25-36, 37-48 for block 2 (which can also be thought of as trials 49-60, 61-72,

73-84, and 85-96 of the task).

To examine behavioral accuracy changes across the aforementioned time

points, a 2 (learning type: feedback vs. observation trials) x 4 [time points: 1

(trials 1-12) vs. 2 (13-24) vs. 3 (25-36) vs. 4 (37-48)] x 2 (group: group1/feedback

vs. group2/observation) x 2 (block: early vs. late) repeated-measures ANOVA

was performed using learning type, time point, and block as within-subjects

factors and group as a between-subjects factor. Results revealed a significant

main effect of learning type (F(1,39) = 8.74; p < 0.01), a main effect of time point

(F(3,117) = 27.70; p < 0.001), a main effect of block (F(1,39) = 33.70; p < 0.001), and

no main effect of group (F(1,39) = 0.35; p > 0.05; Figure 4.2C and D). Significant

interactions included an interaction of learning type by time point (F(3,117) = 7.66; p

< 0.001), learning type by block (F(1,39) = 13.21; p < 0.005), time point by block

(F(3,117) = 8.91; p < 0.001), and a three way interaction of learning type by time

point by block (F(3,117) = 6.49; p < 0.001), with no other significant interactions

observed (all p values > 0.05). Post-hoc t-tests were completed to examine the

significant main effects and interactions. The main effect of learning type

91

indicated greater accuracy for observation compared with feedback trials (t(40) =

2.31; p < 0.05) as previously reported in the first ANOVA performed in the update

phase. The main effect of time point revealed that as expected, participants’

accuracy increased across trial time points, time points 4 (81%) > 3 (76%) > 2

(72%) > 1 (67%) [time point 4 > time point 3 (t(40) = 2.59; p < 0.05); time point 3 >

time point 2 (t(40) = 2.67; p < 0.05); time point 2 > time point 1 (t(40) = 2.67; p <

0.05)]. As expected, the main effect of block was driven by greater accuracy in

the second (80%) compared with the first block (68%; t(40) = 5.96; p < 0.001).

The interaction of learning type by time point was driven by an increase in

accuracy across time points for the feedback trials [time points 4 (81%) > 3 (73%)

(t(40) = 3.04; p < 0.005); time point 3 (73%) marginally > 2 (68%) (t(40) = 1.80; p =

0.08); time point 2 (68%) > 1 (59%) (t(40) = 3.28; p < 0.005)], but not for the

observation trials (all p values > 0.05). The significant interaction of learning type

by block was caused by significantly greater accuracy for observation (74%) than

feedback (62%) cues in the first block (t(40) = 3.37; p < 0.005) with no difference

between learning types observed in the second block (FB = 79%; OB = 81%; t(40)

= 0.57; p > 0.05). This was most likely due to the fact that participants

significantly increased their accuracy for the feedback trials throughout the

course of the first block in the update phase (as seen in the learning type x time

point x block interaction). The significant interaction of time point by block was

driven by increases across time points in block 1 [time point 4 (79%) marginally

greater than 3 (73%) (t(40) = 1.78; p = 0.08); time point 3 (73%) > 2 (66%) (t(40) =

2.80; p < 0.01); time point 2 (66%) > 1 (55%) (t(40) = 3.26; p < 0.005)], but no

92

significant increases in accuracy across time points in block 2 [only marginally

greater accuracy for time point 4 (84%) > (79%) (t(40) = 1.85; p = 0.07), all other p

values > 0.05], suggesting that participants’ accuracy was stabilized in the

second block of the update phase, demonstrating that participants updated cue

contingencies successfully by the end of the first block. Lastly, the three way

interaction of learning type by time point by block was driven by a significant

increase in only feedback trial accuracy in the first block only [time point 4 (79%)

> 3 (69%) (t(40) = 2.38; p < 0.05); time point 3 (69%) > 2 (59%) (t(40) = 2.63; p <

0.05); time point 2 (59%) > 1 (38%) (t(40) = 4.43; p < 0.001)], with no significant

changes in the second block for feedback trials (all p values > 0.05), and no

significant changes in either block for the observation trials (all p values > 0.05;

Figure 4.2 C and D). To summarize, these results suggest that participants in

both groups were initially worse at specifically feedback cues following

probabilistic cue value reversal, but rapidly updated their responses such that by

the end of the first update block, there was no significant difference between

feedback and observation cue performance. Interestingly, neither a significant

main effect of group nor any significant interactions with group were observed,

suggesting that both groups updated cue contingencies in a similar fashion.

Test 2: Accuracy

The second test phase was administered to determine how well

participants learned the new cue contingencies, after the reversal in the update

phase. Critically, as in all test sessions, no informative information about the cue

was provided (no feedback for the feedback trials; no observational information

93

for the observation trials). In this test, feedback, observation, and novel trials

were administered for both group 1 and group 2. The first question of interest

examined if participants in both groups performed better on information learned

in the update phase compared with novel information and compared accuracy

across the two groups to determine if one group performed better than the other

in this test session. To address this question, a 3 (learning material: feedback

vs. observation vs. novel) x 2 (group: feedback vs. observation) repeated-

measures ANOVA was conducted using learning material as a within-subjects

factor and group as a between-subjects factor. A main effect of learning material

(F(1.67,65.10) = 26.28; p < 0.001), no main effect of group (F(1,39) = 0.003; p > 0.05),

and no significant interaction between learning material and group (F(1.67,65.10) =

0.42; p > 0.05) were observed. Post-hoc paired-samples t-tests were performed

to examine the main effect of learning material, collapsed across group. As

expected, results indicated no accuracy differences between previously learned

material in the update phase (feedback vs. observation trials; t(40) = 1.36; p >

0.05), and greater accuracy for both feedback and observation cues compared to

novel cues (t(40) = 6.28; p < 0.001 and t(40) = 5.13; p < 0.001, respectively).

The second analysis probed for potential differences between levels of

cue difficulty and learning types, across groups, excluding novel information. In

order to do so, a 2 (learning type: feedback vs. observation) x 3 (cue difficulty:

deterministic vs. easy vs. hard) x 2 (group: group1/feedback vs.

group2/observation) repeated-measures ANOVA was conducted using learning

type and cue difficulty as within-subjects factors and group as a between-

94

subjects factor. No significant results were observed (no main effect of learning

type (F(1,39) = 1.80; p > 0.05); no main effect of cue difficulty (F(1.55,60.57) = 1.31; p

> 0.05); no main effect of group (F(1,39) = 0.007; p > 0.05); and no significant

interactions - all p values greater than 0.05). These results suggest that

irrespective of group or learning type, participants updated cue contingencies

well.

Follow-up Test: Accuracy

A final test was conducted behaviorally in the laboratory to examine

changes in accuracy approximately 2 days after participation in the fMRI portion

of the experiment. The same two analyses were performed to examine potential

performance differences in this phase as were performed in the second test

phase administered inside the MRI. First, to examine potential differences within

learning material across groups, a 3 (learning material: feedback vs. observation

vs. novel) x 2 (group: group1/feedback vs. group2/observation) repeated-

measures ANOVA was conducted using learning material as a within-subjects

factor and group as a between-subjects factor. A main effect of learning material

(F(1.67,64.97) = 15.33; p < 0.001), no main effect of group (F(1,39) = 0.219; p > 0.05),

and no significant interaction between learning material and group (F(1.67,64.97) =

1.16; p > 0.05) were observed. Post-hoc paired-samples t-tests probing the main

effect of learning material (collapsed across group) indicated no difference

between previously learned material (feedback vs. observation cues; t(40) = 1.00;

p > 0.05) and significantly greater accuracy for both feedback and observation

cues compared to novel cues (t(40) = 4.96; p < 0.001 and t(40) = 6.51; p < 0.001,

95

respectively) as was expected based on the results from the previous test

phases.

Second, to probe for potential cue difficulty differences across groups in

this follow-up test phase, a 2 (learning type: feedback vs. observation) x 3 (cue

difficulty: deterministic vs. easy vs. hard) x 2 (group: group1/feedback vs.

group2/observation) repeated-measures ANOVA was conducted using learning

type and cue difficulty as within-subjects factors and group as a between-

subjects factor. As anticipated, based on the results from the second test

administered on day 1, there were no significant findings. Specifically, no main

effect of learning type (F(1,39) = 1.34; p > 0.05); no main effect of cue difficulty

(F(2,78) = 0.376; p > 0.05); no main effect of group (F(1,39) = 1.05; p > 0.05); and no

significant interactions (all p values greater than 0.05) were observed. These

results therefore confirm the results of the second test administered on day 1,

suggesting that participants updated cue contingencies well, irrespective of how

they initially acquired the cue associations (the group they were assigned to:

group1/feedback vs. group2/observation), the trial learning type, or the difficulty

of the probabilistic cues.

4.3.2 Neuroimaging results

Learning phase

Main Effect of Cue Difficulty: One unresolved question from the results of

experiment 1 was to determine if the MTL and BG are engaged in pure feedback

learning as well as pure observation learning (nonbiased by both learning types

presented in the same learning session). In order to examine the engagement of

96

these areas during distinct types of learning, the feedback groups’ and

observation groups’ neuroimaging data were analyzed separately. Therefore,

two independent within-subjects, repeated-measures ANOVAs investigating a

main effect of cue difficulty were performed within BrainVoyager: one for the

feedback learning participants (group 1) and a separate ANOVA for the

observation learning participants (group 2). Results indicated that a network of

similar regions was engaged in both the feedback learning group (Table 4.1) and

observation learning group (Table 4.2), including loci within the striatum, medial

temporal lobe, insula, cingulate cortex, and prefrontal areas.

Feedback learning group: A region of the right caudate nucleus (x, y, z =

5, 1, 12; Figure 4.3A) and left medial temporal lobe, encompassing the amygdala

and hippocampus (x, y, z = -22, -8, -15; Figure 4.3C) were engaged during

feedback learning. Mean parameter estimates were extracted from these

regions of interest for further analysis. Activity within the caudate nucleus was

greatest in response to hard cues and decreased according to cue difficulty (hard

> easy > deterministic) (hard cues marginally greater than easy cues, t(20) = 2.90;

p = 0.07; easy > deterministic t(20) = 4.30; p < 0.0005; Figure 4.3B), while the left

MTL showed the opposite pattern (deterministic > easy > hard) (deterministic

marginally greater than easy cues, t(20) = 1.96; p = 0.06; easy > hard t(20) = 3.10;

p < 0.01; Figure 4.3D).

Observation learning group: Within the observation group, a region of the

right caudate nucleus (x, y, z = 5, -2, 15), the left caudate nucleus extending into

the left putamen (x, y, z = -13, 1, 9) (Figure 4.4 A), and left hippocampus (x, y, z

97

= -25, -20, -12; Figure 4.4C) were engaged during learning. Mean parameter

estimates were extracted from these regions for further examination. A similar

pattern of activity appeared within the right caudate nucleus, left caudate nucleus

extending into the putamen, and the hippocampus as was observed in the

feedback learning group. Specifically in the right caudate nucleus, hard cues

elicited stronger activation compared with easy and deterministic cues (t(19) =

4.77; p < 0.0005 and t(19) = 4.26; p < 0.0005, respectively; Figure 4.4B), with a

similar pattern observed in the left caudate/putamen region (hard > easy and

deterministic; t(19) = 4.11; p < 0.001 and t(19) = 4.55; p < 0.0005, respectively; not

depicted in the figure). The left hippocampus showed the opposite pattern with

deterministic and easy cues eliciting stronger activation than hard cues (t(19) =

3.62; p < 0.005 and t(19) = 4.28; p < 0.0005, respectively; Figure 4.4D).

These results therefore are a partial replication of the results observed in

experiment 1. Loci within both the caudate nucleus and hippocampus were

involved in feedback learning, as well as observation learning and were

modulated by cue difficulty. The hippocampus exhibited the same pattern of

results as reported in experiment 1, however the caudate nucleus, was engaged

differentially in the learning task employed in experiment 1 versus the task

employed in experiment 2; an interpretation of this differentiation is provided in

the discussion section.

Learning phase: Correlations between striatum and medial temporal lobe

As a rudimentary manner of examining interactions between the striatum

and medial temporal lobe during learning, a series of Pearson’s correlations were

98

performed on mean parameter estimates extracted from the striatal and medial

temporal lobe regions in the feedback and observation learning groups.

Specifically, the areas functionally defined by the main effect of cue difficulty

analyses were used in the analyses.

Feedback learning group: In the feedback group, no significant positive or

negative correlations were observed between the mean BOLD signal extracted

from the right caudate nucleus (x, y, z = 5, 1, 12) and left medial temporal lobe (x,

y, z = -22, -8, -15).

Observation learning group: In the observation group, a significant

positive correlation was observed between the right caudate nucleus (x, y, z = 5,

-2, 15) and the left MTL (x, y, z = -25, -20, -12) for observation deterministic cues

(r = 0.467, p = 0.04) with a nearly significant positive correlation observed

between these same regions for easy cues (r = 0.434, p = 0.056). No other

significant positive or negative correlations were observed.

Update phase

The key question of interest in the update phases was to examine what

brain regions were engaged during the updating of probabilistic learning, when

learning types remained the same (feedback-feedback and observation-

observation) versus when learning types evolved (feedback-observation and

observation-feedback). It was hypothesized that this novel manner of examining

how multiple memory systems interact during a dynamic learning environment

may provide new insight to the debate regarding the nature of interactions among

memory regions. In order to address this question of interest, one analysis was

99

performed which included all forty-one participants’ data – combining groups 1

(feedback) and 2 (observation). A 2 (learning type: same vs. different) x 3 (cue

difficulty: deterministic vs. easy vs. hard) x 2 (group: group1/feedback vs.

group2/observation) repeated-measures ANOVA was performed within

BrainVoyager using learning type and cue difficulty as within-subjects measures

and group as a between-subjects measure. The results presented below

highlight the most relevant findings.

Main Effect of Cue Difficulty: A similar network of regions engaged in

processing differences in cue difficulty was observed during the update phase as

was engaged during the learning phases, including loci within the striatum, MTL,

insula, cingulate cortex, as well as several frontal and parietal regions (Table

4.5). Mean parameter estimates were extracted from key areas of interest,

specifically two regions bordering the right putamen/insula (x, y, z = 26, 13, 3 and

x, y, z = 29, -14, 12; Figure 4.5A and C) and bilateral parahippocampus (x, y, z =

20, -20, -18, not shown in the figure, and x, y, z = -22, -14, -21; Figure 4.5E).

Interestingly, the mean BOLD signal in the more anterior region of the lateral

putamen/insula (x, y, z = 26, 13, 3) scaled according to difficulty with the greatest

response elicited for hard cues (irrespective of learning type): hard > easy >

deterministic (hard > easy t(40) = 3.12; p < 0.005; easy > deterministic t(40) = 2.13;

p < 0.05; Figure 4.5B). However, the more lateral and posterior region of the

putamen/insula identified (x, y, z = 29, -14, 12) showed the opposing pattern of

activity, with the greatest response elicited to deterministic cues (irrespective of

learning type): deterministic > easy > hard (deterministic > easy t(40) = 2.28; p <

100

0.05; easy > hard t(40) = 2.39; p < 0.05; Figure 4.5D). Furthermore, within the

parahippocampus , the BOLD signal also scaled according to difficulty with both

regions showing the greatest response to deterministic cues: deterministic > easy

> hard (irrespective of learning type; right parahippocampus: deterministic >

easy: t(40) = 2.30; p < 0.05; easy > hard: t(40) = 2.72; p < 0.01; left

parahippocampus: deterministic marginally > easy: t(40) = 1.85; p = 0.07; easy >

hard: t(40) = 2.67; p < 0.05; Figure 4.5F). This result of both similar and opposing

patterns of BOLD responses within regions of the putamen and

parahippocampus when updating cue contingencies following a reversal in cue

values supports parallel engagement of these regions during the updating

process.

Main Effect of Learning Type: Brain regions which processed differences

in learning type, that being variations in BOLD responses to the same versus a

different learning type as was employed in the learning phase, were also

investigated. A group of areas involved in processing differences in learning type

included a locus within the cingulate cortex (x, y, z = -10, -5, 36), a locus within

the insula (x, y, z = 41, 7, 6), as well as several parietal and temporal regions

(Table 4.6). Interestingly, no areas within the medial temporal lobes or basal

ganglia exhibited a main effect of learning type, indicating that these regions did

not differentiate between whether the learning type remained the same or

changed compared with initial cue acquisition.

Main Effect of Group: Areas processing differential activity for participants

in the feedback group versus those in the observation group were also

101

examined. The main effect of group analysis revealed activation within two areas

which survived correction, the precentral gyrus and the thalamus (Table 4.7).

Interestingly, no engagement of the basal ganglia, MTL, or midbrain was

observed in processing a main effect of group. While null findings in

neuroimaging studies are difficulty to interpret, and must be done with caution,

one possible interpretation is that this null result may be indicative of a parallel

mode of operations between multiple memory regions, as no key region of

interest was engaged in processing greater activation to the feedback versus the

observation group, or vice versa.

Interaction of Learning Type and Group: Several regions exhibited an

interaction of learning type and group, including loci within the striatum and

midbrain, as well as the insula, and cingulate cortex. Specifically the bilateral

caudate nucleus (right: x, y, z = 8, 3, 11; and left x, y, z = -8, 3, 8) as well as the

bilateral midbrain, centered on the VTA (right: x, y, z = 7, -15, -9; and left: x, y, z

= -4, -15, -9) exhibited an interaction of learning type by group. Mean parameter

estimates were extracted from these regions of interest for further examination.

In all regions examined, the interaction was driven by a larger response to same

than different learning trials for the feedback group (e.g., feedback trials), and a

larger response to different than same learning trials for the observation group

(e.g., feedback trials) [right caudate: FB group: same > different t(20) = 6.85; p <

0.0001; OB group: different > same: t(19) = 8.34; p < 0.0001; left caudate: FB

group: same > different t(20) = 8.77; p < 0.0001; OB group: different > same: t(19) =

5.14; p < 0.0001; right midbrain: FB group: same > different t(20) = 8.28; p <

102

0.0001; OB group: different > same: t(19) = 4.95; p < 0.0001; left midbrain: FB

group: same > different t(20) = 7.90; p < 0.0001; OB group: different > same: t(19) =

5.92; p < 0.0001]. Therefore, within both the feedback and observation groups, a

greater response to feedback trials was observed in loci within the caudate

nucleus and midbrain. This result is consistent with the literature suggesting a

role for the striatum and midbrain in processing feedback information (Packard

and Knowlton, 2002; Wise, 2004; Delgado, 2007).

Update phase: Correlations between striatum and medial temporal lobe

Correlations were performed on mean parameter estimates extracted from

striatal and medial temporal lobe regions in the update phase, functionally

defined by the main effect of cue difficulty analysis. Only correlations across

regions, which were matched according to cue difficulty (deterministic-

deterministic, easy-easy, hard-hard) were examined. The only significant across-

region correlation observed was between the right parahippocampus (x, y, z =

20, -20, -18) and right putamen/insula (x, y, z = 29, -14, 12) for deterministic cues

in the same learning condition (r = 0.308, p = 0.050).

Granger Causality Analysis

Granger causality analyses were conducted for the feedback and

observation groups separately during the update phase. The purpose of this

analysis was to examine functional and effective connectivity during the dynamic

portion of the task when participants were updating previous cue contingencies

via both feedback and observation trials. The midbrain (centered on the VTA)

was chosen as the seen region given its anatomical projections to both the

103

striatum and the hippocampus and its critical role in reward-related learning and

memory processes (Figure 4.6A).

1) Functional connectivity:

Feedback group: Within the feedback group, using the right VTA as the

seed region resulted in functional connectivity with the bilateral caudate nucleus

extending into the putamen (x, y, z = 6, 4, 6 and x, y, z = -17, 1, 5) as well as the

right hippocampus (x, y, z = 24, -20, -6). Functional connectivity maps using the

left VTA seed revealed very similar instantaneous correlations with the bilateral

caudate nucleus extending into the putamen/globus pallidus (x, y, z = 8, 1, 12

and x, y, z = -12, -2, 4) and the bilateral hippocampus (x, y, z = 24, -22, -8 and x,

y, z = -32, -15, -13).

Observation group: Functional connectivity in the observation group using

the right and left VTA seeds occurred with similar regions of the basal ganglia,

namely loci within the bilateral globus pallidus. Two loci of the globus pallidus,

(x, y, z = 8, 1, -1 and x, y, z = -15, 1, -2) correlated with the right VTA, while two

similar areas of the globus pallidus (x, y, z = -8, 1, -3 and x, y, z = 18, -2, -6)

correlated with the left VTA. In addition, the right VTA exhibited functional

connectivity with the bilateral parahippocampus (x, y, z = 16, -23, -12 and x, y, z

= -19, -28, -9) while the left VTA exhibited connectivity with the bilateral posterior

entorhinal cortex (x, y, z = 17, -23, -9 and x, y, z = -20, -25, -9).

To summarize the functional connectivity results, instantaneous

correlations were observed between the bilateral VTA, the caudate nucleus

extending into the globus pallidus, and the hippocampus in the feedback group

104

while somewhat overlapping regions in the globus pallidus and medial temporal

lobes were functionally connected with the bilateral VTA in the observation

group. Therefore, similar functional connectivity in known reward-related and

memory regions (midbrain, basal ganglia, and medial temporal lobes) was

observed across groups, suggesting interactions between these areas during the

updating of previously learned cue contingencies.

2) Effective connectivity:

Feedback group: Maps using the left VTA as a seed region revealed

directed connectivity originating from the bilateral putamen (x, y, z = 15, 2, 5 and

x, y, z = -19, 0, 5) to the left VTA (Figure 4.6B). Using the right VTA seed

revealed directed influences from the right VTA to the right hippocampus (x, y, z

= 26, -20, -7; Figure 4.6C) and right parahippocampus (x, y, z = 16, -31, -12; not

shown).

Observation group: Effective connectivity using the left VTA as a

reference region indicated no regions of interest surviving cluster threshold

correction. Using the right VTA as a reference region revealed directed

influences from the right VTA going to the left hippocampus (x, y, z = -30, -21, -

9), the left parahippocampal gyrus (x, y, z = -25, -29, -9) (Figure 4.6D), as well as

the posterior cingulate and thalamus.

In sum, a unilateral flow of information was observed in which directed

influences from the putamen were sent to the left VTA, and directed influences

from the right VTA were sent to the hippocampus and parahippocampus in the

feedback group. The observation group exhibited a segment of this arc, that

105

from the right VTA to the left hippocampus and parahippocampus [a region of the

left putamen (x, y, z = -24, -3, 6) did send directed influences to the left VTA, but

did not survive cluster threshold correction].

Prediction Error Analysis

Learning phase:

Feedback learning group: A traditional Q learning model was used to

estimate prediction error values for the feedback trials in group 1 (feedback

group). Regions that positively correlated with this prediction error signal

included three loci within the basal ganglia, as well as an area within the anterior

cingulate (x, y, z = -1, 34, -6) and the inferior frontal gyrus (x, y, z = -49, 25, 15).

Within the basal ganglia, the ventral striatum (nucleus accumbens: x, y, z = 5, 4,

-3), the internal segment of the globus pallidus (x, y, z = -13, 1, -3) (Figure 4.7A),

and the head of the caudate nucleus (x, y, z = 14, 22, 9; not shown) all positively

correlated with the prediction error signal.

Observation learning group: An action prediction error, which is a non-

signed measure of participants’ surprise at viewing a given cue-value association

(saliency signal), was calculated for the observation learning trials for group 2

(observation group). Areas of interest which positively correlated with this action

prediction error signal included the medial frontal gyrus (x, y, z = 5, -2, 48), the

midbrain (x, y, z = 2, -26, -6), and the bilateral insula (x, y, z = 29, 16, 0 and x, y,

z = -28, 16, 6) (not depicted in the figure).

To summarize the learning phase prediction error results, areas of the

basal ganglia were involved in encoding feedback prediction errors, including the

106

ventral striatum, globus pallidus, and caudate nucleus, consistent with previously

reported results in the literature (Ashby et al., 2002; O'Doherty et al., 2004;

Tanaka et al., 2004; Pessiglione et al., 2006). For the observation trials, the

action prediction error signal engaged distinct regions including the medial frontal

gyrus, the midbrain, and the insula, which did not replicate previous findings in

the literature (Burke et al., 2010).

Update phase: Both feedback prediction errors (for the feedback trials) and

action prediction errors (for the observation trials) were examined in the update

phase, for both group 1 (feedback) and group 2 (observation).

Feedback group: Areas of interest which positively correlated with the

prediction error signal for the feedback trials included the right ventral caudate

nucleus (x, y, z = 2, 10, 0) and the external component of the left globus pallidus

(x, y, z = -19, -2, -6), regions fairly similar to those engaged during the learning

phase while encoding the prediction error signal for this group (Figure 4.7B). For

the observation trials, an area of interest that positively correlated with the action

prediction error signal included the superior frontal gyrus (x, y, z = -22, 13, 48) a

region distinct from the medial frontal gyrus which encoded this signal in the

observation group leaning phase (not depicted in the figure).

Observation group: Loci within the ventral striatum (bilateral nucleus

accumbens: x, y, z = 5, 4, -3 and x, y, z = -10, 4, -3; Figure 4.7C), the medial

temporal lobe (encompassing the hippocampus/amygdala: x, y, z = -25, -5, -21;

Figure 4.7D), and the medial frontal gyrus (x, y, z = -1, 22, -15; not shown)

positively correlated with the prediction error signal for the feedback trials. For

107

the observation trials, the cingulate gyrus (x, y, z = 11, -29, 36) was the only area

that positively correlated with the action prediction error signal (not shown in the

figure).

To summarize, in the update phase, regions of the basal ganglia

correlated with feedback prediction errors in both groups, with the observation

group additionally showing correlations with the medial temporal lobe and medial

frontal gyrus. The action prediction error signal correlated with distinct regions in

the feedback (superior frontal gyrus) and observation groups (cingulate gyrus).

4.3.3 Exploratory analyses

Two additional analyses were performed for exploratory purposes in order

to examine potential effects of cue accuracy and cue consistency on the BOLD

responses in the BG and MTL, as well as to probe for potential differences in

these regions for participants who learned to optimize their responses versus

those who did not. The first analysis investigated potential differences in the BG

and MTL when accounting for cue accuracy (correct versus incorrect) in the

feedback trials and cue consistency (consistent versus inconsistent) in the

observation trials. This analysis was performed as the BG and MTL are known

to be modulated by both accuracy and consistency during learning (Delgado et

al., 2000; Delgado, 2007; Chen et al., 2011; Li et al., 2011). For each region

described below, BOLD responses were extracted from the previously defined

functional regions of interest processing a main effect of cue difficulty (FB group:

right caudate nucleus and left MTL). A within-subjects ANOVA was performed

examining potential differences in cue difficulty and cue accuracy for the

108

feedback trials in each ROI. To summarize the results briefly, no significant main

effects of cue accuracy or significant interactions were observed (right caudate

nucleus: main effect of cue difficulty F(2,38) = 11.69; p < 0.001; marginally

significant main effect of cue accuracy F(1,19) = 3.19; p = 0.09; no significant

interaction F(2,38) = 0.29; p > 0.05; left MTL: main effect of cue difficulty F(2,38) =

9.10; p < 0.005; no main effect of cue accuracy F(1,19) = 0.55; p > 0.05; no

significant interaction F(2,38) = 0.43; p > 0.05). However, for exploratory purposes,

Student’s t-tests were performed in order to examine potential differences across

levels of difficulty according to accuracy (corrected for multiple comparisons) as it

was desired to examine if the responses in the BG and MTL were driven by

responses to correct rather than incorrect trials (e.g., easy correct > hard correct,

but no difference for incorrect trials). Note one person was excluded from this

analysis as he/she experienced no incorrect trials in the deterministic condition.

In the learning phase, for the feedback group, the right caudate nucleus

and the left MTL exhibited differences in cue difficulty for both correct and

incorrect trials. In the right caudate nucleus, BOLD response for hard correct

trials was greater than deterministic correct trials (t(19) = 2.89; p < 0.01) while a

marginally greater response for hard than easy correct trials (t(19) = 1.88; p =

0.08) was observed. The same pattern was observed for incorrect trials (hard >

deterministic t(19) = 3.06; p < 0.01 and marginally greater for hard than easy t(19) =

2.26; p = 0.04; trend after sequential Bonferroni correction), suggesting that the

pattern of activity in the caudate was not driven by one type of feedback alone.

The left MTL however, exhibited greater responses for deterministic than hard

109

correct trials (t(19) = 3.17; p < 0.005) and marginally greater responses for easy

than hard correct trials (t(19) = 1.63; p = 0.12). Marginally greater BOLD signals

were observed for deterministic than hard incorrect trials (t(19) = 2.26; p = 0.04;

trend after sequential Bonferroni correction) and significantly greater for easy

than hard incorrect trials (t(19) = 2.65; p < 0.05). This pattern of results suggests

that both the caudate nucleus and MTL were not primarily modulated by correct

versus incorrect trials. Furthermore, accounting for accuracy did not change the

pattern of results observed in these areas; that is, in the striatum the greatest

BOLD response was elicited in response to hard cues, while in the MTL the

largest BOLD response occurred for deterministic cues.

A similar analysis was performed in the update phase, including both the

feedback and observation group as these groups were combined in this phase.

Note five people were excluded from this analysis as they experienced no

incorrect trials for at least one condition. Four repeated-measures ANOVAs were

performed using cue difficulty and feedback cue accuracy as within-subjects

measures and group as a between-subjects measure (feedback group and

observation group; one ANOVA per ROI). To summarize the results briefly, no

significant main effect of cue accuracy was observed (anterior putamen/insula:

F(1,34) = 0.58; p > 0.05; posterior putamen/insula F(1,34) = 1.79; p > 0.05; right

parahippocampus: F(1,34) = 0.89; p > 0.05; left parahippocampus: F(1,34) = 2.47; p

> 0.05). The anterior putamen/insula exhibited a significant three way interaction

of cue difficulty, cue accuracy, and group (F(2,68) = 3.60; p < 0.05) and the

posterior putamen/insula exhibited a significant interaction of cue difficulty by cue

110

accuracy (F(1.66,56.40) = 3.59; p < 0.05). For exploratory purposes, Student’s t-

tests were performed in order to examine potential differences across levels of

difficulty according to accuracy (corrected for multiple comparisons).

Interestingly, differences across levels of cue difficulty in these ROIs occurred in

correct, but not incorrect trials. The anterior putamen exhibited greater

responses for hard than deterministic (t(35) = 3.39; p < 0.0005) and marginally

greater responses for hard than easy correct trials (t(35) = 1.89; p = 0.07) and

easy than deterministic correct trials (t(35) = 2.02; p = 0.05). The more posterior

region of the putamen exhibited the opposing pattern of activity, with greater

responses for deterministic compared with easy (t(35) = 3.04; p < 0.005) and hard

(t(35) = 4.29; p < 0.001) correct trials. In the right parahippocampus marginally

greater responses were observed for deterministic compared with hard correct

trials (t(35) = 1.82; p = 0.07). In the left parahippocampus, greater BOLD

responses were elicited for both deterministic and easy compared hard correct

trials (t(35) = 3.41; p < 0.005 and t(35) = 2.89; p < 0.01 respectively). No

differences were observed across levels of cue difficulty for the incorrect trials in

any ROI (all p values > 0.05). Therefore, in the update phase, the putamen and

parahippocampus exhibited differences in BOLD responses across levels of

difficulty in correct but not incorrect trials. However, accounting for cue accuracy

did not change the pattern of results observed in these regions.

Lastly, the pattern of the putamen and parahippocampal ROIs in the

update phase observation trials was examined, when accounting for cue

consistency. Note one person was excluded from this analysis as he/she

111

experienced no easy inconsistent trials (due to missed trials). Four repeated-

measures ANOVAs were performed using cue difficulty and cue consistency as

within-subjects measures and group as a between-subjects measure (one

ANOVA per ROI). To summarize briefly, the putamen ROIs exhibited no effects

of cue consistency (anterior putamen/insula: F(1,38) = 1.17; p > 0.05; posterior

putamen/insula F(1,38) = 0.29; p > 0.05), while the parahippocampal ROIs were

modulated by cue consistency (right parahippocampus: F(1,38) = 11.32; p < 0.005

and a marginally significant interaction of cue difficulty and cue consistency F(1,38)

= 3.96; p = 0.05; left parahippocampus: F(1,38) = 5.13; p < 0.05). The right

parahippocampus indicated a main effect of cue consistency, with greater BOLD

responses observed for inconsistent (e.g., circle is lower than 5) than consistent

trials (e.g., circle is higher than 5 – value of circle is 87.5% higher than 5; t(39) =

2.75; p < 0.01) while the left parahippocampus exhibited a marginally greater

response for inconsistent trials (t(39) = 1.80; p = 0.08). In both regions, this effect

was driven by differences in easy, but not hard, cue consistency (right

parahippocampus: easy inconsistent > consistent; t(39) = 3.27; p < 0.005; left

parahippocampus: easy inconsistent > consistent; t(39) = 2.80; p < 0.01).

The second exploratory analysis probed for potential differences between

participants who learned to optimize their responses, termed optimizers, versus

those who did not optimize their responses, termed non-optimizers, during the

learning and update phases. It was theorized that the responses in the MTL and

BG may be modulated by the ability to optimize during the task (e.g., optimizers

may exhibit greater MTL activation but reduced BG recruitment compared with

112

non-optimizers). In the learning phase, behavioral accuracy was examined in the

last 24 trials of the second block of learning (trials 72-96). Participants who were

100% correct for either all the easy cues or all the easy and hard cues presented

in the last 24 trials were termed optimizers, whereas participants who were less

than 100% correct were categorized as non-optimizers. In the feedback group, 9

participants were categorized as optimizers while the remaining 12 were

categorized as non-optimizers. In the observation group, 8 participants were

categorized as optimizers while the remaining 12 were categorized as non-

optimizers. Two repeated-measures ANOVAs using cue difficulty (deterministic,

easy, and hard) as a within-subjects factor and optimizer group (optimizer and

non-optimizer) as a between-subjects factor were performed (one for the FB and

one for the OB group) in order to examine potential differences in accuracy

throughout learning for participants who optimized their responding at the end of

the learning phase versus those who did not.

In the feedback group, results indicated a main effect of cue difficulty

(F(2,38) = 25.95; p < 0.001), a main effect of optimizer group (F(1,19) = 15.38; p <

0.005), and a significant interaction (F(2,38) = 4.06; p < 0.05). Optimizers’

accuracy (82%) in the entire learning phase was significantly better than non-

optimizers (70%; t(19) = 3.92; p < 0.005). This was driven by significantly greater

accuracy for easy cues (t(16.99) = 7.14; p < 0.001) and marginally greater accuracy

for deterministic cues (t(19) = 1.83; p = 0.08) for optimizers compared with non-

optimizers. For the observation group, results indicated a main effect of cue

113

difficulty (F(1.39,25.05) = 66.07; p < 0.001), no main effect of optimizer group (F(1,18)

= 0.38; p > 0.05), and no significant interaction (F(1.39,25.05) = 0.67; p > 0.05).

The next analyses examined whether the BOLD responses in the

functionally defined ROIs in the learning phase were modulated by optimizer

group. Specifically, the regions of the caudate nucleus and MTL which exhibited

a main effect of cue difficulty were examined. A repeated-measures ANOVA was

performed on the mean BOLD responses extracted from each ROI using cue

difficulty as within-subjects factor and optimizer group as a between-subjects

factor. In the FB group, the right caudate nucleus exhibited a main effect of cue

difficulty (F(2,38) = 14.71; p < 0.001), no main effect of optimizer group (F(1,19) =

1.86; p > 0.05), and a significant interaction of cue difficulty by optimizer group

(F(2,38) = 7.09; p < 0.005). The interaction was driven by marginally greater BOLD

responses for easy cues in the non-optimizers than the optimizers (t(16.19) = 2.37;

p = 0.03; trend after sequential Bonferroni correction). This result is consistent

with the critical role of the striatum in acquiring cue contingencies and then

decreasing activation once contingencies are acquired (Tricomi et al., 2004;

Delgado et al., 2005). The left MTL however exhibited a main effect of cue

difficulty (F(2,38) = 11.98; p < 0.001), but no main effect of optimizer group (F(1,19) =

0.97; p > 0.05), and no significant interaction (F(2,38) = 0.22; p > 0.05).

The same ANOVA was performed on the regions engaged in learning in

the observation group. The right and left caudate nucleus and left MTL exhibited

main effects of cue difficulty (right caudate nucleus: F(2,36) = 13.38; p < 0.001; left

caudate nucleus: F(2,36) = 12.88; p < 0.001; left MTL: F(2,36) = 8.83; p < 0.005), but

114

not optimizer group (right caudate nucleus: F(1,18) = 0.11; p > 0.05; left caudate

nucleus: F(1,18) = 1.35; p > 0.05; left MTL: F(1,18) = 0.49; p > 0.05), and no

significant interaction (right caudate nucleus: F(2,36) = 2.46; p > 0.05; left caudate

nucleus: F(2,36) = 0.00; p > 0.05; left MTL: F(2,36) = 3.20; p = 0.05). To summarize

therefore, only the right caudate nucleus in the feedback group was modulated

by optimizer group, no other region’s BOLD responses were modulated by

whether or not participants optimized their responding at the end of the learning

phase.

In the update phase, in order to classify optimizer versus non-optimizers

different criteria were used than the criterion used in the learning phase. In this

phase, behavioral accuracy was examined in the second block of the update

phase as a whole, rather than examining only the last 24 trials. This was done

because when examining the last 24 trials of the second block, nearly every

participant optimized their responses and since participants were relearning cue

contingencies, they had more experience with the task. Therefore, any

participant who was 100% correct for the easy cues was categorized as an easy

cue optimizer, any participant who was 100% correct for the easy and hard cues

was categorized as an easy/hard cue optimizer, and those who did not optimize

for either easy or easy and hard cues were termed non-optimizers. Using this

criterion, 15 participants were easy cue optimizers, 13 participants were

easy/hard cue optimizers, and the remaining 13 participants were non-

optimizers. A repeated-measures ANOVA was performed on the behavioral data

using learning type (same or different) and cue difficulty (deterministic, easy, and

115

hard) as within-subjects measures and learning phase group (feedback or

observation) and optimization group (easy cue optimizers, easy/hard cue

optimizers, and non-optimizers) as between-subjects factors. Results indicated

no main effect of learning type (F(1,35) = 0.08; p > 0.05), a main effect of cue

difficulty (F(2,70) = 33.73; p < 0.001), no main effect of learning phase group (F(1,35)

= 0.08; p > 0.05), and a main effect of optimization group (F(2,35) = 7.05; p <

0.005). Significant interactions included learning type by learning phase group

(F(1,35) = 6.05; p < 0.05), and a three way interaction of cue difficulty, learning

phase group, and optimization group (F(4,70) = 3.07; p < 0.05). Of interest was the

observation that accuracy overall was modulated by optimization group with the

easy/hard cue optimizers (83%) significantly more accurate than easy cue

optimizers (73%;t(21.04) = 3.09; p < 0.01) and non-optimizers (67%; t(24) = 4.10; p <

0.001).

Lastly, the ROIs functionally defined by the main effect of cue difficulty

analysis were examined to probe for potential differences between optimizers

and non-optimizers in the update phase. The same ANOVA performed on the

behavioral data was conducted on the neuroimaging data, one ANOVA per ROI.

To summarize the findings briefly, no region exhibited a main effect of optimizer

group [anterior putamen: (F(2,35) = 0.35; p > 0.05); posterior putamen: (F(2,35) =

1.96; p > 0.05); right parahippocampus: (F(2,35) = 1.16; p > 0.05); left

parahippocampus: (F(2,35) = 0.41; p > 0.05)], nor any significant interactions with

optimizer group (all p values > 0.05). Therefore, the BOLD responses during the

update phase in the putamen and parahippocampus were not significantly

116

different for participants who optimized their responses in this phase versus

those who did not.

4.4 Discussion

The purpose of this experiment was to examine in a novel manner how

the medial temporal lobe and basal ganglia interact during probabilistic learning,

by employing multiple types of learning (feedback and observation trials) and a

reversal of probabilistic cue contingencies. Behavioral results indicated that

participants in the observation group initially acquired cue contingencies better

than participants in the feedback group. In the update phase, interestingly, all

participants, irrespective of group, were initially more efficient at updating old cue

contingencies with new cue values via observation learning. Over the course of

the update phase, however, updating via feedback improved for both groups,

such that by the end of the update phase and in the test phases, no learning type

differences were observed. Neuroimaging results revealed the engagement of

the MTL and BG during both feedback (group 1) and observation (group 2)

learning, supporting the hypothesis that these regions are engaged

simultaneously and may operate in parallel during probabilistic learning. In

further support of this hypothesis, while both of these regions were engaged

during learning, they exhibited differential patterns of activity, with the basal

ganglia exhibiting greatest BOLD responses to hard cues and the medial

temporal lobe exhibiting greatest BOLD responses to deterministic cues. Results

from the update phase corroborate this finding as loci within the basal ganglia

117

and medial temporal lobe were engaged in the update phase, processing

differences in cue difficulty. In the update phase, interestingly, distinct loci within

the basal ganglia exhibited both differential (BOLD signal greatest for hard cues)

as well as similar (BOLD responses greatest for deterministic cues) patterns of

activity as those observed in the medial temporal lobe (BOLD signal greatest for

deterministic cues). The application of functional and effective connectivity

allowed for a more sophisticated examination of interactions among these

regions during the reversal of probabilistic cue contingencies. Results revealed

both functional (instantaneous correlations) and effective (directed influences)

connectivity among key reward-related and memory structures, specifically the

midbrain, loci within the basal ganglia (caudate nucleus, putamen, and globus

pallidus), and loci within the medial temporal lobe (hippocampus and

parahippocampus). Lastly, the application of reinforcement learning models

allowed for the examination of putative dopaminergic influences during both initial

acquisition as well as updating of cue contingencies. Feedback trial prediction

error signals were encoded within regions of the basal ganglia (ventral striatum,

caudate nucleus, and globus pallidus) during both learning and updating of

probabilistic cues, replicating findings previously reported in the literature

(O'Doherty et al., 2004; Tanaka et al., 2004; Pessiglione et al., 2006). An action

prediction error, representing an unsigned surprise signal generated during

participants’ viewing cue/association pairs, exhibited recruitment of several

distinct regions, including the superior frontal gyrus, medial frontal gyrus,

midbrain, as well as the cingulate gyrus. Taken as a whole, these results

118

implicate a critical role for the basal ganglia and medial temporal lobes during

initial acquisition of feedback and observation information (Poldrack et al., 1999;

Poldrack et al., 2001; Foerde et al., 2006; Cincotta and Seger, 2007) as well as

reversal learning in accord with the literature (Annett et al., 1989; Myers, 2000;

Carrillo et al., 2001; Cools et al., 2002; Schoenbaum and Setlow, 2003; Setlow et

al., 2003; Frank and Claus, 2006; Myers et al., 2006; Shohamy et al., 2009) and

in support of the parallel model of engagement of these distinct regions.

It was hypothesized that participants would initially acquire cue

contingencies well, irrespective of how cue values were initially acquired

(feedback vs. observation learning). However, behavioral results indicated that

participants in the observation group acquired cue contingencies in the learning

phase better than feedback group participants, in accord with a recent study

which demonstrated greater behavioral accuracy for instructed compared with

feedback learning (Li et al., 2011). However, this result was not hypothesized,

based on the results of experiment 1, and suggests that observation learning

may have be easier for participants than learning via feedback in this specific

learning environment, which included deterministic as well as probabilistic cues.

The presence of deterministic cues may have helped improve observation

learning, while not facilitating feedback learning to the same extent. In support of

this theory, accuracy differences were greatest across learning types for

deterministic cues while accuracy was most equated across learning types for

the hard probabilistic cues. It was further hypothesized that participants’

accuracy would initially decrease in the update phase following probabilistic cue

119

reversal for both learning types in both groups, and subsequently improve over

time as participants updated old cue contingencies. This hypothesis was partially

supported, with a significant improvement in feedback trial accuracy within the

first block of the update phase, but no significant increase in observation trial

performance over time (as participants were already performing quite well, mean

= 74% in block 1). Lastly, two hypotheses were made regarding behavioral

accuracy in the update phase: 1) participants may have been more efficient at

updating cue values within the same learning type (observation-observation or

feedback-feedback) as they would be accustomed to this manner of learning and

perhaps could readily employ neural structures already involved in initial learning

to update contingencies. 2) Alternatively, participants may have exhibited more

efficient updating of cue values when employing a new learning type

(observation-feedback or feedback-observation) as perhaps engaging a new

learning mechanism would facilitate faster updating (particularly for feedback

learning, which can be habit-driven and at times impervious to changes in

stimulus-outcome associations – for review see (Packard and Knowlton, 2002;

Yin and Knowlton, 2006). Neither of these two hypotheses were supported,

however, as interestingly, all participants, irrespective of group, initially were

more efficient at updating cue contingencies via observation learning. This result

suggests that employing observation learning may be a faster and more effective

manner of updating information early in learning. This observation learning type

advantage was not permanent however, as feedback trial accuracy steadily

improved throughout the first update block, resulting in no learning type

120

differences in the second update block. Therefore, while updating cue

contingencies via observation information was initially beneficial, both learning

types were equally effective in the long term, consistent with the behavioral

results of experiment 1 and reports in the literature of equivalent accuracy for

information acquired via feedback and observation (Poldrack et al., 2001;

Shohamy et al., 2004b).

The pattern of activity reported within the striatal regions in the learning

phases of the current experiment, showing greater responses to hard compared

with easy and deterministic cues, was not expected based on the results of

experiment 1. However, such a finding is not uncommon in the literature.

Delgado and colleagues (2005) suggest that the striatum is involved in the initial

learning of cue contingencies and decreases activation once these contingencies

have been acquired. In a probabilistic card guessing task, from which the

experiments in this dissertation were modified, participants were required to learn

the value of several cues: two cues were deterministic (100%), two cues were

probabilistic (67%), and one cue was random (50%). The authors discovered

that the striatum was initially engaged for all three cue types relatively equally,

early in the task when participants were initially learning the cue outcomes and

therefore feedback was meaningful (e.g., feedback indicated a correct choice for

the 100% and 67% cues). However, as time progressed and participants

acquired the cue contingencies, the striatum was differentially engaged

depending on the cue probability, exhibiting decreased activation in the feedback

period of the trial for the 100% cues and greater BOLD signal responses during

121

the feedback period for the unlearnable cue (50%). The authors theorized that

the striatum plays a pivotal role in learning cue associations, and becomes less

engaged once these associations have been successfully acquired. The cue

probabilities employed in the current task were 100%, 87.5%, and 62.5%. It is

possible that the addition of the deterministic cue changed the learning

environment, effectively making the hard cues seem harder. Therefore, the

striatum was most engaged in learning the value of the 62.5% cue. An

environment where everything is probabilistic (task employed in experiment 1)

compared with an environment where only some things are probabilistic [tasks

employed in experiment 2 and (Delgado et al., 2005)] is most likely a very

different experience for the participants. In the task employed in experiment 1,

there were only two probabilities, 85% and 65% which most likely made for a

simpler learning environment. Greater striatal BOLD activation in response to

the 85% cue in experiment 1 was most likely driven by more correct feedback for

the easy compared with hard cues.

The learning phase results support the hypothesis that the basal ganglia

and medial temporal lobes operate in parallel during learning, by providing

evidence of parallel engagement, as well as competitive and cooperative

interactions between these regions. The fact that these two distinct regions were

engaged simultaneously, suggests that they may be online and involved in

learning at the same time. This ability to be engaged during the same time

period indicates parallel processing of these areas during learning. Evidence of

cooperative interactions was observed from positive correlations exhibited

122

between the hippocampus and caudate nucleus in the observation group for the

deterministic and easy cues. It is thought provoking that despite the fact that

these two regions displayed opposing patterns of activity the BOLD signal was

still positively correlated between the areas for only the observation group.

Perhaps acquiring the information in an observational manner, possibly by

employing a declarative-like strategy, modified the interactions of these regions

despite their opposing pattern of BOLD signals. This result may also have been

driven by individual variations in mean BOLD responses in the caudate nucleus

and hippocampus. This is an interesting result which merits further exploration in

order to fully understand its significance. Lastly, the finding that the pattern of

activity within the hippocampus and caudate nucleus/putamen was opposing,

suggests that the striatum and hippocampus may encode distinct information in

the learning phase of the current task. Opposing patterns of activity and

differential activation within the hippocampus and striatum may be interpreted as

competitive interactions, as has been previously done in the literature (Poldrack

et al., 1999; Poldrack et al., 2001; Foerde et al., 2006; Seger and Cincotta,

2006). Furthermore, this finding partially replicates Poldrack and colleagues

2001 study, which reported activation of the striatum and deactivation of the MTL

during feedback learning. However, the authors also observed greater activation

in the MTL during an observation version of their task (WPT) compared with the

feedback version; whereas in the present study BOLD responses below baseline

occurred for both feedback and observation trials. Mean BOLD signal below

baseline in the MTL has been reported in several probabilistic learning tasks

123

(Poldrack et al., 1999; Poldrack et al., 2001; Aron et al., 2004; Foerde et al.,

2006). It has been suggested that “rest” in neuroimaging experiments may

engage the MTL, and therefore BOLD responses below baseline during task

periods may not be indicative of active suppression of this region during learning

(Foerde et al., 2006). Additionally, a seminal study by Stark and Squire

demonstrated that BOLD responses within the MTL may appear as either

activation or deactivation in the same task, dependent solely on the rest/baseline

condition with which MTL BOLD responses are compared (Stark and Squire,

2001).

Results from the update phase also support parallel engagement of the

basal ganglia and medial temporal lobes. Compared with the learning phase,

distinct regions within the basal ganglia and MTL were engaged in the update

phase, processing differences in cue difficulty, specifically loci within the left

putamen and bilateral parahippocampal gyri. The right and left

parahippocampus displayed the same pattern of activation as the hippocampus

region in the learning phase, while only one putamen area exhibited greater

responses to hard than easy and deterministic cues. The more posterior

putamen area displayed the same pattern of activity as seen within the MTL; that

is greater responses to deterministic and easy compared with hard cues. This

result also supports parallel engagement between the MTL and basal ganglia, as

different striatal areas processed probabilistic information in opposing manners,

making obvious the fact that the dorsal striatum plays several roles during the

learning process. There are multiple loops within the striatum (Haber, 2003),

124

which connect to distinct sections of both the midbrain and cortex. It is possible

that discrete regions of the putamen, one being more posterior in the putamen,

and one located more anterior and medially, may be engaged in distinct

functional loops and therefore convey different information to both the midbrain

and the cortex. The observation of activation bordering the putamen/insula

makes interpreting these results slightly more challenging however, as this result

could be driven more by insula than putamen BOLD responses. This caveat

should be taken into consideration as it is possible that more medial regions of

the striatum may display a different pattern of results; however give the learning

phase results this seems unlikely.

The null findings within the main effect of learning type and main effect of

group analyses in the update phase may possibly be interpreted as evidence of

parallel engagement of the MTL and basal ganglia. Neither the striatum nor any

regions within the MTL were more strongly engaged in updating information that

was the same (feedback-feedback or observation-observation) or different

(feedback-observation or observation-feedback) as the manner in which cue-

value information was initially acquired when both groups were combined into

one analysis. However, regions of the cingulate and insular cortex were

engaged and exhibited greater BOLD responses to the same than a different

learning type. Neither were loci within either the MTL or basal ganglia more

strongly engaged for either the feedback or the observation group in the update

phase. It could have been theorized, that if the striatum was selectively engaged

in feedback learning, it may have been recruited in initial feedback acquisition

125

(group 1), and as a result of this strong early activation, subsequently online

during the reversal phase to a greater degree in group 1 compared with group 2

participants. Likewise, it could have been theorized that if the MTL was strongly

recruited during initial observation learning (group 2) it may have been online and

more strongly engaged throughout the reversal phase for group 2 compared with

group 1. This result was not observed; however, null findings in neuroimaging

studies are difficult to interpret, and not necessarily indicative of any finding per

se. This null result therefore should be interpreted with caution and may be an

area of interest for further pursuit in future studies.

The interaction of learning type by group observed in the update phase

revealed that regions of the striatum and midbrain were more strongly involved in

processing feedback compared with observation trials. It is not surprising that

these areas were more strongly engaged during feedback trials, which contained

outcome relevant information, as compared with observational trials, which did

not carry outcome relevant information. This result is consistent with a large

body of research implicating both the striatum and the midbrain in feedback

processing and outcome monitoring (Delgado et al., 2000; Knutson et al., 2001;

Delgado et al., 2003; Aron et al., 2004; Tricomi et al., 2004; Marco-Pallares et al.,

2007; Carter et al., 2009).

The application of reinforcement learning models to the behavioral data

provided the opportunity to probe how neural substrates are engaged during

initial acquisition and updating of probabilistic information. Furthermore, it

allowed for the investigation of putative dopaminergic influences during learning

126

and reversal of cue contingencies. The results implicated the traditional network

of regions commonly reported to encode reward prediction errors, namely the

nucleus accumbens, the globus pallidus, and the caudate nucleus (O'Doherty et

al., 2004; Tanaka et al., 2004; Pessiglione et al., 2006). Interestingly, for the

observation learners, the medial temporal lobe (hippocampus/amygdala) and

medial frontal gyrus were also engaged in processing feedback trial prediction

errors in the update phase. One possible explanation for this result is that

participants in the observation group employed a more declarative-like learning

strategy, and thus engaged the medial temporal lobe, which then tracked

outcomes that violated participants’ expectations when updating old cue

contingencies. This finding is similar to the observation of the hippocampus

tracking feedback trial prediction error signals in experiment 1. Furthermore, a

recent study (Chen et al., 2011) observed mismatch signals in the CA1 region of

the hippocampus and the perirhinal cortex during associative retrieval when

previously viewed face-house pairs where mismatched. Additionally, the

amygdala has previously been reported in processing prediction error signals

(Bray and O'Doherty, 2007) and integrating prediction error signals with other

learning signals (Belova et al., 2007); therefore this result is not completely

unexpected. However, it is interesting that the MTL was engaged in encoding

prediction errors for the feedback trials, only within the observation group.

Results from the action prediction error analyses, which measured the

level of surprise or saliency when participants viewed cue/value associations

during the observation trials, revealed the engagement of two prefrontal areas

127

including the medial frontal gyrus and superior frontal gyrus as well as the

midbrain, insula, and cingulate gyrus. This network of regions, in particular the

midbrain, insula, and cingulate gyrus, was commonly activated in both the

learning and update phases of the task. Therefore it is reasonable that this

network also positively correlated with a saliency signal during observation trials.

It may have been expected that the dorsal lateral prefrontal cortex (DLPFC)

would be engaged during encoding of the action prediction error signal in the

learning and update phases, based on the findings of Burke and colleagues

(2010). The authors developed an observational-based action prediction error

model in order to examine the neural correlates of observational learning. Burke

and colleagues reported that a signal correlating with this model was encoded in

the DLPFC. The task employed in this experiment, however, is dramatically

different from the experiment utilized by Burke et al., which may explain the

discrepancy in the results. In the current experiment, participants, although

observing the cue and outcome on the screen, were instructed to be active

participants in the task. They were informed to optimize their responding and

told that the amount of money they would be paid depended on their

performance. In contrast, Burke and colleagues employed both an active and a

passive (observer) condition for the participants. In the observer condition,

participants were told they were viewing the actual choice and outcome of

another human player’s choice. On the observer trials, the participants were not

required to make choices, but rather simply observe the other player’s choices

and outcomes. In this learning environment, perhaps a more realistic

128

observational learning situation, action prediction errors were encoded in the

DLPFC. Burke et al.’s task therefore is fundamentally different from the task

employed in the current experiment, and as such the result of distinct areas of

activation involved in encoding the action prediction error signal employed in this

experiment may not be so surprising.

Granger causality analyses using the ventral tegmental area as a

reference region support the theory that the MTL and BG operate in parallel

during probabilistic learning. Functional connectivity results revealed a relatively

similar network of regions was engaged when updating cue contingencies,

including the caudate nucleus, putamen, globus pallidus, hippocampus, and

parahippocampus, across the observation and feedback groups. Some

distinctions arose however, with the feedback group correlating primarily with the

caudate nucleus, putamen and hippocampus, while the observation group

exhibited instantaneous correlations with the globus pallidus and

parahippocampus. The functional connectivity results implicate that key reward

learning and memory structures are correlated during probabilistic reversal

learning. This finding is in accord with a broad literature linking the involvement

of the BG and MTL during reversal learning and enhances previous results by

suggesting that these regions may not act independently during the reversal

process, but rather are functionally connected (Annett et al., 1989; Myers, 2000;

Carrillo et al., 2001; Cools et al., 2002; Frank and Claus, 2006; Myers et al.,

2006; Shohamy et al., 2009). Furthermore, directed connectivity results revealed

that the putamen influenced the VTA, and the VTA influenced the hippocampus

129

and parahippocampus during probabilistic reversal. This result supports the

theory that the MTL and BG may not interact directly, but rather may

communicate via midbrain dopaminergic centers (Poldrack and Rodriguez, 2004;

Shohamy and Adcock, 2010). Regions of the putamen do exhibit reciprocal

anatomical projections with the VTA/SN (Haber and Knutson, 2010), and the

VTA does send dopaminergic projections to the hippocampus (Haber and

Knutson, 2010). Therefore the effective connectivity results observed during the

updating of probabilistic cue contingencies in the present data set are

anatomically possible. Subsequent studies exploring the potential flow of

information from the putamen to VTA and from the VTA to the hippocampus

during probabilistic reversal learning may be an area of future research.

Lastly, the results from the learning phase provided an important

clarification regarding the results from experiment 1. In the learning phase of the

task employed in experiment 1, both a region of the caudate nucleus and the

hippocampus were engaged in processing differences in cue difficulty. However,

in that experiment, both feedback and observation learning trials were completed

in the same phase and therefore the results may have been skewed by the

presence of both trial types within the same phase. As a consequence, it was

important to examine the engagement of the hippocampus and striatum in

feedback and observation learning independently. The learning phase of

experiment 2 provided such a distinct examination as one group of participants

acquired cue contingencies via feedback and a separate group acquired cue

contingencies via observation. Importantly, in both the feedback group and the

130

observation group, regions of the MTL, specifically the hippocampus extending

into the amygdala, and the striatum, including the caudate nucleus and the

putamen, were engaged in processing cue difficulty. Perhaps equally interesting

was the result that in this experiment the striatal and hippocampal regions

displayed opposing patterns of activity. Both regions were involved in processing

cue difficulty, but whereas the MTL (activity encompassing the

hippocampal/amygdala regions) showed greatest activity to deterministic cues

and decreased in activation according to difficulty, the striatal regions displayed

greatest activity to hard cues and decreased with respect to difficulty. This

finding therefore partially replicates the result of experiment 1, where the

hippocampus and the caudate nucleus displayed stronger activation to easy

compared with hard cues. It is theorized that this hippocampal pattern of activity

(easy > hard) represents a learning signal, as the hippocampus is known to be

involved in encoding and recalling information. Several studies have shown

greater hippocampal activation is elicited for information that is subsequently

remembered, compared with information that is subsequently forgotten (Brewer

et al., 1998; Wagner et al., 1998; Law et al., 2005; Adcock et al., 2006; Shrager

et al., 2008). While follow up tests did not support this effect in this study, it is

possible that if a test had been administered one week following completion of

the task, participants would have better remembered the easier cues.

To summarize, results from experiment 2 suggest that regions within MTL

and BG are involved in the initial acquisition and updating of both feedback and

observation probabilistic information. Furthermore, prediction error and Granger

131

causality analyses lend support to the theory that these distinct memory systems

operate in parallel during learning and may interact via midbrain dopaminergic

regions. However, it is still unknown what specific learning contexts drive

memory systems to operate in parallel or interact directly via competition or

cooperation. Experiment 3 was designed to address this question directly.

Chapter Five: Experiment 3

5.1 Introduction: Background & Rationale

One manner of investigating the nature of interactions between multiple

memory systems in non-human animals is to lesion, or temporarily knockout, one

brain region and observe the effect on behavior. In humans, this question has

been addressed by studying multiple memory systems in patients who have

abnormal functioning of one system (e.g., Parkinson’s disease or MTL amnesia).

Examining this question in healthy human participants, however, is challenging.

One manner of doing so is to employ behavioral interference tasks, whereby one

region of interest (e.g., the MTL) is recruited during learning in order to observe

the subsequent effect on behavior and memory system interactions. The goal of

this experiment was to engage the MTL during learning of a secondary task in

order to examine interactions between multiple memory systems. It was

theorized, given the rich animal lesion and human patient literature that new

insights would be gained into the nature of how multiple memory systems interact

during behavioral interference. A specific hypothesis, based on the results of

experiments 1 and 2, and described in detail below, was formulated in which

132

memory systems would cooperate when information to-be-learned was

challenging and operate more independently when information to-be-learned was

easy, in accordance with the parallel processing model of multiple memory

systems.

The results of animal lesion studies in rodents have demonstrated a clear

double dissociation between the function of the basal ganglia and the medial

temporal lobe in learning, with the MTL supporting spatial learning and the BG

supporting stimulus-response instrumental learning (Packard et al., 1989;

Packard and McGaugh, 1992, 1996). One relatively recent exemplary study

demonstrated bidirectional competitive interactions between these distinct

regions, as knocking out one region, the dorsal striatum, actually improved

hippocampal dependent learning, and likewise, knocking out the dorsal

hippocampus, improved striatal dependent learning in mice (Lee et al., 2008). In

addition, the application of neurotransmitters, in particular glutamate and

dopamine has been shown to influence the expression of hippocampal and

striatal-based learning (Packard and White, 1991; Packard, 1999). In a plus

maze task, rodents were always placed in the same start location in the maze

(south arm) and required to navigate to one arm which was baited with a food

reward (west arm) (Packard, 1999). Rodents naturally solve this task in one of

two ways, and a probe trial is administered in order to determine how each

individual rodent solves the task. On the probe trial, rodents are placed in a new

starting location in the maze (north arm). Rodents that navigate to the correct

arm (west arm) are labeled place learners, as they remembered the correct

133

location of the food reward. Rodents which make the previously correct motor

response (turn left) to navigate to the previously correct location (west arm), will

now end up in the opposite arm (east arm), and are termed response learners

because they learned the correct motor response in order to navigate to the food

location. However, when rodents are over-trained on this task, all rodents

typically engage in response learning, suggesting a shift in the engagement of

memory systems over time. Interestingly, intrahippocampal injections of the

neurotransmitter glutamate bias rodents towards place learning both early and

late in the training process, indicating a blockade by the hippocampus in shifting

towards response learning. Likewise, intracaudate injections of glutamate

produce response learning both early and late in the task, suggesting

accelerated response learning.

In a related study using an eight-arm radial maze task, the affects of

intracerebral injections of dopamine agonists in both the hippocampus and the

caudate nucleus were examined in rodents (Packard and White, 1991). In this

version of the win-shift radial maze task, animals obtained food from 4 randomly

selected arms, then a delay period was administered, after which rodents were

returned to the maze and the other 4 arms, which previously did not contain food,

now held the food reward. In the win-stay version of this task, animals learned to

approach 4 randomly lit arms, twice within a trial, in order to obtain food. Post-

training intrahippocampal injections of three types of dopamine agonists (indirect,

D1, and D2) improved retention on the win-shift version of the task, compared

with control rodents (received saline injections), while intracaudate injections of

134

these dopamine agonists had no affect on the win-shift version. Likewise,

intracaudate injections of all three dopamine agonists improved acquisition of the

win-stay version of the task, compared with control rodents, while

intrahippocampal injections had no effect on this task version. This important

animal work demonstrates two key points: one, both the hippocampus and

striatum are involved in spatial learning tasks, are online simultaneously, and

contribute to solving these tasks in unique ways; two, how these memory

systems are engaged and the relative degree of their engagement can be

manipulated by the administration of neurotransmitters to bias the engagement of

one memory system over the other, thereby affecting how learning tasks place.

In studies involving human participants, the question of the medial

temporal lobes’ and basal ganglia’s contribution to learning and how these

multiple memory systems interact has been addressed in a multitude of manners

including behavioral and functional neuroimaging experiments using healthy

human participants (Poldrack et al., 1999; Poldrack et al., 2001; Seger and

Cincotta, 2005; Cincotta and Seger, 2007; Shohamy and Wagner, 2008; Mattfeld

and Stark, 2010), patient populations (Knowlton et al., 1994; Knowlton et al.,

1996; Dagher et al., 2001; Myers et al., 2003; Hopkins et al., 2004; Voermans et

al., 2004; Shohamy et al., 2005; Shohamy et al., 2006; Schmitt-Eliassen et al.,

2007) as well as by employing tasks involving cognitive interference (Foerde et

al., 2006; Brown and Robertson, 2007; Foerde et al., 2007). Studies examining

probabilistic learning in human patient populations have provided evidence that

the medial temporal lobes play a crucial role in the flexible transfer of information,

135

while the basal ganglia is involved in the initial acquisition of stimulus

associations (Myers et al., 2003) as well as the formulation of sequences of

stimulus outcome associations (Nagy et al., 2007).

One manner of examining the involvement of distinct neural substrates in

healthy human participants is to employ tasks of cognitive interference in order to

engage one neural substrate and observe the outcome on behavior, akin to a

behavioral “knockout” of one region of interest. One study examined the

interaction of multiple memory systems using fMRI of healthy human participants

who completed a feedback-based probabilistic learning task while simultaneously

engaging in a second cognitive demanding task (tone counting task) (Foerde et

al., 2006). Specifically, participants completed a feedback-based version of the

weather prediction task, where they were required to learn the associations of

various cues via feedback. The task consisted of two conditions: one condition

was a control condition in which tones were played in the background and

participants were instructed to ignore the tones (single-task); the second

condition required participants to simultaneously attend to and count the number

of high tones (high and low pitch tones were played; dual-task). The authors

hypothesized that the engagement of the MTL and BG could be modulated by

distraction, such that the addition of a cognitively demanding task (tone counting)

would occupy working memory, thereby decreasing declarative memory

encoding and biasing the engagement of the basal ganglia system. Results

revealed that behavioral accuracy was only moderately diminished in the dual-

task compared with the single-task condition. In a subsequent probe session (no

136

feedback administered, no tones played, cues learned under single and dual-task

conditions were intermixed), no accuracy differences were observed between

cues learned under single versus dual-task conditions. Furthermore, in the probe

session, BOLD signal within the medial temporal lobe correlated with behavioral

accuracy for associations made under the single-task condition only, while BOLD

signal within a region of the striatum, specifically the putamen, correlated with

behavioral accuracy for those associations made under the dual-task condition

only. Lastly, BOLD activity in the MTL during the probe session positively

correlated with declarative cue knowledge for cues learned under the single-task

condition only. The authors concluded that this probabilistic classification task

can be successfully learned via the MTL or the BG, and that the engagement of

these regions depends on the task demands – such that the presence of a

distracting working memory task biased the engagement of the BG during

learning, while learning under “normal” conditions engaged the MTL.

Importantly, while the MTL and the striatum were both able to support successful

performance on the weather prediction task, the nature of the knowledge that

was acquired differed, with only the MTL supporting declarative knowledge about

learned cue associations.

A second, unrelated behavioral study examined interactions between

declarative and procedural memory during the consolidation process, which

consisted of either a period of participants staying awake versus a period of

sleep (Brown and Robertson, 2007). This study consisted of two experiments: in

experiment 1 the influence of declarative learning (word list learning) was

137

measured on procedural consolidation (serial reaction time task/SRTT); in

experiment 2 the influence of procedural learning was measured on declarative

consolidation (using the same tasks). In experiment 1 participants first

performed the SRTT, then performed the word list learning task, and after a 12

hour period of either wakefulness or sleep, were retested on the SRTT.

Interestingly, the authors observed that following a period of wakefulness,

declarative learning interfered with subsequent improvements on the procedural

skill (SRTT). Furthermore, a negative correlation was observed between

subsequent skill performance and declarative list learning, such that the more

words participants learned, the worse they performed on the procedural skill.

However, following a period of sleep, declarative learning had no interference on

skill performance. The authors concluded that during periods of wakefulness,

declarative learning interfered with “off-line” processing, or the consolidation, of

the procedural skill. The mirror result was observed when declarative learning

was followed by procedural skill learning. Procedural skill learning disrupted off-

line processing of declarative information following a period of wakefulness, and

a negative correlation was observed such that greater motor skill was associated

with worse word recall. Additionally, no interfering effect of procedural skill

learning was observed on declarative learning following a period of sleep. The

authors proposed several models by which multiple memory systems may

interact, 1) potentially exhibiting interconnected or overlapping neural circuits

during the awake state thereby interfering with each other’s consolidation

process during wakefulness, while 2) exhibiting functional uncoupling or the

138

engagement of independent systems during the sleep state such that

consolidation processes among distinct brain regions are independent during

sleep.

To summarize, new insights into how multiple memory systems interact

during learning can be achieved by employing tasks of cognitive interference,

such as the studies employed by Foerde and colleagues (2006) as well as Brown

and Robertson (2007). Foerde et al. demonstrated that the relative engagement

of multiple memory systems can be biased by task demands, while Brown and

Robertson demonstrated that the exact nature of interactions between multiple

memory systems is dynamic, still yet to be completely characterized, and may

change depending on whether consolidation of declarative and procedural

information occurs during an awake versus a sleep state.

One remaining open question is characterizing under which specific

learning scenarios memory systems may cooperate versus compete. The

purpose of the present experiment was to address this question by employing a

cognitive interfering task, in order to engage the medial temporal lobe and

examine the effects of such a behavioral “knockout” of the MTL on a feedback-

based probabilistic learning task. The aim of employing such a design was to

gain new insights into the relative engagement and interactions between the MTL

and BG during feedback probabilistic learning by probing one learning scenario

in which multiple memory systems may cooperate.

The following hypothesis was formulated based on the results of

experiments 1 and 2: the MTL and BG may cooperate during

139

challenging/inconsistent learning scenarios and may operate independently

during easy/consistent learning scenarios. Results from experiment 1 suggested

that the BG and MTL may cooperate when learning is inconsistent as the

following results were observed: 1) positive correlations between the

hippocampus and caudate nucleus’s BOLD signal during probabilistic

observation learning; 2) a positive correlation between the hippocampus and

caudate nucleus’s BOLD signal for feedback easy trials, late in learning when

participants received incorrect feedback (prediction error-like situation); 3)

marginally greater hippocampal BOLD signal for feedback hard cues, later in

learning and marginally greater caudate nucleus BOLD signal for observation

hard cues, later in learning; and 4) encoding of a feedback prediction error signal

within both the striatum and hippocampus. Furthermore, results from experiment

2 corroborate the theory that these memory systems have the ability to both

compete and cooperate during learning, as mutual engagement of these regions

was observed during the learning and update phases, at times exhibiting similar

patterns of activation (potential cooperation) and at other times exhibiting

opposing patterns of activity (potential competition). Positive correlations were

observed in both the learning and update phases across the striatum and MTL

for deterministic cues and a feedback prediction error signal was encoded within

loci in the striatum and the MTL (hippocampus/amygdala). Additionally, it has

been theorized that depending on the learning scenario, memory systems may

operate in parallel or may interact directly via competition or cooperation (White

and McDonald, 2002). However, the exact learning scenarios when cooperation

140

or competition occur remain uncharacterized. Therefore, it was hypothesized,

based on the aforementioned results and theoretical work that the BG and MTL

operate in parallel and may interact directly via cooperation when the learning

scenario is particularly challenging/inconsistent. The rationale for this hypothesis

is that in inconsistent and therefore challenging learning scenarios, when cue

value contingencies are low, e.g., 70% predictive of the value, it is possible that

neither the MTL nor the BG system will learn the cue values well. Therefore, if

both systems independently are not able to acquire the values, it is theorized that

they may cooperate, by communicating and sharing information each system has

individually acquired in order to learn the cue values in a synergistic manner.

However, when the contingencies are consistent, and therefore less challenging

to learn, e.g., 90%, one system alone may successfully acquire the cue values or

both systems independently may learn the contingencies and therefore the

systems do not need to cooperate.

In order to test this hypothesis, a within-subjects fMRI study was

conducted using a feedback probabilistic learning task, which contained both

easy (90%) and hard (70%) to learn information. The hypothesis was that the

MTL and BG may cooperate during acquisition of difficult (70% cues), but not

easy (90% cues) information. Furthermore, in order to demonstrate that both the

MTL and BG are required to learn challenging/inconsistent information, multiple

learning sessions were utilized, one of which was designed to engage the MTL in

an unrelated task (behavioral “knockout” of the MTL). This was a two-day

experiment, in which participants came to the laboratory one day and performed

141

a declarative memory scene encoding task and returned the following day to

complete a feedback probabilistic learning task while undergoing functional

neuroimaging. Specifically probabilistic cue learning occurred via feedback in

three distinct sessions. In each session, a probabilistic cue was presented on

the screen along with a natural scene (landscape image). The three sessions

were: 1) a feedback control session, in which participants were required to learn

only the value of probabilistic cues and ignored the natural scenes. This session

provided for a baseline examination of multiple memory systems during feedback

learning; 2) a perceptual decision session, in which participants were required to

learn the value of the probabilistic cues as well as attend to the natural scenes

and make a perceptual decision about each scene. This session was employed

as a control for the cognitive demands elicited in the declarative memory

interference condition (dual-tasking); 3) a declarative memory interference

condition, in which participants were required to learn the value of the

probabilistic cues while simultaneously engaging in a declarative memory

recognition task (determining whether each natural scene was old or new). Of

key importance was the declarative memory interference condition, which was

designed to behaviorally sequester or “knock out” the medial temporal lobe by

engaging it in the recognition of previously viewed scenes. This task design

therefore allowed for an examination of the relative engagement of the MTL and

BG and interactions between these two regions when the MTL is available to

assist in learning (control conditions – feedback control and perceptual decision

session) and when it may not be available to assist in learning (experimental

142

condition – declarative memory interference condition).

5.2 Materials & Methods

5.2.1 Experimental design

This study involved two days of participation. On day one, participants

completed a behavioral declarative memory scene encoding task in the

laboratory. Participants viewed 40 natural landscape scenes on a computer and

were asked to determine whether or not each scene contained water (sixty

percent of the scenes contained water, e.g., a lake, stream, waterfall; each scene

was presented one time; Figure 5.1A). Every scene was a unique landscape

image, borrowed from a database from the Center for Cognitive Neuroscience at

Duke University. The purpose of the water detection question was to ensure that

participants paid attention to the scenes during the encoding session.

Participants were told that they should try their best to learn the scenes as they

would be tested on them the following day and their payment on the subsequent

day would be determined based on how well they correctly discriminated

between old scenes (shown on day 1) and new scenes (to be shown on day 2).

During the encoding session, each scene was displayed for 4 seconds during

which time the participants studied the scenes and looked for water. This was

followed by a 2-4 second jittered inter-stimulus-interval (ISI), followed by a 4

second period in which participants made a choice regarding the presence or

absence of water in the scene they had just viewed. Participants were instructed

to press 1 if there was definitely water in the scene, 2 if there was probably water

143

in the scene, 3 if there was probably no water in the scene, and 4 if there was

definitely no water in the scene. They were instructed to press 2 or 3 if they had

trouble remembering the scene or if it was ambiguous as to whether the scene

contained water. A blue box highlighted participants’ choice and no penalty was

assigned if the participant missed a trial. The water response period was

followed by a 4-10 second jittered ITI before the onset of the next scene.

Participants were paid $10 for completion of the scene encoding session on day

1.

On the second day of the experiment, participants completed the MRI

portion of the study, which consisted of a learning and memory task that was a

modification of the tasks employed in experiments 1 and 2. Participants were

required to learn the numerical value (range 1-9) of six different visual cues. The

value of each cue was either higher or lower than the number five. There were

two phases in the experiment, a learning phase and a test phase. Two

independent variables were manipulated in the learning phase: the learning

session type and the difficulty of the probabilistic cues. There were three distinct

learning sessions, all of which contained feedback learning: the feedback control

session (Figure 5.1B), the perception decision session (Figure 5.1C), and the

declarative memory interference session (Figure 5.1D). The second independent

variable was the probabilistic value of the cues (predictive outcome of the

numerical value of each cue): 3 cues were 90% predictive of their value, termed

“easy cues” and 3 cues were 70% predictive of their value termed “hard cues”.

Participants were instructed on all three session types and performed a short

144

practice version with different probabilistic cues and different natural scenes prior

to beginning the task inside the MRI. The practice session natural scenes were

drawn from the Duke University scene database as well as the McGill University

Calibrated Colour Image Database (Olmos and Kingdom, 2004).

In the learning phase participants’ primary goals were to learn the value of

six cues and correctly identify old versus new natural scenes. The learning

phase consisted of six blocks (two presentations of each of the three session

types) of 20 trials each for a total of 120 trials. In all sessions, participants were

required to learn the numerical value of two cues (one 90% cue and one 70%

cue per session). Each session contained 10 presentations of each of the two

cues; all trials were feedback based. The trial format in every session consisted

of a 4 second natural scene/cue presentation period, followed by a 2-4 second

jittered ISI, proceeded by feedback presentation (contingent on participant’s

response for the cue value; check mark for correct, X for incorrect, and # for

missed trials), followed by a response screen which varied by session type

(details provided below), and lastly a jittered ITI (4-10 seconds) before onset of

the next stimulus. In all learning sessions, the stimulus presentation period

consisted of a natural scene displayed at the top of the screen and a probabilistic

cue presented at the bottom of the screen. Depending on the session type, the

natural scene was either new (presented for the first time) or old (previously

shown on the day prior during the encoding session). Each natural scene

presented in the experiment was shown only one time; therefore the scenes did

not overlap between sessions, and participants were made explicitly aware of

145

this fact. Participants were instructed to make a button press in all sessions

during the stimulus presentation period regarding the value of the probabilistic

cue (button 1= higher than 5; button 2 = lower than 5). Similar to the feedback

trials in experiments 1 and 2, participants were instructed to optimize their

responses. Instructions for the natural scenes varied by session type as

described below.

In the feedback control session, participants were instructed to focus on

learning the value of the probabilistic cues and were told to ignore the natural

scenes, which were all new (no old scenes were presented from the day prior)

and unimportant (Figure 5.1B). Participants were informed that the natural

scenes in the feedback control session would never be shown again and that

they would not be tested on these scenes in the future. Participants viewed the

natural scene/cue and made a button press indicating the value of the

probabilistic cue during the 4 second stimulus presentation/response period.

After the ISI, participants received feedback contingent on their response.

Feedback presentation was proceeded by a response screen, where participants

made a button press (button 1, button 2, button 3, button 4) which they were

informed was not meaningful and had no associated value. They were told they

could know why the button press was necessary in this session at the end of the

experiment if they desired (it served as a motor control). The purpose of this

session was to provide a baseline level of feedback learning in the task.

In the perception decision session, participants were instructed to pay

attention to the natural scenes and to detect water in them while simultaneously

146

learning the value of the probabilistic cues (Figure 5.1C). They were informed

that in this session all of the scenes were new. They were also told that they did

not need to learn the scenes, as they would not be tested on them, but were

simply required to detect water in each scene. The purpose of this session was

to serve as a cognitive-demand control for the declarative memory interference

session. While looking for water may not be as mentally taxing as trying to

remember a scene, it does require some attentional processing and dual-tasking.

In the perception decision session participants viewed the natural scene/cue and

made a button press indicating the value of the probabilistic cue during the

stimulus presentation period. Following the ISI and feedback presentation, the

response screen appeared and participants were instructed to make a response

regarding the presence or absence of water in the natural scene they had viewed

in the stimulus presentation period. This response screen was identical to that

presented during the day 1 encoding session. They were instructed to press 1 if

the scene definitely contained water, 2 if it probably contained water, 3 if it

probably contained no water, and 4 if it definitely contained no water.

In the declarative memory interference session, participants were required

to determine whether the natural scene presented in each trial was old or new

while simultaneously learning the value of the probabilistic cues (Figure 5.1D).

During the stimulus presentation period they viewed a natural scene and

probabilistic cue and were instructed to try to remember if the scene was old or

determine if it was new while learning the value of the probabilistic cue. As in all

other sessions, they were informed to make a button press indicating the value of

147

the probabilistic cue in the stimulus presentation period. Following the ISI and

feedback presentation period, the response screen appeared. In this session

participants were told to press 1 if they thought the scene was definitely old, 2 if it

was probably old, 3 if it was probably new, and 4 if it was definitely new.

Additionally, participants were instructed to press 1 only if they had a strong

memory of seeing the scene on the day prior (I remember) and 2 if they

recognized the scene to be old, but did not have a strong memory of seeing it the

day before (I know). They were explicitly informed that half of the scenes would

be old and half would be new in this session and that this was the only session in

which they would see natural scenes that were shown on the day prior (old

scenes). They were reminded that the total amount of money they would be paid

would be determined by how well they correctly identified old versus new natural

scenes and how well they learned the value of the feedback cues in all three of

the learning sessions.

Participants completed the sessions in one of two orders: order 1) block 1:

feedback control, block 2: declarative memory interference, block 3: perception

decision, block 4: feedback control, block 5: declarative memory interference,

block 6: perception decision; order 2) block 1: feedback control, block 2:

perception decision, block 3: declarative memory interference, block 4: feedback

control, block 5: perception decision, block 6: declarative memory interference.

Half of the participants completed one session order and the other half

completed the other session order. Trial order in each block was pseudo-

randomized to ensure that the trial types were balanced throughout each block

148

(e.g., there were an equal number of old and new scenes paired with easy and

hard cues in trials 1-10 and 11-20). Different trial orders were created for the two

blocks of each session type (e.g., trial order in feedback control block 1 differed

from trial order in feedback control block 2); all participants completed the

identical trial order.

Following completion of the learning sessions, participants completed the

test session (not depicted in the figure). The test phase contained the

probabilistic cues learned in all three sessions, as well as two novel cues

presented for the first time. The cues from the learning phase were blue, and the

novel cues were white in color. Critically, in the test session, no feedback was

provided and no natural scenes were presented. Participants were simply

required to indicate the value of the probabilistic cues they had learned in the

learning sessions. The test session contained 80 trials total, 10 presentations of

each cue, presented in random order. Trial structure and timing in this test phase

were identical to that used in the test phases in experiments 1 and 2 (self-timed

stimulus/response phase followed by a 6-14 second jittered ITI before onset of

the next cue).

5.2.2 Data analysis

Behavioral Data Analysis:

D Prime: In order to examine participants’ ability to correctly identify

scenes which contained water versus those that did not in the day 1 and day 2

sessions as well as scenes which were old versus new in the day 2 task, D prime

(d’) analyses were performed (Macmillan and Creelman, 1991). D prime

149

analyses are informative as they account for a participant’s response bias (e.g.,

more likely to indicate water versus no water; or more likely to choose old versus

new) when calculating a participant’s ability to correctly identify a stimulus

(sensitivity).

Neuroimaging Data Analysis:

GLM: The primary analysis of interest was to examine activation within

the brain during the three distinct learning sessions (feedback control, perceptual

decision, and declarative memory interference). Comparing brain activation

across learning sessions was most desired as the key question of interest was

examining how multiple memory systems were engaged during pure feedback

learning compared with feedback learning concurrent with declarative memory

interference. Therefore, three random-effects GLM analyses were conducted in

the learning phase; one GLM per learning session. Each GLM contained the

following two predictors: session type (which varied by session: feedback control,

perception decision, and declarative memory interference) and cue difficulty

(easy and hard). Missed trials and six motion parameters were also included as

regressors of no interest. In order to gain the most complete picture of the

engagement of brain regions during learning and memory processes, the mean

BOLD signal was examined during two periods: 1) the stimulus presentation

period and 2) the feedback presentation period. The stimulus presentation

period of the trial occurred when the probabilistic cue and natural scene were

presented on the screen. This time period was examined in order to probe for

differences in cue difficulty and memory processing within the brain, with the

150

primary focus on reward-related learning and memory structures (e.g., striatum,

medial temporal lobe, and midbrain). It was during this time period that

participants were making a choice regarding the value of the probabilistic cue

and presumably either ignoring or attending to the natural scene depending on

the session (ignore the scene in the feedback control session; look for water in

the perceptual decision session; or try to remember the scene in the declarative

memory interference session). During the feedback presentation period,

participants received feedback regarding their choice of the probabilistic cue

value (correct, incorrect, missed trial).

In the GLM created for the feedback control session, a contrast of easy vs.

hard cues was performed to examine activity within the brain during both the

stimulus presentation as well as the feedback presentation periods. In the

perception decision GLM, a 2 (water: water vs. no water) x 2 (cue difficulty: easy

vs. hard) within-subjects ANOVA was performed using water (water-scene and

no water-scene) and cue difficulty (easy and hard) as within-subjects factors

when examining activity during both the stimulus presentation as well as the

feedback presentation periods. The primary SPM of interest investigated a main

effect of cue difficulty. The water condition was included as a factor because

participants attended to the presence or absence of water in the scenes during

the day 1 encoding session, and discussions with participants after the

experiment indicated that some participants used the presence or absence of

water in the natural scenes as a memory cue (way to learn and subsequently

remember the scenes). In the declarative memory GLM, a 2 (scene: old vs. new)

151

x 2 (cue difficulty: easy vs. hard) within-subjects ANOVA was performed using

memory (old scenes and new scenes) and cue difficulty (easy and hard) as

within-subjects factors when examining activity during both the stimulus

presentation as well as the feedback presentation periods. SPMs of interest

investigated a main effect of memory, a main effect of cue difficulty, and an

interaction of memory and cue difficulty. SPMs were thresholded at p<0.005 with

an appropriate cluster threshold correction, correcting to a cluster-level false-

positive rate of 5% when possible. As noted in the results section, these

analyses did not produce activation within any a priori regions of interest

(striatum, MTL, midbrain) at cluster threshold corrected levels in the control

conditions. Therefore, the hypothesis could not be tested and results from these

SPMs are not reported.

For the test phase, BOLD activity was examined across the entire trial

(stimulus onset and participant response). A random-effects GLM was

performed using the following predictors: learning material (feedback control,

perception decision, declarative memory interference, and novel) and cue

difficulty (easy and hard). SPMs were generated from this GLM with the contrast

of interest as previously studied (feedback control, perception decision,

declarative memory interference) versus non-studied (novel) cues. Resulting

SPMs were set to a threshold of p<0.005 and a cluster threshold level of 5

contiguous voxels or 132mm3. As a note, three participants’ data was excluded

from the test phase neuroimaging analysis due to technical errors (e.g., Eprime

quitting during the scanning session).

152

Region of Interest Analyses: Independent regions of interest were drawn

in order to examine activity within a priori brain regions in an unbiased manner.

Specifically, the following seven regions were examined: bilateral hippocampus

(x, y, z = -20, -11, -17 and x, y, z = 21, -12, -18); bilateral ventral striatum (x, y, z

= -11, 4, 0 and x, y, z = 10, 3, -6); bilateral midbrain, centered on the ventral

tegmental area (x, y, z = -4, -15, -9 and x, y, z = 5, -14, -8); and the left caudate

nucleus (x, y, z = -15, 20, 7). All but one of the ROIs (left caudate nucleus) were

drawn from a previously published paper (Adcock et al., 2006). The bilateral

hippocampus, bilateral ventral striatum, and bilateral midbrain (VTA) were

chosen from this study in particular as the experiment employed in Adcock and

colleagues examined the effects of reward-related motivation on memory. It was

theorized that the areas reported by the authors would be relevant to examine in

the current experiment, which investigated the effect of declarative memory

interference on reward-related learning. The left caudate nucleus ROI was taken

from experiment 1 of this dissertation. The caudate nucleus region was

investigated due to the observation that loci within the caudate nucleus were

engaged in both experiments 1 and 2 of this dissertation, suggesting this area

plays a role in feedback probabilistic learning. Mean parameter estimates were

extracted from the above regions in all three learning sessions (feedback control,

perception decision, and declarative memory interference) and mean BOLD

responses were examined during the stimulus presentation period.

153

5.3 Results

5.3.1 Behavioral Results

Day 1 scene encoding

Results from the d’ analysis revealed that participants’ hit rate for

accurately identifying scenes with water (0.97; responding water to a water-

scene) was significantly greater than the false alarm rate (0.16; responding water

to a no-water-scene; t(23) = 48.12; p < 1.35 x 10-24), with a bias towards labeling

scenes as containing water (d’=2.95; C=-0.41; Figure 5.2A). This result suggests

that participants discriminated very well between scenes that contained water

versus those that did not during the day 1 encoding session (97% accurate).

Day 2 scene water and memory discrimination

Participants’ ability to correctly discriminate between new scenes that

contained water versus those that did not (perception decision session) as well

as old and new scenes (declarative memory interference session) was

examined. In the perception decision session, participants’ hit rate (0.93) was

significantly greater than the false alarm rate (0.11; t(23) = 32.97; p < 7.24 x 10-21)

with a bias towards labeling the scenes as containing water (d’= 2.98; C=-0.14).

Participants’ ability to accurately detect water in the natural scenes on day 1 was

compared with their ability to accurately detect water in the natural scenes on

day 2. Results revealed that participants had a significantly higher hit rate during

the day 1 encoding task (97% > 93%; t(23) = 2.91; p < 0.01), with a significantly

greater false alarm rate observed in the day 1 encoding task as well (16% > 11%;

154

t(23) = 2.20; p < 0.05) suggesting that on a whole, participants performed better

during the day 1 encoding session.

In the declarative memory interference session, participants’ hit rate (0.61;

responding old to an old scene) was significantly greater than the false alarm rate

(0.24; responding old to a new scene; t(23) = 9.58; p < 1.76 x 10-9) with a bias

towards labeling scenes as new (d’=1.11; C=0.25), suggesting that participants

were able to successfully discriminate between old and new scenes.

Furthermore, participants’ hit rate was greater than chance (61% > 50%; t(23) =

3.12; p < 0.005). In addition, participants’ hit and false alarm rates were

compared across session types (perception decision vs. declarative memory

interference). Results revealed a significantly higher hit rate in the perception

decision session compared with the declarative memory interference session

(93% > 61%; t(23) = 9.39; p < 2.48 x 10-9) and a lower false alarm rate in the

perception decision session compared with the declarative memory interference

session (24% > 11%; t(23) = 3.12; p < 0.005; Figure 5.2B), suggesting that, as

expected, identifying scenes which contained water was an easier task than

discriminating between old versus new scenes.

Day 2 probabilistic cue accuracy

Behavioral accuracy was examined in all three cue learning sessions:

feedback control, perception decision, and declarative memory interference

(Figure 5.2C). One question of interest was to examine differences within each

session type as it varied according to cue difficulty. A second question of interest

was to examine behavioral accuracy across sessions in order to determine if

155

accuracy was greater for one session compared with another (e.g., feedback

control > declarative memory). Lastly, as this was a demanding task, it was

hypothesized that participants’ accuracy would improve over time (early/first

block vs. late/second block of learning). Therefore, the first block (trials 1-20) of

each session was used as the early time factor, while the second block (trials 21-

40) of each session was used as the late time factor. In order to examine these

questions, a 3 (learning session: feedback control vs. perception decision vs.

declarative memory interference) x 2 (cue difficulty: easy vs. hard) x 2 (time:

early vs. late) within-subjects, repeated-measures ANOVA was performed.

Results indicated a main effect of session type (F(1.65,37.89) = 16.60; p < 0.001), a

main effect of cue difficulty (F(1,23) = 9.56; p < 0.01), a marginally significant effect

of time (F(1,23) = 3.09; p = 0.09), a marginally significant interaction of cue

difficulty by time (F(1,23) = 3.89; p = 0.06), and a significant three way interaction

(session type x cue difficulty x time) (F(2,46) = 3.40; p < 0.05). Post-hoc t-tests

were performed in order to examine the significant main effects and interactions.

When examining the main effect of session type, results revealed better

performance for cues learned in the feedback control (83%) and perception

decision session (78%) compared with cues in the declarative memory

interference session (63%) (t(23) = 5.76; p < 0.0001 and t(23) = 3.54; p < 0.005,

respectively), with no accuracy differences observed between the feedback and

perception decision control sessions (t(23) = 1.60; p > 0.05). A post-hoc t-test was

also performed in order to examine the main effect of cue difficulty result, and as

expected performance was significantly greater for easy (80%) compared with

156

hard (70%) cues (t(23) = 3.13; p < 0.005). In addition, a post-hoc t-test was

performed to examine the marginally significant main effect of time, which as was

theorized, was caused by nearly better performance in the second block (76%)

compared with the first block (73%; t(23) = 1.71; p = 0.1). The marginally

significant interaction of cue difficulty by time was driven by significantly greater

performance for easy (80%) than hard (67%) cues in the first block (t(23) = 3.52; p

< 0.005), but only marginally better accuracy for easy (79%) compared with hard

(73%) cues in the second block (t(23) = 1.76; p = 0.09). This result was most

likely driven by the increase in accuracy for the hard cues over time (t(23) = 2.25;

p < 0.05), but not the easy cues (t(23) = 0.27; p > 0.05). Post-hoc t-tests were

also performed to examine the significant three way interaction (session type x

cue difficulty x time). This interaction was driven by significant changes in

performance from the first to the second learning block in the feedback control

learning session only (81% to 85%), with no changes observed over time in

either the perception decision (77% to 79%) or the declarative memory

interference sessions (61% to 65%; all p values > 0.05). In the feedback control

session, performance for hard cues increased over time (t(23) = 3.31; p < 0.005)

while performance for easy cues was marginally worse in the second compared

with the first block (t(23) = 2.14; p = 0.04; trend after sequential Bonferroni

correction).

Correlations between natural scene sensitivity (d’) and probabilistic cue accuracy

Two correlations were performed in order to examine potential

relationships between participants’ ability to perform the probabilistic cue learning

157

task concurrently with the scene discrimination tasks (both the perception

decision detection task and declarative memory interference task). Specifically,

one correlation was conducted to probe a potential relationship between

participants’ ability to differentiate between water/no-water scenes while learning

the probabilistic cue values in the perception decision session. Participants’ hit

rate, false alarm rate, and probabilistic cue accuracy for easy and hard cues

were entered into a Pearson’s correlation analysis. Results revealed a significant

positive correlation between participants’ hit rate and easy cue accuracy

(r=0.510, p = 0.01), but not hard cue accuracy (r=0.275, p = 0.19). A second

correlation was conducted to examine a potential relationship between

participants’ ability to discriminate between old and new scenes while learning

probabilistic cue values. No significant relationships were observed between

participants hit rate, false alarm rate, and probabilistic easy and hard cue

accuracy (all p values > 0.05).

Day 2 test phase

A test phase was performed while participants remained inside the MRI to

probe for behavioral performance differences on probabilistic cues (easy and

hard) that were acquired across the three session types. The first analysis

probed for potential differences in session type, irrespective of cue difficulty, by

conducting a repeated-measures, within-subjects ANOVA using session type as

a within-subjects factor (feedback control, perception decision, declarative

memory interference, and novel). A significant main effect of session type was

observed (F(2.11,48.62) = 13.21; p < 0.001). Post-hoc t-tests indicated significantly

158

greater performance for cues presented in all sessions compared with novel cues

(feedback control vs. novel: t(23) = 5.18; p < 0.0005; perception decision vs. novel:

t(23) = 4.54; p < 0.0005; declarative memory interference vs. novel: t(23) = 3.19; p

< 0.005). Marginally significant better accuracy was observed for cues previously

presented in the feedback control session (81%) compared with the declarative

memory interference session (64%; t(23) = 2.19; p = 0.04; trend after sequential

Bonferroni correction). No other significant differences were observed between

cues previously presented in the various learning sessions (all p values >0.05).

The second analysis examined differences across session types and

levels of cue difficulty, excluding novel cues. In order do so, a 3 (learning

session: feedback control vs. perception decision vs. declarative memory

interference) x 2 (cue difficulty: easy vs. hard) repeated measures, within-

subjects ANOVA was performed. Results revealed a main effect of session type

(F(1.21,27.87) = 3.88; p < 0.05), no main effect of cue difficulty (F(1,23) = 0.07; p >

0.05), and no interaction of cue difficulty and session type (F(2,46) = 0.60; p >

0.05). Post-hoc t-tests indicated marginally greater accuracy for the feedback

control compared with the declarative memory cues, collapsed across difficulty

(t(23) = 2.19; p = 0.04; trend after sequential Bonferroni correction).

5.3.2 Neuroimaging results

Learning phase analyses

Region of Interest Analyses: In order to examine the engagement of

learning and memory regions in an unbiased manner during the learning

sessions, and to try to test the hypothesis that the MTL and BG correlate during

159

inconsistent learning conditions, independent regions of interest were drawn. It

was hypothesized that regions of the MTL and BG would correlate during the

control sessions (feedback and perception decision) most strongly for the

inconsistent learning condition (hard cues) as the MTL may be online and

available to assist in probabilistic cue learning. It was further hypothesized that

these regions would not correlate during the declarative memory interference

condition as the MTL would be engaged in the declarative memory task rather

than the probabilistic cue learning task. Specifically activity within the bilateral

hippocampus, bilateral ventral striatum, bilateral midbrain, and left caudate

nucleus was examined. Two sets of analyses were performed. The first analysis

examined activity within each ROI during all learning sessions independently. In

the feedback control session, this analysis therefore examined differences in

BOLD signal according to levels of cue difficulty; in the perception decision

session, this analysis examined differences in BOLD signal across levels of cue

difficulty and the presence or absence of water in the natural scenes; and in the

declarative memory session, this analysis examined differences in BOLD signal

across levels of cue difficulty and the declarative memory scene condition (old or

new scene).

The second analysis examined activity within each ROI across different

session types (feedback control vs. perception decision vs. declarative memory

interference) and levels of cue difficulty (easy vs. hard), collapsed across the

memory scene condition (water/no-water or old/new). Results are presented

below for the first analysis (activity in each learning session independently)

160

followed by the results from the second analysis (comparison of activity within

each ROI across learning sessions).

Feedback control session: In the feedback control session, paired-

samples t-tests were performed in each ROI to probe for differences in the BOLD

signal response to easy and hard probabilistic cues. The right ventral striatum

exhibited significantly greater responses to hard than easy cues (t(23) = 2.79; p <

0.05), while marginally greater responses for hard compared with easy cues was

observed in the left ventral striatum (t(23) = 1.82; p = 0.08). No other regions of

interest displayed any modulation in activation by cue difficulty.

Perception decision session: In the perception decision session, paired-

samples t-tests were performed in each ROI to explore potential differences in

the mean BOLD signal in response to easy versus hard cues; however no

significant differences were observed in any region of interest in this session.

Declarative memory interference session: In the declarative memory

session, modulation of the mean BOLD signal in response to levels of cue

difficulty as well as old versus new declarative memory scenes was examined.

The right midbrain exhibited greater BOLD responses to hard compared with

easy cues (t(23) = 2.18; p < 0.05) [a 2 (memory scene: old vs. new) x 2 (cue

difficulty: easy vs. hard) within-subjects, repeated-measures ANOVA was

conducted, results indicated no main effect of memory scene (F(1,23) = 0.87; p >

0.05); a main effect of cue difficulty (F(1,23) = 4.76; p < 0.05); and no interaction of

memory scene by cue difficulty (F(1,23) = 0.05; p > 0.05)]. No other significant

effects were observed.

161

Session type comparison: A comparison of the mean BOLD signal within

each ROI across learning sessions (feedback control vs. perception decision vs.

declarative memory interference) was also performed. In order to do so, seven

separate 3 (session type: feedback control vs. perception decision vs. declarative

memory interference) x 2 (cue difficulty: easy vs. hard) repeated-measures,

within-subjects, ANOVAs were conducted (one per ROI) using session type and

cue difficulty as within-subjects factors. Interestingly, every ROI except the left

caudate nucleus exhibited a significant main effect of session type and no

significant main effect of cue difficulty; only the right ventral striatum displayed an

interaction of session type by cue difficulty.

For each ROI, the main effect of session type was driven by greater BOLD

responses in the declarative memory interference session compared with both

the perception and feedback control sessions (except for the caudate nucleus,

which exhibited no main effect of session type; representative results are shown

in Figure 5.3 – beta plots are exhibited for the left ROIs as the pattern of activity

was similar across bilateral ROIs). The right hippocampus mean BOLD signal

scaled according to session type with declarative memory interference >

perception decision > feedback control. All other regions exhibited no differences

between the mean BOLD signal elicited in response to feedback versus the

perception decision control sessions. These results suggest that the

hippocampus, ventral striatum, and ventral tegmental area are all more strongly

engaged during the memory session than the control sessions, but interestingly

162

are not modulated by probabilistic cue difficulty. The results of this analysis are

presented below according to ROI:

1) Right hippocampus: main effect of session type (F(2,46) = 13.09; p <

0.001); no main effect of cue difficulty (F(1,23) = 0.59; p > 0.05); no

interaction of session type and cue difficulty (F(2,46) = 1.16; p > 0.05).

Declarative memory interference session > perception decision (t(23) =

2.26; p < 0.05); perception decision > and feedback control session

(t(23) = 3.00; p < 0.01).

2) Left hippocampus: main effect of session type (F(1.59,36.45) = 8.37; p <

0.005); no main effect of cue difficulty (F(1,23) = 0.07; p > 0.05); no

interaction of session type and cue difficulty (F(2,46) = 0.73; p > 0.05).

Declarative memory interference session > perception decision and

feedback control session (t(23) = 3.52; p < 0.005 and t(23) = 4.48; p <

0.0005, respectively), with no differences observed between BOLD

response to the control conditions (t(23) = 0.95; p > 0.05; Figure 5.3A).

3) Right ventral striatum: main effect of session type (F(2,46) = 3.96; p <

0.05); no main effect of cue difficulty (F(1,23) = 2.06; p > 0.05); an

interaction of session type and cue difficulty (F(2,46) = 5.06; p < 0.05).

Declarative memory interference session marginally > perception

decision (t(23) = 2.32; p = 0.03; trend after sequential Bonferroni

correction); declarative memory interference session > feedback

control session (t(23) = 2.63; p < 0.05, respectively), with no differences

163

observed between BOLD response to the control conditions (t(23) =

0.72; p > 0.05).

4) Left ventral striatum: main effect of session type (F(2,46) = 9.01; p <

0.001); no main effect of cue difficulty (F(1,23) = 2.22; p > 0.05); no

interaction of session type and cue difficulty (F(2,46) = 1.22; p > 0.05).

Declarative memory interference session > perception decision and

feedback control session (t(23) = 4.00; p < 0.001 and t(23) = 4.03; p <

0.001, respectively), with no differences observed between BOLD

response to the control conditions (t(23) = 0.24; p > 0.05; Figure 5.3B).

5) Right midbrain: main effect of session type (F(2,46) = 9.73; p < 0.001);

no main effect of cue difficulty (F(1,23) = 1.10; p > 0.05); no interaction of

session type and cue difficulty (F(1.57,36.01) = 1.29; p > 0.05). Declarative

memory interference session > perception decision and feedback

control session (t(23) = 3.12; p < 0.005 and t(23) = 4.47; p < 0.0005,

respectively), with no differences observed between BOLD response to

the control conditions (t(23) = 1.27; p > 0.05).

6) Left midbrain: main effect of session type (F(2,46) = 9.37; p < 0.001); no

main effect of cue difficulty (F(1,23) = 1.96; p > 0.05); no interaction of

session type and cue difficulty (F(1.55,35.74) = 0.42; p > 0.05). Declarative

memory interference session > perception decision and feedback

control session (t(23) = 2.67; p < 0.05 and t(23) = 4.73; p < 0.0005,

respectively), with no differences observed between BOLD response to

the control conditions (t(23) = 1.49; p > 0.05; Figure 5.3C).

164

7) Left caudate nucleus: no main effect of session type (F(2,46) = 1.13; p >

0.05); no main effect of cue difficulty (F(1,23) = 0.13; p > 0.05); no

interaction of session type and cue difficulty (F(2,46) = 1.00; p > 0.05;

Figure 5.3D).

Independent ROI Correlation Analyses: In order to test the hypothesis

that the mean BOLD signal within the striatum and hippocampus would be

positively correlated during the control conditions, specifically for the inconsistent

learning situation (hard cues), with potential correlations observed between the

midbrain, a series of Pearson’s correlations were performed. Specifically three

correlations, one for each learning session, were conducted. An abundance of

correlations were observed across regions. Therefore, for simplicity of

presentation, a summary of the correlations observed is described below and the

details regarding the correlations (which areas correlated with each other and at

what statistical value) are presented in Appendix 2:

Feedback control session: five positive correlations were observed

across anatomical regions, four occurred in the hard cue condition and one

occurred in the easy cue condition. In the hard cue condition, the hippocampus

correlated with the ventral striatum and caudate nucleus, while the ventral

striatum additionally correlated with the midbrain. In the easy cue condition, the

midbrain correlated with the ventral striatum.

Perception decision session: seventeen positive correlations were

observed across anatomical regions, five occurred in the hard cue condition and

165

twelve occurred in the easy cue condition. In the hard cue condition the midbrain

correlated with the hippocampus and ventral striatum. In the easy cue condition,

the hippocampus correlated with the ventral striatum, the caudate nucleus, and

midbrain; the midbrain additionally correlated with both the ventral striatum and

the caudate nucleus.

Declarative memory interference session: fourteen positive correlations

were observed across anatomical regions, seven in the hard cue condition and

seven in the easy cue condition. In the hard cue condition, the hippocampus

correlated with the ventral striatum, caudate nucleus, and midbrain, while the

ventral striatum correlated with the midbrain. In the easy cue condition, the

hippocampus correlated with the ventral striatum and midbrain, and the ventral

striatum correlated with the midbrain.

To summarize, the correlation results indicate that while there were

significant positive correlations in the control conditions between the striatum,

hippocampus, and midbrain for the hard cues, positive correlations were also

observed in the declarative memory interference condition and in all conditions

for the easy cues. The observation of positive correlations in all learning

sessions for both easy and hard cues indicates that these regions did not

correlate exclusively when the learning condition was inconsistent (hard cue

condition). However, it is thought provoking that several more correlations were

observed across anatomical regions in the more challenging conditions

(perception decision and declarative memory interference) compared with the

166

feedback control condition. A possible interpretation of this result is presented in

the discussion.

Stimulus Presentation & Feedback Presentation Period Whole Brain

Analyses: As described in the methods section, the stimulus presentation period

of the trial, when the probabilistic cue and natural scene were presented on the

screen, was examined in order to probe areas of the brain that were involved in

processing differences in cue difficulty and memory information (old and new

scenes). Additionally, the feedback period of the trial was examined in order to

probe for differences in the brain while processing feedback regarding the

probabilistic cues. However, no BOLD activation was observed during either of

the control conditions (feedback or perception decision) in any area of interest

(MTL, BG, midbrain) at a threshold of p<0.005, uncorrected during the stimulus

or the feedback presentation periods. Therefore, this hypothesis could not be

tested by examining functional activation in these regions during either the

stimulus presentation or the feedback presentation period and as a

consequence, no results are reported during these time periods.

5.4 Discussion

The purpose of the current experiment was to investigate interactions

between memory systems during feedback-based probabilistic learning

concurrent with declarative memory interference. Behavioral sensitivity (d’) as

well as accuracy measures revealed that participants successfully encoded

natural scenes (day 1) and subsequently remembered them the following day

167

while performing a feedback-based probabilistic cue learning task. Participants’

accuracy during the feedback-based learning task was significantly modulated by

the presence of the concurrent declarative memory task, such that performance

during the control sessions was greater than the declarative memory interference

session. In addition, behavioral differences in levels of cue difficulty (easy cue

accuracy > hard cue accuracy) were only observed in the feedback control

condition, suggesting that session type had a stronger influence in modulating

behavior. Neuroimaging results revealed that within independent regions of

interest, BOLD activity was also modulated by session type, perhaps tracking

these behavioral measures. Likewise, only one region, namely the ventral

striatum exhibited differences in BOLD responses according to cue difficulty in

the feedback control condition. No ROIs were modulated by cue difficulty in the

perception decision session, and in the declarative memory interference session

only the right midbrain’s BOLD signal was modulated by cue difficulty. The right

ventral striatum and right midbrain both displayed greater BOLD responses to

hard compared with easy cues, replicating a similar result observed in

experiment 2. Lastly, a series of Pearson’s correlations revealed widespread

positive correlations across ROIs in all learning sessions, with the most

correlations observed in the dual-tasking sessions. Interestingly, no negative

correlations were observed across these independently defined regions. The

significance of these results is postulated below.

Behavioral results indicated that participants successfully attended to the

presentation of natural scenes during the day 1 encoding session, as

168

demonstrated by the high hit rate (97%) on a perceptual detection task during

encoding. Furthermore, participants performed the perception decision task

(water detection) well, while simultaneously performing feedback-based

probabilistic cue learning on day 2. Behavioral sensitivity (d’) was significantly

worse on the water detection task on the second day, when learning the

probabilistic cues; however, a hit rate of 93% is still indicative of participants’

ability to successfully detect water in the new natural scenes despite having to

dual-task. Participants’ sensitivity at detecting old versus new natural scenes in

the declarative memory interference session was significantly worse compared

with the perception decision detection task (hit rate: 61% versus 93%), indicating

that the tasks were not equated in their difficulty. This result complicates

interpreting the neuroimaging results, as described below. However, participants

did discriminate between old versus new scenes at above chance levels,

suggesting that the task was not impossible to complete.

An examination of behavioral probabilistic cue accuracy revealed that, as

expected, participants were better at learning cue values in the feedback control

condition (83%) compared with the declarative memory interference condition

(63%). However, it was hypothesized that accuracy would scale according to

session type, such that feedback control > perception decision > declarative

memory interference. Contrary to this hypothesis however, no significant

difference was observed between participants’ cue accuracy in the feedback

control compared with the perception decision session (83% ≅ 78%; p = 0.12). It

appears that this form of dual-tasking did not significantly affect participants’

169

ability to learn the probabilistic cues. In fact, a correlation analysis conducted on

participants’ hit rate and probabilistic cue accuracy revealed a significant positive

relationship for probabilistic easy cues only, in the perception decision session.

No such positive relationship was observed for the hard cues. This result

indicates that participants’ ability to accurately perform these distinct tasks

simultaneously was related. A similar correlation performed for the declarative

memory interference session produced no significant relationships between

participants’ hit rate, false alarm rate, or cue accuracy, for neither easy nor hard

cues. This result may suggest that perhaps participants solved these distinct

tasks in different ways and that their ability to perform each task: old/new scene

discrimination versus probabilistic cue learning was unrelated.

The main effect of session type observed in the bilateral hippocampus,

ventral striatum, and midbrain ROIs is an interesting and unexpected finding.

These reward-related learning and memory regions displayed a global increase

in the mean BOLD signal in the declarative memory interference session

compared with the control sessions. Two theories were formulated to interpret

this finding. The first theory is that participants’ motivation to gain increased

monetary payment modulated the BOLD signal in key learning and memory

ROIs. It is possible that participants were motivated to perform well, especially in

the declarative memory interference condition, and as a result of this motivation,

ramping up of activation in these areas occurred during this session. In support

of this theory, Adcock and colleagues (2006) demonstrated modulated BOLD

signal in the exact regions examined in this experiment (ROIs, except for the

170

caudate nucleus, were drawn from this paper), in a declarative memory encoding

task. BOLD signal within these areas varied depending on the reward value

associated with to-be-learned-stimuli. In Adcock and colleagues study,

participants performed a declarative memory encoding task (viewed natural

scenes) while undergoing functional neuroimaging. Critically, the presentation of

each scene was preceded by a monetary value, either $5 or 10 cents which

indicated to the participant how much they would be paid if they subsequently

remembered the scene correctly on a follow up test. Interestingly, only high-

reward scenes ($5) that were subsequently remembered on a behavioral test

administered the next day elicited greater BOLD signal in the hippocampus,

ventral striatum, and midbrain. High-reward scenes that were subsequently

forgotten as well as low-reward scenes (that were both remembered and

forgotten) did not elicit increased engagement of these learning and memory

regions preceding encoding. Furthermore, greater correlation between the

hippocampus and the right midbrain (VTA) for high-reward compared with low-

reward scenes, and for high-reward scenes that were subsequently remembered

was observed. This data provides strong evidence that reward-related

motivation influences learning by specifically predicting the information which is

subsequently remembered. The authors concluded that their results are in

accord with the hypothesis that reward-motivated dopamine release in the

hippocampus prior to learning promotes memory formation (Lisman and Grace,

2005; Shohamy and Adcock, 2010).

171

In accord with Adcock et al.’s result, Wittman and colleagues (2005)

demonstrated that pictures which predicted monetary reward versus those that

did not were associated with greater BOLD activation in the substantia nigra.

Furthermore, on a test administered three weeks later, participants were more

accurate at remembering the pictures which predicted reward versus those that

did not. Additionally midbrain and hippocampal BOLD activation was greater for

subsequently recognized reward-predicting pictures versus those that were

forgotten. Therefore, this study also highlights a critical relationship between

reward-related motivation and learning and memory. An additional line of

research examining genetic polymorphisms in dopamine clearance pathways

among human individuals supports the hypothesis that dopamine is involved in

human memory formation (Meyer-Lindenberg et al., 2005; Schott et al., 2006). A

relatively recent study demonstrated differential degrees of engagement of

reward-related memory structures depending on individual polymorphisms in

dopamine clearance pathways (Schott et al., 2006). In this experiment,

polymorphisms in the dopamine transporter (DAT1) gene affected levels of

midbrain activation in an episodic encoding task. Additionally, catechol-O-methyl

transferase (COMT) polymorphisms modulated activity in the right prefrontal

cortex, such that participants with low COMT activity displayed stronger coupling

between the hippocampus and PFC during the encoding task. The authors

concluded that individual genetic variations in dopamine clearance pathways in

the brain effect both prefrontal cortex and midbrain BOLD activation in an

172

episodic encoding task, implicating a strong role for dopamine in human memory

formation.

Therefore based on the literature, it is possible that the engagement of

reward-related learning and memory structures was modulated by participants’

motivation to earn monetary compensation. Perhaps participants were highly

motivated to perform well in especially the declarative memory interference

condition as they were aware that their payment would be determined based on

how well they learned the probabilistic cues in all conditions and how accurately

they discriminated between old and new natural scenes. Unfortunately, no

questionnaire addressed this theory directly by asking participants how motivated

they were to perform in each session individually and is therefore a limitation of

this study. The potential role of dopamine, in particular, in influencing this global

BOLD signal increase in reward-related learning structures is only moderately

supported by the observation that a region of the dorsal striatum, namely the

caudate nucleus, did not display such a global increase in BOLD responses

during the declarative memory interference session. Given that midbrain

projections do not reach the dorsal components of the caudate nucleus (Devan

and White, 1999; Haber and Knutson, 2010), it is possible that a lack of

dopamine modulation in this area resulted in no differentiation of the BOLD signal

by learning session. However, this theory is impossible to test in the current

paradigm and therefore should be regarded with caution, especially given that

null findings in neuroimaging studies are not indicative of any particular finding.

This theory may represent an area of future studies, namely exploring differential

173

BOLD activation within regions of the brain that receive direct dopamine

modulation versus those that do not in reward-related learning and memory

tasks.

The second possible explanation for the observed increased BOLD

activation during the declarative memory interference session is that it was the

most cognitively demanding session. Because it was the hardest condition, as

demonstrated by behavioral sensitivity and accuracy measures, this session may

have recruited the greatest neural activation, eliciting the largest metabolic

demand, and causing the most substantial increase in BOLD responses in

reward-related learning structures. This explanation is in accord with one theory

regarding the nature of the BOLD signal (Heeger and Ress, 2002). Additionally,

the literature supports the notion that some brain regions do correlate with

increased task demand, in particular several prefrontal regions, including the

posterior dorsolateral prefrontal cortex BA44/46 (see Buckner and Wheeler, 2001

for review). However, such an increase was not observed in the caudate

nucleus, which may be an indication that this increased BOLD signal was not

uniform throughout the brain in the current paradigm. Given that the perceptual

decision and declarative memory interference sessions were not equated in their

difficulty makes it challenging to disentangle these two interpretations of the

neuroimaging results, and is a limitation of this study. If the two tasks had been

equated in difficulty, it would have strengthened the argument that learning and

memory regions are ramping up activation due to reward-related motivational

influences.

174

The results from the independent ROI analyses therefore, did not support

the hypothesis that the MTL and BG cooperate, as measured by simple

correlations, during inconsistent learning conditions (hard probabilistic cue

condition) and operate independently during more consistent learning conditions

(easy probabilistic cue condition). Examining the mean BOLD signal extracted

from these a priori regions of interest revealed that rather than tracking

differences in cue difficulty in individual sessions, all ROIs, with the exception of

the caudate nucleus, exhibited increased BOLD signal in the declarative memory

interference session compared with both the control sessions (perception

decision and feedback learning). Furthermore, an abundance of correlations

were observed across the ROIs for both the easy and hard cues in all learning

sessions. Therefore, while these data do not support the exclusivity of the

hypothesis generated (correlations for only inconsistent learning condition), the

data do not completely refute it either, as there were positive correlations across

regions for hard cues in the control sessions. Additionally, it is possible that in

the present task design, the declarative memory interference condition itself

functioned as the challenging learning condition, rather than the hard probabilistic

cue condition. Evidence supporting this theory comes from a post-experimental

questionnaire in which participants were asked to indicate which session was the

most difficult for them. The majority of participants indicated that the declarative

memory interference session was the hardest (14 participants), although others

indicated that the perceptual decision session was the most challenging (9

participants; 1 person stated that the feedback control session was the most

175

difficult). If in fact the declarative memory interference session was the most

difficult, then the data do somewhat support the theory that the MTL and BG

cooperate when the learning environment is challenging, as demonstrated by

worse behavioral performance in the declarative memory interference session as

well as an increased number of correlations in this session compared with the

feedback control condition. However, the lack of behavioral interference and

large number of positive correlations across regions in the perception decision

session make this theory weak.

To summarize, these results suggest that distinct learning and memory

regions are more globally engaged when simultaneously learning the value of

probabilistic cues via feedback and performing a declarative memory recognition

task than when performing less cognitively demanding tasks (control conditions).

Although the data in this experiment did not support the proposed hypothesis,

employing a behavioral “knockout” of the MTL did provide new insights into the

engagement and interactions of multiple memory systems (modulation of

behavior and BOLD activation across multiple memory regions, as well as the

number of positive correlations across regions, by learning session type) and

may be an area of further research in the future. In particular it is still unknown

under what conditions multiple memory systems compete versus cooperate

during learning.

176

Chapter Six: General Discussion

6.1 Purpose & Summary of Experiments

The purpose of the experiments performed in this dissertation was to

examine the engagement and interactions between multiple human memory

systems during probabilistic learning. Specifically the engagement and

interactions between the medial temporal lobe, with focus on the hippocampus,

and the basal ganglia, with emphasis on the striatum was examined during

feedback and observation probabilistic learning. The tasks employed

investigated this question in young healthy human participants using both within

and between-subjects, event-related designs which allowed for the examination

of BOLD activity within regions to discrete events across the dynamic learning

process. Multiple types of learning as well as variations in cue difficulty were

employed in order to probe how distinct brain substrates, namely the MTL and

BG, were modulated in multiple learning environments. The key hypothesis for

the experiments performed in this dissertation was that multiple memory

systems, specifically the MTL and BG, operate in parallel – they are online

simultaneously, engaged in learning and may operate independently by encoding

unique information or may compete or cooperate directly. Furthermore, it was

hypothesized that interactions between these anatomical regions may be

facilitated by dopaminergic midbrain activity, as investigated by the application of

reinforcement learning models which are postulated to represent a dopaminergic

learning signal in experiments 1 and 2.

177

Nature of representations in the MTL and BG: It is evident that the BG

and the MTL are involved in probabilistic learning as demonstrated by the results

in the current studies as well as an abundance of evidence in the literature.

However, important questions to consider include what is the nature of

representations in each of these regions? Is it the same or distinct? While these

are difficult questions to illuminate completely in the experiments utilized in this

dissertation, one may theorize about the representations encoded in the MTL

and BG during the feedback and observation tasks in the present experiments.

As the MTL is often implicated in encoding stimulus-stimulus associations and

episodic information (Gluck and Myers, 1993; Bunsey and Eichenbaum, 1995;

Eichenbaum and Bunsey, 1995) as well as recently implicated in tracking reward-

related information (Li et al., 2011), it is theorized that for the observation trials in

experiments 1 and 2, the MTL encoded the stimulus-stimulus associations of the

cue (stimulus 1) and the informative arrow (stimulus 2). Over the course of

learning, as these two stimuli are repeatedly paired (e.g., circle and upward-

facing arrow) the MTL may have encoded the value for the circle as higher than

5, binding these two associations together. For the feedback trials, since the trial

structure is distinct, it is possible that the MTL encoded either the episodic

experiences of incrementally learning the cue-outcome associations over time

(participant remembers previous episodes of receiving correct feedback for a

cue, which then informs the current cue-value choice) or that the hippocampus

was tracking the reward value of the probabilistic cues (differentiates between

correct and incorrect feedback; consistent with Li and colleagues 2011). The

178

basal ganglia however, may have encoded distinct representations during both

feedback and observation learning trials than the MTL. For the feedback trials,

one hypothesis is that the basal ganglia, in particular the striatum, was involved

in feedback learning tracking goal outcomes and updating cue contingencies via

error correction, consistent with the literature. During observation learning, the

striatum may have been involved in action selection as well as encoding cue-

arrow associations, in accord with Seger’s theory of the role of the BG during

category learning (2008).

One manner of testing the above hypotheses and examining whether or

not the MTL and BG were encoding similar or distinct information may be to

examine their ability to flexibly use previously acquired cue-outcome associations

in a novel manner. It has been suggested that the hippocampus is involved in

flexibly using knowledge in novel scenarios whereas the BG is not (Myers et al.,

2003). Therefore, a probe test could have been administered for both trial types

in order to determine in if a new context, where participants would need to

flexibly use previously acquired cue-value information in a novel manner, the

hippocampus would be engaged in the flexible usage of the knowledge, while the

BG would not. An alternative idea would be to track the BOLD responses over

the learning phase in a trial by trial basis. BOLD responses in the striatum may

initially have been driven by responses to the feedback period, but as learning

progressed it may have shifted to the cue period. Examining these phases (cue

versus feedback) distinctly could have allowed for decoupling of such activation.

However, the task was not designed to address this question and there was no

179

inter-stimulus-interval placed between the cue and feedback periods. As a

consequence, examining the cue and feedback periods separately would be

challenging and specifically the results obtained by examining the feedback

period may have been skewed by the BOLD responses in the cue period.

Some evidence supporting the theory that the MTL may have been

encoding stimulus-stimulus associations was observed in the BOLD responses to

consistent versus inconsistent information. In both experiments 1 and 2, the

striatum (caudate nucleus and putamen) was not modulated by cue consistency,

whereas the MTL (hippocampus and parahippocampus) was modulated by the

consistency of the observation cues. In both studies, the MTL exhibited greater

BOLD responses (sometimes manifested as the least negative BOLD responses)

to inconsistent compared with consistent cue values, perhaps encoding a

mismatch signal between the expected (e.g., circle paired with upward-facing

arrow) and actual (e.g., circle paired with downward-facing arrow) cue-values

observed during learning. It is interesting that only the MTL regions were

modulated by cue consistency, and not the striatum, suggesting that perhaps

these areas may have encoded distinct representations of cue values, and that

the MTL was involved in binding stimulus-stimulus associations and encoding

mismatch signals, consistent with the literature (Gluck and Myers, 1993; Bunsey

and Eichenbaum, 1995; Eichenbaum and Bunsey, 1995; Ploghaus et al., 2000;

Chen et al., 2011).

The main findings from experiments 1-3 support the hypothesis that the

MTL and BG are engaged simultaneously during feedback (experiments 1-3) and

180

observation (experiments 1 and 2) probabilistic learning and operate in parallel,

by at times exhibiting parallel processing, and at times interacting directly via

cooperation or competition. Evidence supporting each manner of interaction is

summarized below:

Evidence of parallel engagement: Parallel processing for the purpose of

this dissertation is defined as the engagement of multiple memory regions in the

same learning session, while displaying unique patterns of BOLD activation.

This result suggests that both areas have the capability of being engaged

simultaneously during the learning process, but are encoding unique information,

in accord with White and McDonald’s theory (2002). The examination of the

brain substrates involved in feedback versus observation learning revealed areas

of the ventral striatum which exhibited greater BOLD signal to feedback learning

trials than observation learning trials in experiment 1 (as shown by a main effect

of learning type in the learning phase). A similar effect was observed in

experiment 2, in which regions of the ventral striatum and midbrain exhibited

stronger engagement during feedback than observation learning trials (as shown

by an interaction of learning type by group in the update phase). Interestingly,

only regions within the basal ganglia were modified by a main effect of learning

type, while no areas were identified in the MTL exhibiting such a differentiation.

Several neuroimaging papers have shown that feedback and reward processing

engages the ventral striatum (for review see Delgado, 2007), thus, it is not

surprising that this area was recruited more strongly during feedback learning,

which contained outcome-relevant information. It may have been expected that

181

the MTL would be selectively modulated during observation learning, as

previously reported (Poldrack et al., 2001); however this effect was not observed.

While a null result in neuroimaging is not indicative of any particular finding and

the details of the paradigms employed in this dissertation differ from previous

probabilistic learning studies, it is possible that the MTL BOLD signals observed

in experiments 1 and 2 indicated mutual activation during both feedback and

observation learning – as suggested by the main effect of cue difficulty analyses.

Therefore, this result suggests that at times, memory systems may operate

independently and encode unique information.

In further support of parallel engagement during learning, Granger

causality analyses, which allowed for an investigation of both functional and

effective connectivity across the brain during feedback and observation learning

distinctly (experiment 1) and concurrently (experiment 2), revealed recruitment of

a common network of neural substrates during learning. Results from both

experiments supported the engagement of a similar network of regions, including

the striatum, medial temporal lobe, and midbrain. In experiment 1, using the

hippocampus as the seed region revealed functional connectivity with the

caudate nucleus, while using the caudate nucleus as a seed region revealed

connectivity with the hippocampus. In experiment 2, using the midbrain as a

reference region elicited functional connectivity with a larger network of regions

including the caudate nucleus/putamen, globus pallidus, hippocampus, and

parahippocampus. Furthermore, effective connectivity results implicated a

potential unilateral flow of information whereby the putamen sent directed

182

influences to the midbrain (VTA) and the midbrain sent directed influences to the

hippocampus and parahippocampus. This observation that key learning and

memory regions were functionally and effectively connected during probabilistic

learning is strengthened by the fact that three reference regions employed in the

analyses: the caudate nucleus, the hippocampus, and midbrain, all produced

predominately converging results. The theoretical implications of these results

suggest that rather than the MTL, BG, and midbrain functioning independently

during learning, instead dynamic interactions may be occurring between these

areas such that they influence each other during both initial acquisition and

reversal of probabilistic information. Effective connectivity results suggest that

this interaction may be occurring via connectivity with midbrain dopaminergic

areas.

Evidence of cooperative interactions: Cooperative interactions for the

purpose of this dissertation are defined as the exhibition of similar patterns of

BOLD signal in both the MTL and BG during learning as well as positive

correlations across these distinct regions. In experiment 1, a region of the

caudate nucleus and hippocampus both displayed greater BOLD signal to easy

compared with hard probabilistic cues for feedback and observation trials. In

experiment 2, one region of the basal ganglia, the putamen, as well as multiple

areas of the MTL, including the hippocampus and parahippocampus, exhibited

greater BOLD signal in response to both deterministic and easy compared with

hard cues. This result is in agreement with studies in the literature which have

also documented similar patterns of activity in the MTL and BG during simple

183

visuomotor learning tasks (Toni et al., 2001; Amso et al., 2005; Law et al., 2005;

Haruno and Kawato, 2006) as well as information integration category learning

tasks (Cincotta and Seger, 2007). Furthermore, multiple positive correlations

across distinct regions (MTL, BG, and midbrain) were observed in all three

experiments. In fact negative correlations, which have previously been used as

evidence of competition (Poldrack et al., 2001; Seger and Cincotta, 2006), were

never observed across these distinct learning and memory structures, even when

the BOLD signal displayed opposite patterns of activity across regions

(experiment 2) as well as when the BOLD signal was investigated using

independent regions of interest (experiment 3). Lastly, both loci within the MTL

and BG were involved in encoding a feedback-based prediction error signal in

experiments 1 and 2. This result suggests three things, one that a feedback

learning signal is encoded in the MTL, in addition to the basal ganglia, which has

previously been shown in the literature (O'Doherty et al., 2004; Tanaka et al.,

2004; Pessiglione et al., 2006); second that distinct memory regions may at times

encode the same learning signal; and third, that dopamine may be facilitating

interactions among discrete regions during learning in accord with the effective

connectivity results observed in this dissertation as well as the theory that

dopamine modulates reward-related learning (Lisman and Grace, 2005;

Shohamy and Adcock, 2010).

Evidence of competitive interactions: Competitive interactions in this

thesis are defined as the exhibition of opposing patterns of BOLD signal in both

the MTL and BG during learning and negative correlations across distinct

184

regions. In experiment 2, multiple regions of the basal ganglia, including the

ventral striatum, caudate nucleus, and putamen, as well as multiple areas of the

MTL, including the hippocampus and parahippocampus, exhibited opposing

patterns of activation in response to varying levels of probabilistic cue difficulty.

Specifically, the MTL displayed greater responses to easy than hard cues,

whereas the loci within the basal ganglia exhibited greater activation in response

to hard than easy cues. Opposing patterns of BOLD signal across distinct brain

regions has been interpreted as competitive interactions in the literature

(Poldrack et al., 2001; Foerde et al., 2006).

6.2 Limitations

Perhaps the largest limitation of this dissertation is that it is very difficult to

examine interactions between multiple memory systems using functional

magnetic resonance imaging. While this question is an active area of research in

the functional neuroimaging field, it is one that contains rather significant

challenges. The primary concern is that using BOLD as a dependent measure, it

is difficult to examine if distinct regions are actually functionally interacting.

Simply because the pattern of BOLD responses observed in discrete regions is

similar or distinct, does not necessarily indicate that the regions are interacting.

Therefore, the definitions of cooperative and competitive interactions during

learning used in this dissertation, while consistent with the literature, should be

considered with some amount of caution. Specifically because of this limitation,

more sophisticated analyses were employed including a Granger causality

185

analysis as well as a prediction error analysis in order to explore in a more in

depth manner how distinct memory systems interact during learning.

The major differences between the feedback and observation versions of

tasks employed in this dissertation were outlined in the materials and methods

sections of experiments 1 and 2. Despite their differences, however, the two

learning sessions share the common goal of learning the value of probabilistic

cues. As such, participants may have engaged in a variety of cognitive

strategies in order to successfully complete the task. For example, as learning

progressed over time in the feedback session it is possible that during the

stimulus presentation period participants may have employed a more declarative-

based cued-recall strategy. Participants may have also engaged in verbal

rehearsal strategies, irrespective of the task version, during the learning phase.

Research investigating how participants solve a distinct probabilistic learning

task, the WPT, may provide insight into possible declarative and nondeclarative

components of category learning in addition to exploring the declarative

knowledge that participants posses during these types of learning tasks (Gluck et

al., 2002; Meeter et al., 2006). Newell and colleagues (2007) recently reported

that participants had comparable declarative knowledge on a feedback and

observation version of the WPT. As a result, the authors suggested that the

feedback version of the WPT may not be an exclusively nondeclarative task.

Corroborating this theory, Meeter and colleagues (2008) have suggested that

participants may solve the WPT via several distinct strategies including

engagement of rule-learning, incremental learning (both of which are thought to

186

engage the BG), memorization techniques (MTL dependent), or some

combination of these three strategies. Furthermore, Shohamy and colleagues

(2004b) suggest that in order to solve probabilistic categorization tasks

participants most likely recruit multiple parallel learning systems. Therefore, it is

quite possible that both the observation and feedback versions of the tasks

employed in experiments 1 and 2 contain some declarative and nondeclarative

components. As a result, the use of the terms declarative and nondeclarative in

this context are meant as a reference, and are not meant to indicate the sole

manner in which participants may solve the tasks.

The possibility that the tasks used in experiments 1 and 2 contain

elements of declarative and nondeclarative learning may have contributed to the

observation that the hippocampus and striatum were involved in both observation

and feedback learning – primarily modified by cue difficulty rather than learning

type. The involvement of multiple cognitive operations (e.g., cued recall or

rehearsal strategies) therefore, may have facilitated the mutual engagement of

these regions. One limitation of the studies employed in this dissertation is that

this possibility cannot be eliminated definitively. Future studies may be better

able to parse out the distinct contributions that multiple cognitive processes may

have on these tasks and the subsequent BOLD signals observed in the MTL and

BG.

In addition, some limitations of the prediction error analyses employed in

experiments 1 and 2 exist. Namely, it is difficult to assess at which time point in

the trial (given the task designs – specifically no jittered ISI between the cue and

187

feedback presentation periods) the individual correlations with the PE signals

occurred within the MTL and BG. Further, it is possible that the trial structure

employed did not allow separation of the prediction error signal from an

uncertainty signal (between cue and outcome) that has been shown to be linked

with local field potentials in the anterior hippocampus (Vanni-Mercier et al.,

2009). However, PE signals carry valence information (i.e., an unexpected

negative outcome produces a negative PE signal), while a saliency, novelty, or

uncertainty signal may be positive irrespective of the valence of the outcome

(more akin to the action prediction error used for the observation trials in

experiment 2). The finding of feedback trial PE signals in the MTL is not

commonly reported in the literature and may point to an area of future research

investigating the nature of these signals in the MTL and potential interactions

with the striatum that may underlie parallel processing in these distinct regions

during probabilistic learning.

6.3 Conclusions & Future Directions

To conclude, the set of experiments performed in this dissertation support

the model of parallel processing of the medial temporal lobe and basal ganglia

during probabilistic learning. However, several questions remain unaddressed.

For example, it is unknown under what circumstances memory systems

cooperate versus compete. Experiment 3 of this dissertation was designed to

address this question, but the results were somewhat inconclusive. A second

open question is probing how exactly dopamine modulates motivated learning in

188

the hippocampus and other learning structures (for review see Shohamy and

Adcock, 2010). Lastly, other neurotransmitters are involved in learning and

memory functions, in addition to dopamine, for example acetylcholine (Hasselmo,

2006). The role that multiple neurotransmitters play during human probabilistic

learning also remains to be fully characterized. These are key questions which

will help to illustrate how two major learning and memory systems interact, which

has implications not only for healthy humans, but may also impact our

understanding and treatment of several neuropsychiatric disorders that target

these structures, including Parkinson’s disease, schizophrenia, and MTL

amnesia.

189

References

Abler B, Walter H, Erk S, Kammerer H, Spitzer M (2006) Prediction error as a

linear function of reward probability is coded in human nucleus accumbens. Neuroimage 31:790-795.

Adcock RA, Thangavel A, Whitfield-Gabrieli S, Knutson B, Gabrieli JD (2006) Reward-motivated learning: mesolimbic activation precedes memory formation. Neuron 50:507-517.

Albouy G, Sterpenich V, Balteau E, Vandewalle G, Desseilles M, Dang-Vu T, Darsaud A, Ruby P, Luppi PH, Degueldre C, Peigneux P, Luxen A, Maquet P (2008) Both the hippocampus and striatum are involved in consolidation of motor sequence memory. Neuron 58:261-272.

Amaral DG (1993) Emerging principles of intrinsic hippocampal organization. Curr Opin Neurobiol 3:225-229.

Amso D, Davidson MC, Johnson SP, Glover G, Casey BJ (2005) Contributions of the hippocampus and the striatum to simple association and frequency-based learning. Neuroimage 27:291-298.

Andersen P, Bliss, T.V.P., Skrede, K.K. (1971) Lamellar organization of hippocampal excitatory pathways. Exp Brain Res 13:222-238.

Annett LE, McGregor A, Robbins TW (1989) The effects of ibotenic acid lesions of the nucleus accumbens on spatial learning and extinction in the rat. Behav Brain Res 31:231-242.

Aron AR, Shohamy D, Clark J, Myers C, Gluck MA, Poldrack RA (2004) Human midbrain sensitivity to cognitive feedback and uncertainty during classification learning. J Neurophysiol 92:1144-1152.

Ashby FG, Spiering BJ (2004) The Neurobiology of Category Learning. Behavioral and Cognitive Neuroscience Reviews.3:pp.

190

Ashby FG, Maddox WT, Bohil CJ (2002) Observational versus feedback training in rule-based and information-integration category learning. Mem Cognit 30:666-677.

Atallah HE, Rudy JW, O'Reilly RC (2008) The role of the dorsal striatum and dorsal hippocampus in probabilistic and deterministic odor discrimination tasks. Learn Mem 15:294-298.

Atallah HE, Lopez-Paniagua D, Rudy JW, O'Reilly RC (2007) Separate neural substrates for skill learning and performance in the ventral and dorsal striatum. Nat Neurosci 10:126-131.

Ballard IC, Murty VP, Carter RM, MacInnes JJ, Huettel SA, Adcock RA (2011) Dorsolateral prefrontal cortex drives mesolimbic dopaminergic regions to initiate motivated behavior. J Neurosci 31:10340-10346.

Balleine BW, Delgado MR, Hikosaka O (2007) The role of the dorsal striatum in reward and decision-making. J Neurosci 27:8161-8165.

Belova MA, Paton JJ, Morrison SE, Salzman CD (2007) Expectation modulates neural responses to pleasant and aversive stimuli in primate amygdala. Neuron 55:970-984.

Beninger RJ, Wasserman J, Zanibbi K, Charbonneau D, Mangels J, Beninger BV (2003) Typical and atypical antipsychotic medications differentially affect two nondeclarative memory tasks in schizophrenic patients: A double dissociation. Schizophrenia Research.61:pp.

Berger TW, and Orr, W.B. (1983) Hippocampectomy selectively disrupts discrimination reversal conditioning of the rabbit nictitating membrane response. Behavioral Brain Research 8:49-68.

Bliss TV, Collingridge GL (1993) A synaptic model of memory: Long-term potentiation in the hippocampus. Jan 1993. Nature.361:pp.

Bolam JP, Hanley JJ, Booth PA, Bevan MD (2000) Synaptic organisation of the basal ganglia. J Anat 196 ( Pt 4):527-542.

191

Bray S, O'Doherty J (2007) Neural coding of reward-prediction error signals during classical conditioning with attractive faces. J Neurophysiol 97:3036-3045.

Brewer JB, Zhao Z, Desmond JE, Glover GH, Gabrieli JD (1998) Making memories: brain activity that predicts how well visual experience will be remembered. Science 281:1185-1187.

Brog JS, Salyapongse A, Deutch AY, Zahm DS (1993) The patterns of afferent innervation of the core and shell in the "accumbens" part of the rat ventral striatum: immunohistochemical detection of retrogradely transported fluoro-gold. J Comp Neurol 338:255-278.

Brown RM, Robertson EM (2007) Off-line processing: Reciprocal interactions between declarative and procedural memories. Journal of Neuroscience 27:10468-10475.

Buckner RL, Wheeler ME (2001) The cognitive neuroscience of remembering. Nat Rev Neurosci 2:624-634.

Bunsey M, Eichenbaum H (1995) Selective damage to the hippocampal region blocks long-term retention of a natural and nonspatial stimulus-stimulus association. Hippocampus 5:546-556.

Burke CJ, Tobler PN, Baddeley M, Schultz W (2010) Neural mechanisms of observational learning. Proc Natl Acad Sci U S A 107:14431-14436.

Burwell RD, Witter MP, Amaral DG (1995) Perirhinal and postrhinal cortices of the rat: a review of the neuroanatomical literature and comparison with findings from the monkey brain. Hippocampus 5:390-408.

Carrillo MC, Gabrieli JD, Hopkins RO, McGlinchey-Berroth R, Fortier CB, Kesner RP, Disterhoft JF (2001) Spared discrimination and impaired reversal eyeblink conditioning in patients with temporal lobe amnesia. Behav Neurosci 115:1171-1179.

Carter RM, Macinnes JJ, Huettel SA, Adcock RA (2009) Activation in the VTA and nucleus accumbens increases in anticipation of both gains and losses. Front Behav Neurosci 3:21.

192

Chen J, Olsen RK, Preston AR, Glover GH, Wagner AD (2011) Associative retrieval processes in the human medial temporal lobe: Hippocampal retrieval success and CA1 mismatch detection. Learn Mem 18:523-528.

Cincotta CM, Seger CA (2007) Dissociation between striatal regions while learning to categorize via feedback and via observation. J Cogn Neurosci 19:249-265.

Cools R (2006) Dopaminergic modulation of cognitive function-implications for L-DOPA treatment in Parkinson's disease. Neurosci Biobehav Rev 30:1-23.

Cools R, Clark L, Owen AM, Robbins TW (2002) Defining the neural mechanisms of probabilistic reversal learning using event-related functional magnetic resonance imaging. J Neurosci 22:4563-4567.

Corkin S, Sullivan EV, Carr FA (1984) Prognostic factors for life expectancy after penetrating head injury. Arch Neurol 41:975-977.

Dagher A, Owen AM, Boecker H, Brooks DJ (2001) The role of the striatum and hippocampus in planning: a PET activation study in Parkinson's disease. Brain 124:1020-1032.

Davachi L, Wagner AD (2002) Hippocampal contributions to episodic encoding: insights from relational and item-based learning. J Neurophysiol 88:982-990.

Delgado MR (2007) Reward-related responses in the human striatum. Ann N Y Acad Sci 1104:70-88.

Delgado MR, Locke HM, Stenger VA, Fiez JA (2003) Dorsal striatum responses to reward and punishment: effects of valence and magnitude manipulations. Cogn Affect Behav Neurosci 3:27-38.

Delgado MR, Miller MM, Inati S, Phelps EA (2005) An fMRI study of reward-related probability learning. Neuroimage 24:862-873.

Delgado MR, Nystrom LE, Fissell C, Noll DC, Fiez JA (2000) Tracking the hemodynamic responses to reward and punishment in the striatum. J Neurophysiol 84:3072-3077.

193

Devan BD, White NM (1999) Parallel information processing in the dorsal striatum: Relation to hippocampal function. Apr 1999. The Journal of Neuroscience.19:pp.

Djonlagic I, Rosenfeld A, Shohamy D, Myers C, Gluck M, Stickgold R (2009) Sleep enhances category learning. Learn Mem 16:751-755.

Doeller CF, King JA, Burgess N (2008) Parallel striatal and hippocampal systems for landmarks and boundaries in spatial memory. Proc Natl Acad Sci U S A 105:5915-5920.

Doll BB, Jacobs WJ, Sanfey AG, Frank MJ (2009) Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res 1299:74-94.

Domesick VB (1969) Projections from the cingulate cortex in the rat. Brain Res 12:296-320.

Eichenbaum H, Bunsey M (1995) On the binding of associations in memory: Clues from studies on the role of the hippocampal region in paired-associate learning. Feb 1995. Current Directions in Psychological Science.4:pp.

Eldridge LL, Knowlton BJ, Furmanski CS, Bookheimer SY, Engel SA (2000) Remembering episodes: a selective role for the hippocampus during retrieval. Nat Neurosci 3:1149-1152.

Fagan AM, Olton DS (1986) Learning sets, discrimination reversal, and hippocampal function. Behav Brain Res 21:13-20.

Foerde K, Knowlton BJ, Poldrack RA (2006) Modulation of competing memory systems by distraction. Proc Natl Acad Sci U S A 103:11778-11783.

Foerde K, Poldrack RA, Knowlton BJ (2007) Secondary-task effects on classification learning. Mem Cognit 35:864-874.

Forman SD, Cohen JD, Fitzgerald M, Eddy WF, Mintun MA, Noll DC (1995) Improved assessment of significant activation in functional magnetic

194

resonance imaging (fMRI): use of a cluster-size threshold. Magn Reson Med 33:636-647.

Frank MJ (2005) Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and nonmedicated Parkinsonism. J Cogn Neurosci 17:51-72.

Frank MJ, Claus ED (2006) Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychol Rev 113:300-326.

Frank MJ, Seeberger LC, O'Reilly R C (2004) By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 306:1940-1943.

Frey U, Schroeder H, Matthies H (1990) Dopaminergic antagonists prevent long-term maintenance of posttetanic LTP in the CA1 region of rat hippocampal slices. Brain Res 522:69-75.

Frey U, Matthies H, Reymann KG (1991) The effect of dopaminergic D1 receptor blockade during tetanization on the expression of long-term potentiation in the rat CA1 region in vitro. Neurosci Lett 129:111-114.

Gerfen CR (1984) The neostriatal mosaic: compartmentalization of corticostriatal input and striatonigral output systems. Nature 311:461-464.

Geweke J (1982) Measurement of Linear Dependence and Feedback between Multiple Time Series. J Am Stat Assoc 77:304-313.

Gluck MA, Bower GH (1988) From conditioning to category learning: an adaptive network model. J Exp Psychol Gen 117:227-247.

Gluck MA, Myers CE (1993) Hippocampal mediation of stimulus representation: a computational theory. Hippocampus 3:491-516.

Gluck MA, Shohamy D, Myers C (2002) How do people solve the "weather prediction" task?: individual variability in strategies for probabilistic category learning. Learn Mem 9:408-418.

195

Gluck MA, Meeter M, Myers CE (2003) Computational models of the hippocampal region: linking incremental learning and episodic memory. Trends Cogn Sci 7:269-276.

Goebel R, Esposito F, Formisano E (2006) Analysis of functional image analysis contest (FIAC) data with brainvoyager QX: From single-subject to cortically aligned group general linear model analysis and self-organizing group independent component analysis. Hum Brain Mapp 27:392-401.

Goebel R, Roebroeck A, Kim DS, Formisano E (2003) Investigating directed cortical interactions in time-resolved fMRI data using vector autoregressive modeling and Granger causality mapping. Magn Reson Imaging 21:1251-1261.

Gotham AM, Brown RG, Marsden CD (1988) 'Frontal' cognitive function in patients with Parkinson's disease 'on' and 'off' levodopa. Brain 111 ( Pt 2):299-321.

Groenewegen HJ, Vermeulen-Van der Zee E, te Kortschot A, Witter MP (1987) Organization of the projections from the subiculum to the ventral striatum in the rat. A study using anterograde transport of Phaseolus vulgaris leucoagglutinin. Neuroscience 23:103-120.

Haber SN (2003) The primate basal ganglia: parallel and integrative networks. J Chem Neuroanat 26:317-330.

Haber SN, Knutson B (2010) The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology 35:4-26.

Hampson RE, Pons TP, Stanford TR, Deadwyler SA (2004) Categorization in the monkey hippocampus: a possible mechanism for encoding information into memory. Proc Natl Acad Sci U S A 101:3184-3189.

Haruno M, Kawato M (2006) Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. J Neurophysiol 95:948-959.

Hasselmo ME (2006) The role of acetylcholine in learning and memory. Curr Opin Neurobiol 16:710-715.

196

Heeger DJ, Ress D (2002) What does fMRI tell us about neuronal activity? Nat Rev Neurosci 3:142-151.

Heimer L, Alheid GF, de Olmos JS, Groenewegen HJ, Haber SN, Harlan RE, Zahm DS (1997) The accumbens: beyond the core-shell dichotomy. J Neuropsychiatry Clin Neurosci 9:354-381.

Holm S (1979) A simple sequentially rejective multiple test procedure. Scand J Stat 6:65-70.

Hopkins RO, Myers CE, Shohamy D, Grossman S, Gluck M (2004) Impaired probabilistic category learning in hypoxic subjects with hippocampal damage. Neuropsychologia 42:524-535.

Jay TM, Glowinski J, Thierry AM (1989) Selectivity of the hippocampal projection to the prelimbic area of the prefrontal cortex in the rat. Brain Res 505:337-340.

Joel D, Niv Y, Ruppin E (2002) Actor-critic models of basal ganglia: New anatomical and computational perspectives. Neural Networks.15:pp.

Jung Y, Hong S, Haber SN (2003) Organization of Direct Hippocampal Projections to the Different Regions of the Ventral Striatum in primate. The Korean J Anat 36:67-76.

Kelley AE, Domesick VB (1982) The distribution of the projection from the hippocampal formation to the nucleus accumbens in the rat: an anterograde- and retrograde-horseradish peroxidase study. Neuroscience 7:2321-2335.

Knowlton BJ, Squire LR, Gluck MA (1994) Probabilistic classification learning in amnesia. Learn Mem 1:106-120.

Knowlton BJ, Mangels JA, Squire LR (1996) A neostriatal habit learning system in humans. Science 273:1399-1402.

Knutson B, Adams CM, Fong GW, Hommer D (2001) Anticipation of increasing monetary reward selectively recruits nucleus accumbens. The Journal of Neuroscience.21:pp.

197

Kolb B, Buhrmann K, McDonald R, Sutherland RJ (1994) Dissociation of the medial prefrontal, posterior parietal, and posterior temporal cortex for spatial navigation and recognition memory in the rat. Cereb Cortex 4:664-680.

Krayniak PF, Meibach RC, Siegel A (1981) A projection from the entorhinal cortex to the nucleus accumbens in the rat. Brain Res 209:427-431.

Lagnado DA, Newell BR, Kahan S, Shanks DR (2006) Insight and strategy in multiple-cue learning. J Exp Psychol Gen 135:162-183.

Law JR, Flanery MA, Wirth S, Yanike M, Smith AC, Frank LM, Suzuki WA, Brown EN, Stark CE (2005) Functional magnetic resonance imaging activity during the gradual acquisition and expression of paired-associate memory. J Neurosci 25:5720-5729.

Lee AS, Duman RS, Pittenger C (2008) A double dissociation revealing bidirectional competition between striatum and hippocampus during learning. Proc Natl Acad Sci U S A 105:17163-17168.

Li J, Delgado MR, Phelps EA (2011) How instructed knowledge modulates the neural systems of reward learning. Proc Natl Acad Sci U S A 108:55-60.

Li S, Cullen WK, Anwyl R, Rowan MJ (2003) Dopamine-dependent facilitation of LTP induction in hippocampal CA1 by exposure to spatial novelty. Nature Neuroscience.6:pp.

Lisman JE, Grace AA (2005) The hippocampal-VTA loop: controlling the entry of information into long-term memory. Neuron 46:703-713.

Liu P, Bilkey DK (1996) Direct connection between perirhinal cortex and hippocampus is a major constituent of the lateral perforant path. Hippocampus 6:125-135.

Lynd-Balta E, Haber SN (1994) The organization of midbrain projections to the ventral striatum in the primate. Neuroscience 59:609-623.

Macmillan NA, Creelman CD (1991) Detection Theory: A User's Guide. New York: Cambridge University Press.

198

Marco-Pallares J, Muller SV, Munte TF (2007) Learning by doing: an fMRI study of feedback-related brain activations. Neuroreport 18:1423-1426.

Marston HM, Everitt BJ, Robbins TW (1993) Comparative effects of excitotoxic lesions of the hippocampus and septum/diagonal band on conditional visual discrimination and spatial learning. Neuropsychologia 31:1099-1118.

Mattfeld AT, Stark CE (2010) Striatal and Medial Temporal Lobe Functional Interactions during Visuomotor Associative Learning. Cereb Cortex.

McClure SM, Berns GS, Montague PR (2003) Temporal prediction errors in a passive learning task activate human striatum. Neuron 38:339-346.

McDonald RJ, White NM (1993) A triple dissociation of memory systems: hippocampus, amygdala, and dorsal striatum. Behav Neurosci 107:3-22.

McDonald RJ, White NM (1994) Parallel information processing in the water maze: evidence for independent memory systems involving dorsal striatum and hippocampus. Behav Neural Biol 61:260-270.

McDonald RJ, White NM (1995) Hippocampal and nonhippocampal contributions to place learning in rats. Behavioral Neuroscience.109:pp.

McGeorge AJ, Faull RL (1989) The organization of the projection from the cerebral cortex to the striatum in the rat. Neuroscience 29:503-537.

Meeter M, Myers CE, Shohamy D, Hopkins RO, Gluck MA (2006) Strategies in probabilistic categorization: results from a new way of analyzing performance. Learn Mem 13:230-239.

Meeter M, Radics G, Myers CE, Gluck MA, Hopkins RO (2008) Probabilistic categorization: how do normal participants and amnesic patients do it? Neurosci Biobehav Rev 32:237-248.

Meyer-Lindenberg A, Kohn PD, Kolachana B, Kippenhan S, McInerney-Leo A, Nussbaum R, Weinberger DR, Berman KF (2005) Midbrain dopamine and

199

prefrontal function in humans: interaction and modulation by COMT genotype. Nat Neurosci 8:594-596.

Middleton FA, Strick PL (2000) Basal ganglia output and cognition: evidence from anatomical, behavioral, and clinical studies. Brain Cogn 42:183-200.

Mitchell JA, Hall G (1988) Learning in rats with caudate-putamen lesions: unimpaired classical conditioning and beneficial effects of redundant stimulus cues on instrumental and spatial learning deficits. Behav Neurosci 102:504-514.

Myers CE, Deluca J, Hopkins RO, Gluck MA (2006) Conditional discrimination and reversal in amnesia subsequent to hypoxic brain injury or anterior communicating artery aneurysm rupture. Neuropsychologia 44:130-139.

Myers CE, Shohamy D, Gluck MA, Grossman S, Kluger A, Ferris S, Golomb J, Schnirman G, Schwartz R (2003) Dissociating hippocampal versus basal ganglia contributions to learning and transfer. J Cogn Neurosci 15:185-193.

Myers CE, Hopkins, R.O., Kesner, R.P, Monti, L., Gluck, M.A. (2000) Conditional spatial discrimination in humans with hypoxic brain injury. Psychobiology 28:275-282.

Nagy H, Keri S, Myers CE, Benedek G, Shohamy D, Gluck MA (2007) Cognitive sequence learning in Parkinson's disease and amnestic mild cognitive impairment: Dissociation between sequential and non-sequential learning of associations. Neuropsychologia 45:1386-1392.

Newell BR, Lagnado DA, Shanks DR (2007) Challenging the role of implicit processes in probabilistic category learning. Psychon Bull Rev 14:505-511.

O'Doherty J, Dayan P, Schultz J, Deichmann R, Friston K, Dolan RJ (2004) Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304:452-454.

O'Doherty JP (2004) Reward representations and reward-related learning in the human brain: insights from neuroimaging. Curr Opin Neurobiol 14:769-776.

200

O'Doherty JP, Deichmann R, Critchley HD, Dolan RJ (2002) Neural responses during anticipation of a primary taste reward. Neuron 33:815-826.

O'Reilly R, C., Munakata Y (2000) Computational Explorations in Cognitive Neuroscience: Understanding the Mind by Simulating the Brain. Cambridge, MA: MIT Press.

Olmos A, Kingdom FA (2004) A biologically inspired algorithm for the recovery of shading and reflectance images. Perception 33:1463-1473.

Packard MG (1999) Glutamate infused posttraining into the hippocampus or caudate-putamen differentially strengthens place and response learning. Proc Natl Acad Sci U S A 96:12881-12886.

Packard MG, White NM (1989) Memory facilitation produced by dopamine agonists: Role of receptor subtype and mnemonic requirements. Jul 1989. Pharmacology, Biochemistry and Behavior.33:pp.

Packard MG, White NM (1991) Dissociation of hippocampus and caudate nucleus memory systems by posttraining intracerebral injection of dopamine agonists. Behav Neurosci 105:295-306.

Packard MG, McGaugh JL (1992) Double dissociation of fornix and caudate nucleus lesions on acquisition of two water maze tasks: Further evidence for multiple memory systems. Behavioral Neuroscience.106: 439-446.

Packard MG, McGaugh JL (1994) Quinpirole and d-amphetamine administration posttraining enhances memory on spatial and cued discriminations in a water maze. Mar 1994. Psychobiology.22:pp.

Packard MG, McGaugh JL (1996) Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning. Neurobiol Learn Mem 65:65-72.

Packard MG, Knowlton BJ (2002) Learning and memory functions of the Basal Ganglia. Annu Rev Neurosci 25:563-593.

201

Packard MG, Hirsh R, White NM (1989) Differential effects of fornix and caudate nucleus lesions on two radial maze tasks: evidence for multiple memory systems. J Neurosci 9:1465-1472.

Pagnoni G, Zink CF, Montague PR, Berns GS (2002) Activity in human ventral striatum locked to errors of reward prediction. Nat Neurosci 5:97-98.

Penfield W, Milner B (1958) Memory deficit produced by bilateral lesions in the hippocampal zone. AMA Arch Neurol Psychiatry 79:475-497.

Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD (2006) Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442:1042-1045.

Ploghaus A, Tracey I, Clare S, Gati JS, Rawlins JN, Matthews PM (2000) Learning about pain: the neural substrate of the prediction error for aversive events. Proc Natl Acad Sci U S A 97:9281-9286.

Poldrack RA, Packard MG (2003) Competition among multiple memory systems: converging evidence from animal and human brain studies. Neuropsychologia 41:245-251.

Poldrack RA, Rodriguez P (2004) How do memory systems interact? Evidence from human classification learning. Neurobiol Learn Mem 82:324-332.

Poldrack RA, Prabhakaran V, Seger CA, Gabrieli JD (1999) Striatal activation during acquisition of a cognitive skill. Neuropsychology 13:564-574.

Poldrack RA, Fletcher PC, Henson RN, Worsley KJ, Brett M, Nichols TE (2008) Guidelines for reporting an fMRI study. Neuroimage 40:409-414.

Poldrack RA, Clark J, Pare-Blagoev EJ, Shohamy D, Creso Moyano J, Myers C, Gluck MA (2001) Interactive memory systems in the human brain. Nature 414:546-550.

Reber PJ, Knowlton BJ, Squire LR (1996) Dissociable properties of memory systems: differences in the flexibility of declarative and nondeclarative knowledge. Behav Neurosci 110:861-871.

202

Rempel-Clower NL, Zola SM, Squire LR, Amaral DG (1996) Three cases of enduring memory impairment after bilateral damage limited to the hippocampal formation. J Neurosci 16:5233-5255.

Rice WR (1989) Analyzing Tables of Statistical Tests. Evolution 43:223-225.

Roebroeck A, Formisano E, Goebel R (2005) Mapping directed influence over the brain using Granger causality and fMRI. Neuroimage 25:230-242.

Rolls ET (1994) Neurophysiology and cognitive functions of the striatum. Aug-Sep 1994. Revue Neurologique.150:pp.

Sadeh T, Shohamy D, Levy DR, Reggev N, Maril A (2011) Cooperation between the hippocampus and the striatum during episodic encoding. J Cogn Neurosci 23:1597-1608.

Samejima K, Ueda Y, Doya K, Kimura M (2005) Representation of action-specific reward values in the striatum. Science 310:1337-1340.

Scatton B, Simon H, Le Moal M, Bischoff S (1980) Origin of dopaminergic innervation of the rat hippocampal formation. Neurosci Lett 18:125-131.

Schacter DL, Wagner AD (1999) Medial temporal lobe activations in fMRI and PET studies of episodic encoding and retrieval. Hippocampus 9:7-24.

Schmitt-Eliassen J, Ferstl R, Wiesner C, Deuschl G, Witt K (2007) Feedback-based versus observational classification learning in healthy aging and Parkinson's disease. Brain Res 1142:178-188.

Schoenbaum G, Setlow B (2003) Lesions of nucleus accumbens disrupt learning about aversive outcomes. J Neurosci 23:9833-9841.

Schott BH, Seidenbecher CI, Fenker DB, Lauer CJ, Bunzeck N, Bernstein HG, Tischmeyer W, Gundelfinger ED, Heinze HJ, Duzel E (2006) The dopaminergic midbrain participates in human episodic memory formation: evidence from genetic imaging. J Neurosci 26:1407-1417.

203

Schultz W (1997) Dopamine neurons and their role in reward mechanisms. Curr Opin Neurobiol 7:191-197.

Schultz W (2002) Getting formal with dopamine and reward. Neuron 36:241-263.

Schultz W, Dayan P, Montague PR (1997) A neural substrate of prediction and reward. Science 275:1593-1599.

Scoville WB, Milner B (1957) Loss of recent memory after bilateral hippocampal lesions. J Neurol Neurosurg Psychiatry 20:11-21.

Seger CA (2006) The basal ganglia in human learning. Neuroscientist 12:285-290.

Seger CA (2008) How do the basal ganglia contribute to categorization? Their roles in generalization, response selection, and learning via feedback. Neurosci Biobehav Rev 32:265-278.

Seger CA, Cincotta CM (2005) The roles of the caudate nucleus in human classification learning. J Neurosci 25:2941-2951.

Seger CA, Cincotta CM (2006) Dynamics of frontal, striatal, and hippocampal systems during rule learning. Cereb Cortex 16:1546-1555.

Seger CA, Miller EK (2010) Category learning in the brain. Annu Rev Neurosci 33:203-219.

Setlow B, Schoenbaum G, Gallagher M (2003) Neural encoding in ventral striatum during olfactory discrimination learning. Neuron 38:625-636.

Sherry DF, Schacter DL (1987) The evolution of multiple memory systems. Psychological Review 94:439-454.

Shohamy D, Wagner AD (2008) Integrating memories in the human brain: hippocampal-midbrain encoding of overlapping events. Neuron 60:378-389.

204

Shohamy D, Adcock RA (2010) Dopamine and adaptive memory. Trends Cogn Sci 14.

Shohamy D, Myers CE, Onlaor S, Gluck MA (2004a) Role of the basal ganglia in category learning: how do patients with Parkinson's disease learn? Behav Neurosci 118:676-686.

Shohamy D, Myers CE, Kalanithi J, Gluck MA (2008) Basal ganglia and dopamine contributions to probabilistic category learning. Neurosci Biobehav Rev 32:219-236.

Shohamy D, Myers CE, Grossman S, Sage J, Gluck MA (2005) The role of dopamine in cognitive sequence learning: evidence from Parkinson's disease. Behav Brain Res 156:191-199.

Shohamy D, Myers CE, Geghman KD, Sage J, Gluck MA (2006) L-dopa impairs learning, but spares generalization, in Parkinson's disease. Neuropsychologia 44:774-784.

Shohamy D, Myers CE, Hopkins RO, Sage J, Gluck MA (2009) Distinct hippocampal and basal ganglia contributions to probabilistic learning and reversal. J Cogn Neurosci 21:1821-1833.

Shohamy D, Myers CE, Grossman S, Sage J, Gluck MA, Poldrack RA (2004b) Cortico-striatal contributions to feedback-based learning: converging data from neuroimaging and neuropsychology. Brain 127:851-859.

Shrager Y, Kirwan CB, Squire LR (2008) Activity in both hippocampus and perirhinal cortex predicts the memory strength of subsequently remembered information. Neuron 59:547-553.

Sorensen KE, Witter MP (1983) Entorhinal efferents reach the caudato-putamen. Neurosci Lett 35:259-264.

Squire LR (1992a) Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans. Psychol Rev 99:195-231.

205

Squire LR (1992b) Declarative and nondeclarative memory: Multiple brain systems supporting learning and memory. Journal of Cognitive Neuroscience 4:232-243.

Squire LR, Zola SM (1996) Structure and function of declarative and nondeclarative memory systems. Proc Natl Acad Sci U S A 93:13515-13522.

Squire LR, Stark CE, Clark RE (2004) The medial temporal lobe. Annu Rev Neurosci 27:279-306.

Stark CE, Squire LR (2001) When zero is not zero: the problem of ambiguous baseline conditions in fMRI. Proc Natl Acad Sci U S A 98:12760-12766.

Stubley-Weatherly L, Harding JW, Wright JW (1996) Effects of discrete kainic acid-induced hippocampal lesions on spatial and contextual learning and memory in rats. Brain Res 716:29-38.

Sutherland RJ, Kolb B, Whishaw IQ (1982) Spatial mapping: definitive disruption by hippocampal or medial frontal cortical damage in the rat. Neurosci Lett 31:271-276.

Sutherland RJ, Whishaw IQ, Kolb B (1988) Contributions of cingulate cortex to two forms of spatial learning and memory. J Neurosci 8:1863-1872.

Swainson R, Rogers RD, Sahakian BJ, Summers BA, Polkey CE, Robbins TW (2000) Probabilistic learning and reversal deficits in patients with Parkinson's disease or frontal or temporal lobe lesions: possible adverse effects of dopaminergic medication. Neuropsychologia 38:596-612.

Swanson LW (1981) A direct projection from Ammon's horn to prefrontal cortex in the rat. Brain Res 217:150-154.

Swanson LW (1982) The projections of the ventral tegmental area and adjacent regions: a combined fluorescent retrograde tracer and immunofluorescence study in the rat. Brain Res Bull 9:321-353.

206

Swanson LW, Kohler C (1986) Anatomical evidence for direct projections from the entorhinal area to the entire cortical mantle in the rat. J Neurosci 6:3010-3023.

Talairach J, Tournoux P (1988) Co-Planar Stereotaxic Atlas of the Human Brain. New York: Thieme Medical Publishers, Inc.

Tanaka SC, Doya K, Okada G, Ueda K, Okamoto Y, Yamawaki S (2004) Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat Neurosci 7:887-893.

Thierry AM, Gioanni Y, Degenetais E, Glowinski J (2000) Hippocampo-prefrontal cortex pathway: anatomical and electrophysiological characteristics. Hippocampus 10:411-419.

Toni I, Ramnani N, Josephs O, Ashburner J, Passingham RE (2001) Learning arbitrary visuomotor associations: temporal dynamic of brain activity. Neuroimage 14:1048-1057.

Tricomi E, Fiez JA (2008) Feedback signals in the caudate reflect goal achievement on a declarative memory task. Neuroimage 41:1154-1167.

Tricomi EM, Delgado MR, Fiez JA (2004) Modulation of caudate activity by action contingency. Neuron 41:281-292.

Vanni-Mercier G, Mauguiere F, Isnard J, Dreher JC (2009) The hippocampus codes the uncertainty of cue-outcome associations: an intracranial electrophysiological study in humans. J Neurosci 29:5287-5294.

Voermans NC, Petersson KM, Daudey L, Weber B, Van Spaendonck KP, Kremer HP, Fernandez G (2004) Interaction between the human hippocampus and the caudate nucleus during route recognition. Neuron 43:427-435.

Wagner AD, Schacter DL, Rotte M, Koutstaal W, Maril A, Dale AM, Rosen BR, Buckner RL (1998) Building memories: remembering and forgetting of verbal experiences as predicted by brain activity. Science 281:1188-1191.

207

Watkins CJCH (1989) Learning from delayed rewards. In. England: University of Cambridge.

White NM, McDonald RJ (2002) Multiple parallel memory systems in the brain of the rat. Neurobiol Learn Mem 77:125-184.

Wise RA (2004) Dopamine, learning and motivation. Nat Rev Neurosci 5:483-494.

Wittmann BC, Schott BH, Guderian S, Frey JU, Heinze HJ, Duzel E (2005) Reward-related FMRI activation of dopaminergic midbrain is associated with enhanced hippocampus-dependent long-term memory formation. Neuron 45:459-467.

Yin HH, Knowlton BJ (2006) The role of the basal ganglia in habit formation. Nat Rev Neurosci 7:464-476.

Zeineh MM, Engel SA, Thompson PM, Bookheimer SY (2003) Dynamics of the hippocampus during encoding and retrieval of face-name pairs. Science 299:577-580.

Zola-Morgan S, Squire LR (1984) Preserved learning in monkeys with medial temporal lesions: sparing of motor and cognitive skills. J Neurosci 4:1072-1085.

Zola-Morgan S, Squire LR (1985) Medial temporal lesions in monkeys impair memory on a variety of tasks sensitive to human amnesia. Behav Neurosci 99:22-34.

Zola-Morgan S, Squire LR (1986) Memory impairment in monkeys following lesions limited to the hippocampus. Behavioral Neuroscience 100:155-160.

Zola SM, Mahut H (1973) Paradoxical facilitation of object reversal learning after transection of the fornix in monkeys. Neuropsychologia 11:271-284.

208

Figures Figure 3.1 Schematic of the Task used in the Experiment 1

209

Figure 3.2 Experiment 1 Behavioral Results

210

Figure 3.3 Experiment 1 Neuroimaging Results: Main Effect of Cue Difficulty in the Striatum and Hippocampus

211

Figure 3.4 Experiment 1 Neuroimaging Results: Granger Causality Analysis – Functional Connectivity in the Caudate Nucleus using the Hippocampus as the Seed Region

212

Figure 3.5 Experiment 1 Neuroimaging Results: Prediction Error Analysis in the Hippocampus and Striatum

213

Figure 4.1 Schematic of the Task used in Experiment 2

214

Figure 4.2 Experiment 2 Behavioral Results

215

Figure 4.3 Experiment 2 Neuroimaging Results: Learning Phase Main Effect of Cue Difficulty in the Striatum and MTL – Feedback Group

216

Figure 4.4 Experiment 2 Neuroimaging Results: Learning Phase Main Effect of Cue Difficulty in the Striatum and MTL – Observation Group

217

Figure 4.5 Experiment 2 Neuroimaging Results: Update Phase Main Effect of Cue Difficulty in the Striatum and MTL – Feedback and Observation Groups

218

Figure 4.6 Neuroimaging Results: Granger Causality Analysis – Effective Connectivity using the Midbrain as the Seed Region

219

Figure 4.7 Experiment 2 Neuroimaging Results: Prediction Error Analysis

220

Figure 5.1 Schematic of the Task used in Experiment 3

221

Figure 5.2 Experiment 3 Behavioral Results

222

Figure 5.3 Experiment 3 Neuroimaging Results: ROI Analysis during the Stimulus Presentation Period

223

Tables

Table 3.1 Learning Phase ANOVA: Main Effect of Cue Difficulty Talairach Coordinates Region of

Activation Brodmann Area (BA)

Laterality x y z

# Voxels FStat

Postcentral gyrus BA 5 Left -27 -37 61 257 21.84Superior frontal gyrus BA 6 Right 6 14 52 169 29.79Inferior parietal lobe BA 40 Left -33 -37 46 170 34.36

Postcentral gyrus BA 40 Right 42 -28 43 162 21.32

Precuneus BA 7 Right 3 -43 43 242 20.16

Precentral gyrus BA 6 Right 45 -1 43 161 19.80

Precentral gyrus Right 30 17 37 334 25.42

Medial frontal gyrus BA 32 Left -3 20 37 319 21.72

Middle frontal gyrus BA 9 Left -48 5 37 162 17.04

Cingulate gyrus BA 24 Right 21 17 25 209 39.44

Middle frontal gyrus BA 10 Right 36 44 22 758 56.88

Insula BA 13 Right 27 -34 22 239 24.35Superior temporal lobe BA 22 Left -21 -40 22 408 53.63Claustrum Right 22 -13 22 136 29.74

Insula Right 42 -4 19 49 22.32Inferior frontal gyrus BA 44 Left -60 14 19 219 31.51Superior temporal lobe Right 36 -46 13 125 22.02

Precentral gyrus BA 44 Left -42 14 10 76 17.75Caudate nucleus (body) Left -15 20 7 198 35.40

Lingual gyrus Right 9 -52 1 335 32.75

Insula BA 13 Right 45 14 1 218 27.25

Hippocampus Left -36 -28 -8 96 24.66Middle temporal gyrus BA 20 Right 54 -31 -11 166 23.69

Cerebellum BA 37 Left -21 -40 -14 570 49.03

224

Table 3.2 Learning Phase ANOVA: Main Effect of Learning Type

Talairach Coordinates Region of Activation

Brodmann Area (BA)

Laterality x y z

# Voxels FStat

Superior parietal lobe BA 7 Left -18 -70 58 175 22.45Middle frontal gyrus BA 6 Left -27 8 58 153 20.61Superior parietal lobe BA 7 Left -24 -52 55 245 28.86Superior frontal gyrus BA 8 Right 6 26 52 206 27.13Frontal medial gyrus BA 6 Right 12 17 46 215 34.85Inferior parietal lobe BA 40 Left -39 -37 43 122 21.09Cuneus BA 18 Left -3 -82 16 225 17.93

Thalamus Right 3 -7 13 623 42.32Middle frontal gyrus BA 10 Right 33 50 10 505 34.07Middle occipital gyrus BA 18 Right 27 -85 4 250 19.62

Caudate Right 6 3 4 354 35.41

Globus pallidus Left -12 2 4 571 35.44

Lingual gyrus BA 18 Left -9 -73 1 502 26.84Superior temporal gyrus BA 22 Left -48 -7 1 153 33.19Superior temporal gyrus BA 21 Left -67 -10 1 262 33.11Superior temporal gyrus BA 22 Right 54 2 -2 186 21.03Middle temporal gyrus BA 21 Left -63 -37 -2 349 25.40Claustrum Right 30 11 -2 464 28.67

Insula BA 13 Left -33 14 -2 347 31.27Inferior temporal gyrus BA 37 Right 48 -61 -8 287 37.79

Fusiform gyrus BA 19 Right 30 -64 -11 148 21.50Parahippocampal gyrus Right 18 -37 -11 242 18.04

Cerebellum Right 27 -46 -17 663 27.78

Cerebellum Left -30 -64 -17 269 23.91

225

3.3 Test Phase Contrast of Previously Studied vs. Novel Material

Talairach Coordinates

Region of Activation Brodmann Area (BA)

Laterality

x y z

# Voxels tstat

Previously Studied > Novel

Paracentral lobe BA 4/5 Left -12 -37 61 140 4.30 Medial frontal gyrus BA 6 Left -9 -7 49 77 4.41 Postcentral gyrus BA 1 Left -51 -19 49 422 5.29 Inferior parietal lobe BA 40 Right 51 -28 46 62 3.95 Postcentral gyrus BA 2 Right 45 -16 46 126 4.15 Precuneus BA 7 Left -12 -49 46 69 4.78 Cingulate gyrus BA 31 Left -15 -31 40 127 4.64 Precentral gyrus BA 4 Right 45 -16 34 106 4.35 Inferior parietal lobe BA 40 Left -51 -31 31 148 5.53 Cingulate gyrus BA 23 Right 6 -34 28 71 4.15 Insula BA 13 Left -33 -31 25 56 4.67 Precentral gyrus BA 4 Left -51 -7 22 204 4.35 Postcentral gyrus BA 40 Left -54 -22 19 125 4.36 Superior temporal gyrus BA 41 Right 45 -34 16 221 4.38 Superior temporal gyrus BA 22 Right 39 -46 16 51 4.07 Caudate nucleus Right 6 5 16 88 4.75 Superior temporal gyrus BA 22 Right 51 -46 13 176 4.77 Thalamus Left -18 -25 13 241 5.60 Middle occipital gyrus BA 18 Right 33 -88 10 80 3.84 Superior temporal gyrus BA 42 Left -60 -31 10 247 4.47 Caudate nucleus (tail) Right 27 -40 7 120 4.73 Insula Left -36 -13 7 64 4.24 Thalamus Right 0 -16 4 57 4.42 Middle temporal gyrus BA 21 Left -60 -25 1 124 4.58 Medial temporal lobe BA 28 Left -24 -22 -5 101 4.14 Parahippocampal gyrus BA 36 Left -21 -40 -8 233 4.78 Cerebellum Right 12 -52 -14 758 7.93 Middle occipital gyrus BA 19 Right 30 -79 -17 217 4.25 Cerebellum Left -15 -91 -17 182 4.70 Cerebellum Right 36 -70 -23 126 4.43 Cerebellum Right 15 -22 -23 189 5.12

226

Table 4.1 Learning Phase ANOVA: Main Effect of Cue Difficulty - Feedback Group

Talairach Coordinates Region of Activation

Brodmann Area (BA)

Laterality x y z

# Voxels FStat

Superior Parietal Lobule BA 7 Right 16 -65 60 206 10.42Superior Parietal Lobule BA 7 Left -10 -71 57 251 9.63 Paracentral Lobule BA 5 Left -13 -32 54 223 11.19Superior Parietal Lobule BA 7 Left -31 -53 54 213 11.38Precuneus BA 7 Left -19 -56 51 139 9.58 Precentral Gyrus BA 6 Right 44 4 45 1657 12.97Inferior Parietal Lobule BA 40 Right 41 -50 39 509 12.28Precuneus BA 19 Right 32 -65 39 1025 11.43Inferior Parietal Lobule BA 40 Left -40 -59 39 676 15.47Precentral Gyrus BA 6 Right 38 1 36 152 10.70

Precuneus BA 31 Right 11 -47 36 388 11.32

Cingulate Gyrus BA 32 Right 1 22 30 4355 19.08Middle Frontal Gyrus BA 9 Left -43 19 30 751 16.81Middle Frontal Gyrus BA 10 Right 29 58 21 388 12.82Medial Frontal Gyrus BA 9 Right 2 49 21 206 9.43 Middle Temporal Gyrus BA 19 Left -49 -80 19 2468 18.79Superior Temporal Gyrus BA 39 Right 41 -56 18 261 9.07

Insula Right 41 -23 18 158 12.09Medial Frontal Gyrus BA 10 Left -7 55 18 2897 12.43

Caudate Nucleus Right 5 1 12 243 9.39 Posterior Cingulate BA 23 Left -13 -56 9 6733 20.07Middle Frontal Gyrus BA 46 Right 38 49 6 617 12.71

Insula Right 29 16 3 1954 19.17

Insula Left -34 10 3 2287 25.01Inferior Temporal Gyrus BA 37 Left -34 -65 0 146 11.85MTL/Hippo/Amygdala Left -22 -8 -15 567 12.96

Cerebellum Left -37 -59 -33 156 10.32

Brainstem Left -1 -35 -36 207 8.51

227

Table 4.2 Learning Phase ANOVA: Main Effect of Cue Difficulty - Observation

Group Talairach

Coordinates Region of Activation

Brodmann Area (BA)

Laterality

x y z

# Voxels FStat

Medial Frontal Gyrus BA 6 Left -4 -23 66 195 9.19 Superior Parietal Lobule BA 7 Right 17 -59 57 237 10.23

Precuneus BA 7 Right 5 -53 57 713 10.76Middle Frontal Gyrus BA 6 Right 32 -2 54 170 9.86 Inferior Parietal Lobule BA 40 Right 47 -53 48 1066 10.67

Precuneus BA 7 Left -10 -53 42 339 11.65

Cingulate Gyrus BA 32 Left -1 13 39 3922 19.60Middle Frontal Gyrus BA 9 Right 47 10 36 1384 14.44

Angular Gyrus BA 39 Right 35 -62 30 327 10.25Middle Frontal Gyrus BA 46 Right 38 52 15 328 19.49

Caudate Nucleus Right 5 -2 15 459 13.66Inferior Frontal Gyrus BA 46 Left -52 40 12 184 10.27Caudate Nucleus/Putamen Left -13 1 9 831 13.47Middle Frontal Gyrus BA 46 Left -34 49 9 872 12.37

Thalamus Right 9 -11 6 1402 15.54Inferior Frontal Gyrus BA 44 Right 29 43 0 715 13.41

Insula BA 13 Left -37 13 -3 198 11.52

Hippocampus Left -25 -20 -12 145 10.25

Cerebellum Right 17 -80 -18 287 12.67

228

Table 4.3 Test Phase 1: Contrast of Previously Studied vs. Novel - Feedback Group

Talairach Coordinates

Region of Activation Brodmann Area (BA)

Laterality

x y z

# Voxels tstat

Previously Studied > Novel

Superior Parietal Lobule BA 7 Left -34 -47 60 438 4.61 Superior Parietal Lobule BA 7 Right 14 -62 57 724 4.85 Superior Parietal Lobule BA 7 Left -31 -44 48 441 7.24

Postcentral Gyrus BA 4 Right 47 -14 42 187 5.11

Precuneus BA 7 Right 5 -59 42 211 4.57

Cingulate Gyrus BA 31 Right 8 -38 39 368 5.02 Superior Temporal Gyrus BA 22 Right 56 -38 21 394 5.10 Superior Temporal Gyrus BA 22 Left -55 -44 15 154 4.75

Insula BA 13 Right 41 -14 3 337 5.59 Middle Temporal Gyrus BA 21 Left -37 -26 0 440 6.40

Novel > Previously

Studied

Medial Frontal Gyrus BA 8 Left -7 19 48 13762 -6.92 Superior Parietal Lobule BA 7 Right 38 -65 45 221 -4.41

Inferior Parietal Lobule BA 40 Left -46 -56 42 1997 -5.71

Precuneus BA 19 Left -37 -77 39 212 -3.90

Middle Frontal Gyrus BA 9 Right 44 22 33 141 -4.49 Caudate Nucleus/Putamen Left -16 7 12 474 -7.33 Inferior Frontal Gyrus BA 45 Left -46 40 3 1948 -5.89

Cerebellum Left -4 -38 -9 150 -4.50 Inferior Temporal Gyrus BA 37 Right 56 -44 -15 280 -4.42 Inferior Temporal Gyrus BA 20 Left -61 -32 -15 401 -5.47 Inferior Temporal Gyrus BA 21 Right 62 -8 -18 219 -5.49

Cerebellum Right 23 -77 -39 270 -5.48

229

Table 4.4 Test Phase 1: Contrast of Previously Studied vs. Novel - Observation Group

Talairach Coordinates Region of Activation Brodmann Area (BA)

Lateralityx y z

# Voxels tstat

Previously Studied > Novel Paracentral Lobule BA 4 Right 17 -41 57 196 4.12 Paracentral Lobule BA 5 Right 8 -35 51 286 5.30 Precuneus BA 7 Right 14 -53 48 225 5.01 Precuneus BA 7 Left -13 -50 48 512 5.84 Inferior Parietal Lobule BA 40 Right 50 -32 24 413 5.35 Cuneus BA 19 Left -13 -92 24 318 4.32 Cingulate Gyrus BA 20 Right 20 -50 18 209 4.72 Cuneus BA 18 Left -4 -83 15 500 4.14 Superior Temporal Gyrus BA 22 Left -49 -41 15 1382 6.53 Superior Temporal Gyrus BA 22 Left -61 -23 15 470 7.05 Lingual Gyrus BA 19 Left -10 -47 0 464 4.88 Lingual Gyrus BA 18 Left -13 -71 -6 170 4.91 Cerebellum Left -16 -50 -18 276 4.82 Novel > Previously Studied Superior Frontal Gyrus BA 8 Left -19 25 48 34672 -9.28 Precuneus BA 19 Right 35 -65 42 507 -5.10 Middle Frontal Gyrus BA 8 Left -31 25 39 608 -6.70 Angular Gyrus BA 39 Left -52 -68 30 407 -4.58 Thalamus Left -19 -11 18 140 -6.47 Middle Frontal Gyrus BA 46 Right 35 49 15 142 -4.12 Inferior Frontal Gyrus BA 44 Left -61 19 6 882 -5.30 Middle Frontal Gyrus BA 10 Left -40 49 3 550 -5.39 Insula BA 13 Right 38 19 0 941 -5.20 Claustrum Left -31 16 0 262 -5.20 Inferior Temporal Lobe BA 37 Right 50 -44 -3 889 -7.17 Middle Frontal Gyrus BA 47 Right 38 40 -3 1130 -5.67 Middle Frontal Gyrus BA 48 Left -49 43 -3 668 -4.77 Middle Temporal Gyrus BA 21 Right 44 10 -33 189 -5.14

230

Table 4.5 Update Phase ANOVA: Main Effect of Cue Difficulty

Feedback & Observation Groups Combined Talairach Coordinates Region of Activation Brodmann

Area (BA)Laterality

x y z # Voxels FStat

Precentral Gyrus BA 4 Left -31 -23 58 130 10.98Superior Parietal Lobule BA 7 Right 14 -63 54 493 11.19Precuneus BA 7 Left -16 -59 48 178 8.76 Inferior Parietal Lobule BA 7 Right 41 -62 43 339 9.25 Inferior Parietal Lobule BA 40 Right 26 -41 39 230 10.61Precuneus BA 7 Right 11 -50 39 378 10.28Cingulate Gyrus BA 31 Right 8 -29 39 225 10.10Cingulate Gyrus BA 32 Left -1 16 39 1581 15.57Precuneus BA 31 Left -19 -41 36 187 9.17 Inferior Parietal Lobule BA 40 Right 53 -23 33 237 8.22 Inferior Frontal Gyrus BA 44 Right 44 13 31 1881 17.54Inferior Parietal Lobule BA 40 Left -64 -29 30 9298 20.81Lateral Sulcus Right 44 -32 21 192 9.11 Insula BA 13 Right 32 -29 21 380 12.50Middle Occipital Gyrus BA 18 Left -16 -83 21 10268 16.91Inferior Frontal Gyrus BA 45 Right 53 -41 18 1638 11.66Middle Temporal Gyrus BA 39 Right 53 -62 15 3619 17.36Middle Occipital Gyrus BA 18 Right 11 -89 15 279 10.23Middle Occipital Gyrus BA 19 Left -37 -80 15 231 10.12Insula Left -37 -14 15 256 9.11 Superior Frontal Gyrus BA 10 Right 26 55 12 3461 13.09Putamen/Insula Right 29 -14 12 220 12.40Anterior Cingulate BA 32 Left -13 37 12 1604 10.98Middle Temporal Gyrus BA 39 Left -43 -71 12 361 10.57Middle Frontal Gyrus BA 10 Left -34 52 9 159 10.00Superior Temporal Gyrus BA 42 Left -43 -26 9 329 10.71Middle Temporal Gyrus BA 22 Left -61 -41 6 176 10.06Insula Right 35 -14 3 191 11.64Putamen/Insula Right 26 13 3 618 11.04Lingual Gyrus BA 18 Right 8 -50 3 492 14.92Insula Left -31 13 3 254 8.54 Middle Frontal Gyrus BA 10 Left -40 49 3 667 13.70Middle Temporal Gyrus BA 21 Right 47 -11 -6 2254 12.77Cerebellum Right 20 -50 -12 352 12.75Cerebellum Right 23 -65 -15 512 10.59Cerebellum Right 11 -71 -15 165 9.56 Fusiform Gyrus BA 20 Left -25 -35 -15 133 10.64Inferior Temporal Gyrus BA 21 Left -58 -11 -15 792 17.63Parahippocampal Gyrus Right 20 -20 -18 242 13.36Cerebellum Left -28 -53 -18 186 9.54 Parahippocampal Gyrus BA 28 Left -22 -14 -21 187 8.60

231

Table 4.6 Update Phase ANOVA: Main Effect of Learning Type

Feedback & Observation Groups Combined Talairach CoordinatesRegion of Activation Brodmann

Area (BA)Laterality

x y z # Voxels FStat

Postcentral Gyrus BA 2 Right 33 -23 48 275 19.13Paracentral Lobule Left -1 -13 48 149 16.86Cingulate Gyrus BA 24 Left -10 -5 36 221 19.45Angular Gyrus BA 39 Right 41 -62 35 523 24.68Cingulate Gyrus BA 31 Right 11 -26 33 248 28.32Angular Gyrus BA 39 Left -49 -62 30 229 15.48Middle Temporal Gyrus BA 39 Left -43 -71 28 142 22.55Orbital Gyrus BA 19 Left -34 -77 27 169 26.06Thalamus Left -22 -14 18 193 16.81Superior Temporal Gyrus BA 42 Right 56 -29 15 1134 23.84Insula Right 35 2 9 441 16.64Middle Temporal Gyrus BA 21 Left -64 -45 9 211 26.85Insula Right 41 7 6 318 19.61Lingual Gyrus BA 18 Left -22 -56 6 203 15.97Superior Temporal Gyrus BA 42 Left -48 -23 6 3049 37.20Middle Temporal Gyrus BA 22 Left -43 -38 4 270 15.19Superior Temporal Gyrus BA 21 Right 57 -11 0 213 16.09

232

Table 4.7 Update Phase ANOVA: Main Effect of Group

Talairach Coordinates Region of Activation Brodmann Area (BA)

Lateralityx y z

# Voxels FStat

Precentral Gyrus BA 4 Right 14 -23 54 411 15.63Thalamus Left -4 -20 0 212 15.45

233

Table 4.8 Test Phase 2: Contrast of Previously Studied vs. Novel - Feedback Group

Talairach Coordinates Region of Activation Brodmann Area (BA)

Lateralityx y z

# Voxels tstat

Previously Studied > Novel Precentral Gyrus BA 4 Left -28 -23 63 895 5.11 Inferior Parietal Lobule BA 40 Left -22 -38 54 300 6.28 Precuneus BA 7 Right 2 -56 51 4334 7.48 Inferior Parietal Lobule BA 40 Left -43 -50 51 139 3.90 Precentral Gyrus BA 4 Left -19 -20 48 211 6.43 Cingulate Gyrus BA 31 Left -4 -35 42 611 5.82 Inferior Parietal Lobule BA 40 Left -28 -44 39 370 6.24 Inferior Parietal Lobule BA 40 Right 56 -29 30 610 5.14 Postcentral Gyrus BA 2 Left -64 -20 30 317 4.64 Inferior Parietal Lobule BA 40 Left -64 -41 24 195 4.02 Superior Temporal Gyrus BA22 Left -61 -50 15 277 4.85 Superior Temporal Gyrus BA 22 Left -61 -32 12 182 4.61 Parahippocampal Gyrus Right 20 -50 3 149 4.25 Cerebellum Right 2 -44 -3 189 5.22 Novel > Previously Studied

Precentral Gyrus BA 9 Left -37 4 36 131 -4.54 Middle Frontal Gyrus BA 46 Left -40 43 18 838 -5.07 Middle Frontal Gyrus Left -28 46 3 138 -4.74 Insula BA 13 Left -37 13 0 353 -4.40

234

Table 4.9 Test Phase 2: Contrast of Previously Studied vs. Novel - Observation Group

Talairach Coordinates Region of Activation Brodmann Area (BA)

Lateralityx y z

# Voxels tstat

Previously Studied > Novel

Precuneus BA 31 Left -22 -41 33 526 5.90 Precuneus BA 7 Left -22 -56 30 260 4.77 Superior Temporal Gyrus BA 22 Left -34 -50 15 428 4.72 Cerebellum Right 5 -38 0 553 5.30 Cerebellum Right 5 -41 -39 282 5.05

Novel > Previously Studied Medial Frontal Gyrus BA 6 Right 2 4 51 276 -5.47Medial Frontal Gyrus BA 6 Right 17 10 48 198 -4.20Middle Frontal Gyrus BA 6 Right 32 -5 45 140 -4.08Precentral Gyrus BA 6 Left -37 -8 45 353 -4.94Middle Frontal Gyrus BA 9 Right 35 19 33 163 -4.20Precentral Gyrus BA 6 Left -49 7 12 295 -8.48Middle Temporal Gyrus BA 21 Left -52 -38 -9 292 -4.65

235

Table 5.1 Test Phase: Contrast of Previously Studied vs. Novel

Talairach Coordinates Region of Activation Brodmann Area (BA)

Laterality x y z

# Voxels tstat

Novel > Previously Studied

Middle Frontal Gyrus BA 9 Right 41 16 36 436 -4.17 Middle Frontal Gyrus BA 9 Left -22 37 36 429 -5.15 Medial Frontal Gyrus BA 9 Left -7 34 33 4164 -6.08 Angular Gyrus BA 39 Left -43 -59 33 303 -3.78

236

Appendices

Appendix 1: Neuroimaging Test Phase Results for Experiments 1-3

Experiment 1 Neuroimaging Test Phase Analysis:

Test Phase: Contrast of previously studied versus novel information

In the test phase, the primary analysis examined differences in BOLD

responses to learned versus novel material. A contrast of studied (observation

and feedback) versus non-studied (novel) cues was used to identify areas

involved in memory processes (Table 3.3). This contrast produced one area of

activation within the medial temporal lobe – the parahippocampal gyrus (x, y, z =

-21, -40, -8). Post-hoc t-tests on the mean parameter estimates extracted from

the parahippocampal region revealed no significant difference between

observation and feedback information (t(15) = 0.37; p > 0.05). An area just

outside the hippocampus (x, y, z = -24, -22, -5), and a region within the right

caudate nucleus (x, y, z = 6, 5, 16) were also observed in this contrast, but did

not survive correction at the cluster level. This analysis revealed that the

parahippocampal gyrus was most active in response to previously learned

information compared to novel information.

Experiment 2 Neuroimaging Test Phase Analyses:

Test phase 1: Contrast of previously studied versus novel information

As in experiment 1, in the test phases in experiment 2, the analyses of

interest examined differences in BOLD responses to learned versus novel

237

material. Two separate analyses were performed, one for the feedback group

and one for the observation group.

Feedback group: a contrast of previously learned (feedback) versus novel

cues was performed. In this group, a network of regions was involved showing

greater activation to previously learned versus novel information, including a

region in the middle temporal gyrus, just dorsal to the hippocampus (x, y, z = -37,

-26, 0) as well as the insula (x, y, z = 41, -14, 3), and cingulate gyrus (x, y, z = 8,

-38, 39). Several regions also showed greater activation to novel cues, including

a region in-between the caudate nucleus and putamen (x, y, z = -16, 7, 12)

(Table 4.3).

Observation group: a contrast of previously learned (observation) versus

novel cues was performed. A somewhat similar pattern of results was observed

for the observation group, including a region of the cingulate gyrus (x, y, z = 20, -

50, 18) which exhibited greater responses to previously learned compared with

novel cues. Regions exhibiting greater BOLD responses to novel cues included

the insula (x, y, z = 38, 19, 0) (Table 4.4).

Test phase 2: Contrast of previously studied versus novel information

For both the feedback and observation groups, a contrast of previously

learned versus novel cues was performed.

Feedback group: a host of regions showed greater activation for

previously learned information compared with novel information including the

right parahippocampus (x, y, z = 20, -50, 3) and cingulate gyrus (x, y, z = -4, -35,

238

42). Only a few regions showed greater activation to novel stimuli including the

left insula (x, y, z = -37, 13, 0), precentral gyrus, and the middle frontal gyrus

(Table 4.8).

Observation group: the superior temporal gyrus and precuneus were

engaged more strongly for previously learned versus novel information, while

areas of the middle and medial frontal gyri were more engaged when processing

novel information (Table 4.9).

Experiment 3 Neuroimaging Test Phase Analysis:

Test phase: Contrast of previously studied versus novel information

As in experiments 1 and 2, in the test phase in experiment 3, the analysis

of interest examined differences in BOLD responses to learned versus novel

material. A contrast of previously learned (feedback control + perception

decision + declarative memory interference) versus novel cues was performed

(balancing for the unequal comparison). Results revealed activation within the

medial frontal gyrus, middle frontal gyrus, and angular gyrus exhibiting greater

responses to novel cues than previously studied information. These regions

were the only active areas observed in the SPM that survived cluster threshold

correction (Table 5.1).

Appendix 2: Experiment 3 Correlations Observed across the Independent

Region of Interest Analysis

239

1) In the feedback control session, the following positive correlations were

observed for the:

Hard cues:

- Left hippocampus – right ventral striatum (r = 0.416, p < 0.05) and left

caudate nucleus (r = 0.467, p < 0.05)

- Left ventral striatum – right hippocampus (r = 0.584, p < 0.005) and left

VTA (r = 0.469, p < 0.05)

Easy cues:

- Left VTA - left ventral striatum (r = 0.415, p < 0.05)

2) In the perception decision session the following positive correlations were

observed for the:

Hard cues:

- Right VTA – left hippocampus (r = 0.477, p < 0.05); right hippocampus (r =

0.430, p < 0.05); left ventral striatum (r = 0.507, p < 0.05); right ventral

striatum (r = 0.410, p < 0.05)

- Left VTA – left ventral striatum (r = 0.480, p < 0.05)

Easy cues:

- Left hippocampus – left ventral striatum (r = 0.429, p < 0.05); right ventral

striatum (r = 0.527, p < 0.01); left VTA (r = 0.589, p < 0.005); right VTA (r =

0.587, p < 0.005)

- Right hippocampus – right ventral striatum (r = 0.522, p < 0.01) and left

VTA (r = 0.448, p < 0.05)

240

- Left ventral striatum – left VTA (r = 0.606, p < 0.005) and right VTA (r =

0.698, p < 0.001)

- Right ventral striatum – left VTA (r = 0.561, p < 0.005) and right VTA (r =

0.538, p < 0.01)

- Left caudate nucleus – left hippocampus (r = 0.419, p < 0.05) and left VTA

(r = 0.447, p < 0.05)

3) In the memory session the following positive correlations were observed for

the:

Hard cues:

- Left hippocampus – left (r = 0.425, p < 0.05) and right ventral striatum (r =

0.529, p < 0.01) and left caudate nucleus (r = 0.530, p < 0.01)

- Right hippocampus – left (r = 0.621, p < 0.005) and right ventral striatum (r

= 0.517, p < 0.05); and the right VTA (r = 0.462, p < 0.05)

- Left ventral striatum – right VTA (r = 0.667, p < 0.001)

Easy cues:

- Right hippocampus – left (r = 0.483, p < 0.05) and right ventral striatum (r

= 0.584, p < 0.005); and the left VTA (r = 0.453, p < 0.05)

- Left ventral striatum – left (r = 0.677, p < 0.001) and right VTA (r = 0.672, p

< 0.001)

- Right ventral striatum – left (r = 0.671, p < 0.001) and right VTA (r = 0.647,

p < 0.005)

241

Appendix 3: Behavioral Questionnaires Administered in Experiments 1-3

Experiments 1 & 2: Pre-Experiment Motivation Questionnaire:

Now that you understand the game, I’d like to ask you a few questions before we begin:

1. How interested are you in playing this game? 1 2 3 4 5 6 7 Not interested, Extremely Here for the money Interested 2. How motivated are you to succeed at this game? 1 2 3 4 5 6 7 Not at all Extremely Motivated 3. How excited are you to play this game?

1 2 3 4 5 6 7 Not at all Extremely Excited 4. How excited are you about the money you could win based on your performance? 1 2 3 4 5 6 7 Not at all Extremely Excited

242

Experiment 1: Behavioral Questionnaire Administered Following MRI Task:

Object

H or L 1 2 3 4 5 6 7

ConfidenceValue

H or L

H or L

H or L

H or L

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7

H or L

H or L

H or L

H or L

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

243

Experiment 2: Behavioral Questionnaire Administered Following MRI Task:

Object ConfidenceValue

H or L 1 2 3 4 5 6 7Very Somewhat Very unsure sure

H or L

H or L

H or L

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7

H or L

H or L

H or L 1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

Very Somewhat Very unsure sure

244

Object

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percentage

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

Percent of time you thought this shape was H?________Percent of time you thought this shape was L?________

245

Object Like Scale

1 2 3 4 5 6 7Dislike Neutral Like

very much very much

1 2 3 4 5 6 7Dislike Neutral Like

very much very much

1 2 3 4 5 6 7Dislike Neutral Like

very much very much

1 2 3 4 5 6 7Dislike Neutral Like

very much very much

1 2 3 4 5 6 7Neutral Like

very much

1 2 3 4 5 6 7Neutral Like

very much

1 2 3 4 5 6 7Neutral Like

very much

1 2 3 4 5 6 7Neutral Like

very much

Dislikevery much

Dislikevery much

Dislikevery much

Dislikevery much

246

• Did you use any strategies to solve this task? Please try to describe how

you played this game:

• Did you find it difficult to learn from both the arrows and the checks and Xs

in the second part? Which one was easier?

• When trying to determine the value of each object, did you rely more on

what you learned in the first or the second part of the game?

• What did you think happened in the second part of the game? Did you

think the values changed? If so, did you think they all changed or only

some of them?

247

Experiments 1 & 2: Post-Experiment Motivation Questionnaire:

Now that you’ve played the game, I’d like to ask you a few questions:

1. How interested were you when playing this game? 1 2 3 4 5 6 7 Not interested, Extremely Here for the money Interested 2. How motivated were you to succeed at this game? 1 2 3 4 5 6 7 Not at all Extremely Motivated 3. How excited were you to play this game?

1 2 3 4 5 6 7 Not at all Extremely Excited 4. How excited are you about the money you just won based on your performance? 1 2 3 4 5 6 7 Not at all Extremely Excited

248

Experiment 3: Pre-Experiment Motivation Questionnaires Administered on Days

1 &2:

Now that you understand the game, I’d like to ask you a few questions before we begin:

1. How interested are you in playing this game? 1 2 3 4 5 6 7 Not interested, Extremely Here for the money Interested 2. How motivated are you to succeed at this game? 1 2 3 4 5 6 7 Not at all Extremely Motivated 3. How excited are you to play this game?

1 2 3 4 5 6 7 Not at all Extremely Excited

249

Experiment 3: Behavioral Questionnaire Administered Following MRI Task:

ValueShape Confidence Scale

H or L1 2 3 4 5 6 7

Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7Very Somewhat Very unsure sure

H or L1 2 3 4 5 6 7

Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7Very Somewhat Very unsure sure

H or L1 2 3 4 5 6 7

Very Somewhat Very unsure sure

H or L1 2 3 4 5 6 7

Very Somewhat Very unsure sure

H or L 1 2 3 4 5 6 7Very Somewhat Very unsure sure

250

Shape Estimate of Percentage

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

Percent of time you thought this shape was high?_____Percent of time you thought this shape was low? _____

251

Shape Like Scale1 2 3 4 5 6 7Dislike Neutral Like very much very much

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

1 2 3 4 5 6 7

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

Dislike Neutral Like very much very much

252

• Did you use any strategies while trying to determine the value of the shapes?

Please try to describe how you played this game:

• Which learning scenario was most difficult for you: shape learning session,

remember the scenes session, or the perception decision session, and why?

• How difficult did you find it to try to simultaneously learn the value of the shape

while recalling whether or not the natural scene was old or new?

• How accurate do you think you were at remembering which natural scenes were

old and which are new?

253

Experiment 3: Post-Experiment Motivation Questionnaire Administered on Day 1 (Scene Encoding Session): Now that you’ve played the game, I’d like to ask you a few questions:

1. How interested were you when playing this game? 1 2 3 4 5 6 7 Not interested, Extremely Here for the money Interested 2. How motivated were you to succeed at this game? 1 2 3 4 5 6 7 Not at all Extremely Motivated 3. How excited were you to play this game?

1 2 3 4 5 6 7 Not at all Extremely Excited 4. How successful do you think were you at attending to the natural scenes?

1 2 3 4 5 6 7 Not at all Extremely Successful 5. Did you use any strategies to try to learn and remember the natural scenes? If so

please describe what you did during the task to try to learn the scenes:

254

Experiment 3: Post-Experiment Motivation Questionnaire Administered on Day 2

(MRI Session):

Now that you’ve played the game, I’d like to ask you a few questions:

1. How interested were you when playing this game? 1 2 3 4 5 6 7 Not interested, Extremely Here for the money Interested 2. How motivated were you to succeed at this game? 1 2 3 4 5 6 7 Not at all Extremely Motivated 3. How excited were you to play this game?

1 2 3 4 5 6 7 Not at all Extremely Excited 4. How successful do you think were you at attending to the natural scenes?

1 2 3 4 5 6 7 Not at all Extremely Successful 5. How successful do you think you were at remembering which scenes are old and

which scenes are new?

1 2 3 4 5 6 7 Not at all Extremely

Successful 6. Did you use any strategies to try to remember the natural scenes? If so please

describe what you did during the task to try to remember the scenes:

7. Since the shapes were presented to you in pairs ([circle square] [club triangle] [diamond sun]) did you assume that one shape in each of the pairs was higher than 5 and one was lower than 5? Or did you treat each shape independently?

255

Vita

Kathryn Cain Dickerson

1983 Born January 16th in Syracuse, New York.

2001 Graduated from Tully High School, Tully, New York.

2001-2006 Attended the University of Rochester, Rochester New York.

2005-2006 Take 5 Scholar at the University of Rochester, Rochester New

York.

2006 B.A. in Brain and Cognitive Sciences, University of Rochester,

Rochester, New York.

2006-2011 Graduate work in Behavioral and Neural Science, Rutgers

University, Newark, New Jersey.

2008 Article: “Distinct mechanisms of impairment in cognitive ageing and

Alzheimer’s disease,” Brain, vol. 131(6), p. 1618-1629.

2010 Article: “Parallel contributions of distinct human memory systems

during probabilistic learning,” NeuroImage, vol. 55(1), 266-276.

2011 Article: “Reward-related learning via multiple memory systems”,

Biological Psychiatry, submitted.