14
Parsing a mental program: Fixation-related brain signatures of unitary operations and routines in natural visual search Juan E. Kamienkowski a, b, 1 , Alexander Varatharajah c, 1 , Mariano Sigman d , Matias J. Ison e, * a Laboratory of Applied Articial Intelligence, Computer Science Institute, School of Exact and Natural Science, University of Buenos Aires National Council of Science and Technology, Buenos Aires, Argentina b Physics Department, School of Exact and Natural Science, University of Buenos Aires, Buenos Aires, Argentina c Department of Engineering, University of Leicester, Leicester, United Kingdom d Torcuato Di Tella University, Buenos Aires, Argentina e School of Psychology, University of Nottingham, Nottingham, United Kingdom ARTICLE INFO Keywords: Visual search EEG Eye movements Attention Natural scenes Oscillations ABSTRACT Visual search involves a sequence or routine of unitary operations (i.e. xations) embedded in a larger mental global program. The process can indeed be seen as a program based on a while loop (while the target is not found), a conditional construct (whether the target is matched or not based on specic recognition algorithms) and a decision making step to determine the position of the next searched location based on existent evidence. Recent developments in our ability to co-register brain scalp potentials (EEG) during free eye movements has allowed investigating brain responses related to xations (xation-Related Potentials; fERPs), including the identication of sensory and cognitive local EEG components linked to individual xations. However, the way in which the mental program guiding the search unfolds has not yet been investigated. We performed an EEG and eye tracking co-registration experiment in which participants searched for a target face in natural images of crowds. Here we show how unitary steps of the program are encoded by specic local target detection signatures and how the positioning of each unitary operation within the global search program can be pinpointed by changes in the EEG signal amplitude as well as the signal power in different frequency bands. By simultaneously studying brain signatures of unitary operations and those occurring during the sequence of xations, our study sheds light into how local and global properties are combined in implementing visual routines in natural tasks. 1. Introduction Most daily tasks for example reading this sentenceinvolve a sequence of operations. During the last few decades, there has been an accumulation of knowledge about how the brain computes unitary op- erations. In contrast, the mechanisms by which unitary operations are assembled in programs and routines in the brain have seen comparatively very little investigation (Ullman, 1984; Fujii and Graybiel, 2005; Roelf- sema, 2005; Jubault et al., 2007; Lange et al., 2010; Zylberberg et al., 2011, 2013). One prominent example of a mental program comes from natural viewing, where two levels of processing must be integrated in real-time: visual processing of each stimulus, and the integration of in- formation along the sequence to achieve the goal of the whole task. However, little is known about brain activity in natural viewing as obtaining EEG potentials is intrinsically difcult because eye movements heavily contaminate brain signals. In xed-gaze scenarios, sequential-decision making has been one specic case in which the iteration of unitary operations of information updating has been studied in human neurophysiology (Lange et al., 2010). Lange and colleagues showed a gradual reduction of cortical ac- tivity with each sample: activity was inversely related to the accumulated evidence, which was interpreted as a reection of top-down inuence on sensory processing (Platt and Glimcher, 1999). These results built on single neuron studies in macaques, reporting that neuronal activity was modulated by the evidence accumulated throughout a sequence of simple operations in neurons in the parietal cortex (Yang and Shadlen, 2007) and in more complex visual tasks by recordings in the primary visual area (Moro et al., 2010). In humans, early ERP studies have shown that in xed-gaze oddball experiments the amplitude of the P3 component, emerging when the target is detected, is modulated by the inter-stimulus * Corresponding author. E-mail address: [email protected] (M.J. Ison). 1 These authors contributed equally to this work. Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/neuroimage https://doi.org/10.1016/j.neuroimage.2018.08.010 Received 3 May 2018; Received in revised form 24 July 2018; Accepted 6 August 2018 Available online 8 August 2018 1053-8119/© 2018 Published by Elsevier Inc. NeuroImage 183 (2018) 7386

Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

NeuroImage 183 (2018) 73–86

Contents lists available at ScienceDirect

NeuroImage

journal homepage: www.elsevier.com/locate/neuroimage

Parsing a mental program: Fixation-related brain signatures of unitaryoperations and routines in natural visual search

Juan E. Kamienkowski a,b,1, Alexander Varatharajah c,1, Mariano Sigman d, Matias J. Ison e,*

a Laboratory of Applied Artificial Intelligence, Computer Science Institute, School of Exact and Natural Science, University of Buenos Aires – National Council of Scienceand Technology, Buenos Aires, Argentinab Physics Department, School of Exact and Natural Science, University of Buenos Aires, Buenos Aires, Argentinac Department of Engineering, University of Leicester, Leicester, United Kingdomd Torcuato Di Tella University, Buenos Aires, Argentinae School of Psychology, University of Nottingham, Nottingham, United Kingdom

A R T I C L E I N F O

Keywords:Visual searchEEGEye movementsAttentionNatural scenesOscillations

* Corresponding author.E-mail address: [email protected] (

1 These authors contributed equally to this work

https://doi.org/10.1016/j.neuroimage.2018.08.010Received 3 May 2018; Received in revised form 24Available online 8 August 20181053-8119/© 2018 Published by Elsevier Inc.

A B S T R A C T

Visual search involves a sequence or routine of unitary operations (i.e. fixations) embedded in a larger mentalglobal program. The process can indeed be seen as a program based on a while loop (while the target is notfound), a conditional construct (whether the target is matched or not based on specific recognition algorithms)and a decision making step to determine the position of the next searched location based on existent evidence.Recent developments in our ability to co-register brain scalp potentials (EEG) during free eye movements hasallowed investigating brain responses related to fixations (fixation-Related Potentials; fERPs), including theidentification of sensory and cognitive local EEG components linked to individual fixations. However, the way inwhich the mental program guiding the search unfolds has not yet been investigated. We performed an EEG andeye tracking co-registration experiment in which participants searched for a target face in natural images ofcrowds. Here we show how unitary steps of the program are encoded by specific local target detection signaturesand how the positioning of each unitary operation within the global search program can be pinpointed by changesin the EEG signal amplitude as well as the signal power in different frequency bands. By simultaneously studyingbrain signatures of unitary operations and those occurring during the sequence of fixations, our study sheds lightinto how local and global properties are combined in implementing visual routines in natural tasks.

1. Introduction

Most daily tasks –for example reading this sentence– involve asequence of operations. During the last few decades, there has been anaccumulation of knowledge about how the brain computes unitary op-erations. In contrast, the mechanisms by which unitary operations areassembled in programs and routines in the brain have seen comparativelyvery little investigation (Ullman, 1984; Fujii and Graybiel, 2005; Roelf-sema, 2005; Jubault et al., 2007; Lange et al., 2010; Zylberberg et al.,2011, 2013). One prominent example of a mental program comes fromnatural viewing, where two levels of processing must be integrated inreal-time: visual processing of each stimulus, and the integration of in-formation along the sequence to achieve the goal of the whole task.However, little is known about brain activity in natural viewing asobtaining EEG potentials is intrinsically difficult because eye movements

M.J. Ison)..

July 2018; Accepted 6 August 2

heavily contaminate brain signals.In fixed-gaze scenarios, sequential-decision making has been one

specific case in which the iteration of unitary operations of informationupdating has been studied in human neurophysiology (Lange et al.,2010). Lange and colleagues showed a gradual reduction of cortical ac-tivity with each sample: activity was inversely related to the accumulatedevidence, which was interpreted as a reflection of top-down influence onsensory processing (Platt and Glimcher, 1999). These results built onsingle neuron studies in macaques, reporting that neuronal activity wasmodulated by the evidence accumulated throughout a sequence of simpleoperations in neurons in the parietal cortex (Yang and Shadlen, 2007)and in more complex visual tasks by recordings in the primary visual area(Moro et al., 2010). In humans, early ERP studies have shown that infixed-gaze oddball experiments the amplitude of the P3 component,emerging when the target is detected, is modulated by the inter-stimulus

018

Page 2: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

interval (ISI) (Fitzgerald and Picton, 1981; Woods et al., 1980; Gonsalvezand Polich, 2002). In many studies, this has been related to the concept ofexpectancy and surprise and led to the prediction that P3-like compo-nents could act as a cortical index to a discrete sequence of accumulatedprocesses (Zylberberg et al., 2013). Beyond discrete event-related po-tentials, some experimental evidence suggests the involvement oflow-frequency oscillations (theta, alpha and beta) in integrative brainprocesses, while high-frequency oscillations (gamma-band oscillations)are sought to reflect the circuit-level mechanisms mediating localencoding processes (either sensory or motor), although both couldinvolve neuronal activity within the same cortical region (Donner andSiegel, 2011). For instance, top-down attention has a different spectralprofile from the local encoding of sensory features in primary visualcortex (Siegel et al., 2008), and decision-making has a different spectralprofile from that of motor planning (Donner et al., 2007; Gross et al.,2004; Hipp et al., 2011; Pesaran et al., 2008). Large modulations in thealpha-band have been associated to both event-related desynchroniza-tion (ERD) and synchronization (ERS). The most common observation isthat brain areas that are activated during a task exhibit ERD, whereasareas associated with task irrelevant and potentially interfering processesexhibit ERS. Thus, the latter process is usually associated with an inhi-bition, and the former one with the release of that inhibition. This hasbeen observed, for instance, in tasks that vary stimulus modality (e.g.,Visual vs. Auditory) (Foxe et al., 1998), stimulus processing domain (e.g.,related to the ventral vs. dorsal processing stream) (Jokisch and Jensen,2007) or stimulation side (e.g., stimulation of the right vs. left visualhemifield) (Thut et al., 2006; Worden et al., 2000). The changes inalpha-band frequency power are also progressive, as shown by the tem-poral spectral evolution (TSE) waveforms in many tasks. For instance,when a warning signal preceded a target or non-target in an oddball task,the alpha-band oscillations not only responded to the identity of thestimulus but also to the expectancy of the upcoming stimulus (Klimeschet al., 1998). Desynchronization of alpha/beta activity has also beenreported when the anticipation of an impending target increases (Baueret al., 2014).

Eye movement and modeling approaches to visual search tend tofocus on general planning of the scanpath, and how image propertiesmodulate both fixation duration and position distributions (Deubeland Schneider, 1996; Findlay et al., 2001; Findlay, 1997; Henderson,2003; Ludwig and Gilchrist, 2002; Najemnik and Geisler, 2005; Wolfe,2007). Other experimental approaches include fixed-gaze neuro-imaging studies with a single spatially spread stimuli presentation, orsequentially centered stimuli presentations. Single presentation EEGstudies tend to focus on the neural basis of spatial covert attention andits mechanisms (Ehinger et al., 2015; Weaver Matthew and ClaytonvanZoest Wieske, 2017), while serial visual presentations (fixed-gaze) aremostly used to study the neural correlates of target detection/identi-fication with overt visual attention (Gonsalvez and Polich, 2002;Polich, 2007). Regardless of the methodological approach, much ofwhat is known in the field comes presenting ‘synthetic’ stimuli,ranging from simple bars and sinusoidal gratings to other typicallaboratory stimuli including pre-segmented objects and faces. Conse-quently, there has been an increasing interest in visual search of nat-ural scenes as the complexity of natural scenes may provide a greaterdepth of the behavior for investigation (Mack and Eckstein, 2011),particularly in free-viewing visual search (Otero-Millan et al., 2008).

Recent studies have started to combine EEG and eye movements(Dimigen et al., 2009, 2011; Kamienkowski et al., 2012a; Ossand�onet al., 2010; Yuval-Greenberg et al., 2008). They have shown that it ispossible to reliably measure brain activity related to the visual infor-mation retrieved from each fixation, which has begun to close the gapbetween these approaches. This opened the possibility to combinethese lines of research and study human neurophysiology of visualovert exploration of natural scenes. In particular, previous studieshave investigated brain activity elicited on each fixation, showing thata clear P3 arises from both synthetic (Kamienkowski et al., 2012a;

74

Brouwer et al., 2013; K€orner et al., 2014; U�s�cumli�c and Blankertz,2016) and natural (Kaunitz et al., 2014) stimuli. Moreover, the P3component is observed both when a target stimulus is embedded in themiddle (Kamienkowski et al., 2012a) or at the end of the stream offixated items (Brouwer et al., 2013; U�s�cumli�c and Blankertz, 2016;Kaunitz et al., 2014; Wenzel et al., 2016), and it is robust enough to bedetected in a trial-by-trial basis (Brouwer et al., 2013; U�s�cumli�c andBlankertz, 2016; Kaunitz et al., 2014; Wenzel et al., 2016). Thiscomponent has been used as part of a brain computer interface (BCI)system for assistive technologies (Wenzel et al., 2016), as well as toinvestigate other “real world” applications such as understanding in-formation systems (L�eger et al., 2014), or the hazard perceptiondriving test (Savage et al., 2013). Most studies have focused on thecomparison between target and distractor responses (but see (K€orneret al., 2014) for sequential differences among distractors) (Ehingeret al., 2015; Weaver Matthew and Claytonvan Zoest Wieske, 2017). Afew co-registration studies have also focused on eye movements overnatural scenes but in scene perception and memorization, anddescribed the early processing of the fixated patches (Ossand�on et al.,2010; Simola et al., 2013, 2015; Staudigl et al., 2017). In a recentstudy (Kaunitz et al., 2014), we showed that there is a high similaritybetween fERPs recorded in an eye movement visual search task andERPs recorded in a fixed-gaze experiment with similar stimuli. Thiswas reflected both in early visual potentials, but also in a late potentialthat discriminated targets from distractors. One pitfall of that study isthat participants were explicitly instructed to perform long fixations.This aimed at eliciting comparable number of long epochs with fixa-tions to targets and distractors at the expense of ecological validity,since in natural viewing the processing of each single stimulus has tobe combined with the information integrated throughout the task.

Here, we aimed to investigate the existence of brain signatures forunitary operations and mental programs in a completely free viewingvisual search task. To date, this has only been addressed in very fewstudies and they all used fixed-gaze paradigms. We hypothesized thatlocal processing in unitary operations under fixed-gaze would beconserved in natural viewing, and could be captured by fixation-related components. Furthermore, we also hypothesized that signa-tures of global integration of information along the task would bereflected in modulations in the P3 amplitude and low-frequency os-cillations. We propose a schematic framework to link the identifiedelectrophysiological signatures with underlying integrative processesin natural viewing, which could form the basis of future theoreticalframeworks of gaze control integrating electrophysiology and eyemovements in natural viewing.

2. Materials and methods

2.1. Participants

Seventeen subjects participated in the experiment (13 male/4 female;aged between 21 and 31 years). All subjects were naïve to the objectivesof the experiment, had normal or corrected to normal vision, and pro-vided written informed consent according to the recommendations of thedeclaration of Helsinki to participate in the study.

2.2. Apparatus

Stimuli were presented in a 21-inch cathode ray tube (CRT) monitorwith a screen resolution of 1024� 768 pixels and at a refresh rate of75Hz. Subjects sat 60 cm from the monitor in a chair, with their headsstabilized using a specially designed ‘cheek rest’ (to avoid EEG artifactsfrom muscular activity from the jaw) and responses were made on astandard ‘qwerty’ keyboard. All stimuli were presented in MATLAB(MathWorks, 2000) using the Psychophysics Toolbox extensions (Brai-nard, 1997; Pelli, 1997).

Page 3: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 1. Experimental Design. (A) The target face (inset) is presented for 3 s. Then, a fixation dot appears in a random position for 1 s. The search starts when thefixation dot disappears and the crowds image is presented. Once the subject finds the target face; they fixate on it for 1 s to end the trial. The crowd image shown hereis the same as the one shown in panel B. (B) Search example: The scan path (black line) and fixations (color dots) are superimposed to the crowd image and the target(highlighted with a brown square). The color of the dots represents the fixation rank of the distractors, -i.e. the fixation number- and the diameter size represents theduration of the fixation. (C) Eye traces of the same trial for both horizontal (black) and vertical (gray) positions of the right eye. The vertical red line shows the onset ofthe fixation to the target. (D) Distribution of the number of fixations preceding the target; these are all the fixations preceding the target regardless of their position. (E)Distribution of fixation durations made to distractors. (F) Distribution of distractor saccade amplitudes.

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

2.3. Stimuli

In the database there were 60 Gy-scale images. The images were offootball crowds in stadiums, each image was 800� 768 pixels (23.4 by22.4 degrees of visual angle) and contained between 23 and 35 dis-tractors (30.68 mean average) (Figs. S1B and D). The luminance profileof each image was equalized and the pictures were converted to gray-scale to constrain the variability of visual salience across global dis-plays (i.e. to avoid characteristics of the image that could be particularly

75

salient). From each image, 3 faces of size 80� 80 pixels were chosen astargets and the rest were marked as distractor faces (Figs. S1C and D).The selected target locations did not follow any specific pattern andincluded different angles, expressions and genders. The spatial distribu-tion of all faces and particularly the targets covered the whole display.

2.4. Experimental procedure

We followed a similar procedure as in Kaunitz et al. (2014) (Kaunitz

Page 4: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 2. FERPs for matched eye movement properties across midline channels. (A) Fixations onset at 0 ms and the fERPs were baseline corrected [�200 -100].Channels Fz, Cz, Pz and Oz are shown for the target (red) and distractor (blue) conditions. The horizontal and vertical saccade amplitude, as well as the saccadeduration preceding the fixation of interest; were matched between targets (N¼ 1895) and distractors (N¼ 1895) for fixations longer than 400ms. The p-values for thesignificant differences (p< 0.01) are represented with black bars at the top of each channel plot, and were estimated using a cluster-based permutation test from theFieldTrip toolbox (Maris and Oostenveld, 2007). The boundaries of the significant intervals observed in the panel A insets are: Fz: {[121, 164] ms, [254 400] ms}, Cz:{[117, 207] ms, [238, 400] ms}, Pz: {[148, 215] ms, [230, 400] ms}, Oz: {[184, 203] ms, [281, 297] ms, [309, 324] ms, [332, 348] ms, [363, 400] ms}. (B)Significant channels within 30ms time windows centered in {0ms, 50ms, 100ms, 170ms, 240ms, 310ms, 380ms}, after the cluster-based permutation test. (C)Scalp topographies of the difference wave (Targets – Distractors). Each topography corresponds to the average activity of the same intervals as panel (B).

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

et al., 2014). Briefly, at the beginning of each trial subjects pressed thespace bar and were presented with a target face for 3 s (Fig. 1A). Thetarget faces were first cropped to a square of size 2.04� � 2.04�. To pre-vent subjects from facilitating their visual search based on the size of thetarget face, in each trial we rescaled the original target face to a randomsize between 1.75� � 1.75� and 2.63� � 2.63�. After this time a fixationdot was presented on the screen at a random location (Fig. S1A). Subjectshad to fixate at the dot location for 1 s for the image of a crowd to appearon the screen (Fig. 1A). The task was to search for the target face withinthe crowd and to fixate on it for 1 s once they have found it (Fig. 1A–C).Trials ended when subjects found the target or after 20 s of visual search.The 60 images were presented in pseudo-random order as a block. Be-tween blocks subjects took 5min resting breaks. In each block the targetface for each image was different from previous blocks. In total subjectsperformed 180 trials (3 different targets per crowd image for the wholeexperiment).

The main difference from Kaunitz et al. (2014) was that in the presentstudy participants were neither trained to perform slower fixations in aprevious session, nor were the instructions biased towards accuracyinstead of speed (Kaunitz et al., 2014). Subjects were simply left toexplore and search for the target as they naturally would. The rationale ofthe procedure in the previous experiment was to get a large enoughsample of distractors to obtain measurable fERPs. In the present experi-ment we aimed to explore naturally occurring fixations, and we showthat even in this situation it is possible to observe robust fERPs.

2.5. Eye movements and EEG data acquisition

Eye movements were registered with an EYELINK 1000 system (SRResearch, Ontario, Canada). The ET was used in binocular mode withstabilised-head and sampling rate of 500Hz in each eye. Saccades weredetected using an adapted version of velocity-based Engbert and Kliegl'salgorithm (Engbert and Kliegl, 2003; Mergenthaler and Engbert, 2007);using the parameters described in Kamienkowski et al. (2012)(Kamienkowski et al., 2012a). Only saccades larger than 1� were kept for

76

the analyses of the data, as saccades below this threshold were consid-ered microsaccades (Otero-Millan et al., 2008). For all the experiments adrift correction was made every 10 trials, and a recalibration of the ETevery 60 trials (before the beginning of a new block).

The EEG data was recorded with a 64-channel 10–20 montage usingthe Biosemi Active-Two System (Biosemi, Amsterdam, Holland) at1024Hz. The Data were imported into Matlab using the EEGLAB toolbox(Delorme and Makeig, 2004) with linked mastoids as the reference. ForfERPs, the datasets that were created were down-sampled at 256Hz andband-pass filtered between 0.1 and 40Hz (six order elliptic filter). Thestart of the fixation on the distractor or target face, identified from theeye tracker data, was taken as the onset of the trial. The brain responsesfor the target from each crowd image were analyzed, as were the fixa-tions to distractors. For the comparisons between targets and distractors,only fixations that lasted longer than 400ms were kept and the EEG datawas aligned to fixation onset and epoched between [-0.2 0.8] secondsfrom the start of the fixation. For constructing the fERPs related to dis-tractors early processing, we kept fixations longer than 200ms. In bothcases, we excluded fixations to distractors larger than 1000ms, whichrepresented (0.6� 0.1)% of the fixations to distractors. For the fERP localanalyses, we applied a baseline correction to each epoch in the timewindow [-200 -100]ms from fixation onset. For the global fERP analyses(Fig. 4), no baseline correction was used.

EEG and ET data were synchronized as in Kaunitz et al. (2014)(Kaunitz et al., 2014). Briefly, an analog card was used to convert thedigital eye position into analog voltage channels. The temporal offsetbetween the signals was corrected by realigning the eye tracking data tothe frontal electrodes that exhibited a sharp saccadic spike potentialshortly after saccade onset.

Data are fully available on reasonable request.

2.6. Matching procedure

Fixations to targets and distractors were match-selected, based on theeye-movement properties of each fixation (Kamienkowski et al., 2012a;

Page 5: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 3. Global effects on target detection. (A) MidlineTarget P3 Amplitude as a function of Short and Long trials.The amplitude was calculated as the mean average in the[250 400] ms windows extracted from the individual targetfERPs, averaged across all trials and subjects. The gray barsrepresent the Short trials (<7 fixations in a trial, N ¼ 865)and the black bars represent the Long trials (�7 fixations,N ¼ 1030). Bonferroni-corrected rank-sum tests showed sig-nificant differences for Fz, Cz, and Pz (p < 0.00001) but notfor Oz (p ¼ 0.4210). n. s.: p > 0,05, and ***: p < 0,001. (B)fERPs to targets as function of time, for different fixationranks. (C) Denoised single trial P3 amplitudes vs. fixationrank at Cz electrode. Amplitude of the P3 distribution (topleft) and fixation rank (botton right) histograms. Two-dimensional histogram of those variables (top right); eachbin of the histogram contains single trial amplitudes of theP3. Amplitudes of the P3 and fixation ranks were negativelycorrelated (Pearson's R¼�0.11, p< 0.00001).

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

Kliegl et al., 2014). This was to avoid any baseline differences createdfrom the eye movements, so that the fERPs for targets and distractorscould be compared without artifactual components affecting the results.In short, we used as matching parameters the preceding saccade hori-zontal (dx) and vertical (dy) amplitudes, as well as its duration (dt). Weimplemented K-nearest neighbors (KNN) algorithm as a robust matchingmethod. This algorithm finds the ‘nearest neighbor’ for an mx-by-n ma-trix X in each point of an my-by-n matrix Y. The method is exhaustive anduses replacement; first calculating the distance of each point and thenfinding the smallest distance. Once matched, the element is placed backin the pool to be matched for distance again. A standardized Euclideandistance metric (see Equation (4.1)) was used, where xs is a columnvector from X that corresponds to yt a column vector from Y:

dst ¼ ðxs � ytÞV�1ðxs � ytÞ' (4.1)

Here, V is the n-by-n diagonal matrix whose jth diagonal element iss(j)2, where s is the vector with the sample standard deviation; this scalesthe difference between rows xs and yt by dividing the corresponding el-ements by the standard deviation. Following thematching procedure, thedifferent parameters matched were shown to have similar distributionsacross both target and distractor condition (see Fig. S2). The matchingprocedure has shown that the parameters have no significant differencesin each of the three variables used: Saccade Duration (D: 32.0ms([24.0ms 38.0ms]), T: 32.0ms ([24.0ms 38.0ms]), Wilcoxon rank-sumtest: p¼ 0,96), Horizontal Saccade Amplitude (D: 4.7 deg ([2.3 deg 8.1deg]), T: 4.6 deg ([2.3 deg 8.2 deg]), Wilcoxon rank-sum test: p¼ 0,86),and Vertical Saccade Amplitude (D: 2.4 deg ([1.1 deg 4.7 deg]), T: 2.5

77

deg ([1.2 deg 4.8 deg]), Wilcoxon rank-sum test: p¼ 0,81).In earlier studies, explicit instructions were given to participants in

order to increase the length of fixations, producing more predictable and“matchable” eye movements (Kamienkowski et al., 2012a). Here wefound that it is possible to match properties with unrestrictedfree-viewing eye movements in a robust way.

2.7. Single-trial analysis

Single-trial ERPs were extracted using a denoising algorithm based onwavelet decomposition (Ahmadi and Quian Quiroga, 2013; Navajaset al., 2013; Quian Quiroga, 2000). The first step was to project thesingle-trial traces into the wavelet space. Then, the wavelet coefficientsrelated to the evoked responses were selected automatically for eachchannel and subject using the algorithm implemented in the ep_denpackage (Ahmadi and Quian Quiroga, 2013). Finally, the ERPs weredenoised by reconstructing the signal using only those waveletcoefficients.

2.8. Time-frequency analysis

For the Time-Frequency analysis, the EEG data sampled at 1024 Hzwas band-pass filtered at 0.1–95 Hz (butterworth), and band-stop filteredbetween 49 and 51Hz (FIR). The data were epoched into trials (including10 s before the presentation of the stimuli and 10 s after the end of thetrial). Then, Infomax ICA was calculated only on successful trials. Theartifactual ICs related to muscular, erroneous electrodes and eye

Page 6: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 4. Global effects on baseline and visual processing. (A) Midline fERPs to distractors as function of the fixation rank. fERPs were epoched between [-500 500]ms and no baseline correction were applied in order to observe its slow progression along the trial. From top to bottom: Fz, Cz, Pz, and Oz. The gray shadow in the toppanel shows the [�500–100] ms period analyzed in panel B). Vertical solid lines represent the onset of each fixation, and vertical dashed lines represent theboundaries of each epoch. (B) Average amplitude in the [�500–100] ms period for each electrode. Error bars represent the �s.e.m. interval. (C) Early fERPs todistractors in the electrode Oz as a function of fixation rank. Baseline correction between [�200–100] ms were applied. The gray shadow shows the [75 125] msperiod analyzed in panel D. The vertical solid lines represent the onset of each fixation. (D) Average amplitude in the [75 125] ms period for each electrode. Error barsrepresent the �s.e.m. interval.

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

movements were selected and removed based on ‘ADJUST’ (Mognonet al., 2011) and ‘EyeCatch’ (Bigdely-Shamlo et al., 2013) methods andsupervised by an expert. Across all participants we removed (8� 5) in-dependent componenets (mean� stdev). Due to the potential confoundof removing aspects of (fixation-related) cognitive processes of interest,we were particularly conservative in discarding any ICs, for instance thesole appearance of an alpha peak was an argument for keeping the IC.

Time-Frequency transform was performed using a method based on

78

the convolution of the signal with a complex wavelet (‘mtmconvol’method) from the FieldTrip toolbox (Oostenveld et al., 2011), usingHanning tapers. The time was sampled on windows of 0.5 s, covering thewhole trial with a step of 0.1 s. The frequency was sampled from 0.5 to30 Hz with a step of 0.5 Hz.

Spectral profiles, defined as the change in the cortical power spectrumdue to a functional process, have been proposed to be a functional linkbetween the MEG/EEG signals and cognition (Donner and Siegel, 2011;

Page 7: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

Klimesch, 1999). Spectral profiles account for changes of the frequencyspectrum with a given experimental manipulation, stimulus variation ortask progression. In our case, we aimed to explore the cognitive corre-lates of progression of the task by using their fingerprints on the spectralprofile, i.e. we calculated the correlation between the frequency powerand the fixation rank to distractors during the visual search.

In particular, for the present analysis we focused on two frequencybands: theta ((L�eger et al., 2014) Hz) and alpha ([8 13] Hz). Thus, thepower of frequencies falling into those bands were averaged for eachelectrode and time window. Then, the trials were epoched into fixations,averaging for each fixation the power estimated for all the windows thatfall into the fixation. Thus, we ended up with two values of power foreach fixation (sorted by fixation rank) and electrode, and we averagedthose values first within participants and then across participants for theelectrodes of interest.

3. Results

3.1. Experimental paradigm and behavior

The time course of an exemplary trial is shown in Fig. 1A. Acrossall subjects, we obtained a total of 19 722 fixations larger than 50ms.The overall distribution of the eye movement variables during thesearch presented a typical pattern. The mean and standard deviation ofthe fixation duration were (240� 103) ms (Fig. 1E). The 5% lowerlimit of the saccade amplitude was 1.28� (Fig. 1C), and the percentageof saccades that were actually smaller than the target face (2.04�) was18.66%. Participants found the target most of the times -P(targetfound) ¼ (0.86� 0.02)- and performed very few fixations to the targetbefore the one that finished the trial -percentage of trials with re-fixations to the target ¼ (4.3� 0.6)%-. Altogether, this suggests thatsubjects were successful at performing this task and did not typicallyperform two consecutive fixations on the same face (Fig. 1B and C).Another important global variable of the search is the number of fix-ations performed. The median number of fixations needed to find thetarget (before the last fixation to the target) was 6 (interquartile range:[3 13]) (Fig. 1D)). To assess the influence of individual differences inbehavioral performance, we compared the variance in two dependentvariables (Response Time over correct trials –RTc- and probability offinding the target, PT) across participants (N¼ 17) and compared itwith the ones obtained across trials (N¼ 180). We found that both thevariances in the mean RTc and PT across participants: (var(RTcjpart)¼ 0.36, and var (PTjpart)¼ 0.004) were smaller than theones across trials: (var (RTcjtrials)¼ 1.99, and var (PTjtrials)¼ 0.012).To test the hypothesis of lower variability across participants (H1), westatistically compared the observed variance across participants withthe ones obtained from a distribution of 100 000 surrogates, createdby randomly taking subsamples of 17 images (same as the number ofparticipants). We determined the p-value as the fraction of surrogateswhich showed a smaller variance than the original test statistic. Thenull hypothesis of equal variances were rejected with p-values: P(RTcjpart >(RTcjtrials)¼ 0.00024 and P(PTjpart> PTjtrials)¼ 0.01374.

3.2. Fixation event-related potentials in unrestricted visual search

The dynamics of fERP responses to targets and distractors, summa-rized in Fig. 2A, were broadly consistent with what has been observed inprevious studies using synthetic stimuli or prolonged fixations: A strongP1 at occipital electrodes, a central positivity at ~170ms which re-sembles the Vertex Positive Potential (VPP) in response to faces (Jeffreys,1989), and a large target-specific P3 peaking after 300ms spread fromcentral electrodes (Polich, 2007; Kamienkowski et al., 2012a; Kaunitzet al., 2014).

The VPP was confirmed by analyzing all fixations to distractors largerthan 200ms (N¼ 10 466), changing the reference from linked mastoidsto average, and observing that the central positive peak turns into a

79

parieto-occipital negativity, i.e. the N170, with a slight lateralization tothe right (Wilcoxon rank-sum test: PO7 vs. PO8 amplitudes in the[150 200] ms time window: p< 0.05, zval¼�2.00, Cohen'sd (N¼ 34)¼�0.34) (Mognon et al., 2011; Bigdely-Shamlo et al., 2013)(Fig. S3).

In order to control for differences in eye movements for the com-parison between targets and distractors, we considered: First, only thefixations to distractors longer than 400 ms -where we analyzed onlythe first 400 ms-; and second, we matched the properties of theincoming saccade -the duration, and the horizontal and vertical am-plitudes- (Fig. S2). As expected, we found that the strongest differenceappeared after 300 ms and was more prominent in the centro-parietalelectrodes but widely spread across most of the scalp (Fig. 2B and C,cluster-based permutation test with αcluster<0.01 (Maris and Oos-tenveld, 2007)). Interestingly, consistent significant differencesemerge across several electrodes much earlier (around 170 ms), whichsuggestively matches the latency (Fig. 2A, black bars; the limits of theearly significant interval in the Cz electrode are [117, 207] ms) andscalp distribution of the VPP (Fig. 2C). Further investigation of thewaveforms for fixations that were longer than 400ms showed nosignificant differences with shorter fixations (between 300 ms and400 ms (Fig. S4).

3.3. Emergence of properties of the P3 component in free-viewing

In fixed-gaze oddball experiments, two major subcomponents ofthe P3, the P3a and the P3b are typically reported (Squires et al.,1977). The P3a is elicited at the more frontal electrode sites and islarger for novel stimuli, whereas the P3b component is elicited incentral and parietal regions, and is larger for the detection of a target.Importantly, a modulating property that positively affects the ampli-tude of the P3 is the ISI (Fitzgerald and Picton, 1981; Woods et al.,1980; Gonsalvez and Polich, 2002). This has been related to theconcepts of expectancy and surprise. As the participant is anticipatinga target to be presented, a build-up of expectancy is sought to occur;this effect is usually related to the P3b subcomponent of the P3(Polich, 2007). Conversely, the surprise effect is usually related to theP3a subcomponent of the P3 (Gonsalvez and Polich, 2002; Polich,2007). Therefore, a question raised as to whether fERPs infree-viewing visual search tasks show properties that relate to classicconcepts of expectancy and surprise.

In the current design, there was no control of the ISI -i.e. the intervalbetween fixations-as the participants can freely decide when and whereto fixate next. However, the number of distractors presented before thetarget has also been used to study the properties of the P3 component(Gonsalvez and Polich, 2002). In the current setup, the number of fixa-tions made prior to finding the target could give an indication of thecomplexity of a scene and indicate how difficult it was to find the target.Hence, we can make a parallel with an oddball paradigm -a fixed-gazesequential visual search- (Gonsalvez and Polich, 2002).

As a first step, we collapsed the trials into two categories based on thenumber of fixations needed to find the target: ‘Short’ (less than 7 fixa-tions; N¼ 865) and ‘Long’ trials (more or equal than 7 fixations;N¼ 1030). Significant differences between these categories were foundin frontal, central and parietal midline electrodes (Fz: p< 0.00001,zval¼ 6.61, Cohen's d (N¼ 1895)¼ 0.15; Cz: p< 0.00001, zval¼ 5.77,Cohen's d (N¼ 1895)¼ 0.13; Pz: p< 0.00001, zval¼ 4.80, Cohen'sd (N¼ 1895)¼ 0.11) but not in the occipital electrodes (Oz: p¼ 0.42,zval¼ 1.25, Cohen's d (N¼ 1895)¼ 0.03) (Fig. 3A).

The amplitude of the P3 component on the central electrode showed acontinuous decrease as a function of the number of fixations to the target(Fig. 3B). Moreover, there was a significant negative correlation of thesingle-trial P3 amplitudes with the number of fixations to the target(Fig. 3C; Pearson's correlation coefficient R¼�0.11, p< 0.00001). Thisdecrease is also in agreement with a gradual reduction of the surpriseeffect.

Page 8: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 5. Spectral profile. (A) Correlation between the frequency power during each fixation and the fixation rank as a function of frequency, for each electrode (Fz, Cz,Pz, and Oz). The upper panel shows the Bonferroni-corrected significance of the correlations (considering 240 comparisons, i.e. 60 frequency points and 4 electrodes)and thresholded by 0.01. Red and blue shadows highlight the theta ([3.5 8.0] Hz) and alpha bands ([8.0 13.5] Hz). (B) Power in theta and alpha frequency bands asfunction of fixation rank. Error bars represent the �s.e.m. interval, and the straight line the general trend. (C) Time-Frequency image plots for each electrode, wherethe timecourse of the exploration is indexed by the fixation rank.

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

3.4. Global properties of fERPs

In the previous section we started the investigation with local prop-erties of the fERPs, such as their relation with face processing and target-face detection, and ended with an analysis of more global properties suchas the dependence of the P3 with the length of the trial (where the lengthis the number of fixations performed) -and thus, the number of facesseen-. The dependence of the target detection on the number of previous

80

fixations led us to investigate how the development of the whole trial isencoded in each distractor's fERP, before the target was detected, andhow the target response was built-up.

In order to examine slow changes across the whole trial, we studiedthe relationship between fixation rank (the number of fixations up to theone being considered) and baseline activity -i.e. the activity before thesaccade-when no baseline correction was applied. To investigate theencoding of each stimulus, we also examined the relationship between

Page 9: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

fixation rank and the amplitude of the P1 component with baselinecorrection. Fig. 4A and B shows a strong dependence of the baselineactivity on the fixation rank of the distractor. This dependence appearsmore prominent in the centro-frontal electrodes, although significantcorrelations were observed along midline channels (Pearson's correlationcoefficient R (Fz)¼�0.95, p< 0.000001; R (Cz)¼�0.92, p< 0.00001;R (Pz)¼�0.90, p< 0.00001; R (Oz)¼�0.85, p< 0.0001). In order torule out that these correlations reflected the time elapsed since the startof the trial (which correlates with the fixation rank), we compared thecorrelations of the baseline activity with time using different windowsizes to group fixations along the search -between 100ms and 500ms-.The correlations with time were always smaller than the correlationswith the fixation rank, failing in many cases to reach significance(Fig. S5). This suggests that global changes in baseline activity are relatedto the sequence of fixations as processing units, and not with the meretime elapsed. Conversely, the encoding of each distractor stimulus didnot change with the progression of the trial; as it can be inferred from theconstant amplitude of the P100 as function of fixation rank (Fig. 4C andD; R (Oz)¼�0.27, p¼ 0.33). Thus, there were global changes in baselineand target detection, but not in the encoding of an individual stimulus.

3.5. Global changes in frequency spectrum

Changes of the frequency power with the fixation rank of the dis-tractors, measured as the correlation between these two variables,showed intervals of positive and negative correlations (Fig. 5). Theseintervals, although not significant for all frequencies, fit nicely into thestandard classification of EEG low frequency bands (delta, theta, alpha,and beta) for the different electrodes (Fig. 5A). Significant correlationsappeared in the four frequency bands at different electrodes (Fig 5A, toppanel; p< 0.01, Bonferroni corrected for multiple comparisons for 240tests, i.e. 60 frequency steps x 4 electrodes).

A large positive correlation between theta power and fixation rank,and a decrease in the (upper) alpha band appeared in the occipitalelectrode (Fig. 5B, lower panel). This was also evident in the time-frequency plots (where time is represented in terms of fixation rank;Fig. 5C, lower panel). These changes are usually observed together(Klimesch, 1999), and are sought to reflect increasing demands of visualattention. In contrast, the parietal, central and frontal electrodesexhibited a pronounced negative correlation in the alpha frequency band–i.e. ramping towards the presentation of the target–, but no significantcorrelation on the theta band (Fig. 5B and C). This effect is slightly spreadtowards the beta band, and vanished beyond 23Hz; which is consistentwith the broad frequency range that Donner and Siegel (2011) attributeto the more integrative cognitive processes. In particular, the negativecorrelation of the alpha frequency band is consistent with the findings ofKlimesch and collaborators (Klimesch et al., 1998; Klimesch, 1999), whosuggest that the alpha band must be negatively modulated by the ex-pectancy of an upcoming stimulus.

4. Discussion

We developed a free-viewing paradigm that allowed us to investigatethe brain correlates of unrestricted visual search with natural images. Weshowed that the local dynamics following each stimulus/foveation ex-hibits the fixed-gaze ERP components (P1, N170/VPP, P3). These brainevents are then signatures of the specific unitary operation performed ateach stage of a mental program. A global analysis of the brain activitythroughout the task allowed us to investigate signatures related to theaccumulation of evidence during the task. In particular, we showed anincreased demand of visual attention, changes in the expectancy of thestimulus and robust sequential effects from the analysis of baseline andthe spectral profile. In what follows, we discuss our results in the contextof local dynamics of unitary operations, global dynamics and mentalprograms, their interaction and the implications that our findings mighthave for future concurrent EEG and eye movement studies.

81

4.1. Local dynamics: fERPs during natural tasks

Brain activity elicited after each fixation during visual search hasbeen studied with both synthetic (Kamienkowski et al., 2012a; Brouweret al., 2013; K€orner et al., 2014; U�s�cumli�c and Blankertz, 2016; Wenzelet al., 2016) and natural (Kaunitz et al., 2014) stimuli. These studies havedescribed a robust P3-like potential evoked by the object that theparticipant is looking for. In our experiment we observed a significantdifference between the signals in response to targets and distractors,extending our previous finding using the same stimuli (Kaunitz et al.,2014). Interestingly, in the present study the difference between targetsand distractors became significant about ~100ms earlier (~170ms vs.~250ms). Two possible explanations could account for this difference:Firstly, both protocols differ in the speed-accuracy tradeoff that governedthe search. In our previous experiment, participants were instructed -andbriefly trained-to perform the search slowly; prioritizing the identifica-tion of the fixated face before moving on. In the current experiment,participants were only instructed to search for the targets, which led toshorter fixations. Nevertheless, we observed very fewmisses on the target(situations in which the participant fixated in the target but continuedsearching, as they failed to identify the target), as this occurred in(4.3� 0.6) % of the trials per subject -(7.7� 1.1) trials-. Thus, we canspeculate that, in order to perform the search at this pace, participantsmust take an early decision on whether they have to continue movingtheir eyes or stop. The early detection that we observed in the fERPscould be the brain correlate of this process, although the full identifica-tion could occur later. Secondly, in the present study, as in Kamienkowskiet al. (2012) but not in Kaunitz et al. (2014), we matched the fixations totargets and distractors by the eye movement properties before perform-ing statistical comparisons. This procedure could allow the detection ofmore subtle differences between these signals, which may have other-wise been indistinguishable in Kaunitz et al. (2014).

As in previous studies with faces (Joyce and Rossion, 2005; Rossionand Jacques, 2008), we observed the VPP/N170 potential in response tofaces, which was confirmed by changing the reference from linkedmastoids to average, which led to a shift from a central positive peak(VPP) to a parieto-occipital negativity (N170), with a slight lateralizationto the right (Joyce and Rossion, 2005). These results further support thevalidity of EEG and eye movement co-registration in the study of neuralcorrelates of natural visual processing, and conversely extend the pre-vious knowledge to more natural environments. However, an in-depthunderstanding of face processing in natural environments will requirespecific future experiments using faces and other objects embedded inthe same natural images, to allow a comparison between faces and otherstimuli.

Studies focusing on the early detection of different objects withinrapid presentations of natural complex scenes -in which a cue is pre-sented before the stimulus and rapid responses are required-have shownthat identity stimulus information (contingent upon recognition) is ableto be conveyed quite early, ranging from 150 to 300ms (Carmel andBentin, 2002; Johnson and Olshausen, 2003; VanRullen and Thorpe,2001). In line with this, one recent study reported an early target relatedpotential in free-viewing, but not fixed-gaze conditions (Kamienkowskiet al., 2012a), while another study found that it is possible to identify thetarget location even when fixating on the target for less than 10ms in afree-viewing search, but not in a fixed-gaze condition (Kotowicz et al.,2010). The short timings for target identification can be related toextrafoveal detection. Target identification would trigger a saccade to-wards the target location, or less strictly, it would suggest that there is anongoing computation of the probability to find the target that primes agiven location (Najemnik and Geisler, 2005). This is consistent with anextrafoveal detection of the target in an easy visual search, and with anearly foveal detection in a difficult visual search (Dias et al., 2013). Sincehigh attentional engagement is naturally present in free-viewing, fromfixation onset or even earlier (Corbetta et al., 1998; Melcher and Colby,2008), another possible theoretical explanation for this early

Page 10: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

identification in free-viewing is the temporal modulation of attention(Kamienkowski et al., 2012a, 2012b; Navajas et al., 2014). Indeed, pre-vious behavioral experiments have shown homogeneous performance oftarget detection within almost the entire fixation but, strikingly, confi-dence judgments varied along the fixations. These results suggest that,while it is possible to detect a target accurately in the first tens of mil-liseconds of a fixation, the integration process continues along the wholefixation (Navajas et al., 2014). Moreover, previous studies have shownthat each fixation is consistent with an attentional episode, where sac-cades accentuate the episodic borders (Kamienkowski et al., 2012b), inagreement with a discrete structuring of attention; sampling informationfrom temporal episodes during which several items can access theencoding process (VanRullen, 2016; Wyble et al., 2009, 2011).

Previous EEG experiments have shown that temporal attention has aneffect on the amplitude and latency of the N2 (Correa et al., 2006), the P3component (Miniussi et al., 1999), and on the amplitude of the P1component. Targets appearing at attended moments, close to a cue,evoke a larger P1 (Correa et al., 2006). Furthermore, early componentsseemed to be affected only in highly demanding perceptual processingtasks (Correa et al., 2006). These studies suggest that high attentionalengagement could fasten the processing of the stimulus, and that changeis measurable in the EEG signal as a modulation in early components.Following this idea, we can speculate that early target detection is moreprominent in highly demanding perceptual processing tasks. We previ-ously reported early target detection in tasks involving prolonged fixa-tions when participants were asked to detect a subtle change in asynthetic stimulus (Kamienkowski et al., 2012a), but not when they hadto produce long fixations to find a hidden target face in a crowded scene(Kaunitz et al., 2014). The results presented in this study, with significantearly target detection in free-viewing, might be associated to a higherattentional engagement occurring in a free-viewing visual search taskwith natural images. Additionally, they are in accordance with percep-tual load theories, which state that earlier responses are expected onhigher perceptual load tasks, subtle differences in crowded syntheticstimulus or faster searches in natural scenes; since they all involve higherattentional resources (Lavie et al., 2014).

Furthermore, the early differences found in this natural visual searchparadigm are in line with fast saccade response time (SRT) experiments,in which participants are asked to decide between two stimuli, andrespond making a saccade towards the target as fast as possible. In theseexperiments, participants are able to respond in less than 150ms andstarting as early as around 100ms to detect human faces and animals(Crouzet et al., 2010; Kirchner and Thorpe, 2006).

4.2. Global dynamics: monitoring the progress of the task

We found a significant decrease in the baseline amplitude in central-frontal regions, which followed the progression of the search. Similargradual changes in the baseline were observed in other tasks in whichparticipants required to accumulate some evidence in order to achieve adecision (Lange et al., 2010). In our case, the amount of evidence isdifficult to estimate in a direct way. Current visual search models basedon eye movements tend to predict the sequence of fixations based on theconstruction of saliency maps on static images (i.e. disregarding theamount of evidence gained in each fixation) (Borji and Itti, 2013; Tor-ralba et al., 2006). One interesting exception is the model of Najemnikand Geisler (2005), which estimates how the probability of finding thetarget at the end of the search changes in every fixation (Najemnik andGeisler, 2005). Although the fixation rank is naturally correlated withtime, we found that the correlations between the baseline activity andtime were smaller than those between the baseline activity and fixationrank. This suggests that global changes in baseline activity are indexingthe sequence of processing steps within the task, rather than the timeelapsed since the start of the trial.

To better understand the neural and cognitive processes involved inthe slow dynamics throughout the trial, we explored the contribution of

82

different frequency bands to the EEG signal. In particular, we calculatedthe correlation between the power in narrow bands and the fixation rankto build a spectral profile as proposed by Donner and Siegel (2011)(Donner and Siegel, 2011). Consistent with fixed-gaze studies, the broaddecrease in alpha and increase in occipital theta activity could suggest asurge in the resources involved in visual attention (Klimesch, 1999). Forinstance, an increase in theta activity in the occipital cortex was reportedwhen viewing images that were later remembered (Osipova et al., 2006).A negative modulation in the alpha band has been previously related tothe expectancy of an upcoming stimulus (Klimesch et al., 1998; Klimesch,1999; Freunberger et al., 2008), which comes from an increasing prob-ability of finding the target over time. This is consistent with our results,since we observed a negative correlation in the alpha frequency band -i.e.ramping towards the presentation of the target-mainly over the parietal,central and frontal electrodes. Interestingly, the negative correlation weobserved spread beyond the alpha range, up to 23Hz in thecentro-parietal sites. This broad frequency range is sometimes referred toas “low frequencies” (e.g., see (Donner and Siegel, 2011)) and has beenparticularly linked to cognitive integrative processes.

4.3. Interactions between local and global dynamics

Despite the fact that estimates of “surprise” and “expectancy” cannotbe directly derived from our experiment, we can consider that shortertrials will likely be more associated with surprise. On the other hand, themore fixations a subject made within a trial, the more the target will beexpected. Therefore, we hypothesized that trials that required very fewfixations to find the target will have a larger surprise associated, whilelong trials will have a larger expectancy. Our results suggest a strongereffect of surprise over expectancy in the P3, since the difference in theamplitude favored shorter trials. Moreover, this result was supported bythe more frontal distribution of shorter versus longer trials, consistentwith the observations of Polich (2007) (Polich, 2007). These late changeswere observed despite the fact that the early feedforward visual pro-cessing of each stimulus, tagged by the P1 component, was preservedthroughout the search. Together with the fact that low frequencies, butnot high frequencies, correlated with fixation rank, this suggests thatonly the integrative cognitive processes are related to the progression ofthe search.

An alternative explanation for the influence of ISI on P3 in fixed-gazeexperiments is the Recycling Cycle Hypothesis (Fitzgerald and Picton,1981; Gonsalvez and Polich, 2002), which suggests that the P3 compo-nent is reduced with short ISIs because the system requires time torecover from recent ERP production. In our experiment, this would meanthat the P3 is reduced in later fixations only because the underlyinggeneration processes lack the resources to fully recover within amultiple-fixation-trial. However, this alternative explanation is unlikelyfor three reasons: First, participants only saw one target, which impliesthat the resources related to target processing should be fully availablefor the whole trial. Second, we have previously shown that the infor-mation regarding the salient target in an attentional blink task recoversrapidly in successive fixations, since saccades accentuate episodicboundaries of the temporal episodes (Kamienkowski et al., 2012b). Itfollows that the alternative explanation would be unlikely even if morethan one target was present. Third, resources related to distractor pro-cessing should already vanish before 300ms -according to mainstreamtheories such as the Global Workspace Theory, which proposes earlyprocessing of a new stimulus could start in parallel (Dehaene and Nacc-ache, 2001; Dehaene et al., 2014; Sigman and Dehaene, 2008)-. This isalso in agreement with the episodic theory of attentional deployment,which proposes that information is sampled from temporal episodes ofabout 400ms (Wyble et al., 2009, 2011).

4.4. Towards an integrative framework in natural viewing

From an algorithmic perspective –in line with Marr's levels at which

Page 11: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

Fig. 6. Integrative framework of natural visual search. Schematic framework of the identified electrophysiological signatures and their interactions with keyprocesses in a free-viewing visual search task.

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

the brain as an information-processing device must be understood(Marr, 2010)– the schematic diagram shown in Fig. 6 can be used to putour findings into a more general framework. At the bottom level (visualencoding in Fig. 6), every fixation produces the typical sequences ofvisual ERPs towards a face, including the P1 and N170/VPP, irre-spective of the fixation rank. These fERPs share the topographies andlatencies of their fixed-gaze counterparts (see also (Kamienkowskiet al., 2012a; Kaunitz et al., 2014)), thus the initial feedforward pro-cessing of the stimulus in each fixation seems to be conserved and isindependent of the task. The eye movement control level (motor pre-p./exec in Fig. 6) can be separated into the parallel processing of the“when” and “where” streams plus an inhibition, as proposed by thewell-established model of saccade generation of Findlay and Walker(1999) (Findlay and Walker, 1999). Some existent models alreadyinclude these ideas, for instance, in the CRISP model (Nuthmann et al.,2010), the “when” stream is constantly generating saccades that couldbe inhibited, while the “where” is implemented as a two-stage saccadeprogramming composed by an initial, labile stage that is subject tocancellation (inhibition of the “when” stream), followed by a non-labilestage (Nuthmann et al., 2010; Reichle et al., 1998). Thus, there is acontinuous crosstalk between saccade preparation and visual-cognitiveprocessing, which allows difficulties in online visual and cognitiveprocessing to immediately inhibit (i.e. delay) saccade initiation, leadingto longer fixation durations. The early target detection potential that weobserved could be interpreted as a cancellation signal during the firstlabile stage of the saccade preparation, where the next saccade iscanceled at early latencies leading to longer fixations (Nuthmann et al.,2010). Successful early target detection is followed by a global P3component, reflected in higher and later activity, which also resemblesthe P3 recorded in fixed gaze conditions (Polich, 2007; Kamienkowskiet al., 2012a; Kaunitz et al., 2014).

Interestingly, the preparation of the upcoming saccade could beaccompanied with a modulation of cover attention towards the potentiallanding site. These processes have previously been studied with crowdedscenes in fixed-gaze reaction time experiments (Wolfe, 2007), and insaccade reaction time experiments, which allowed studying the interac-tion between covert attention and saccade execution (Ehinger et al.,2015; Weaver Matthew and Claytonvan Zoest Wieske, 2017). However,it is important to distinguish these processes from the overt allocation ofattention that follows the eye movement. The interplay between covertand overt allocation of attention is one of the fundamental open issues forwhich the co-registration of brain activity and eye movements couldlargely contribute in future studies.

83

At the top level (global integration in Fig. 6), the integration of theinformation about the whole image acquired so far could involve, forinstance, computing the target position probabilities based on existentevidence, which necessarily includes information about the surfacealready explored as well as past events and the current position(Najemnik and Geisler, 2005; Kliegl et al., 2014; Koechlin and Sum-merfield, 2007). These integrative processes also serve to build a generalmodel of the visual scene, and to guide future eye movements, as anothercrosstalk between the saccade preparation and visual-cognitive process-ing. Although it could run on top of visual encoding and saccade prep-aration processes, it should interchange information with them. Hence, itshould be in pace with the fixations. This accumulated evidence isindexed by changes in baseline, which scales with the fixation rank betterthan with time itself, and is further reflected by changes in alpha/thetaband oscillatory activity and modulations in the amplitude of the P3component.

It is well established that human behavior in complex tasks isstrongly shaped by individual differences in certain capacities andstrategies. For instance, measures of visual working memory (VWM)capacity are strongly related to performance in both visual searchwithout eye movements (Luria and Vogel, 2011; Luck and Vogel,2013) and constrained eye movement paradigms (Theeuwes et al.,2009). Also, VWM capacity correlates with slow potentials such as thecontralateral delay activity (CDA) (Vogel et al., 2005) that have asimilar behavior as the baseline changes during search. These esti-mates of VWM capacity are potentially appealing because they havebeen proven to be very robust within a single participant (for instance,they could yield a test-retest reliability of 0,77 after 1.5 years later)and consistent predictors of many complex visual tasks (Luck andVogel, 2013). By comparing the variance in two behavioral dependentvariables, we showed that the major source of variability in thisexperiment were the trials and not the participants. However, futurework on both complex visual search models should include VWM orother performance measures, which could influence markers ofcognition and decision-making, as co-variables.

4.5. Alternative theoretical frameworks

The main focus of the present paper is related to the deployment ofovert visual attention rather than covert visual attention, since eyemovements are an essential part of behaviour. Although these processescould share some mechanisms, these are not necessarily the same(Hulleman and Olivers, 2017). Some of these shared mechanisms could

Page 12: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

be part of the global processes of the task. For instance, the CompetitiveGuided Search model (Moran et al., 2013) -which is an updated versionof the Guided Search model (Wolfe, 2007; Wolfe et al., 2010)-, hasincluded a module that evaluates whether to quit the search or continue.This “quit unit” establishes an accumulation process that runs in parallelwith the actual search, and competes to terminate the task. However,since in our task participants were not able to finish the trial beforefinding the target or reaching the 20 s maximum trial period and therewas not a forced-choice for target present/absent, response times are notas relevant as in other experiments so it is unlikely that a quit unit wouldplay a major role. Interestingly, the integrative framework we presenthere could potentially combine more than one integrative processes inthe global workspace, leaving space for contribution from other theo-retical models.

4.6. Concluding remarks

Our findings contribute to the study of human physiology andcognition in natural environments in three fundamental ways: First, itpresents robust fERPs in unconstrained visual search with real-worldimages. This extends previous fERP work but also highlights potentialdifferences, as early target detection was only previously found withsynthetic stimuli, giving a more complete understanding of how thebrain processes unitary operations in natural viewing. Second, byfocusing on brain oscillations at different frequencies and modulationof evoked components throughout the trial, we were able to confirmthat specific brain signatures, previously identified for fixed-gaze par-adigms as indexes of accumulation of evidence, can be robustly ob-tained in natural viewing. Finally, it allows us to introduce a data-driven framework to link oscillations with the underlying mentalprogram.

The schematic diagram that we have considered here is, of course, asimplification of all the complexities involved in natural viewingeveryday scenes. However, we believe it can be a starting point forconsidering general neurophysiology data-driven frameworks to eluci-date how complex mental operations are performed in natural viewing.

Acknowledgments

We are grateful to Dr Lisandro Kaunitz for invaluable assistance in thecreation of the stimuli and data acquisition and Dr Markus Bauer forhelpful comments on the manuscript. Part of this project was funded bythe EPSRC (UK, EP/1016899/1) and CONICET (Argentina).

Appendix A. Supplementary data

Supplementary data related to this article can be found at https://doi.org/10.1016/j.neuroimage.2018.08.010.

Author contributions

J.E.K., and M.J.I. designed the experiments with contributions fromM.S. A.V. conducted the experiments. J.E.K. and M.J.I. developed theanalysis approach. J.E.K. and A.V. analyzed the data with contributionsfrom M.J.I. J.E.K., and M.J.I. wrote the paper with input from allauthors.

Additional information

Supplementary information is available for this paper.

Conflicts of interest

The authors declare no competing interests.

84

References

Ahmadi, M., Quian Quiroga, R., 2013. Automatic denoising of single-trial evokedpotentials. Neuroimage 66, 672–680. https://doi.org/10.1016/j.neuroimage.2012.10.062.

Bauer, M., Stenner, M.-P., Friston, K.J., Dolan, R.J., 2014. Attentional modulation ofalpha/beta and gamma oscillations reflect functionally distinct processes. J. Neurosci.34, 16117–16125. https://doi.org/10.1523/JNEUROSCI.3474-13.2014.

Bigdely-Shamlo, N., Kreutz-Delgado, K., Kothe, C., Makeig, S., 2013. EyeCatch: data-mining over half a million EEG independent components to construct a fully-automated eye-component detector. In: Engineering in Medicine and Biology Society(EMBC), 2013 35th Annual International Conference of the IEEE. IEEE,pp. 5845–5848. Available: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber¼6610881.

Borji, A., Itti, L., 2013. State-of-the-Art in visual attention modeling. IEEE Trans. PatternAnal. Mach. Intell. 35, 185–207. https://doi.org/10.1109/TPAMI.2012.89.

Brainard, D.H., 1997. The psychophysics toolbox. Spatial Vis. 10, 433–436. https://doi.org/10.1163/156856897X00357.

Brouwer, A.-M., Reuderink, B., Vincent, J., Gerven MAJ van, Erp JBF van, 2013.Distinguishing between target and nontarget fixations in a visual search task usingfixation-related potentials. J. Vis. 13 https://doi.org/10.1167/13.3.17, 17–17.

Carmel, D., Bentin, S., 2002. Domain specificity versus expertise: factors influencingdistinct processing of faces. Cognition 83, 1–29. https://doi.org/10.1016/S0010-0277(01)00162-7.

Corbetta, M., Akbudak, E., Conturo, T.E., Snyder, A.Z., Ollinger, J.M., Drury, H.A., et al.,1998. A common network of functional areas for attention and eye movements.Neuron 21, 761–773. https://doi.org/10.1016/S0896-6273(00)80593-0.

Correa, �A., Lupi�a~nez, J., Madrid, E., Tudela, P., 2006. Temporal attention enhances earlyvisual processing: a review and new evidence from event-related potentials. BrainRes. 1076, 116–128. https://doi.org/10.1016/j.brainres.2005.11.074.

Crouzet, S.M., Kirchner, H., Thorpe, S.J., 2010. Fast saccades toward faces: face detectionin just 100 ms. J. Vis. 10 https://doi.org/10.1167/10.4.16, 16–16.

Dehaene, S., Naccache, L., 2001. Towards a cognitive neuroscience of consciousness:basic evidence and a workspace framework. Cognition 79, 1–37. https://doi.org/10.1016/S0010-0277(00)00123-2.

Dehaene, S., Charles, L., King, J.-R., Marti, S., 2014. Toward a computational theory ofconscious processing. Curr. Opin. Neurobiol. 25, 76–84. https://doi.org/10.1016/j.conb.2013.12.005.

Delorme, A., Makeig, S., 2004. EEGLAB: an open source toolbox for analysis of single-trialEEG dynamics including independent component analysis. J. Neurosci. Meth. 134,9–21. https://doi.org/10.1016/j.jneumeth.2003.10.009.

Deubel, H., Schneider, W.X., 1996. Saccade target selection and object recognition:evidence for a common attentional mechanism. Vis. Res. 36, 1827–1837. https://doi.org/10.1016/0042-6989(95)00294-4.

Dias, J.C., Sajda, P., Dmochowski, J.P., Parra, L.C., 2013. EEG precursors of detected andmissed targets during free-viewing search. J. Vis. 13 https://doi.org/10.1167/13.13.13, 13–13.

Dimigen, O., Valsecchi, M., Sommer, W., Kliegl, R., 2009. Human microsaccade-relatedvisual brain responses. J. Neurosci. 29, 12321–12331. https://doi.org/10.1523/JNEUROSCI.0911-09.2009.

Dimigen, O., Sommer, W., Hohlfeld, A., Jacobs, A.M., Kliegl, R., 2011. Coregistration ofeye movements and EEG in natural reading: analyses and review. J. Exp. Psychol.:General. American Psychological Association 552. https://doi.org/10.1037/a0023885.

Donner, T.H., Siegel, M., 2011. A framework for local cortical oscillation patterns. TrendsCognit. Sci. 15, 191–199. https://doi.org/10.1016/j.tics.2011.03.007.

Donner, T.H., Siegel, M., Oostenveld, R., Fries, P., Bauer, M., Engel, A.K., 2007.Population activity in the human dorsal pathway predicts the accuracy of visualmotion detection. J. Neurophysiol. 98, 345–359. https://doi.org/10.1152/jn.01141.2006.

Ehinger, B.V., K€onig, P., Ossand�on, J.P., 2015. Predictions of visual content across eyemovements and their modulation by inferred information. J. Neurosci. 35,7403–7413. https://doi.org/10.1523/JNEUROSCI.5114-14.2015.

Engbert, R., Kliegl, R., 2003. Microsaccades uncover the orientation of covert attention.Vis. Res. 43, 1035–1045. https://doi.org/10.1016/S0042-6989(03)00084-1.

Findlay, J.M., 1997. Saccade target selection during visual search. Vis. Res. 37, 617–631.https://doi.org/10.1016/S0042-6989(96)00218-0.

Findlay, J.M., Walker, R., 1999. A model of saccade generation based on parallelprocessing and competitive inhibition. Behav. Brain Sci. 22, 661–674 discussion674–721.

Findlay, J.M., Brown, V., Gilchrist, I.D., 2001. Saccade target selection in visual search:the effect of information from the previous fixation. Vis. Res. 41, 87–95.

Fitzgerald, P.G., Picton, T.W., 1981. Temporal and sequential probability in evokedpotential studies. Can. J. Psychol. 35, 188–200.

Foxe, J.J., Simpson, G.V., Ahlfors, S.P., 1998. Parieto-occipital approximately 10 Hzactivity reflects anticipatory state of visual attention mechanisms. Neuroreport 9,3929–3933.

Freunberger, R., Klimesch, W., Griesmayr, B., Sauseng, P., Gruber, W., 2008. Alpha phasecoupling reflects object recognition. Neuroimage 42, 928–935. https://doi.org/10.1016/j.neuroimage.2008.05.020.

Fujii, N., Graybiel, A.M., 2005. Time-varying covariance of neural activities recorded instriatum and frontal cortex as monkeys perform sequential-saccade tasks. Proc. Natl.Acad. Sci. U. S. A. 102, 9032–9037. https://doi.org/10.1073/pnas.0503541102.

Gonsalvez, C.J., Polich, J., 2002. P300 amplitude is determined by target-to-targetinterval. Psychophysiology 39, 388–396. https://doi.org/10.1017/S0048577201393137.

Page 13: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

Gross, J., Schmitz, F., Schnitzler, I., Kessler, K., Shapiro, K., Hommel, B., et al., 2004.Modulation of long-range neural synchrony reflects temporal limitations of visualattention in humans. Proc. Natl. Acad. Sci. U. S. A. 101, 13050–13055. https://doi.org/10.1073/pnas.0404944101.

Henderson, J.M., 2003. Human gaze control during real-world scene perception. TrendsCognit. Sci. 7, 498–504.

Hipp, J.F., Engel, A.K., Siegel, M., 2011. Oscillatory synchronization in large-scale corticalnetworks predicts perception. Neuron 69, 387–396. https://doi.org/10.1016/j.neuron.2010.12.027.

Hulleman, J., Olivers, C.N.L., 2017. On the brink: the demise of the item in visual searchmoves closer. Behav. Brain Sci. 40 https://doi.org/10.1017/S0140525X16000364.

Jeffreys, D.A., 1989. A face-responsive potential recorded from the human scalp. Exp.Brain Res. 78, 193–202. https://doi.org/10.1007/BF00230699.

Johnson, J.S., Olshausen, B.A., 2003. Timecourse of neural signatures of objectrecognition. J. Vis. 3 https://doi.org/10.1167/3.7.4, 4–4.

Jokisch, D., Jensen, O., 2007. Modulation of gamma and alpha activity during a workingmemory task engaging the dorsal or ventral stream. J. Neurosci. 27, 3244–3251.https://doi.org/10.1523/JNEUROSCI.5399-06.2007.

Joyce, C., Rossion, B., 2005. The face-sensitive N170 and VPP components manifest thesame brain processes: the effect of reference electrode site. Clin. Neurophysiol. 116,2613–2631. https://doi.org/10.1016/j.clinph.2005.07.005.

Jubault, T., Ody, C., Koechlin, E., 2007. Serial organization of human behavior in theinferior parietal cortex. J. Neurosci. 27, 11028–11036. https://doi.org/10.1523/JNEUROSCI.1986-07.2007.

Kamienkowski, J.E., Ison, M.J., Quiroga, R.Q., Sigman, M., 2012. Fixation-relatedpotentials in visual search: a combined EEG and eye tracking studyFixation-relatedpotentials in visual search. J. Vis. 12 https://doi.org/10.1167/12.7.4, 4–4.

Kamienkowski, J.E., Navajas, J., Sigman, M., 2012. Eye movements blink the attentionalblink. J. Exp. Psychol. Hum. Percept. Perform. 38, 555–560. https://doi.org/10.1037/a0027729.

Kaunitz, L.N., Kamienkowski, J.E., Varatharajah, A., Sigman, M., Quiroga, R.Q.,Ison, M.J., 2014. Looking for a face in the crowd: fixation-related potentials in an eye-movement visual search task. Neuroimage 89, 297–305. https://doi.org/10.1016/j.neuroimage.2013.12.006.

Kirchner, H., Thorpe, S.J., 2006. Ultra-rapid object detection with saccadic eyemovements: visual processing speed revisited. Vis. Res. 46, 1762–1776. https://doi.org/10.1016/j.visres.2005.10.002.

Kliegl, R., Dambacher, M., Dimigen, O., Sommer, W., 2014. Oculomotor control, brainpotentials, and timelines of word recognition during natural reading. In: Horsley, M.,Eliot, M., Knight, B.A., Reilly, R. (Eds.), Current Trends in Eye Tracking Research.Springer International Publishing, pp. 141–155. Available: http://link.springer.com/chapter/10.1007/978-3-319-02868-2_10.

Klimesch, W., 1999. EEG alpha and theta oscillations reflect cognitive and memoryperformance: a review and analysis. Brain Res. Rev. 29, 169–195. https://doi.org/10.1016/S0165-0173(98)00056-3.

Klimesch, W., Doppelmayr, M., Russegger, H., Pachinger, T., Schwaiger, J., 1998. Inducedalpha band power changes in the human EEG and attention. Neurosci. Lett. 244,73–76. https://doi.org/10.1016/S0304-3940(98)00122-0.

Koechlin, E., Summerfield, C., 2007. An information theoretical approach to prefrontalexecutive function. Trends Cognit. Sci. 11, 229–235. https://doi.org/10.1016/j.tics.2007.04.005.

K€orner, C., Braunstein, V., Stangl, M., Schl€ogl, A., Neuper, C., Ischebeck, A., 2014.Sequential effects in continued visual search: using fixation-related potentials tocompare distractor processing before and after target detection. Psychophysiology51, 385–395. https://doi.org/10.1111/psyp.12062.

Kotowicz, A., Rutishauser, U., Koch, C., 2010. Time course of target recognition in visualsearch. Front. Hum. Neurosci. 4 https://doi.org/10.3389/fnhum.2010.00031.

Lange, FP de, Jensen, O., Dehaene, S., 2010. Accumulation of evidence during sequentialdecision making: the importance of top–down factors. J. Neurosci. 30, 731–738.https://doi.org/10.1523/JNEUROSCI.4080-09.2010.

Lavie, N., Beck, D.M., Konstantinou, N., 2014. Blinded by the load: attention, awarenessand the role of perceptual load. Phil Trans R Soc B 369, 20130205. https://doi.org/10.1098/rstb.2013.0205.

L�eger, P.-M., S�enecal, S., Courtemanche, F., Guinea, AO de, Titah, R., Fredette, M., et al.,2014. Precision is in the eye of the beholder: application of eye fixation-relatedpotentials to information systems research. J. Assoc. Inf. Syst. Online 15. Available:http://aisel.aisnet.org/jais/vol15/iss10/3.

Luck, S.J., Vogel, E.K., 2013. Visual working memory capacity: from psychophysics andneurobiology to individual differences. Trends Cognit. Sci. 17 (8), 391–400.

Ludwig, C.J.H., Gilchrist, I.D., 2002. Stimulus-driven and goal-driven control over visualselection. J. Exp. Psychol. Hum. Percept. Perform. 28, 902–912. https://doi.org/10.1037/0096-1523.28.4.902.

Luria, R., Vogel, E.K., 2011. Visual search demands dictate reliance on working memorystorage. J. Neurosci. 31, 6199–6207.

Mack, S.C., Eckstein, M.P., 2011. Object co-occurrence serves as a contextual cue to guideand facilitate visual search in a natural viewing environment. J. Vis. 11 https://doi.org/10.1167/11.9.9, 9–9.

Maris, E., Oostenveld, R., 2007. Nonparametric statistical testing of EEG- and MEG-data.J. Neurosci. Meth. 164, 177–190. https://doi.org/10.1016/j.jneumeth.2007.03.024.

Marr, D., 2010. Vision: a Computational Investigation into the Human Representation andProcessing of Visual Information, Second. MIT Press.

Melcher, D., Colby, C.L., 2008. Trans-saccadic perception. Trends Cognit. Sci. 12,466–473. https://doi.org/10.1016/j.tics.2008.09.003.

Mergenthaler, K., Engbert, R., 2007. Modeling the control of fixational eye movementswith neurophysiological delays. Phys. Rev. Lett. 98, 138104. https://doi.org/10.1103/PhysRevLett.98.138104.

85

Miniussi, C., Wilding, L.E., Coull, T.J., Nobre, et al., 1999. Orienting attention intimeModulation of brain potentials. Brain 122, 1507–1518. https://doi.org/10.1093/brain/122.8.1507.

Mognon, A., Jovicich, J., Bruzzone, L., Buiatti, M., 2011. ADJUST: an automatic EEGartifact detector based on the joint use of spatial and temporal features.Psychophysiology 48, 229–240.

Moran, R., Zehetleitner, M., Müller, H.J., Usher, M., 2013. Competitive guided search:meeting the challenge of benchmark RT distributions. J. Vis. 13 https://doi.org/10.1167/13.8.24, 24–24.

Moro, S.I., Tolboom, M., Khayat, P.S., Roelfsema, P.R., 2010. Neuronal activity in thevisual cortex reveals the temporal order of cognitive operations. J. Neurosci. 30,16293–16303. https://doi.org/10.1523/JNEUROSCI.1256-10.2010.

Najemnik, J., Geisler, W.S., 2005. Optimal eye movement strategies in visual search.Nature 434, 387–391. https://doi.org/10.1038/nature03390.

Navajas, J., Ahmadi, M., Quiroga, R.Q., 2013. Uncovering the mechanisms of consciousface perception: a single-trial study of the N170 responses. J. Neurosci. 33,1337–1343. https://doi.org/10.1523/JNEUROSCI.1226-12.2013.

Navajas, J., Sigman, M., Kamienkowski, J.E., 2014. Dynamics of visibility, confidence,and choice during eye movements. J. Exp. Psychol. Hum. Percept. Perform. 40,1213–1227. https://doi.org/10.1037/a0036321.

Nuthmann, A., Smith, T.J., Engbert, R., Henderson, J.M., 2010. CRISP: a computationalmodel of fixation durations in scene viewing. Psychol. Rev. 117, 382–405. https://doi.org/10.1037/a0018924.

Oostenveld, R., Fries, P., Maris, E., Schoffelen, J.-M., 2011. FieldTrip: open sourcesoftware for advanced analysis of MEG, EEG, and invasive electrophysiological data.Intell Neurosci. https://doi.org/10.1155/2011/156869, 2011: 1:1–1:9.

Osipova, D., Takashima, A., Oostenveld, R., Fern�andez, G., Maris, E., Jensen, O., 2006.Theta and gamma oscillations predict encoding and retrieval of declarative memory.J. Neurosci. 26, 7523–7531. https://doi.org/10.1523/JNEUROSCI.1948-06.2006.

Ossand�on, J.P., Helo, A.V., Montefusco-Siegmund, R., Maldonado, P.E., 2010.Superposition model predicts EEG occipital activity during free viewing of naturalscenes. J. Neurosci. 30, 4787–4795. https://doi.org/10.1523/JNEUROSCI.5769-09.2010.

Otero-Millan, J., Troncoso, X.G., Macknik, S.L., Serrano-Pedraza, I., Martinez-Conde, S.,2008. Saccades and microsaccades during visual fixation, exploration, and search:foundations for a common saccadic generator. J. Vis. 8 https://doi.org/10.1167/8.14.21, 21–21.

Pelli, D.G., 1997. The VideoToolbox software for visual psychophysics: transformingnumbers into movies. Spatial Vis. 10, 437–442. https://doi.org/10.1163/156856897X00366.

Pesaran, B., Nelson, M.J., Andersen, R.A., 2008. Free choice activates a decision circuitbetween frontal and parietal cortex. Nature 453, 406–409. https://doi.org/10.1038/nature06849.

Platt, M.L., Glimcher, P.W., 1999. Neural correlates of decision variables in parietalcortex. Nature 400, 233–238. https://doi.org/10.1038/22268.

Polich, J., 2007. Updating P300: an integrative theory of P3a and P3b. Clin.Neurophysiol. 118, 2128–2148. https://doi.org/10.1016/j.clinph.2007.04.019.

Quian Quiroga, R., 2000. Obtaining single stimulus evoked potentials with waveletdenoising. Phys. Nonlinear Phenom. 145, 278–292. https://doi.org/10.1016/S0167-2789(00)00116-0.

Reichle, E.D., Pollatsek, A., Fisher, D.L., Rayner, K., 1998. Toward a model of eyemovement control in reading. Psychol. Rev. 105, 125–157.

Roelfsema, P.R., 2005. Elemental operations in vision. Trends Cognit. Sci. 9, 226–233.https://doi.org/10.1016/j.tics.2005.03.012.

Rossion, B., Jacques, C., 2008. Does physical interstimulus variance account for earlyelectrophysiological face sensitive responses in the human brain? Ten lessons on theN170. Neuroimage 39, 1959–1979. https://doi.org/10.1016/j.neuroimage.2007.10.011.

Savage, S.W., Potter, D.D., Tatler, B.W., 2013. Does preoccupation impair hazardperception? A simultaneous EEG and Eye Tracking study. Transport. Res. F TrafficPsychol. Behav. 17, 52–62. https://doi.org/10.1016/j.trf.2012.10.002.

Siegel, M., Donner, T.H., Oostenveld, R., Fries, P., Engel, A.K., 2008. Neuronalsynchronization along the dorsal visual pathway reflects the focus of spatialattention. Neuron 60, 709–719. https://doi.org/10.1016/j.neuron.2008.09.010.

Sigman, M., Dehaene, S., 2008. Brain mechanisms of serial and parallel processing duringdual-task performance. J. Neurosci. 28, 7585–7598. https://doi.org/10.1523/JNEUROSCI.0948-08.2008.

Simola, J., Torniainen, J., Moisala, M., Kivikangas, M., Krause, C.M., 2013. Eye movementrelated brain responses to emotional scenes during free viewing. Front. Syst.Neurosci. 7 https://doi.org/10.3389/fnsys.2013.00041.

Simola, J., Le Fevre, K., Torniainen, J., Baccino, T., 2015. Affective processing in naturalscene viewing: valence and arousal interactions in eye-fixation-related potentials.Neuroimage 106, 21–33. https://doi.org/10.1016/j.neuroimage.2014.11.030.

Squires, K.C., Donchin, E., Herning, R.I., McCarthy, G., 1977. On the influence of taskrelevance and stimulus probability on event-related-potential components.Electroencephalogr. Clin. Neurophysiol. 42, 1–14. https://doi.org/10.1016/0013-4694(77)90146-8.

Staudigl, T., Hartl, E., Noachtar, S., Doeller, C.F., Jensen, O., 2017. Saccades are phase-locked to alpha oscillations in the occipital and medial temporal lobe duringsuccessful memory encoding. PLoS Biol. 15 https://doi.org/10.1371/journal.pbio.2003404 e2003404.

Theeuwes, J., Belopolsky, A., Olivers, C.N., 2009. Interactions between working memory,attention and eye movements. Acta Psychol. 132 (2), 106–114.

Thut, G., Nietzel, A., Brandt, S.A., Pascual-Leone, A., 2006. α-Bandelectroencephalographic activity over occipital cortex indexes visuospatial attention

Page 14: Parsing a mental program: Fixation-related brain ...users.df.uba.ar/juank/files/Kamienkowski2018NeuroImage.pdf · Parsing a mental program: Fixation-related brain signatures of unitary

J.E. Kamienkowski et al. NeuroImage 183 (2018) 73–86

bias and predicts visual target detection. J. Neurosci. 26, 9494–9502. https://doi.org/10.1523/JNEUROSCI.0875-06.2006.

Torralba, A., Oliva, A., Castelhano, M.S., Henderson, J.M., 2006. Contextual guidance ofeye movements and attention in real-world scenes: the role of global features inobject search. Psychol. Rev. 113, 766–786. https://doi.org/10.1037/0033-295X.113.4.766.

Ullman, S., 1984. Visual routines. Cognition 18, 97–159. https://doi.org/10.1016/0010-0277(84)90023-4.

U�s�cumli�c, M., Blankertz, B., 2016. Active visual search in non-stationary scenes: copingwith temporal variability and uncertainty. J. Neural. Eng. 13, 016015. https://doi.org/10.1088/1741-2560/13/1/016015.

VanRullen, R., 2016. Perceptual cycles. Trends Cognit. Sci. 20, 723–735. https://doi.org/10.1016/j.tics.2016.07.006.

VanRullen, R., Thorpe, S.J., 2001. The time course of visual processing: from earlyperception to decision-making. J. Cognit. Neurosci. 13, 454–461. https://doi.org/10.1162/08989290152001880.

Vogel, E.K., McCollough, A.W., Machizawa, M.G., 2005. Neural measures revealindividual differences in controlling access to working memory. Nature 438 (7067),500.

Weaver Matthew, D., Clayton, Hickey, van Zoest Wieske, 2017. The impact of salienceand visual working memory on the monitoring and control of saccadic behavior: aneye-tracking and EEG study. Psychophysiology 54, 544–554. https://doi.org/10.1111/psyp.12817.

Wenzel, M.A., Golenia, J.-E., Blankertz, B., 2016. Classification of eye fixation relatedpotentials for variable stimulus saliency. Front. Neurosci. 10 https://doi.org/10.3389/fnins.2016.00023.

Wolfe, J.M., 2007. Guided Search 4.0. Integrated Models of Cognitive Systems. OxfordUniversity Press.

86

Wolfe, J.M., Palmer, E.M., Horowitz, T.S., 2010. Reaction time distributions constrainmodels of visual search. Vis. Res. 50, 1304–1311. https://doi.org/10.1016/j.visres.2009.11.002.

Woods, D.L., Courchesne, E., Hillyard, S.A., Galambos, R., 1980. Recovery cycles of event-related potentials in multiple detection tasks. Clin. Neurophysiol. 50, 335–347.https://doi.org/10.1016/0013-4694(80)90001-2.

Worden, M.S., Foxe, J.J., Wang, N., Simpson, G.V., 2000. Anticipatory biasing ofvisuospatial attention indexed by retinotopically specific α-bankelectroencephalography increases over occipital cortex. J. Neurosci. 20. RC63–RC63.

Wyble, B., Bowman, H., Nieuwenstein, M., 2009. The attentional blink provides episodicdistinctiveness: sparing at a cost. J. Exp. Psychol. Hum. Percept. Perform. 35,787–807. https://doi.org/10.1037/a0013902.

Wyble, B., Potter, M.C., Bowman, H., Nieuwenstein, M., 2011. Attentional episodes invisual perception. J. Exp. Psychol. Gen. 140, 488–505. https://doi.org/10.1037/a0023612.

Yang, T., Shadlen, M.N., 2007. Probabilistic reasoning by neurons. Nature 447,1075–1080. https://doi.org/10.1038/nature05852.

Yuval-Greenberg, S., Tomer, O., Keren, A.S., Nelken, I., Deouell, L.Y., 2008. Transientinduced gamma-band response in EEG as a manifestation of miniature saccades.Neuron 58, 429–441. https://doi.org/10.1016/j.neuron.2008.03.027.

Zylberberg, A., Dehaene, S., Roelfsema, P.R., Sigman, M., 2011. The human Turingmachine: a neural framework for mental programs. Trends Cognit. Sci. 15, 293–300.https://doi.org/10.1016/j.tics.2011.05.007.

Zylberberg, A., Paz, L., Roelfsema, P.R., Dehaene, S., Sigman, M., 2013. A neuronal devicefor the control of multi-step computations. Pap. Phys. 5, 1–15. https://doi.org/10.4279/PIP.050006.