Social-Status Ranking: A Hidden Channel to Gender Inequality … · 2018. 10. 18. · ranking among the other participants in the group. This allows the peer to compare performances

Social-StatusRanking:

AHiddenChanneltoGenderInequalityunderCompetition

ArthurSchram,JordiBrandts,KlaritaGërxhani

January14,2018

AbstractCompetition involves twomaindimensions, a rivalry for resources and the ranking of relativeperformance. If socially recognized, the latter yields a ranking in terms of social status. Therivalry for resources resulting from competitive incentives has been found tonegatively affectwomen’sperformancerelativetothatofmen.However,littleisknownaboutgenderdifferencesin the performance consequences of social-status ranking. In our experimentswe introduce anoveldesignthatallowsustoisolatetheeffectsofstatusrankingfromthosecausedbyarivalryfor resources. Subjectsdoa time-limited taskwhere theyneed to search fornumbersandaddthem up. Performance is straightforwardly measured by the number of correct summations.When there is no status ranking we find no gender differences in the number of attemptedsummations or in performance. By contrast, when there is status ranking men significantlyincrease the number of attempted summations aswell as the number of correct summations.Remarkably, when women are subjected to status ranking, they significantly decrease thenumberof attempted summations.Thenet result is striking.With status rankingmenattemptmoresummationsandcorrectlysolvemanymorethanwomen.Thesedifferencesaremarkedlylarge and statistically highly significant. Our results suggest that increased participation incompetitiveenvironmentscouldharmwomen’slabormarketsuccessalongahiddenchannel. Keywords: Status, competition, gender, experiments JEL codes: C91, J16 Acknowledgments Muchoftheworkreportedinthispaperwasdonewhilethefirstandthirdauthorswerevisitingthe University of Pompeu Fabra and the Institut d’Anàlisi Econòmica in Barcelona. We aregrateful to both institutions for their hospitality. We also thank the Research Priority AreaBehavioral Economics of theUniversity ofAmsterdam, the SpanishMinistry of Economics andCompetitiveness throughGrant:ECO2014-59302-Pand through theSeveroOchoaProgram forCentersofExcellenceinR&D(SEV2015-0563)andtheGeneralitatdeCatalunya(Grant:2014SGR510)forfinancialsupport,VeronicaBenet-MartinezforprovidingusaccesstothelaboratoryoftheUniversitat Pompeu Fabra inBarcelona, andPablo Lopez-Aguilar, EvaMaciocco, Elia SolerPastor,SilviaSorianoandImmaTrianoforhelpinorganizingtheexperiments.Wearegratefultoseminar participants at University Ca’ Foscari in Venice, University of Padua, University ofAmsterdam,HeidelbergUniversity,EuropeanUniversityInstitute,andtheUniversidadCarlosIIIdeMadrid,andJosBosch,ThomasBuser,MariiPaskov,ChristinaRott,AljazUle,andMatthijsvanVeelenforcommentsatvariousstagesofthisproject. Authors Arthur Schram (corresponding author)

Jordi Brandts Klarita Gërxhani

Robert Schumann Center for Advanced Studies, EUI, Italy and CREED, Amsterdam School of Economics University of Amsterdam, P.O. Box 15867 1001 NU Amsterdam The Netherlands

Institut d’Anàlisi Econòmica (CSIC) and Barcelona GSE Campus UAB 08193 Bellaterra (Barcelona) Spain

Department of Political and Social Sciences European University Institute Via dei Roccettini 9 50014 San Domenico di Fiesole (FI), Italy

phone +31-20-525.4252 [email protected]



1

1.Introduction

Gender differences in behavior under competitive pressures on the one hand,

andinattitudestowardscompetitionontheother,havebeenrecognizedforover

adecadenow(Gneezyetal.2003,NiederleandVesterlund2007,Balafoutasand

Sutter2012,Wozniaketal.2014,Brandtsetal.2015).Theexistingliteraturehas

predominantly focusedononeparticulardimensionof competition,which is a

rivalry for resources (Stigler1987).There is,however, anotherdimension that

seems to have escaped scholarly attention. Competition typically entails a

rankingofrelativeperformance,sincehigh-rankingperformancedeterminesthe

winner(s) in competitive environments. If socially recognized, such a perfor-

mancerankingyieldsarankingintermsofsocialstatus,asdefinedbyBalletal.

(2001). This ‘social’ aspect of the recognition is important. If a ranking is only

privatelyknown,thennosocialstatusisinvolved.

Competition often creates a social-status ranking amongst the competitors.

For example, competition for highly regarded jobs or promotions involves a

rivalry for resourceswhere some people are successful and others not; but it

also implies applicants being ranked relative to others with the successful

applicant obtaining higher social status than thosewho did not get the job or

promotion.Thisrankingissocialbecausethesuccessfulapplicantisknownand

theemployer(andoftenothers)knowsthosewhodidnotsucceed.1

Inthispaperwestudygenderdifferencesinthereactiontostatusrankingin

isolation from the rivalry dimension of competition. In particular,we focus on

how performance is affected by the anticipation that onewill be compared to

othersbyapeerandcomparethis tocaseswherenosuchsocialrankingtakes

place.

Little isknownabouttheconsequencesforgender inequalityofthestatus-

ranking dimension of competition as such. In natural environments rivalry for

resourcesandstatusrankingaretypicallyinterlinked.However,therearemany

instances inwhichthestatusranking ismuchmoresalientthaninothers.This

holds, forexample, inprofessions thatareat leastpartiallyprotected from the

market like the judiciary, themilitary, NGOs, the churches and universities. In

1Intheremainder,wewilloftenusethesingleterm‘status’whenreferringtosocialstatus.

2

theseorganizationspeopleinhighpositionstypicallyenjoyhighstatus,whereas

the payoff differences with respect to people in lower positions are often not

large.2

Evenifonecouldnotdistinguishinthefieldbetweenrivalryforresourcesand

status ranking, the two are in principle distinct phenomena that can have

differential effects and, hence, could affect men’s and women’s behavior

differently. A better understanding of gender differences in performance

thereforerequiresananalyticaldistinctionbetween the twodimensions.Aside

fromimprovingourunderstanding,itisimportanttonotethatthisdistinctionis

alsohighly relevant fromapolicyperspective.The likelihoodof successofany

policyaimingtodiminishthegendergapmaydependverymuchonwhethera

rivalry for resources or status ranking are causing performance differences.

Considerapolicyaiming to reduce the rivalry for resourceswhilemaintaining

the social ranking of performance. An organization could, for example, reduce

the restrictions on the number of promotions and base them solely onmerit,

independently of how many others are promoted. Being promoted will then

increaseone’ssocialstatuswithoutreducingotherhighperformers’chances,i.e

withouttherebeinganyrivalryforresources.Ifthecauseofthegendergaplies

intheeffectsofsocialranking,thenthispolicyislikelytofail.

Itiswelldocumentedthatattitudestowardsstatusdifferacrossgender,with

menusually found to attributemore importance to status thanwomen (Frank

1999,Carlssonetal.2009,MujcicandFrijters2013),thoughthereversehasalso

beenreported(Johansson-Stenmanetal,2002,Alpizaretal.2005).Here,wedo

notfocusontheimportanceattributedtostatusperse.Instead,weaddressthe

complementary matter of gender differences in performance given that one

knows that a performance comparison will take place that will reflect one’s

status ranking. The anticipation of a social ranking has been shown to affect

performance (De Botton 2004, Wilkinson and Picket 2010), but gender

differences in this effect have not been addressed. What we do know is that,

whenmenandwomenare´forced´tocompeteforresourcesandtherankingof

performanceisnotmadesalient(i.e.,thestatusdimensionofcompetitionisnot 2Of course, the status related to certainpositionsmightyield futuremonetarybenefits. Statusperse,however,canbean importantmotivatingfactor forperformance.Forevidencefromthefield,seeBlanes-i-Vidal&Nossol(2011)andBarankay(2012).

3

obvious),thenperformancediffersacrossgenderforsomeenvironmentsbutnot

for others (Niederle and Vesterlund 2011). This leaves open the question

whether the performance differs between men and women when the status-

rankingdimensionismoresalient.

We use laboratory experiments to isolate the effects of status ranking. Our

designmakesitpossibletoholdconstanttherivalry-for-resourcesdimensionof

competitionand tovary thedimensionweare interested in.Ourexperimental

designhastwotreatments,differingonlyinthesecondofthreeparts.3Forboth

treatments, part 1 consists of a task where participants’ monetary payoff is

based purely on the individual score (i.e., performance), so that there is no

competitive aspect to the incentive scheme. There are two groups of

participants. One group does the task of part 1 and then skips part 2. Their

performanceonthetaskservesasabenchmarktowhichwecomparethatofthe

participants intheothergroup.Theparticipants intheothergroupalsodothe

taskinpart1andtheninpart2havetoreporttheirscorestoapeerseatedina

separateoffice.Thispeerdoesnotknowwhattaskwasundertaken.

We conduct two treatments in a between-subject design. In the ‘Status

Ranking’ treatment (SR), each participant in part 2 individually and privately

reportstothesamepeerand(truthfully)readsaloudhis/herscoreaswellasthe

ranking among the other participants in the group. This allows the peer to

compareperformancesinthetask.Thisparticularwayofmakingtherankpublic

(‘social’)aimsatcreatingsocialrecognitionbymakingitsalientandtangibleto

participants. As argued above, status ranking does not stem from receiving

feedbackaboutrelativeposition;itistherecognitionofone’srankingbyothers

thatcreatesasocialstatus.

Inthe‘Conformity’treatment(CF),eachparticipantreportstoadifferentpeer

and (truthfully) reads aloud the score, but not the rank. This treatment

distinctionuses the fact that status is inherentlypositional to isolate themere

effects of having to report one’s result to a stranger from the effects of social-

statusranking,i.e.,beingcomparedtoothersbyastranger(apossibilitypointed

outbyHeffetzandFrank2008).Importantly,inbothtreatmentsallparticipants 3Part3 consists of dictator games (Hoffmanet al. 1994).Thiswasdesigned to investigate theconsequencesofhavingbeenpubliclyrankedforsubsequentbargainingenvironments.Becauseitisbeyondthescopeofthepaper,thispartisdescribedandanalyzedinappendixB.

4

who have to report to a peer are informed about this before starting on the

summationtask.

Ourresultsshowmarkedlydistinctoutcomesformenandwomen.Forthose

participantswhodonothavetoreporttoapeerandforthoseunderconformity

(CF),genderdifferencesinperformancearesmallandinsignificant.Incontrast,

understatusranking(SR)menattemptmany(andsignificantly)moresumma-

tionsandsolvemanymorecorrectly thanwomen.Whenwomenknowbefore-

handthatasocialrankingoftheirperformancewilltakeplace,theyreducethe

number of attempted summations. In this sense men perform better than

women.Wecanunequivocallyattributetheobservedperformancedifferencesto

thesocial ranking,becausenogenderdifference isobservedwhen theydonot

report theirscore,norunderconformity,whereparticipantsreport theirscore

toathirdpartywhocannotcomparethisscoretothatofothers.

Theremainderofthispaper isorganizedasfollows.Thenextsectionbriefly

reviews the literatures on gender differences in preferences for competition,

stereotypethreat,andstatusrankingsandrelatesthemtothisstudy.Section3

presentsour experimentaldesignandprocedures, and section4describesour

results.Aconcludingdiscussionisofferedinsection5.

2.StateoftheArt

There is by now an extensive literature on gender differences in behavior in

relationtocompetition(foroverviews,seeCrosonandGneezy2009orNiederle

andVesterlund2011). This has addressedbothperformancedifferenceswhen

men andwomen compete and genderdifferences in thewillingness to enter a

competitive environment. In this literature, the focus is on the rivalry-for-

resources aspect of competition. A competitive environment typically involves

one or a few of the best performers obtaining amonetary prize, whereas the

otherparticipantsdonotearnanything.

Regardinggenderdifferencesinbehaviorundercompetitivepressures,afirst

influential study(Gneezyetal.2003)shows foramaze-solving task thatwhen

forced to compete for resourceswomen do not perform better than in a non-

competitive environment where earnings are based solely on individual

5

performance. In contrast, such competition strongly improves performance by

men.Thisresultisonlyobservedwhenmenandwomenparticipateinamixed-

gender competition, however. A similar effect is observed when 10-year olds

competeinrunningcontests(GneezyandRustichini2004).4Withrespecttothe

issueofgenderdifferences inattitudes towardscompetition, theseminalwork

by Niederle and Vesterlund (2007) establishes that women have a lower

willingnesstoentercompetitionthandomen.

The firststudies linkingexperimentalmeasuresofcompetitiveness toactual

education and labor market outcomes have only recently started to appear.

Theseshowthat(differencesin)competitivenesshelpexplainwhywomensort

out of jobswith competitive compensation regimes (Flory et al. 2014);predict

whetherChinesestudentschoosetoparticipateinacompetitiveentryexamfor

prestigious universities (Zhang 2013); predict future salary expectations of

Americancollegestudents(Reubenetal.2017);andcanpartiallyexplaingender

differencesinacademiccareerchoicesofDutchhighschoolstudents(Buseretal.

2014).5

To the best of our knowledge, nothing is yet known about the differential

gender impact of the status-ranking dimension of competition. In previous

studies,thestatusrankingaspectofcompetitionwasinasense‘hidden’,withthe

focus being primarily on the possibility ofwinning amonetary prize by being

amongstthebestperformers.

Thepsychologyliteraturehasofferedvariousexplanationsforeffectsofstatus

rankingperseandforgenderdifferencesintheseeffects.Webrieflydiscussthe

related conceptsof ‘social evaluative threats’ and ‘stereotype threat’.A concei-

vableeffectofstatusrankingforbothmenandwomenisthatitcreatesanxiety

aboutananticipatedcomparison.Thisanxietycanbecausedby‘socialevaluative

threats’, i.e., situations where the social self in humans is endangered. Such

threatsgiverise to large levelsof individualcortisolresponsesduetoa fearof

4 Subsequent research has shown that these performance effects depend on the task underconsideration(Güntheretal.2010,Shurchkov2012,Bohnetetal.2016).5 Various policies have been suggested to address the gender gap in entry into competition.Theseincludequota(BalafoutasandSutter2012,Niederleetal.2013),theprovisionoffeedbackon relative performance (Wozniak et al. 2014), reduced time pressure (Shurchkov 2012),participation in teams (Dargnies 2012), advice (Brandts et al. 2015), and ‘evaluation nudges’(Bohnetetal.2016).Alloftheseaddresstheeffectsobservedwhenthereisrivalryforresources.

6

failureintheeyesofothers(DickersonandKemeny2004).Thereis,however,no

evidence that these physiological responses are gender related.6 This, and the

lackofpreviousstudiesongender-specificperformanceeffectsof(social)status

anxiety are somewhat surprising, because there is ample evidence of gender

differences in ‘stereotype threat’, i.e., cultural beliefs about gender-specific

performance.Suchstereotypethreatscancausedistinctsocialevaluativethreats

formenandwomenandmaythereforedifferentiallyaffectperformance.Indeed,

stereotypethreatisconsideredtobeanimportantcauseofgenderdifferencesin

self-assessment of ability and career aspirations (with men scoring higher in

both;Correl2004,Thébaud2010,Reubenetal.2012).

Stereotypethreatmayleadtoevaluationanxietywhenconductingtasksthat

are considered to be negatively associated with one’s gender (Steele 1997).

Simply knowing that a negative gender stereotype exists may be sufficient to

causeanxiety(Goffman1963,HowardandHammond1985,SteeleandAronson

1995), which inhibits performance (Sarason 1972, Hunt and Hillery 1973,

Michaelsetal.1982,WigfieldandEccles1989,O’BrienandCandall2003).Hence,

stereotype threat could conceivably cause gender differences, both in the

performance under competition for resources and in the effects of anticipated

status ranking. In our study,we exclude this possibility.We are careful not to

primestereotypethreat.Furthermore,ourdesignallowsustoisolateanyeffects

of pre-existing stereotype threats related to gender. The results, however,

indicatenoevidenceofsucheffects.

There are a few non-laboratory studies that look at the effects of giving

rankinginformationtoworkerswithoutpecuniaryconsequences.Blanes-i-Vidal

andNossol(2011)studydatafrompersonnelrecordsforwarehouseworkersof

aGermanwholesaleandretailorganization, inwhichworkerswerepaidpiece

rates and received private ranking information on their pay and productivity.

Using a quasi-experimental research design they find that providing this

information leads to a large increase in workers’ productivity. In contrast,

6Inongoingresearch,twooftheauthorscollaboratewithCarstendeDreutolookmorespecifi-callyatwhethervariation inperformancecanbeexplainedbyphysiologicalreactionstostatusranking.Inalaboratoryenvironmentsimilartotheoneusedhere,implementedattheUniversityof Amsterdam, saliva samples were collected to enable a study of hormonal reactions. SeeAppendixCformoreinformation.

7

Barankay(2012)findsanegativeeffectofprovidingrankingaboutfeedback.He

presentstheresults fromarandomizedcontrol trialwithfurnituresalespeople

whoareprivatelyinformedabouttheirperformancerank.Hefindsthatprivately

giving rank information without any pecuniary consequences decreases sales

considerably formen, but not for women. Note that the private nature of the

ranking information inbothof these fieldexperimentsmeans that theydonot

measuretheimpactofsocial-statusrankingthatweareinterestedin.

3.ExperimentalProceduresandDesign

The experiment was run at the laboratory of the Universitat Pompeu Fabra

(UPF)inBarcelonabetweenApril2014andMay2016.Thereweresixsessions

with13 and sixwith18participants, for a total of 186participants; 144were

‘active’participants (A-andB-players;seebelow),while theresthadapassive

role(C-players;seebelow).Allparticipantswererecruitedonavoluntarybasis

fromtheUPFsubjectpoolusingtheORSEErecruitmentsoftware(Greiner2004).

If more volunteers showed up than needed for the session, participants were

randomlyselectedandtheremainderwassentoffwitha€7show-upfee.

Theexperimentwaspartly computerized.7 Instructionswerehandedouton

paper and are reproduced in part I of the Supplementary Material (SM). The

experiment consists of three parts. In part 1 (computerized), participants

undertake an individual task. In part 2 (not computerized), some active

participantsarerequiredtoreporttheirresulttootherwiseinactiveplayers.Part

3(computerized;discussedinAppendixB)involvespairsofparticipantsplaying

dictatorgames.Instructionsforparts2and3weredistributedaftercompletion

ofthepreviouspart.

Sessionslastedapproximately50minutes.Attheendofeachsession,partici-

pants were paid their earnings (which were contingent on their decisions in

parts1and3)inprivate.Foractiveparticipants,averageearningsincludingthe

€7show-upfeewere€23.47(€24.08),ex(in)cludingtwooutliers(asexplained

below).Inactiveparticipantsreceiveda€20participationfee.

7TheexperimentalsoftwarewasdevelopedinDelphiattheCenterforResearchinExperimentalEconomics and political Decision making (CREED) by CREED programmer Jos Theelen. It isavailableuponrequest.

8

3.1.PlayerTypes

Beforeenteringthelaboratory,participantsarerandomlyallocatedtothethree

typesofplayers,denotedbyA,BandC.OnlytypesAandBenterthelaboratory

and do the tasks described below. C-players are taken to separate rooms and

remain inactive throughout the experiment. In every session there are six A-

players and six B-players. Depending on the treatment (see below), there are

eithersixoroneC-player.

3.2.Task

Part1 is thesame inallsessionsand is takenfromWeberandSchram(2016).

Participantsarepresentedwithasequenceofpairsof10x10matricesfilledwith

two-digit numbers. Thesematrices appear at the lower half of their computer

monitor(Figure1).

Figure1:ScreenshotPart1

Notes. The instructions inform participants that the numbers in the cells were ‘randomlygenerated’(cf.SM).Drawingfromauniformdistributionwouldhaveledtoahighprobabilityofveryhighsums.Toavoidthis,foreachcell,wefirstdrewarandomnumberbetween40and99,sayX.Then,wedrewa randomnumber (uniformly)between10andX.This gives a far lowerprobabilityofhighnumbers(thechanceofanumberbeing75ormoreisapproximately0.06).

Foreachpairofmatriceseachparticipanthastoindividuallysearchtofindthe

highestnumberintheleftmatrixandthehighestnumberintherightmatrixand

9

to calculate the sum of these two numbers. This summust be entered in the

windowatthecenter-topofthemonitor.8Acorrectansweryieldsoneeuro.We

applythispiece-rateremunerationinallofourtreatments.Afteranumberhas

been entered, two new matrices appear, regardless of whether the sum was

correctornot.Thetaskcontinuesfor15minutes.Thepiecerateremuneration

that we apply aims at minimizing the rivalry for resources in all of our

treatments.Anytreatmentdifferencesthatmightoccurcanthenbeattributedto

thesocialstatusdimensionofcompetition.

B-players are instructed about the summation task and perform the task

withoutfurtherinteractionwithotherplayers.A-playersareinformedbeforethe

task that theywill be required to report their performance to a C-player after

completion.Performanceismeasuredasthenumberofcorrectsummations.The

A-playerinstructionsalsoemphasizetheimportanceofdoingwellinthistaskby

mentioning that it has been shown to correlate positively with success in

professionallife.9Participantsweretoldthatwewouldprovideevidenceofthis

claim upon request after the experiment. For this purpose, we had available

copiesofKoedelandTyhurst(2012),whichisaresumestudylinkingmathskills

tolabormarketoutcomes.

After finishing the instructions, each A-player is individually taken to a C-

playerandreadsaloudatextstatingthats/hewillreturnafterthetasktoreport

her/hisscore(i.e.,performance).Thisisdonetocreatetheanticipationofhaving

8 Alternatively, we could have used the summation task applied in Niederle and Vesterlund(2007).Shurchkov(2012, fn21),however, reportsevidenceofastereotype threat in this task,wherewomenfeelapriorithatmenhaveanadvantage.Toavoidthis,wedecidedtouseataskthatoneofushassuccessfullyappliedbefore(WeberandSchram2016).InthispreviousstudytherewasnoevidenceofgenderdifferencesandourdataforB-playersconfirmthis.Thisiswhywebelieve there tobenostereotype threat for the taskweused.Thisbelief findssupport inarecentapplicationofthesametaskinanexperimentweraninBologna.There,wealsoelicitedbeliefsaboutmaleandfemaleperformanceinthistaskbylettingsubjectsguesswhethermenorwomenhadthehighermeanscore(witha fiveeuroprize foracorrectguess).Thisshowednoevidence of expected performance differences; out of 30 participants, 17 (13) thoughtwomen(men)wouldscorebetter.9Thisemphasiswasmadetostresstheimportanceofstatusrankingbasedontheperformancein the particular task we used. After an analysis of results obtained in early sessions, somecolleaguessuggestedthat the fact thatA-playersbutnotB-playersweregiventhis informationmightcauseastereotypethreatthataffectsgenderdifferencesobservedamongstA-players.Forthisreason,inlatersessionstheB-playerswerealsoprimedwiththistextinthesamewayastheA-players.Theydidnotparticipateinpart2(hence,didnotreporttotheirpeers).Weobservednogendereffectsfortheseparticipants(moredetailsareavailableuponrequest).Weconcludethattheemphasisdoesnotinitselfinducestereotypethreat.Thisalsosuggeststhatinretrospectitwasunnecessarytoprovidethisinformationaltogether.

10

tolaterreporttotheC-player.ThetextusedisgiveninSM.Theexperimenters

takingtheA-playerstoseethecorrespondingC-playerwerealwaysamananda

woman.

3.3.Treatments

Westartwiththedistinctionbetweentwotreatmentsthatdifferonlyinwhether

C-playersareabletocomparetheperformanceofA-players.Thesearedenoted

as the ‘Status Ranking’ (SR) treatment and the ‘Conformity’ treatment (CF-NR,

which denotes ‘Conformity-No Ranking’). In SR, there is only one C-player. In

part2of theexperiment,eachA-playerreports(oneatatime)tothisC-player

andreadsaloudthenumberofcorrectsummationsandtheownrankamongst

theA-players(cf.theupperpanelofFigure2).

The conformity treatmentwasdesignedwith the idea that simply reporting

one’s score to a peer might already induce social evaluative threat and affect

behavior. In section 3.3 of their excellent overview of the literature on status,

Heffetz and Frank (2008)write: “Indeed ifwe assume that status depends on

actions, status-seeking individuals are expected to change their behavior in

predictablewaysdependingonwhethertheiractionsarevisible toothers.The

observationthattheyoftendo,however,isconsistentnotonlywithpreferences

for status,butalsowithanypreferenceswhereothers’opinionsare important

(e.g. because of considerations of reputation, shame, fear of punishment, etc.).

This shouldbeborne inmindwhen interpreting theevidencebelow.” Inother

words, our status-ranking treatmentmay confound the effects of social status

withothereffectsrelatedtoawishto‘conform’toapeer’sopinions.10Tostudy

thestatuseffectinSR,weuseCFtoisolatesuchothereffects.

To control for such ‘conformity’ effects,weuse theCF-NR treatment,where

there are six C-players, each seated in a separate room. Each A-player in this

treatment reports (one at a time) to a different C-player and reads aloud the

number of correct summations, but does not report anything related to the

player’sranking(seethelowerpanelofFigure2).Whenreporting,A-playersuse

printed(truthful)textsprovidedbyus(cf.SM).InbothSRandCF-NR,B-players

10 We believe the term ‘conformity’ to be adequate to capture the idea that people may beinfluencedbyothers’opinionsindependentlyfromanystatusconcerns.

11

Figure2:ExperimentalDesign

Notes.A-andB-playersindividuallydothesummationtask.ThenA-playersreportprivatelytoC-player(s)(indicatedbyarrows).PanelAshowstheStatusRanking(SR)treatmentwhereeachA-player individually goes to the (same) C-player and reports his or her own score and rankamongst A-players. Panel B shows the Conformity (CF) treatment where each A-playerindividuallygoestohisorher‘own’C-playerandreportsthescore.

donotreporttoC-players.Theirperformanceservesasabehavioralbenchmark

ofisolatedplaywithoutreporting.

NoteatthisstagethattheremaybetwodifferencesbetweentheCF-NRand

SR treatments. In SR, the social ranking is not only known to others (i.e., C-

players),butalsototheA-playersthemselves.InCF-NR,A-playersdonotknow

(and, hence, cannot report) their social ranking. To separate the effects of

reportingandknowingtheownsocialranking,weaddatreatmentinwhicheach

A-player is informed about her own rank but knows that every A-player will

report to a distinct C-player, i.e., there is no social ranking. We denote this

treatmentbyCF-PR(‘Conformity-PrivateRanking’).

Inalltreatments,C-playerinstructionsinformthemthattheywillbetoldthe

result of either one (CF-NR/CF-PR) or six (SR) participants. They are not in-

B:Conformity

A:StatusRanking

C

A1 A2 A3 A4 A5 A6

B1 B2 B3 B4 B5 B6

C

A1 A2 A3 A4 A5 A6

B1 B2 B3 B4 B5 B6

C C C C C

12

formedaboutthetask,butaretoldthathighscoresindicatebetterperformance

thanlowscores.11A-playersknowthattheC-playersdonotknowthetask.After

allA-playershavereportedtheirscores,C-playersarepaid€20anddismissed.

Thechoice to induce social rankingviaa face-to-faceencounterwithapeer

deserves further discussion. Of crucial importance is that –as argued above–

social status requires that the ranking is public (i.e. socially recognized).12 An

alternativewouldhavebeentoorganizetheinteractionbetweentheA-andthe

C-playerthroughthecomputer.Thiswouldhave,however,seriouslyreducedthe

saliencyofthesocialaspectofstatusintheSRtreatment.Adisadvantageofour

approach may be that face-to-face interaction introduces various possible

channels through which our main results might emerge. We hope to have

diminished the number of channels by introducing only minimal contact

betweenthetwoparticipantsinvolved(theA-playerreadsaloudaone-linetext

prepared by us and the C-player is not allowed to respond). We consider a

further investigation of possible channels bywhich this face-to-face encounter

mightcausetreatmenteffectsaninterestingtopicforfutureresearch.13

3.4.Pilot

Before running the 12 sessions of this experiment, we organized four pilot

sessions (in March 2014). These differed from the final experiment on two

accounts.First,participantsweregiventenminutesinsteadof15minutestodo

the summation task. We increased the amount of time given to create more

leeway for differences in performance. Second, A-players did not go to the C-

playersbetweenreadingtheinstructionsforpart1andstartingthesummation

task.We introduced this tomake the reporting of their result to a peermore

prominent.

11WedonotinformC-playersaboutthetaskinordertoavoidthemformingopinionsaboutwhatisa‘good’score.SuchopinionscouldgenerateafeelingofrankingevenintheCFtreatments,inthesenseofaperformancelevelabove/belowacertainlevelbeingjudgedasgood/bad.12 Social rankingmight conceivably also occur via the experimenters. The sessionswere orga-nized in away, however, thatmade it obvious to the participants that no experimenter couldobservetheirrank.Moreinformationisavailableuponrequest.13 As suggested by an anonymous reviewer, the face-to-face encounter between the twoparticipantsmightcreatearivalry forresources in theSR treatment ifA-playersbelieve thatahighrankinthetaskmightaftertheexperimentbringthemfavorsbytheC-player.Thoughwedonot believe that thiswould cause the large treatment effects thatwe report below,we cannotexcludethispossibility.

13

4.Results

Ourpresentationoftheresultsfocusesongenderdifferencesinperformancein

the various treatments, distinguishing between attempted summations and

performance (i.e., thenumberof correct summations).Because all tests reflect

pairwise comparisons between independent samples of individuals, we use

(two-sided) permutation (a.k.a. randomization) t-tests using Monte-Carlo

resampling with 5000 repetitions (henceforth, PtT) throughout the analysis.14

PtT do not make assumptions about the underlying distributions and the

numberofobservationsneededfortrustworthyinferenceis(much)lowerthan

for the testsmore commonly used in experimental work. For example,Moir’s

(1998)studyinthisjournalalreadyshowsthesuccessofthesetestswithasfew

as eight observationsper treatment cell.Ournumbersof observationsper cell

vary between 16 and 52 (note that by designwe havemore observations for

players of typeB) and all tests of ourmainhypotheses are basedon26 to72

observations.WeprovideafurtherdiscussionofourtestsinAppendixC,which

also provides supportive evidence for our results using data from related

experimentsinAmsterdam(cf.fn.6).

In presenting our results, we first investigate whether privately knowing

one’sownrankhasaneffectonthenumbersofattemptsandperformance.We

then continue with considering the effects of social-status ranking on the

numberofattemptsandperformance.Anoverviewofoursummarystatisticsis

presented in Appendix A and Appendix B reports the effects of experienced

statusrankingonchoicesinthedictatorgame.

4.1.TheEffectsofPrivateRankingInformation

To check whether knowing one’s relative position (without anyone else

knowing)hasaneffect,wecomparetheCF-NRandCF-PRtreatments.Figure3

comparesattempsandperformanceacrossgender for these two treatments. It

showsthattheorderingbetweenmenandwomenonbothmeasuresisreversed 14See,forexample,Moir(1998).Wepreferthepermutationt-testoverthemorecommonMann-Whitneytestbecausethelattertestsfordifferencesindistributionsoftwoindependentsamples.Wearemorepreciselyinterestedindifferencesinthemeansofthedistributions.Nevertheless,the results presented here are robust to using Mann Whitney or t-tests instead of thepermutationt-test.

14

whensubjectsknowthattheywillbeprivatelyprovidedwithinformationabout

theirrankingamongsttheA-players.Differencesaresmall,however.Noneofthe

within-genderdifferencesinattemptsorperformancebetweenCF-NRandCF-PR

arestatisticallysignificant(PtT;allp>0.24;N=20forwomen,N=16formen).

More importantly, there are no significant gender differences in attempted

summationsorperformanceforeitherconformitytreatment(PtT;attempts: in

CF-NRp=0.374,N=18;inCF-PRp=0.292,N=18;performance:inCF-NRp=

0.242,N=18;inCF-PRp=0.509,N=18).Forthisreason,wepoolthedatafor

theCF-PRandCF-NRtreatmentsfromhereonwards,unlessindicatedotherwise.

Figure3:AttemptsandPerformanceinConformityTreatments

Notes. Bars show number of attempts at calculating summations (left) andperformance (number of correct summations, right), separately for women andmen.CF-NR:Conformitytreatmentwithoutknowingownrank;CF-PR:Conformitytreatmentwithknowingownrank.Errorbarsshow95%confidenceintervals.

4.2.TheEffectsofAnticipatedStatusRanking

Whenfurtheranalyzingthedata,weleaveouttwooutliersintheSRtreatment

with more than 100 attempted summations (see SM, part II). Including them

wouldfurtherstrengthenourresults.Figure4presentsthemainresultsofthis

paper. The results for type B show that women make insignificantly more

attemptsandhaveinsignificantlylowerperformancethanmenwhentheydothe

summationtaskswithouthavingtovisitaC-player(PtT;p=0.757forattempts,

p = 0.887 for performance; in both cases N = 72). This is an important

7911131517192123

CF-NR CF-PR CF-NR CF-PR

attempts performance

women

men

15

benchmark indicating that for this task our participants experience no

unaccounted-forstereotypethreatrelatedtogender(cf.section2).15

In the conformity treatments –i.e., when participants know that they will

reporttheirresulttoapeerbutalsoknowthatthisC-playerwillnotbeableto

compare this result to others’ performance– the differences betweenmen and

womenareverysmallandstatisticallyinsignificant(PtT;p=0.951forattempts,

p=0.658forperformance;inbothcasesN=36).

Figure4:AttemptsandPerformance

Attempts Performance

Notes.Barsshowthenumberofattemptsatcalculatingsummations(leftpanel)andperformance(number of correct summations, right panel), separately for women andmen. CF: Conformity(CF-NRandCF-PRpooled);SR:StatusRanking.Errorbarsshow95%confidenceintervals.

The most remarkable result is observed for the treatment where A-players

report to a C-player and know that this peer will be able to compare their

performancetoothers(SR).Here,womenmakemanyfewerattemptsandhave

much lower performance than men and these gender differences are highly

significantforbothattempts(PtT;p<0.001;N=34)andperformance(PtT;p<

0.001;N=34).TheobservedgenderdifferenceinperformanceinSRisadirect

consequence of the difference in attempts because the fraction of attempted

15ForconformityorstatusrankingtohaveadifferentialeffectitisnotpersenecessarythatthereisnogendergapinthebehavioroftheBplayers.BeforewestartedourworkourmotivationwastofindoutwhetherCorSRleadstodifferentbehaviorthaninthebenchmarkwithouthavingaclearhypothesisaboutwhatbenchmarkbehaviorwouldbe.

12141618202224

CF SR

typeB typeA

women

men

8

10

12

14

16

CF SR

typeB typeA

16

summationsthatiscorrectdoesnotdifferbetweenmenandwomeninSR(PtT;p

=0.789;N=34).16

The ‘dif-in-dif’ result shown in Figure 4 is a direct consequence of the

differenceinthewaymenandwomenreacttotheintroductionofconformityor

status ranking.When introducing conformity inCF (having to report toothers

withoutbeingcompared),womenslightlyincreasetheirattemptsbuthavelower

performance(comparedtothebehaviorofB-playerswhodonotreport).These

differences are far from statistically significant, however (PtT; p = 0.548 for

attempts, p = 0.692 for performance; in both cases N = 72).17 Men (slightly)

increasetheirnumberofattemptsandhavealmostthesameperformance;again

theseeffectsarestatisticallyinsignificant(PtT;p=0.496forattempts,p=0.933

forperformance;inbothcasesN=36).18

When introducing social-status ranking in SR, a comparison to the ‘non-

reporting’ B-players shows that women reduce their number of attempts and

performance, while men strongly increase attempts and performance. For

women,thefirsteffectisstatisticallysignificant(PtT;p=0.044,N=68)whilethe

effect on performance is insignificant (PtT, p = 0.172, N=68). For men, both

effects are statistically significant (PtT; p = 0.003 for attempts, p = 0.050 for

performance; in both casesN=38). These results allowus to conclude that the

gender difference we observe in a situation where anticipated status ranking

may affect behavior is caused by men increasing the number of attempted

summationsandwomendecreasingit.

To investigate the effect of status ranking within gender, the most direct

comparison is between our treatments CF-PR and SR. Recall that the only

differencebetweenthesetwoisthateachparticipantreportshisorherscoreto

adifferentpeerinCF-PRwhileallsixparticipantsreporttothesameC-playerin

SR.Theeffectswefindareremarkable.Ananticipationofstatusrankingmakes

women significantly reduce the number of summations they attempt (PtT for

attempts,p=0.010,N=26).Thereductioninperformanceisnotsignificant(PtT

16 In the follow-up experiments in Amsterdam (see fn. 6), the patterns observed here wereconfirmed(cf.AppendixC).17AsimilarlackofsignificanteffectsisobservedwhenconsideringtheCFtreatmentsseparately(inCF-NR:p=0.731,p=0.428,N=62,respectively;inCF-PR:p=0.163,p=0.854,N=62).18 Similarly, there areno significant effectswhen considering theCF treatments separately (inCF-NR:p=0.163,p=0.370,N=28,respectively;inCF-PR:p=0.875,p=0.411,N=28).

17

for correct, p = 0.221, N = 26). For men, the numbers of attempts and

performancebothincreasesignificantly(PtT,p=0.047forattempts,p=0.028for

correct,inbothcasesN=26).

5.Conclusions

Ourexperimentalstudyabstractsfromrivalryforresourcesandfocusesonthe

effectsofsocialstatusresultingfromthesocialrankingofperformances.Wefind

thatmenmakemoreattemptsandincreasetheirperformanceinanticipationof

status ranking.Women, on the other hand,make fewer attempts and perform

morepoorlywhentheyknowtheywillbecomparedtoothers.Thisresultsina

largeandstatisticallyhighlysignificantgendergap.

Our findings suggest that anticipated ranking of social status alone is an

important element in observed gender differences in real-world competitive

environments. Previous studies have shown that women tend to ‘opt out’ of

competitive situations (Niederle and Vesterlund 2007). Our results imply that

findingwaystomakewomen‘optin’maynotsufficetobridgethegendergap.In

fact, our study shows that –if the status ranking inherent to competition is

salient–forcinganopt-inwillmakewomenslowdownintryingtoperformtheir

taskandwillmakemenexcel.

Thoughbeingcomparedtoothersisparticularlydisadvantageoustowomen,

the aggregate effect across men and women may not be negative. In our

experiments, total productivity (measured by the total number of correct

summations for oneman and onewoman) is on average 22.8 for participants

who do not report to anyone, 22.5 for those in conformity and 23.7 for those

anticipatingstatusranking.Thissuggeststhatsuchrankinghasnegativeeffects

on gender equalitywithout negatively affecting economic efficiency. Efficiency

and equity could both be enhanced if one could diminish the effect of social-

statusrankingonwomenwhilemaintainingthestimulatingeffectithasonmen.

Ourfocusinthispaperhasbeenongeneratingcausalevidenceonthegender

effects of social-status ranking. The question arises what are the mechanisms

underlying thisphenomenon.Tentative interpretationsofour findingsare that

either women choke under status pressure, or that status ranking with peers

demotivateswomen.Thefactthatinanticipationofstatusrankingwomenmake

18

fewerattemptshintsatthelatter,thoughitraisesthefollow-upquestionofwhy

this demotivation occurs. It is also possible that women simply becomemore

careful inperformingtheirtask, inthesenseofponderingtheirdecisionsmore

before submitting them to the computer. Finally, the observability inherent in

socialstatuscomparisonmightinduceinwomenadesiretoconformtoagender

norm(similartothe‘actingwife’phenomenonreportedinBursztynetal2017).

Atthisstageitisunclear,however,whythiswouldappearinourStatusRanking

treatment and not in Conformity. A solid explanation of the effects we find is

beyond the scope of this paper, but it deserves further investigation in future

research.

Given the increasing labor participation of women, such gender differences

and the ‘hidden’ factor of social-status ranking under competition need to be

addressed. A first step would be to reduce for women the performance

comparisonwithothersinworkingenvironments.Thiscanbedone,forexample,

via fixed promotion standards based on individual performance without

comparisontopeers.AnexampleofthispracticeisthatinmanyNorthAmerican

universities, tenure decisions are not made in direct comparison to other

candidates who are simultaneously up for tenure, but to a set of standards

expected for a tenuredposition.Our results suggest that that if thisprocedure

reduces thesalienceofstatusranking, it lead tobetterperformancebywomen

than in universities where they have to applyand compete for vacant tenure

positions(asisoftenthecaseinEurope).

Finally, to thebest of ourknowledge, this is the first study that isolates the

effectof social-status ranking from the rivalrydimensionof competition.More

researchisneededtoestablishtheconsequencesofwhatwefindhere,thateven

whenrivalryforresourcesisheldconstant,simplybeingcomparedtoothershas

an opposite effect on men and women, leading to gender differences in

performance and resource allocations. An interesting direction for future

researchwouldbetomakebothdimensionsofcompetitionsalient.Ourhunchis

that they would reinforce each other in creating more advantageous

environmentsformenthanforwomen.

19

References

F. Alpizar, F. Carlsson, O. Johansson-Stenman (2005). How much do we care aboutabsolute versus relative income and consumption? Journal of Economic Behavior andOrgization56,405-421.L.Balafoutas,M.Sutter(2012).Affirmativeactionpoliciespromotewomenanddonotharmefficiencyinthelaboratory.Science335,579-582.S.B. Ball, C.C. Eckel (1996). Buying Status: Experimental Evidence on Status inNegotiation.PsychologyandMarketingXIII:381-405.Ball,SherylB.,andCatherineC.Eckel(1998).TheEconomicValueofStatus. JournalofSocio-EconomicsXXVII:495-514.S.B. Ball, C.C. Eckel, P.J. Grossman, W.R. Zame (2001). Status in markets. QuarterlyJournalofEconomics116,161-188.Barankay(2012).Rankincentives.EvidencefromaRandomizedWorkplaceExperiment.WorkingPaper.J. Blanes-i-Vidal, M. Nossol (2011). Tournaments without prizes: Evidence frompersonnelrecords.ManagementScience57,1721-1736.J. Brandts, V. Groenert, C. Rott (2015). The impact of advice on women’s and men’sselectionintocompetition.ManagementScience,61(5),1018-1035.I.Bohnet,A.vanGeen,M.H.Bazerman(2016).WhenPerformanceTrumpsGenderBias.JointVersusSeparateEvaluation.ManagementScience,62(5),1225-1234.

L.Bursztyn,T.Fujiwara,andA.Pallais(2017).‘ActingWife’:MarriageMarketIncentivesandLaborMarketInvestments. American Economic Review 107, 3288-3319 .

Th.Buser,M.Niederle andH.Oosterbeek (2014).Gender, competitiveness and careerchoices.QuarterlyJournalofeconomics129,1409-1447.S. Correll (2004). Constraints into preferences: Gender, status, and emerging careeraspirations.AmericanSociologicalReview69,93-113.R.CrosonandU.Gneezy(2009).Genderdifferencesinpreference. JournalofEconomicLiterature47,1–27.A.DeBotton(2004).StatusAnxiety.HamishHamilton,London.S.Dickerson,M.Kemeny (2004).Acute stressors and cortisol responses:A theoreticalintegrationandsynthesisoflaboratoryresearch.PsychologyBulletin130,355-391.

F. Carlsson, G. Gupta, O. Johansson-Stenman (2009). Keeping up with the Vaishyas?CasteandrelativestandinginIndia.OxfordEconomicPapers61,52-73.M-P. Dargnies (2012). Men Too Sometimes Shy Away from Competition: The Case ofTeamCompetition.ManagementScience58:1982-2000.

20

J.A. Flory, A. Leibbrandt, J.A. List (2014). Do competitive work places deter femaleworkers? A large-scale natural field experiment on gender differences in job-entrydecisions.ReviewofEconomicStudies(onlinefirst)R.H.Frank(1999).LuxuryFever.PrincetonUniversityPress,NewJersey.E.Goffman(1963).Stigma.Simon&Schuster,NewYork.U.Gneezy,M.Niederle,A.Rustichini(2003).Performanceincompetitiveenvironments:genderdifferences.QuarterlyJournalofEconomics118,1049–1074U. Gneezy, A. Rustichini (2004). Gender and competition at a young age. AmericanEconomicReview4,377–381.B.Greiner(2004).PapersonStrategicInteraction2003-10,MPIofEconomics.C. Gunther, N.A. Ekinci, C. Schwieren, M. Strobel (2010). Women can’t jump? Anexperiment on competitive attitudes and stereotype threat. Journal of EconomicBehaviorandOrganization75,395–401O.Heffetz,R.Frank(2008).Preferencesforstatus:Evidenceandeconomicimplications.In J. Benhabib M.Jackson, & A. Bisin (eds.).Handbook of Social Economics 1A. 69-91.Elsevier,Amsterdam.E.Hoffman,K.McCabe,K.Shachat,V.Smith(1994). ,Preferences,propertyrights,andanonymityinbargaininggames.Games&EconomicBehavior7,346-380.J. Howard, R. Hammond (1985). The hidden obstacles to black success – rumors ofinferiority.TheNewRepublic19317-21.P.J.Hunt,J.M.Hillery,(1973).Socialfacilitationinacoactionsetting:Anexaminationoftheeffectsoverlearningtrials.JournalofExperimentalSocialPsychology9,563-571.O.Johansson-Stenman,F.Carlsson,D.Daruvala(2002).Measuringfuturegrandparents’preferencesforequalityandrelativestanding.EconomicJournal112,362-383.C.Koedel,E.Tyhurst (2012).Mathskillsand labor-marketoutcomes:Evidence fromaresume-basedfieldexperiment.EconomicsofEducationReview31,131-140.J.W. Michaels, J. M. Blommel, R. M. Brocato, R. A., Linkous, J. S., Rowe (1982). Socialfacilitationandinhibitioninanaturalsetting.ReplicationsinSocialPsychology2,21-24.R.Moir (1998). A Monte Carlo Analysis of the Fischer Randomization Technique:Reviving Randomization for Experimental Economists. Experimental Economics 1, 87-100.R. Mujcic, P. Frijters (2013). Economic choices and status: Measuring preferences forincomerank.OxfordEconomicPapers65,47-73.M. Niederle, L. Vesterlund (2007). Do women shy away from competition? Do mencompetetoomuch?QuarterlyJournalofEconomics122,1067-1101.

21

M.Niederle,L.Vesterlund(2011).GenderandCompetition.AnnualReviewinEconomics3,601–630.M.Niederle,C.Segal,L.Vesterlund(2013).HowCostlyIsDiversity?AffirmativeActioninLightofGenderDifferencesinCompetitiveness.ManagementScience:59:1–16L. T. O’Brien, C. S. Candall (2003). Stereotype threat and arousal: Effects onwomen'smathperformance.PersonalityandSocialPsychologyBulletin,29,782-789.E.Reuben,P.Rey-Biel,P.Sapienza,L.Zingales(2012).Theemergenceofmaleleadershipincompetitiveenvironments.JournalofEconomicBehavior&Organization83(1),111–117.E.Reuben,M.Wiswall,B.Zafar(2017).PreferencesandBiasesinEducationalChoicesandLaborMarketExpectations:ShrinkingtheBlackBoxofGender.EconomicJournal127(604),2153–2186.I.G.Sarason(1972).AbnormalPsychology.Appleton,Century,Crofts,NewYork.O.Shurchkov(2012).Underpressure:genderdifferencesinoutputqualityandquantityunder competitionand time constraints. Journal of theEuropeanEconomicAssociation10,1189–1213.C.M.Steele(1997).Athreat intheair:Howstereotypesshape intellectual identityandperformance.AmericanPsychologist52,613–629.C.M.Steele,J.Aronson(1995).StereotypethreatandtheintellectualtestperformanceofAfrican-Americans.JournalofPersonalityandSocialPsychology69,797–811.G. Stigler (1987). Competition. In J. Eatwell, M. Milgate, P. Newman (eds). The NewPalgrave:ADictionaryofEconomics,,531.TheMacmillanPress,London.S. Thébaud (2010). Gender and entrepreneurship as a career choice: Do self-assessmentsofabilitymatter?SocialPsychologyQuarterly73,288-304.

M.WeberandA.Schram(2016).TheNon-EquivalenceofLaborMarketTaxes:AReal-EffortExperiment.ForthcominginTheEconomicJournal.A. Wigfield, J. S. Eccles (1989). Test anxiety in elementary and secondary schoolstudents.EducationPsychology24,159-183.R.G.Wilkinson,K.E.Picket(2010).TheSpiritLevel.PenguinBooks,London.D. Wozniak, W. Harbaugh, U. Mayr (2014). The menstrual cycle and performancefeedback alter gender differences in competitive choices. Journal of LaborEconomics32,161-198.Y.J.Zhang(2013).CanExperimentalEconomicsExplainCompetitiveBehaviorOutsidetheLab? Working paper, Hong Kong University of Science and Technology.

i

OnlineAppendices

A. SummaryStatistics

B. StatusRankingandEntitlementinBargaining

C. Permutationt-TestsandtheRobustnessofourResults

ii

Appendices

A.SummaryStatistics

TableA.1showsparticipants’characteristicsacrosstypesandtreatments.

TableA.1:Participants’Characteristics

TypeB TypeA-CF/NR

TypeA-CF/PR

TypeA-SR

Women 0.72 0.56 0.56 0.47Economics 0.37 0.33 0.36 0.44Age 21.9 21.8 23.2 21.7N 72 18 18 36Notes. Women: fraction of female participants; Economics:fractionofparticipantswithamajorineconomicsorbusiness;Age:averageage;N:numberofparticipants.

Table A1 shows that the fractions of participants with a major in economics andbusinessaresimilaracrosstypes.Wedistinguishbetweenthesemajorsbecausethesearethefieldsinwhichstudentsaremostlikelytohaveexperiencedtaskssimilartothesummation task. The differences are statistically insignificant (Fisher’s exact test,p=0.11). The average age is also statistically indistinguishable across types (Kruskal-Wallis,p=0.44).The distribution of women is similar across A-types. The differences are statisticallyinsignificant (Fisher’s exact test, p=0.86). Women seem somewhat over-representedamongst B-types however. Across all four categories, the differences are marginallysignificant(Fisher’sexacttest,p=0.07).Notethatthisdoesnotcauseproblemsforouranalyses,becausetheyalleithercomparegenderswithintypesorcomparetypeswithingender.

iii

B.ExperiencedStatusRankingandBargaining

Themaintextfocusesontheeffectsofananticipatedstatusranking.Here,wediscusstheeffectsofexperiencedstatusranking.Theliteratureonsucheffectsisscarce.MostinfluentialineconomicshasbeentheworkbyBallandEckel(1996,1998)andBalletal(2001).Intheirexperiments,highstatusisinducedtosomeparticipantsbycallingthemforward and awarding themwith a gold star. In subsequent interactions, high-statusparticipantsobtainalargershareoftheresourcesthanlow-statusparticipants.Thisisobservedbothintheultimatumgame(BallandEckel1996,1998)andinmarkets(Balletal.2001).Theinterpretationgiveninthesepapersisthatahighsocialstatuscreatesafeelingof‘entitlement’toresources,evenifthestatusisunrelatedtothetaskinwhichthe resources are generated (Ball et al. 2001). These studies do not address possiblegenderdifferencesinthiseffect.DesignInpart3ofourexperiment,eachA-playerispairedwithaB-playerandeachB-playerispairedwithadifferentA-player.This is illustrated inFigureB.1.Thispairing schemeaims at avoiding direct-reciprocity influences on participants’ behavior. Each partici-pant plays two dictator games, once as a dictator, once as a recipient. The dictatordivides€10betweenherselfandtherecipientwithwhichsheispaired.Forexample,A2divides€10betweenherselfandB2andB6divides€10betweenherselfandA1.Afteralldecisionshavebeenmade,arandomdrawdetermineswhetherdictatordecisionsbytheA-playersortheB-playersarepaidout.

FigureB.1:Dictatorpairing

Notes.Arrowsgivepairings,pointingfromthedictatortotherecipient.

ResultsWe first investigate whether our data replicate Ball et al. (1996, 1998, 2001)’s‘entitlement’ results. In our experiment bargaining is represented in its most simpleform: the dictator game. The Ball et al. entitlement results predict that having beenrankedhighly in thesummationtaskwillmakeoneoffer lessasadictator.Wewouldthen expect lower offers by the top-ranked participants of type A-SR than for top-ranked typesBorA-CF.This isbecause the former typeknows that they scoredwellwhen being socially ranked and the latter types scored well but were not sociallyranked(infact,typesA-CF-NRandtypesBdidnotevenknowtheirranks).Onaverage,theamountofferedbythesetop-rankedtypesis3.00,3.33and1.88fortypeB,typeA-CF and type A-SR, respectively. As predicted, we find that those who were socially

A1 A2 A3 A4 A5 A6

B1 B2 B3 B4 B5 B6

iv

rankedofferedleastinthesubsequentbargaining.iThepairwisedifferencesindictatorgiving between the socially ranked (and top-3) dictators and each of the other twocategoriesismarginallystatisticallysignificant(PtT,p=0.065,N=55whencomparingtypeA-SRtotypeB;p=0.095,N=34whencomparingtypeA-SRtotypeA-CF).ThisisinlinewiththeentitlementresultsbyBalletal.(1996,1998,2001).FigureB.2showsperplayertypetheaverageamount(outof€10)givenbymenand

womentotherecipient,separatelyforplayerswitharankinthetop3andthoseinthebottom3.

FigureB.2:DictatorAllocations

Notes.Barsindicatetheamountineurosgivenbythedictatortotherecipient(s)hewaspairedto.Rank1,2,3(4,5,6)indicatesthattheparticipantwasamongstthetop3(bottom3)inowngroup in terms of performance. Recall that only type A-SR and type A-CF-PR players knewtheir rank. The numbers of observations are larger than or equal to eight in all categories,exceptmen in typeA-SRwithrank4-6(N=5)andwomenintypeA-SRwithrank1-3(N=3).Tieswere treatedas follows. InCFandSR theparticipant that reached the (tied)numberofcorrect solutions firstwas rankedabove theother. For typeB, all those tiedweregiven thesamerank.Errorbarsshow95%confidenceintervals.

At first sight, the largest differences between men and women are observed forparticipantswhohadbeensubjectedtosocialranking(notethatthenumbersofhighlyrankedwomenandlowlyrankedmeninSRarelow,however).Irrespectiveofwhetherthey scored highly or lowly, men seem to give less than women after having beensocially ranked.We again test for genderdifferencesusingpermutation t-tests. TableB.1presentsthep-valuesthatthesetestsyield.

iKnowing (only)privately thatonehasahighrankdoesnotmakeonegive less.Thenineparticipantswithatop-3rankintypeA-CF-PRgaveonaverage4.0inthedictatorgame,whilethenineparticipantswithatop-3rankintypeA-CF-NRgave2.67.Thisdifferenceisstatisticallyinsignificant(PtT,p=0.363,N= 18). This suggests that the social aspect of ranking is important for Ball et al’s entitlement effect tooccur.NotethatthesocialaspectisakeypartofthestatusinducementprocedureinBalletal.(2001).

0

1

2

3

4

5

6

rank1,2,3 rank4,5,6 rank1,2,3 rank4,5,6 rank1,2,3 rank4,5,6

typeB typeA-CF typeA-SR

women men

v

TableB.1:TestResultsforGenderEffectsinDictatorAllocations

TypeB TypeA-CF TypeA-SRAll 0.616

N=720.300N=36

0.001N=34

Top3 0.264N=39

0.458N=18

0.180N=16

Lower3 0.660N=33

0.564N=18

0.017N=18

Note.Cellsshowthep-valueofanPtTtestonequalmeancontributionsbymenandwomenforthetypedepictedinthecolumnandtheranksdepictedintherow.

The results in Table B.1 show only few gender effects in dictator giving. The oneimportant exception is thatmenwho have scored badly in the summation task (i.e.,lower half in the ranking) offer significantly less than women after having beensubjectedtosocialranking.Thesizabledifferencebetweenmenandwomenobservedfortop-rankedsubjects(cf.FigureB.2)inSRisstatisticallyinsignificant,possiblyduetothe lownumberofwomen in thesample. Inshort,mentendtogive less thanwomenwhenranked,butespeciallysoafterhavingbeenpubliclyrankedlowly.

vi

AppendixC:RobustnessofourResultsThe numbers of observationswe use for our statistical tests are at the lower end ofwhat one typically observes in the experimental literature. In this appendix, we firstarguethattheyneverthelessallowforvalid inferencewhenusingpermutationt-tests.Then, we use additional data obtained from a related experiment to show that ourconclusionsarerobusttoenlargingthedataset.Permutationt-testsPermutation (a.k.a. randomization) tests (Fisher 1935) are based on reshufflingtreatmentlabelsinadataset.Considerthecaseofat-testfordifferencesinmeans.Theideaforreshufflingstartsfromnotingthatanobservedt-statisticmaybeseenasadrawfromall possible t-statistics. TableC.1provides an example.Assume thatweobservetheheight(incm)ofthreemenandthreewomenasdepictedinthefirstrow.Wewanttoinvestigatewhethermenaretallerthanwomen.

TableC.1:Anexampleofpermutationt-tests Men Women t-statistics p-valueObserved 176,182,190 164,168,170 3.47 0.03Shuffle2 164,182,190 176,168,170 0.91 0.41….. Shuffle20 176,168,170 176,182,190 3.47 0.03Applying a t-test to the observed heights would lead us to believe that men aresignificantlytalleratthe5%-level.Thelownumberofobservations(six)shouldmakeone doubt the normality assumption underlying this t-test, however. There are 20possibledistributionsofthesixheightsbetweenthreemenandthreewomen.Ofthese(only)theobserveddistributionand‘shuffle20’giveat-valueof3.47orhigher.Evenifheightsaredistributedrandomlybetweenmenandwomen,thereisthena10%chancethatat-testwouldconcludethatmenaretaller.Thisisanexactprobability.ThisFishertestshowsusthat–basedonthesesixobservations–wecannotconcludethatmenaretallerthanwomen.iiForoursample, thereare toomanyobservations tocheckallpossiblereshufflesof

thedata.Inthiscase,onecanrandomlydrawasetofthese(inourcasewetook5000reshuffles, or ‘permutations’) and use the distribution of the resulting t-statistics toinvestigate how likely the observed value is to occur. Because of this Monte-Carloresampling,p-values are estimatedwith amargin of error. In caseswhere the upperboundof the95%confidence interval for thep-valueexceeded thechosensizeof thetest,wethereforeincreasedthenumberofreplicationsuntilthestandarderrorofpwasbelow0.0015.Becausethistestisbasedonexactstatistics,thenumbersofobservationsneededis

muchlowerthanintraditionalparametricandnon-parametrictests.Forgivensamplesize, the testhas thehighestpower in comparison to related tests (Siegel1956,Moir1998).Moir(1998)reportsaMonte-Carlostudythatshowsveryreliableresultsforasfewaseightobservationspertreatmentcategory.Thesmallestcellcountwebaseourtestsonis16.

iiThisexampleismerelyanillustrationofhowtheexacttestworks.Infact,withsixobservationsandtwocategories,onecannotachieveahighersignificancelevelthan10%.

vii

TheAmsterdamSessionsWeranarelatedexperimentattheCREEDlaboratoryinAmsterdaminJune,2015.Themainpurposeof this study is to investigate thephysiologicalmechanisms involved inthe effects of status ranking. For this reason, we also gathered saliva samples.Otherwise, the sessions and treatmentswere structured exactly like in theBarcelonaexperiment.Theresultsof theAmsterdamexperimentwillbepresented inaseparatestudy. We thank our co-author Carsten de Dreu for agreeing to let us report somebehavioralfindingshere.96subjectsparticipatedineightsessions(fourCF-PRandfourSR).Addingtheseto

theBarcelonadatagivesus236observationsandcellcountsbetween23and69.FigureC.1showstheobservednumbersofattemptsandperformance.Byandlarge,theresultsmirrorthoseobservedinBarcelonaalone.

FigureC.1:AttemptsandPerformance

Attempts Performance

Bothforattemptedsummationsandperformance,whenthereissocial-statusranking,thedifferencebetweenmenandwomen ishighlysignificant (PtT,p<0.01,N=58 inboth cases). The biggest difference between Figure E.1 and Figure 4 in themain textappearstobethata(smaller)genderdifferencemayalsoexistfortypesBandA-CF.Forattempts,thedifferencesobservedinFigureE.1arenotsignificant,however(PtT, p=0.25,N =119 for typeB;PtT, p = 0.66,N =60 for typeA-CF).Forperformance, thegendereffectsaremarginallysignificant(PtT, p= 0.09,N=119 for typeB;PtT, p=0.09,N=60fortypeA-CF).Whenaggregatingthedataacrossthetwolocations,oneneedstotakeintoaccount

that there were differences between the two subject pools. In particular, Dutchparticipantswereacross-the-boardbetter atdoing the summation taskand relativelymoreeconomistsparticipatedinAmsterdamthaninBarcelona.Becausesuchdifferen-cesmight interactwith gender effects,we ran linear regressionsof performance (i.e.,thenumberofcorrectsummations)onaseriesofbackgroundvariables,includinggen-der.Wedidsoseparatelyforeachtypeofplayer.TheresultsarepresentedinTableC.2.These results provide further evidence of the effects observed in the Barcelona

experiment.Aftercorrectingforbackgroundvariables,weonly findsignificantgenderdifferences when participants were subjected to social-status ranking. There are nogendereffects inperformancewhensubjectsdonotreport theirscore toanyone,norwhen they each report todifferentC-players.When there is ranking of social status,women performmuchworse thanmen. Themarginal effect is more than five fewer

14 16 18 20 22 24 26 28

CF SR

type B type A

women men

9 10 11 12 13 14 15 16 17 18

CF SR

type B type A

viii

correct summations. The results also show that Spanish participants have lowerperformancethantheDutchinallroles.

TableC.2:Performance

TypeB TypeA-CF TypeA-SRConstant 12.47*** 14.74*** 17.73***Economist 2.63*** 1.33 –2.16Barcelona –1.67*** –3.35** –3.13*Female 0.99 –0.76 –5.29**FemaleEconomist –3.00* –2.23 5.54N 102 60 40Notes. Cells report the coefficients of linear regressions of the number of correct summations on theindependentvariablesdepictedintherows.*/**/***depictsstatisticalsignificanceatthe1%/5%/10%-level.Thetotalnumberofobservationsisreducedduetomissingobservationsonbackgroundvariables.All in all, the additional data from Amsterdam provide further evidence that ourconclusionsinthemaintextcannotbeattributedtothenumbersofobservations.Reference(notincludedinthemaintext)Siegel,S.(1956).NonparametricStatisticsfortheBehavioralSciences,McGraw-Hill,Toronto.

Documents

Social-Status Ranking: A Hidden Channel to Gender Inequality … · 2018. 10. 18. · ranking among the other participants in the group. This allows the peer to compare performances