Segregation as overexposure - adjusting for covariates when units are small

Segregation as overexposure- adjusting for covariates when units are small

Oskar Nordström SkansIFAU and Uppsala University

Segregation Separation of groups (e.g. minority/majority) across

units (occupations, schools, firms, families…) Host of segregation indices (Gini, Duncan, Hutchens,..)

All measure the distance between the actual distribution and a distribution where the groups are equally represented in all units

With small (measured) units, groups will not be equally represented within each unit, even if randomly allocated

Standard solution to small unit bias

Generate ”counterfactual segregation” by randomly allocating individuals across the units, keeping the group sizes constant This counterfactual segregation is huge if, e.g.,

looking at segregation across firmsMeasure non-random segregation as the distance

between actual and random segregation.

𝑍መ= 𝑍− 𝐸[𝑍]1− 𝐸[𝑍]

What about covariates/confounders?

Suppose that you want to analyze the extent of segregation that cannot be explained by differences in the distribution of education and place-of-residence within the different groups.

In Åslund and Skans, Journal of population economics, 2009, we propose

Measure the exposure to minority workers (D=1) as the fraction of coworkers (i.e. excluding self) that belong to the minority

Under random allocation, average exposure among both minority and majority workers is (trivially) equal to the minority share

Hence, the distance between the minority share and average exposure among minority workers is a measure of segregation

Again, what about covariates..

We want to contrast the minority status of actual ”coworkers”, with coworkers of a similar kind.

We could imagine all jobs being filled by predetermined ”types” of workers defined by some covariates.

Think of the counterfactual (non-segregated) world as providing random coworkers, conditional on their ”types” defined by some covariates

Introduce covariates

Replacing actual exposure by exposure to minority propensities and calculate expected exposure to these propensities instead.

We estimate the propensities using averages within cells

Measure segregation as the distance between averages of actual exposure and conditional expected exposure

Convenient, do not require simulations.Easily extended to account for multiple groups.

Some stata* Individual level cross section, with unit identifiers, minority status, and X:s *Minorities are Dj==1, majority Dj=0, * Units and UnitSize:bysort UnitID: gen UnitSize = _N

* Calculate exposurebysort UnitID: egen Dsum=sum(Dj)gen Exposure=(Dsum-Dj)/(UnitSize-1) /* Subtract self */

* Average among minority workerssum Exposure if Dj==1, meanonlyglobal ActEx=r(mean)

Some stata* Define a set of covariates (all are chategorical variables)global Xvar "IndustryId RegionID Edulevel AgeCategory Female"

* calculate immigrant propensitybysort $Xvar: egen Px=mean(Dj)

* Calculate expected exposure bysort UnitID: egen Psum=sum(Px)gen ExpectedExposure$model=(Psum-Px)/(UnitSize-1) /* Subtract self */

* Sum over minority workerssum ExpectedExposure$model if Dj==1, meanonlyglobal Eeps$model=r(mean)

Extensions

1) Use Px as a threshold and randomly allocate minority status across the population:

gen Rand=uniform()gen FakeDj=Rand<Px

• Calculate alternative segregation indices based on Dj and FakeDj• Without covariates back to standard solution to small-unit bias• Calculate exposure to confirm that the intuition is right…

2) Calculate Px semi-parametrically to avoid over-fitting: probit[logit] Dj [varlist] \ predict Px

3) To expand into a multi-group setting, simply calculate exposure to the own group, and then average over the groups to get the average own-group exposure.

Simulation-based resultsWorkplace segregation, Sweden 2000 - with counterfactual simulations

Duncan Gini Hutchens ExposureActual 0.47 0.65 0.29 0.22

Expected 0.26 0.40 0.17 0.10

ConditionalExpected 0.27 0.41 0.17 0.10(Human Capital)

ConditionalExpected 0.41 0.57 0.24 0.16(HC, Industry, Region)

NMinoritiesUnits/Firms

--- 3,457,951 ------ 340,041 ------ 219,235 ---

Overexposure results, by durationWorkplace segregation, Sweden 2000 - with nonsimulated counterfactuals, by duration

All immigrants Own group Other groupsRecent immigrants Actual 0.27 0.07 0.20

Expected 0.18 0.025 0.09Odds ratio 2.58 3.24 1.93

Nonrecent immigrants Actual 0.21 0.06 0.14Expected 0.15 0.03 0.12Odds ratio 2.00 2.27 1.55

NMinoritiesUnits/Firms

--- 3,457,951 ------ 340,041 ------ 219,235 ---

Associations between overexposure and economic outcomes, by origin (Å&S, Ind Lab Rel Rev 2011)

To sum up…

The overexposure framework is a simple, fast and powerful tool to measure segregation

The framework has nice properties in terms of interpretation

It is straightforward/trivial to implement in Stata, relying on sums by groups

Segregation as overexposure - adjusting for covariates when units are small

Documents

Mitigating Overexposure in Viral Marketing · 2017-11-10 · Mitigating Overexposure in Viral Marketing Rediet Abebe 1, Lada A. Adamic2, and Jon Kleinberg 1Cornell University 2University

OVEREXPOSURE INCIDENT REPORT Calumet Testing ….1 OVEREXPOSURE INCIDENT REPORT Page 3 '.,,..r. Dose Calculations-' '-Specifth Gamp.a Ray.C'onsdnt}fc[r IF-19(fsf.8R cm2'hr-lmC l *

Segregation By: Ismael, Ana, Darlene SEGREGATION Pea Plant

Administrative Segregation, Isolation, A and Segregation... · Administrative Segregation, ... Liman overview segregation June 25, 2013 final 1 The Project and Its Goals ... Arthur

ARMANI’S FEBRUARY DATE/2 THE OVEREXPOSURE INDEX/12 … · ARMANI’S FEBRUARY DATE/2 THE OVEREXPOSURE INDEX/12 ... First copy of new subscription will be mailed within four weeks

Lecture 15: Time Varying Covariates Time-varying covariates

Accidental Overexposure of Radiotherapy Patients in - Publications

Cyclosporine (Equoral ) Population Pharmacokinetics and ... · models were tested to describe the residual variability. Covariates Analysis The screening and selection of covariates

Dynamic Pricing with Demand Covariates - Stanford …web.stanford.edu/~bayati/papers/dpdc.pdf · Dynamic Pricing with Demand Covariates Sheng Qiang Stanford University Graduate School

Health Effects of Overexposure to Respirable Silica Dust · 27/09/2010 · Health Effects of Overexposure to Respirable Silica Dust Silica Dust Control Workshop Elko, Nevada September

Safety Data Sheet Corrosion Preventive Compound, Quart SDS/8030013470981.pdf · Extreme overexposure may result in unconsciousness and possibly death. SIGNS AND SYMPTOMS OF OVEREXPOSURE:

Comparing Groups & Covariates ANOVAandMANOVA

Accidental Overexposure of Radiotherapy Patients in Białystok

Carbon Dioxide (CO2) Safety Program · 5 CHAPTER 5: Health Effects from Overexposure to Carbon Dioxide 28 Accidental Overexposures to Carbon Dioxide 28 Symptoms of Overexposure 29

Research on the overexposure of Amazon credentials in mobile apps

ARMANI’S FEBRUARY DATE/2 THE OVEREXPOSURE INDEX/12 … · 2020. 6. 24. · ARMANI’S FEBRUARY DATE/2 THE OVEREXPOSURE INDEX/12 Women’s Wear Daily • The Retailers’ Daily Newspaper

Regression Discontinuity Designs Using Covariates: Supplemental … · Regression Discontinuity Designs Using Covariates: Supplemental Appendix Sebastian Calonicoy Matias D. Cattaneoz

Predictive Quantile Regression with Persistent Covariates · PDF filePredictive Quantile Regression with Persistent Covariates ... ods, I examine the empirical predictability of monthly

Adjustment of nonconfounding covariates in case-control

Cointegration Test with Stationary Covariates