37

Background Data harmonization Data output Web: Variable documentation system Web: Data extract system IPUMS Dissemination System

Embed Size (px)

Citation preview

Page 1: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System
Page 2: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Background

Data harmonization

Data output

Web: Variable documentation system

Web: Data extract system

IPUMS Dissemination System

Page 3: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Harmonization

MARST Marital Status

code label CN82A403 CO73A411 KN89A413 MX70A402 US90A425

100 SINGLE/NEVER MARRIED 1=never married 4=single 1=single 9=single 6=never married

200 MARRIED/IN UNION

210 Married (not specified) 2=married 2=married 3=monogamous 1=married

211 Civil 3=only civil

212 Religious 4=only religious

213 Civil and religious 2=civil and religious

214 Polygamous 3=polygamous

220 Consensual union 1=free union 5=free union

300 SEPARATED/DIVORCED 3=sep. or divorced

310 Separated 6=separated 8=separated 3=separated

321 Legally separated

322 De facto separated

330 Divorced 4=divorced 5=divorced 7=divorced 4=divorced

400 WIDOWED 3=widowed 5=widowed 4=widowed 6=widowed 5=widowed

999 UNKNOWN/MISSING 0=missing 6=unknown B=blank 1=unknown

China1982

Colombia1973

Kenya1989

Mexico1970

U.S.A.1990

(Marital status)

Page 4: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

IPUMS MicrodataHome

OwnershipRelationto Head

Age MaritalStatus

Occupation

Data extract

Page 5: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

3. Submit extract

19212023311

19212023311

19211212211

19211214400

17051612211

17051212211

17051223310

17051214400

03241214400

03242014400

03242023310

03242013310

Pooled Data Extractssample water sex education

Argentina 20013.6 million

Chile 20021.5 million

Cuba 20021.1 million

Extract

Engine

Argentina 2001

Chile 2002

Cuba 2002

Water supply

Sex

Education

1. Select samples

2. Select variables

1 dataset

3 censuses

4 variables

6.2 million records

Harmonized codes

Page 6: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Q: How can we give researchers the information they need without overwhelming them?

Q: How can we best encourage comparative research?

A: Organize information by variable, not sample

A: Ability to filter out unnecessary information

A: Access to full detail when that is desired

Variable Documentation System

Page 7: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

1. Exploring the Database

Page 8: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variables Page

Page 9: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variables Page

159 samples

Page 10: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Sample Filtering

Page 11: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variables Page – Filtered

Page 12: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

2. Variables – Codes

Page 13: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Codes(Marital status)

Page 14: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Codes(Marital status)

Page 15: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Codes(Marital status)

Page 16: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

3. Variable Descriptions

Page 17: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Description(Marital status)

Page 18: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Comparability Discussion(Marital status)

Page 19: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

4. Variables – Deep Documentation

Page 20: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Enumeration Text

Page 21: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Enumeration Text(Marital status, Cambodia)

Page 22: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Variable Description(Unharmonized source variables)

Page 23: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Unharmonized Variables(Source data for marital status)

Page 24: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Make it easy to get only the variables and samples that a user needs.

Pool the data across time and countries.

Provide tools to help users manage the size of the data.

Provide advanced features to empower researchers to do new kinds of research.

Data Extract System

Page 25: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System
Page 26: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Extract – Select Samples

Page 27: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Extract – Select Samples

Page 28: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Extract – Select Variables

Page 29: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

1. Case selection

2. Customized sample size

3. Attached characteristics

4. Extract revisions

Advanced Extract Features

Page 30: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Case Selection

Page 31: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Customize Sample Sizes

Page 32: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Customize Sample Sizes

Page 33: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Customize Sample Sizes

Page 34: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Pernum Relationship Age Sex Marst Chborn

1 head 53 female separated 6

2 child 28 male single n/a

3 child 22 male single n/a

4 child 21 male single n/a

5 child 25 female married 2

6 child-in-law 28 male married n/a

7 grandchild 3 male single n/a

8 grandchild 1 male single n/a

9 non-relative 32 female separated 2

10 non-relative 10 male single n/a

11 non-relative 5 female single n/a

Location

 

 

 

 

 

 

 

 

 

 

 

Location

 

 

 

 

 

 

 

 

 

 

 

Location

 

 

 

 

 

 

 

 

 

 

 

0

0

0

0

0

6

5

0

0

0

0

0

0

1

1

1

1

0

5

5

0

9

9

0

0

0

6

6

0

0

0

0

0

Spouse’s Father’sMother’s

Constructed “Pointer” Variables

Attached Characteristics

Page 35: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Age of spouse

Employment status of father

Occupation of father

Attached Characteristics

Page 36: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

Download or Revise Extract

Page 37: Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System

END

http://international.ipums.org

Matt SobekMinnesota Population Center

[email protected]