28
EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Embed Size (px)

Citation preview

Page 1: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

EPUNet Training Course 2005

Day 2

Tutors: Olaf Jürgens and Christian

SchmittBerlin, April 11th to April 15th

2005

Page 2: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Overview

Cross-Sectional Matching Transcription Routines (Raw Data into

Statistics-Package-Format) Exploring Data-Sets

EPUNet 2005 Training Course

Page 3: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB Personal File Household File Register File Relational File

Above: One file per wave Country File Link File

Page 4: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Personal File All person related information in the ECHP Only adult respondents Similar variable structure across countries Similar variable structure across waves

Page 5: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Household File Unit of analysis: Household General information that is applicable for all

household members Similar variable structure across countries Similar variable structure across waves

Page 6: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Register File Unit of analysis: persons General information Regardless of age or participation Similar variable structure across countries Similar variable structure across waves

Page 7: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Relational File Unit of analysis: persons (caution: repeated

observations of persons!) Display of the relation-matrix of persons within a

household Kinship, parent-child relationships, sibblings, etc. Similar variable structure across countries Similar variable structure across waves

Page 8: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Country File Unit of analysis: country Display of general country specific

information (PPP, Exchange rates, Population)

Single file One set of variables per wave

Page 9: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

ECHP Overview - Data Sets

Contents of the ECHP UDB

Link File “Heart” of the ECHP Unit of analysis: persons Regardless of age or participation General information for cross-sectional and longitudinal

matches Household membership in a given wave Sampling information Weighting information Single file

Page 10: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Cross-Sectional Matching Procedures

Cross Sectional Matches Person-level Matching Household-level Matching Relational MatchingCentral identifiers for all of the above:

country pid/hid

Base for all matches: ECHP Link File

EPUNet 2005 Training Course

Page 11: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Cross-Sectional Matching Procedures

Matching: Logical order First: country variable (country) Second: household identifier (HID) Third: personal identifier (PID) Always use this logical order!‘sort by country hid pid’(hid may be left out if no household based

information is included in the data generation)

Base for all matches: ECHP Link FileEPUNet 2005 Training Course

Page 12: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Cross-Sectional Matching Procedures

Person-level matching I - Same Individual country pid

Examples of matches Matching information of one Person across

fileso Using personal information from the Register File and

the Personal Fileo Adding information from the Personal File to the Link

File

Page 13: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Cross-Sectional Matching Procedures

Person-level matching - Across Individuals

Examples of matches Matching information of a child to the mother - Unit

of analysis: Mother; additional child related information

Matching information of a husband to his wife - Unit of analysis: Wife; additional information of the husbands income, e.g.

Central information stored within the Relation-File

Page 14: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Cross-Sectional Matching Procedures

Identifiers within Files:

Unit of analysis Basic info / File structure Personal File

(Register File)Country

PID(HID)

Person Level Information

Country FileCountry

General country specific info

Link FileCountry

HID[waveN]PID

Linking across waves

Relationship FileCountry

PID(HID)

Linking across individuals

Household File

Country HID

Household Level Information

Page 15: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Lab Session Day 2

Transcription RoutinesTransformation from PDB to UDBExploring Data SetsCross-sectional matching

procedures

Page 16: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Transcription Routines (Raw Data into Statistics-Package-Format)

From PDB to UDB ‘Raw’ ECHP data comes in comma

separated ASCII format. ‘Raw’ ECHP data comes without any

labels! Transcription Routines for SPSS

In SPSS: open syntax file:

EPUNet 2005 Training Course

Page 17: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Transcription Routines (Raw Data into Statistics-Package-Format)

From PDB to UDB ‘Raw’ ECHP ASCII format without Transcription Routines for SPSS:

In SPSS: open syntax file:

‘UDB_readin.SPS’

‘UDB_label.SPS’

Adjust pathnames to fit your file structure

RunEPUNet 2005 Training Course

Page 18: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Transcription Routines (Raw Data into Statistics-Package-Format)

From PDB to UDB - Result: ECHP UDB Files in SPSS’ *.sav - format Link File (1 file ‘ulink’) Personal File (‘pfile[n]’ wave 1 to 8) Household File (‘hfile[n] wave 1 to 8) Register File (‘rfile[n] wave 1 to 8) Relationship File (‘relat[n] wave 1 to 8) Country File (1 file ‘ctryvars’)

EPUNet 2005 Training Course

Page 19: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets - See Doc-Pan 166

Personal File Open Pfile

Get file = ‘X:\path1\a_w8p.sav’.

Descriptives variables PE001.

For self defined employment status

Continue with ‘a_w7p.sav’, ‘a_w8h.sav’, ‘ulink.sav’, etc.

EPUNet 2005 Training Course

Page 20: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Personal File - contents Demographic information Employment and activity Calendar of activities Income Educational attainment Current education and training Health/Care Migration

Satisfaction EPUNet 2005 Training Course

Page 21: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Household File - contents Demographic information Household income Household related benefits Accommodation and housing situation Durables Persons in household.

EPUNet 2005 Training Course

Page 22: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Register File - contents Panel specific information (personal identifier

[PID], household identifier, weights, etc. Demographic information (age, sex, etc.)

EPUNet 2005 Training Course

Page 23: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Relationship File - contents Always lists two persons per case! Central: relation between person one and person

two (pid1 relation pid2)

EPUNet 2005 Training Course

Page 24: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Country File - contents One record for each country/panel One block of variables for each wave RATE: Exchange rates in Euro PPP: Purchasing power parities POPTOT: Total population in private households POP16P: Number of persons aged 16+ living in private households POPHHD: Number of private households

EPUNet 2005 Training Course

Page 25: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Link File - contents General structural information for linking

households and individuals within and across waves

All panel household members (regardless of age or panel participation)

Basic demographic information Cross sectional and longitudinal weights Sample status Wave specific household identifiers

EPUNet 2005 Training Course

Page 26: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

General structure: Identical naming of variables across waves First letter of variable describes file (‘P’ for

Personal File) Second letter of variable describes module (‘PM

for Personal File, module migration) Subsequent numbers describe exact information

(PM001 for Personal File, module migration 001 for migration trajectory)

EPUNet 2005 Training Course

Page 27: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Exploring Data Sets

Getting information:

Central tool: Codebook (Doc-Pan 166) containing: List and description of all ECHP UDB variables

and

information on availability and comparability of variables

across countries and across waves

EPUNet 2005 Training Course

Page 28: EPUNet Training Course 2005 Day 2 Tutors: Olaf Jürgens and Christian Schmitt Berlin, April 11th to April 15th 2005

Files to use

Personal File Household File Register File Relationship File Country File Link File

UDB_readin.sps UDB_label.sps