28
Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 1 / 20 Usage and importance of DASP in Stata Abdelkrim Araar, Jean-Yves Duclos and Luis Huesca Comparisons of Stata to other software or use of Stata together with other software. Mexico, May 12, 2011

Usage and importance of DASP in Stata

  • Upload
    doantu

  • View
    227

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Usage and importance of DASP in Stata

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 1 / 20

Usage and importance of DASP in Stata

Abdelkrim Araar, Jean-Yves Duclos and Luis Huesca

Comparisons of Stata to other software or use of Stata together with othersoftware.

Mexico, May 12, 2011

Page 2: Usage and importance of DASP in Stata

Outline

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 2 / 20

DASP: a Stata package for distributive analysis

Conclusion

Page 3: Usage and importance of DASP in Stata

DASP: a Stata package fordistributive analysis

Outline

DASP: a Stata packagefor distributiveanalysis

DASPin a few words

DASPfeatures

OtherDASPfeatures

DASP’s main menu

DASP’s mainvariables

Using variables inDASP

InputtingDASPcommands

Applications and filesin DASP

Producing curveswith DASP

SavingDASPgraphs

Examples with theMexican data

Conclusion

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 3 / 20

Page 4: Usage and importance of DASP in Stata

DASP in a few words

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 4 / 20

� Stata enables programmers to provide specialized “.ado” routines to addto the power of the software.

� DASP, which stands forDistributive Analysis Stata Package, is mainlydesigned to assist those researchers and policy analysts that are interestedin conducting distributive analysis with Stata.

� DASPuses Stata for two main reasons:

� Stata is a powerful tool to store and manage household data surveys.CombiningDASPand Stata allows to use the same environment forprocessing and analyzing data.

� Stata easily allows adding specialized programs, making itpossiblefor programmers to add to its power and flexibility.

Page 5: Usage and importance of DASP in Stata

DASP features

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 5 / 20

DASPallows to:

� Estimate the most popular statistics (indices, curves) used for the analysisof poverty, inequality, social welfare, and equity;

� Estimate the differences in such statistics;� Estimate standard errors and confidence intervals by takingfull account

of survey design;� Perform the most popular distributive decomposition procedures;� Check for the ethical robustness of distributive comparisons;� Support distributive analysis on more than one data base at the same time.

Page 6: Usage and importance of DASP in Stata

Other DASP features

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 6 / 20

� Contains optimized algorithms for the estimation of distributive indices;� Unifies syntax and parameter use across various estimation procedures

for distributive analysis;� For eachDASPmodule, three types of files are provided1:

� *.ado file: contains the program of the module;� *.hlp file: contains the help material for the given module;� *.dlg file: allows the user to perform the estimation using the

module’s dialog box.

1For more information about DASP modules, see the user manual: (?)).

Page 7: Usage and importance of DASP in Stata

DASP’s main menu

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 7 / 20

DASP’s windows menu makes it possible to access quickly each of the dialogboxes. The latter are grouped by main themes.

Page 8: Usage and importance of DASP in Stata

DASP’s main variables

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 8 / 20

� VARIABLE OF INTEREST. This is the variable that usually captures livingstandards. It can represent, for instance, income per capita orexpenditures per adult equivalent.

� SIZE VARIABLE . This refers to the “ethical” or physical size of theobservation. This variable usually refers to the number of householdmembers.

� GROUP VARIABLE. Say that we wish to estimate poverty within acountry’s rural area or within female-headed families. Oneway to do thisis to forceDASPto focus on a population subgroup defined as those forwhom some GROUP VARIABLE(say, area of residence) equals a givenGROUP NUMBER(say 2, for rural area).

� SAMPLING WEIGHT. Sampling weights are the inverse of the samplingprobability. This variable should be set upon the initialization of the dataset.

Page 9: Usage and importance of DASP in Stata

Using variables inDASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 9 / 20

� DASPmakes it possible to use simultaneously more than one data file.� The user should initialize each data file before using it withDASP. This

initialization is done by:

� Labeling variables and values for categorical variables;� Initializing the sampling design with the commandsvyset;� Saving the initialized data file.

� It is useful to add a character such as “I” to the names of initialized files(Example: Uganda99I.dta) in order to distinguish them.

Page 10: Usage and importance of DASP in Stata

Inputting DASP commands

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 10 / 20

� Stata andDASPcommands can be entered directly into a commandwindow:

� An alternative is to use dialog boxes. For this, the commanddbshould betyped and followed by the name of the relevantDASPmodule. Example:db ifgt.

Page 11: Usage and importance of DASP in Stata

Applications and files in DASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 11 / 20

Two main types of applications are provided inDASP. For the first one, theestimation procedure uses only one data file, the data file in “memory” (or“loaded”). It is from that file that the relevant variables must be specified.

Page 12: Usage and importance of DASP in Stata

Applications and files in DASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 11 / 20

Two main types of applications are provided inDASP.For the second type of applications, two distributions are needed. For each ofthese two distributions, the user can specify the currently-loaded data file (theone in memory) or one saved on disk.

Page 13: Usage and importance of DASP in Stata

Producing curves withDASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 12 / 20

� DASPwas designed to facilitate the use of curves to display distributiveinformation.

� For instance, if we wish to graph Lorenz curves to compare inequalitybetween rural and urban areas, the following command line can be typed:clorenz exppc, hgroup(zone) hsize(size)where in this exampleexppc is per capitaexpenditures ,size ishousehold size andzone is the zone variable (1 = rural / 2= urban).

Page 14: Usage and importance of DASP in Stata

Producing curves withDASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 12 / 20

� After executing this command the following window appears:

Page 15: Usage and importance of DASP in Stata

Producing curves withDASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 12 / 20

� For many curves,DASPallows showing their confidence intervalsaccording to selected levels of statistical significance (this value is bydefault set to 95%).

� For instance, to draw confidence intervals around FGT curves, we can usethecfgtsm DASPmodule:cfgtsm exppc, alpha(0)hsize(size) hgroup(sex) max(100000)

Page 16: Usage and importance of DASP in Stata

Producing curves withDASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 12 / 20

� After executing this command the following window ap-pears: Drawing the confidence interval of distributive curves (FGT curves)

0.2

.4.6

.8

0 20000 40000 60000 80000 100000Poverty line

Male Female

Confidence interval(95%)FGT Curve(s)

Page 17: Usage and importance of DASP in Stata

SavingDASP graphs

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 13 / 20

� Graphs produced withDASPor Stata can be saved in many differentformats. Among them:

*.gph is Stata’s graphical format. It is useful to allow re-editing the graph(with Stata 10 or higher).

*.wmf is the Windows metafile format. This format may be easily insertedinto Word documents. The user can also copy a Stata graph and pasteit directly into a Word document.

*.eps is the encapsulated postscript format. This format can easily beinserted in Latex documents.

Page 18: Usage and importance of DASP in Stata

Examples with the Mexican data

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 14 / 20

−.1

5−

.1−

.05

0L_

x(p)

− C

_t(p

)

0 .2 .4 .6 .8 1Percentiles (p)

Null horizontal line Total

Rural Urban

Source: Author’s estimation using ENIGH 2008 and DASP.VAT has been adjusted by informal expenditures.

Mexican VAT by urban−rural area, 2008 (Ordering by gross percapita income)Progressivity curve(s). TR approach

Page 19: Usage and importance of DASP in Stata

Examples with the Mexican data

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 14 / 20

0−

.005

−.0

1−

.015

C_x

−t(

p)−

L_x

(p)

0 .2 .4 .6 .8 1Percentiles (p)

Null horizontal line Total

Rural Urban

Source: Author’s estimation using ENIGH 2008 and DASP.VAT has been adjusted by informal expenditures.

Mexican VAT by urban−rural area, 2008 (Ordering by gross percapita income)Progressivity curve(s). IR approach

Page 20: Usage and importance of DASP in Stata

Examples with the Mexican data

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 14 / 20

0.2

.4.6

.8C

_b(p

) +

C_t

(p)

− 2

L_x

(p)

0 .2 .4 .6 .8 1Percentiles (p)

Null horizontal line Total

Rural Urban

Source: Author’s estimation using ENIGH 2008 and DASP.VAT has been adjusted by informal expenditures & Oportunidades is the transfer.

Mexican VAT Vs Transfer by urban−rural area, 2008.Progressivity curve(s). TR approach

Page 21: Usage and importance of DASP in Stata

Examples with the Mexican data

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 14 / 20

0.0

08.0

16.0

24.0

32.0

4C

_x+

b(p)

− C

_x−

t(p)

0 .2 .4 .6 .8 1Percentiles (p)

Null horizontal line Total

Rural Urban

Source: Author’s estimation using ENIGH 2008 and DASP.VAT has been adjusted by informal expenditures & Oportunidades is the transfer.

Mexican VAT Vs Transfer by urban−rural area, 2008.Progressivity curve(s). IR approach

Page 22: Usage and importance of DASP in Stata

Examples with the Mexican data

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 14 / 20

0.0

05.0

1.0

15C

_x+

b(p)

− C

_x−

t(p)

0 .2 .4 .6 .8 1Percentiles (p)

Null horizontal line Total

Rural Urban

Source: Author’s estimation using ENIGH 2008 and DASP.VAT has been adjusted by informal expenditures & Oportunidades is the transfer.

Mexican VAT Vs Transfer’s recipients by urban−rural area, 2008.Progressivity curve(s). IR approach

Page 23: Usage and importance of DASP in Stata

Conclusion

Outline

DASP: a Stata packagefor distributiveanalysis

Conclusion

Summary

Relevant DASPcommands

Exercises with Stataand DASPAdditional literatureon DASP and DAD

References

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 15 / 20

Page 24: Usage and importance of DASP in Stata

Summary

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 16 / 20

� Stata is a popular software that provides powerful statistical applicationsand that is simple to use.

� Stata commands can be inputted through dialog boxes, do files, orcommands windows.

� DASPfacilitates the estimation of the most popular statistics used for theanalysis of poverty, inequality, social welfare, and equity, and providesvarious sophisticated statistical tools to check for the robustness and theprecision of such statistics.

� DASPunifies syntax and parameter use across various estimationprocedures for distributive analysis.

� DASPallows the use of two distributions at the same time, and simplifiesthe production of tables and graphs.

Page 25: Usage and importance of DASP in Stata

Relevant DASP commands

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 17 / 20

� FGT and EDE-FGT poverty indices (ifgt).� FGT CURVE with confidence interval (cfgts).� Lorenz and concentration curves (clorenz).

Page 26: Usage and importance of DASP in Stata

Exercises with Stata and DASP

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 18 / 20

� Exercises 1.1, 1.2, 1.3

Page 27: Usage and importance of DASP in Stata

Additional literature on DASP and DAD

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 19 / 20

� The material from this presentation draws largely fromAraar and Duclos (2009b) and Duclos and Araar (2006). See alsoAraar and Duclos (2009a), et Christian Toft (2006) and Zhang(2003) forthe DAD software.

Page 28: Usage and importance of DASP in Stata

References

Usage and importance of DASP in Stata 3rd STATA Users Group Meeting Mexico – 20 / 20

ARAAR, A. AND J.-Y. DUCLOS (2009a): “DAD: a Software for Povertyand Distributive Analysis,”Journal of Economic and Social Measurement,43, 175–189.

——— (2009b): “DASP: Distributive Analysis Stata Package,” PEP,CIRPÉE and World Bank, Université Laval.

DUCLOS, J.-Y. AND A. A RAAR (2006):Poverty and Equity Measurement,Policy, and Estimation with DAD, Berlin and Ottawa: Springer and IDRC.

ET CHRISTIAN TOFT, M. B. (2006): “Poverty and Equity. Measurementand Analysis Jean-Yves Duclos, Abdelkrim Araar and Carl Fortin,”Économie publique/ Public economics, 18-19, 5–12.

ZHANG, Q. (2003): “DAD, an innovative tool for income distributionanalysis,”Journal of Economic Inequality, 1, 281–284.