46
Data Evaluation and Methods Research Replication An Approachto the Analysisof Data From ComplexSurveys Development and evaluation of a replication technique for estimat- ing variance. DHEW Publication No. (PHS) 79-1269 U.S. DEPARTMENT OF HEALTH, EDUCATION, AND WELFARE Public Health Service Office of Health Research, Statistics, and Technology National Center for Health Statistics Hyattsville, Maryland 20782 Series 2 Number 14

VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Data Evaluation andMethods Research

ReplicationAn Approachto the Analysisof DataFrom ComplexSurveys

Development and evaluation of a replication technique for estimat-ing variance.

DHEW Publication No. (PHS) 79-1269

U.S. DEPARTMENT OF HEALTH, EDUCATION, AND WELFAREPublic Health Service

Office of Health Research, Statistics, and TechnologyNational Center for Health Statistics

Hyattsville, Maryland 20782

Series 2Number 14

Page 2: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

.. .----, ., . . .

Vital and Health Statistics Series 2-No. 14First issued as DHEW Publication No. (PHS) 66-1000

April 1966

Page 3: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

NATIONAL CENTER FOR HEALTH STATISTICS

DOROTHY P. RICE. Director

ROBERT A, ISRAEL, Deputy Director

JACOB J. FELDMAN, Ph. D., Associate Director for Analysis

GAIL F. FISHER, Ph. D., Associate Director for the Cooperative Health ,Statistics System

ROBERT A. ISRAEL, Acting Associate Director for Data System

JAMES T. BAIRD, JR., Ph.D., Associate Director for International Statistics

ROBERT C. HUBER, Associate Director for Mmagenzent

MONROE G, SIRKEN, Ph. D., Associate Director for Mathematical S tatistics

PETER L, HURLEY, Associate Directorfor Operations

JAMES M. ROBEY, Ph. D., .Associate Director for Program Development

PAUL E. LEAVERTON, Ph. D., Associate Director for Research

ALICE HAYWOOD, Inform~tion Officer

Library of Congress Catalog Card Number 66-60010

Page 4: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

PREFACE‘The theory of design of surveys has advanced greatly in the past

three decades. one result is that many surveys now rest upon complexdesigns involving such factors as stratification and poststratification,multistage cluster sampling, controlled selection, and ratio, regression,or composite estimation. Another result is a growing concern andsearch for valid and efficient techniques for analysis of the output fromsuch complex surveys.

A central difficulty is that most of the standard classical techniquesfor statistical analysis assume that observations are independent of oneanother and are the result of simple random sampling, often from auniverse of normal or other known distribution-a situation that does notprevail in modern complex design. This report reviews several aspectsof the problem and the limited literature on the topic. It offers a newmethod of balanced half-sample pseudoreplication as a solution toone phase of the problem.

The entire matter of how best to analyze data from complex sur-veys is nearly as broad as statistical theory itself. It encompasses notonly the technical features of analysis, but also relationships amongpurpose and design of the survey, and the character of inferenceswhich may be drawn about populations other than the finite universewhich was sampled. This report treats only a very small sector ofthe subject, but, it is believed, introduces a scheme which may bewidely useful. There is good reason to hope that the method, orpossibly variations of it, may have utility beyond the somewhat narrowarea with which it deals specifically.

The exploration and developments reported here are the outgrowthof discussions among a number of people, as is nearly always truewhen the subject is a pervasive one. But they are particularly theproduct of a study by Philip J. McCarthy of Cornell University undera contractual arrangement with the National Center for Health Sta-tistics. Contributions of the Center were coordinated by Walt R.Simmons. Professor McCarthy wrote the report. Garrie J. Losee ofthe Center was responsible for initial work on half-sample replication,as employed in NCHS surveys, and prepared the appendix to this re-port.

Page 5: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

CONTENTS

Preface ------ ------ ------------ ------------- ------------------------- --

Introduction ----------------------------------------------------------

Complex Sample Surveys and Problems of Critical Analysis- ---------------

General Approaches for Solving Problems of Critical Analysis --------------Two Extreme Approaches ------- ------- ------- ------- ----------------

Obtaining “Exact” Solutions ------------------------------------------Replication Methods of Estimating Variances ------------------------ ---

●Pseudoreplication -----------------------------------------------------

Half-Sample Replication Estimates of Variance From StratifiedSamples--Balanced ~lf-Sample Replication ------------------------------------Partially Balanced Half Samples --------------------------------------Half-Sample Replication andthe Sign Test ------------------------ _4-----

Jackknife Estimates of Variance From Smatified Samples ---------------HaIf-Sample Replication and the Jackknife Method With Stratified

Ratio-Type Estimators -------------------------------------------

Summary -------------------------------------------------------------

Bibli~raphy ----------------------------------------------------------

Appendix. Estimation of Reliability of Findings From the First Cycle of theHealth Examination Survey --------------------------------------------

Survey &si~------------------------------------------------------Requirementsof a Variance Estimation Technique ----------------------Developmentof theReplicationTechique ------------------------------Computer Output ---------------------------------------------------Illustration --------------------------------------------------------

Page

i

1

2

779

10,

131316182023

24

29

31

333333343737

Page 6: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

A key featwe of statistical techniques necessary to the analysis of datafrom complex surveys is the method of calculating variance of the sam-ple estimates. EaYliev direct computational procedures aye either in-appropm”ate OY much too difficult, even with high speed electronic com-pute~s, to cope with the elabovate stratijlcation, multistage clustey sawz-pling, and intricate estimation schemes found in many curyent samplesuvveys. A diffe~ent appvoach is needed.

A numbev of statisticians have attempted solution thyough a variety ofschemes which employ some foym of replication or random grouping ofobservations. These efforts are recalled in this repovt, as a part of thebackground Yeview of principal issues present in choice of analyticmethods suitable to the complex su?wey.

Among- the estimating schemes in recent use is a half-sample pseudo-replication technique adapted by the National Center fov Health Statis-tics from an approach developed by the U.S. Bureau of the Census, This,method is descyibed in detail in the repoyt, Typically, it involves sub-sampling a payent sample in such a way that 20-40 pseudoveplicated es-timat~s of any specified statistic are produced, with the precision of thecorresponding statistic from the parent sample being estimated from thevariability among the Replicated estimates.

One difficulty in using this method is that the 20-40 estimates aye cho-sen from among the thousands OY millions of possible replicates of thesame chavactey, and hence may yield an unstable estimate of the var-iability among the possible Replicates. The report presents a system fovcontrolled choice of a limited number of Pseudoveplicates —often nomoye than 20-40 foy a majoy nationul survey— such that for some classesof statistics the chosen small numbev of Replicates hus a vayiance al-gebraically identical with that of all possible replicates of the samechayacte?’ within the pavent sample, and the same expected value as thevariance of all possible replicates of the same character joy all possi-ble pavent samples of the same design. Illustrations of the technique andguides for its use ave included.

SYMBOLS

Data not available ------------------------ ---

Category not applicable ------------------ . . . IQuantity zero ---------------------------- -

Quantity more than O but less than 0.05 ----- 0.0

Figure does not meet standards ofreliability or precision ------------------ *

Page 7: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

REPLICATIONAN APPROACH TO THE ANALYSIS OF DATA

FROM COMPLEX SURVEYS

Philip J. McCarthy, Ph. D., Cornell University

INTRODUCTION

A considerable body of theory and practicehas been developed relating to the design andanalysis of sample surveys. This material isavailable in “such books as Cochran (1963),Deming (1950), Hansen, Hurwitz, and Madow(1953), Kish (1965), Sukhatme (1954), and Yates(1960), and in numerous journal articles. Muchof this theory and practice has the followingcharacteristics: the sampled populations containfinite numbers of elements; no assumptions aremade concerning the distributions of the pertinentvariables in the population; major emphasis isplaced on the estimation of simple populationparameters such as percentages, means, andtotals; and the samples are assumed to be “large”so that the sampling distributions of estimatescan be approximated “by normal distributions.Furthermore, it has frequently been appropriateto regard the principal goal of sample design astiat of achieving a stated degree of precision forminimum cost, or alternatively, of maximizingprecision for fixed cost.

Sample surveys in which major emphasis isplaced on the estimation of population parameterssuch as percentages, means, or totals have beenvariously called “descriptive” or “enumerative”surveys and, as noted above, the work in finite-population sampling theory has been primarilyconcentrated on the design of such surveys.Increasingly, however, one finds reference inthe sample survey literature to “analytical”

surveys or to the use of “analytical statistics. ”Cochran (1963, p. 4), for example, says:

In a” descriptive survey the objective issimply to obtain certain information aboutlarge groups: for example, the numbers ofmen, women, and children who view a tele-vision program. In an analytical survey,comparisons are made between different sub-groups of the population, in order to dis-cover whether differences exist among themthat may enable us to form or to verifyhypotheses ahout the forces at work in thepopulation. . . . The distinction between de-scriptive and analytical surveys is not, ofcourse, clear-cut. Many surveys providedata that serve both purpdses.

Although there are some differences in emphasis,Deming (1950, chap. 7), Hartley (1959), and Yates(1960, p. 297) make essentially the same dis-tinction. Kish (1957, 1965) more or less auto-matically assumes that data derived from mostcomplex sample surveys will be subjected tosome type of detailed analysis, and applies theterm “analytical statistical methods” to proce-dures that go much beyond the mere estimationof population percentages, means, and totals.

The Health Examination Survey (HES) of theNational Center for Health Statistics (NCHS) isone example of a sample survey that, in somerespects, might be classified as an enumerativesurvey, but whose principal value will undoubtedlybe in providing data for analytical purposes. Some

1

Page 8: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

of the basic features of Cycle I of HES are pre-sented in a publication of the National Center forHealth Statistics (Series 11, No. 1). A brief de-scription of the survey, quoted from the report,is as follows:

The first cycle of the Health ExaminationSurvey was the examination of a sample ofadults. It was directed toward the collectionof statistics on the medically defined prev-alence of certain chronic diseases and of aparticular set of dental findings and physicaland physiological measurements. The prob-ability sample consisted of 7,710 of all non-institutional, civilian adults in the age range18-79 years in the United States. Altogether,6,672 persons were examined during theperiod of the Survey which began in October1959 and was completed in December 1962.

A rather detailed account of the survey design hasbeen published by the Center (Series 1, No. 4).

The enumerative and analytical aspects ofthis survey, and the inevitable blending of oneinto the other, ‘are well illustrated in two reportsthat have been published on the blood pressureof adults (NCHS, Series 11, Nos. 4 and 5). Notonly does one find in these reports the distributionof blood pressure readings for the entire sample,but one also finds the comparison (with respectto blood pressure) of subgroups of the populationdefined by a variety of combinations of suchdemographic variables as age, sex, arm girth,race, area of the United States, and size ofplace of residence. It seems unnecessary toargue where the enumerative aspects end andthe analytical aspects begin. For all practicalpurposes. and by any definition one chooses toadopt, the survey is analytical in character. Thesame will be true of almost any sample surveythat one examines, at least as far as many usersof the data are concerned. Certainly this is theview of the staff at NCHS.

The principal goal of this report will be toexamine some of the problems that arise whendata from a complex sample survey operationare subjected to detailed and critical analysis,and to discuss some of the procedures that havebeen suggested for dealing with these problems.Particular emphasis will be placed on a pro-

cedure for estimating variances which is especial-ly suitable for sample designs similar to thoseused in the Health Interview Survey and the HealthExamination Survey.

COMPLEX SAMPLE SURVEYS AND

PROBLEMS OF CRITICAL ANALYSIS

Simple random sampling, usually withoutreplacement, provides the base upon which thepresently existing body of sample survey theoryhas been constructed. Major modifications ofrandom sampling have been dictated by one orboth of two considerations, These are as follows.

(1) One rarely attempts to survey a finitepopulation without having some prior knowledgeconcerning either individual elements in the pop-ulation, or subgroups of population elements, orthe population as an entity. This prior information,depending upon its nature, can be used in thesampIe design or in the method of estimation toincrease the precision of estimates over thatwhich would be achieved by simple random sam-pling. Thus we have such techniques as stratifica-tion, stratification after the selection of thesample (poststratification), selection with prob-abilities proportional to the value of some auxil-iary variable, ratio estimation, alld regressionestimation.

(2) Many finite populations chosen for surveystudy are characterized by one or both of thefollowing two circumstances: the ultimate popula-tion elements are dispersed over a wide geograph-ic area, and groups or “clusters” of elements canbe readily identified in ad;ance of taking the sur-vey, whereas the identification of individual pop-ulation elements would be much more costly.These circumstances have led to the use of multi-stage sampling procedures, where one firstselects a sample of clusters and then selects asample of elements from within each of the chosenclusters.

In addition to these two main streams ofdevelopment, whose results are frequently com-bined in any one survey undertaking, there is awide variety of related and special techniquesfrom which choices can be made for sampledesign and for estimation. Thus one can use

Page 9: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

systematic sampling, rotation sampling (in whicha population is sampled over time with somesample elements remaining constant from timeto time), two-phase sampling (in which the resultsof a preliminary sample are used to improvedesign or estimation for a second sample), un-biased ratio estimators instead of ordinary ratioestimators, and so on. Finally, it is necessaryto recognize that measurements may not be ob-tained from all elements that should have beenincluded in a sample, and that such nonresponsemay influence the estimation procedure and theinterpretation of results.

As sample design and estimation move fromsimple random sampling and the straightforwardestimation of population means, percentages, ortotals to a stratified, multistage design with ratioor regression forms of estimation, it becomesincreasingly difficult to operate in an “ideal”manner even for the purest of enumerative sur-veys. Ideally, one would like to be assured thatthe “best” possible estimate has been obtainedfor the given expenditure of funds, that the biasof the estimate is either negligible or measur-able, and that the precision of the estimate hasbeen appropriately evaluated on the basis of thesample selected. Numerous difficulties are en-countered in achieving this goal. Among theseare: (1) the expressions that must be evaluatedfrom sample data become exceedingly complex,(2) in many instances, these expressions are onlyapproximate in that their validity depends uponhaving “large” samples, and (3) most surveysprovide estimates for many variables—that is,they are multivariate in character—and this, inconjunction with the first point, implies an ex-tremely large volume of computations, even formodern electronic computing equipment. Thislast point is accentuated when one wishes to studythe relationships among many variables in numer-ous subpopulations. The foregoing difficulties arewell illustrated by the Health Interview Survey(No. A-2), the Health Examination Survey (Series1, No. 4), and the Current Population Survey(Technical Paper No. 7). The sample designsand estimation techniques for these three sur-veys are somewhat similar, although the CurrentPopulation Survey employs a composite estimationtechnique (made possible by the rotation of sampleelements) that is not employed in the other twosurveys.

We have at various points in the precedingdiscussion used the term “complex” samplesurveys, implying thereby that the sample designis in some sense or other complex. Little is tobe gained by arguing the distinction between sim-ple or complex under these circumstances, al-though several observations are perhaps in order.We are, of course, primarily concerned withthe complexities of analysis that result from theuse of a particular sample design and estimationprocedure. These complexities arise from variouscombinations of such factors as the following.The assumption of a functional form for the dis-tribution of a random variable over a finite pop-ulation is rarely feasible and thus analyticalpower for devising statistical procedures is lost.The selection of elements without replacement,or in clusters, introduces dependence among ob-servations. Estimators are usually nonlinear andwe are forced to use approximate procedures forevaluating their characteristics. Some designtechniques that are known to increase the pre-cision of estimates almost invariably lead to thenegation of assumptions required by such commonstatistical procedures as the analysis of variance(e.g., strata having unequal within-variances).Further comments on these points will be madelater in this report.

New dimensions of complexity, both concep-tual and technical, arise as one progresses froma truly enumerative survey situation to a purelyanalytical survey. Each of these will now bediscussed briefly.

On the conceptual side, the major questionconcerns the manner in which one chooses toview a finite population-either as a fixed set ofelements for which a statistical description isdesired, or as a sample from an infinite super-population to which inferences are to be made.In simplest terms, this can be viewed as follows.An infinite superpopulation, characterized by ran-dom variable Y with mean

E(y) = g, and with varianceE(y-p)2=u2,

is assumed as a basis for the sampling process.IV independent observations on Y lead to a finitepopulation with mean

(l/N~~lyi= ~, and with variance

3

Page 10: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

(l/fV-I) i~, (Yi -~)z-~z. while a

simple random sample, drawn without replace-ment, from the finite population has the observedmean

(] /n)l~l Yi = Y and variance

(l/.-l) i~l(yi-~)z=S2.

Ordinary sampling theory assumes that we wishto describe the realized finite population of N ele-ments, and we have

E (ylfixed N values of y) = ?

N-n S2v (ylfixed N values of y) - ~ ~

N–n S2; (ylfixed N values of y) = ~ ~

where the symbol A indicates an estimator of apopulation parameter, If, however, we wish todraw inferences for the infinite superpopulationfrom our observed sample, and therefore takeexpectations over an infinite set of finite popula-tions of N elements, then it is straightforward todemonstrate that

E(y)=p

V(y)= u2/n

;(~)=s 2/n

In effect, the only formal difference in the twoviews is that the finite population correction isomitted in the variance of Y and in the estimateof the variance of y. This point has been madeby Deming (1950, p. 251) and Cochran (1963,p. 37). Cochran says , in reference to the com-parison of two subpopulation means:

One point should be noted. It is seldom ofscientific interest to ask whether ~j = ~~ be-cause these means would not be exactly equalin a finite population, except by a rare chance,even if the data in both domains were drawnat random from the same infinite population.

Instead, we test the null hypothesis that the

two domains were drawn from infinite pop-ulations having the same mean. Consequentlywe omit the fpc when computing V(yj ) andV(jj). . . .

Actually, it can also be argued that one wouldrarely expect to find two infinite populationswith identical means. Careful accounts of statis-tical inference sometimes emphasize this factby distinguishing between “statistically significantdifference” and “practically significant differ-ence, ” and by pointing out that null hypothesesare probably never “exactly” true.

In practice, the survey sampler is ordinarilyin a position to control only the inference fromthe sample to the finite population. He may knowthat the finite population is indeed a sample,drawn in some completely unknown fashion froman infinite superpopulation, but when he tries tospecify this superpopulation, his definition willordinarily be blurred and indistinct. Professionalknowledge and judgment will therefore play amajor role in such further inferences. Further-more, comparisons with other studies, com-parisons among subgroups in his finite population,and a consideration of related data must be brought ‘to bear on the problem. There seems to be littlethat one can say in a definite way at present aboutthis general problem, but very perceptive com-ments on this subject have been made by Demingand Stephan (1941) and by Cornfield and Tukey(1956, sec. 5). Even the answer to the specificquestion of whether or not to use finite popula-tion corrections in the comparison of domainmeans would appear to depend upon the circum-stances.

The technical problems raised by the ana-lytical use of data from complex surveys differin degree but not really in kind from those facedin the consideration of enumerative survey data.These problems are primarily of two types:

1. As indicated earlier, most analytical usesof survey data involve the comparison of sub-groups of the finite population from which thesample is selected. These subgroups have beenfrequently referred to as “domains of study.”The basic difficulty raised by this fact is thatvarious sample sizes, which in ordinary sam -

4

Page 11: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

pling theory would be regarded as fixed frbmsample to sample, now become random variables.Furthermore, this occurs in such a manner thatit is usually not possible to use a conditionalargument—that is, it is not possible to considerthe drawing of repeated samples in which thevarious sample sizes are viewed as being equalto the size actually observed—as can be donewhen estimating the mean of a domain on thebasis of simple random sampling.

2. In making critical analyses of surveydata, one is much more apt to use statisticaltechniques that go beyond the mere estimationof population means, percentages, and totals(e.g., multiple regression). Ordinary surveytheory has attacked the problem of providingestimates of sampling error for certain esti.mates, e.g., ratio and regression estimates, butthe body of available theory leaves much to bedesired. An example of each of these problemswill now be described briefly.

One of the most frequently used techniquesfrom sampling theory is that of stratification.A population is divided into L mutually exclusiveand exhaustive strata containing IVl , Nz,. . . , iVLelements; random samples of predeterminedsize r71, nz, . . ..n L are drawn from the respec-tive strata; a value of the variable y is obtainedfor each of the sample elements; and the popula-tion mean is estimated by

where y~ is the mean of the n~ elements drawnfrom the hth stratum, N is the total size of thepopulation, and w~ = N# N. It is easily shownthat an unbiased estimate of the variance of 9is given by

where s; is the variance for the variable y asestimated in the hth stratum, and fh = ~h /~h isthe sampling fraction in the hth stratum. Forpurposes of illustration, let us assume that thestrata are geographic areas of the United States,that nh, Nh,and N refer to all adults (18 years of

age and over) and that the variable y is bloodmessure.

Suppose now that one wishes to estimatethe average blood pressure for males in the 40-45 year age range with arm girth between 38 and40 centimeters. This special group of adults is asubpopulation, or domain of study, with referenceto the total finite population, and elements of thedomain will be found in each of the defined strata.The weights for the strata and the fixed samplesizes do not refer to this subpopulation and, overrepeated drawings of the main sample, the numberof domain elements drawn from a stratum willbe a random variable. Furthermore, the totalnumber of domain elements in a stratum is un-known. Under these circumstances, let

n~,~= the number of sample elements inthe h th stratum falling in domain d.

Yhi ~= the value of the variable for the ith

sample element from domain d in theh t h stratum.

& ‘~~1 (1/fh) nh ~= the estimated tOtal

number of elementsin the domain.

‘h,d

~h,d-(llnh,d) i~l Yhi, d = S-pk mean,for h th stratum,of elements fall-ing in domain d.

Then an estimate of the domain mean and its es-timated variance are given by

t(;d)=z N;(l-fh) ~(Yhi d–jib d)z[fi{ h ‘h(nh–]) i ‘ ‘

where h= 1,2, . ... L and i=], z,....nh.~.

5

Page 12: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

These expressions have been presented anddis-cussed byanumber ofauthors–tirbin (1958, p.117), Hartley (1959, p. 15), Yates (1960, p. 202),Kish (1961, p. 383), and Cochran (1963, p. 149.

The factor (I /;j ) was evidently omitted in theprinting of this formula ).

Three points concerning these results areworthy of note in the context of the presentdiscussion. These are:

1. The estimate is actually a ratio estimate—technically, a combined ratio estimate. It is there-fore almost always biased for small sample sizes,and the variance formula is only approximatelycorrect.

2. The complexity of the formulas, as re-gards derivation and computation, has been in-creased considerably over that of ordinary strati-fication.

3. The variance has a between-strata com-ponent as well as a within-strata component and,if nh ~ is small as compared with n~ , this be-tween-strata component can contribute substan-tially to the variance. Thus we see that changingemphasis from the total population to a sub-population has introduced added complexities intheory and computations.

As regards the use of more advanced sta-tistical techniques in the critical analysis ofsurvey data, we shall simply refer to the diffi-culties that have been encountered in obtainingexact theory when ordinary regression techniquesare applied to random samples drawn from afinite population. Cochran (1963, p. 193) summa-i-izes this very well:

The theory of linear regression plays aprominent part in statistical methodology.The standard results of this theory are notentirely suitable for sample surveys becausethey require the assumptions that the popula-tion regression of y on x is linear, that theresidual variance of y about the regressionline is constant, and that the population isinfinite. If the first two assumptions areviolently wrong, a linear regression esti-mate will probably not be used. However, insurveys in which the regression of y on xis thought to be approximately linear, it ishelpful to be able to use 71, without having

to assume exact linearity or constant residualvariance.

Consequently we present an approachthat does not demand that the regression inthe population be linear. The results holdonly in large samples. They are analogousto the large-sample theory for the ratio es-timate.

Somewhat the same point is made by Hartley(1959, p. 24) in his paper on analyses for do-mains of study. He says:

. . . nevertheless we shall not employ regres-sion estimators. The reason for this is notthat we consider regression theory inappro-priate, but that this theory for finite popula-tions requires considerable development be-fore it can be applied in the present situation.

Some developments have. arisen, since Hartley’spaper and reference to these will be given ‘in thenext section.

As a final complicating factor, we note thatcertain techniques used in some sample surveydesigns are such that their effects on the pre-cision of estimates cannot be evaluated from asample, even in the case of an enumerative sur-vey. We refer specifically to the technique ofcontrolled selection, described by Goodman andKish (1950), and to instances in which only onefirst-stage sampling unit is selected from eachof a set of strata (Cochran, 1963, p. 141).

In order to provide a convenient illustrationof some of the foregoing points, the appendixpresents a brief description of the “complex”sample design and estimation procedure employ-ed in the Health Examination Survey, togetherwith a selection of examples that arose in themore or less routine analysis of informationcollected on blood pressure. The data given areestimates of the percentage of individuals withhypertension in various subclasses of the popu-lation of adults, with the subclasses definecl interms of such demographic variables as race,sex, age, family income, education, occupation,and industry of employment. These subclassescut’ across the strata used in the selection ofprimary sampling units, and the variances of theestimates are also affected by the various clus-tering and estimation features of the design.

6

Page 13: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Most of the cited cases refer to the estimationof variance for the percentage of hypertensionin a single subclass, although several examplesare given in which the percentages in two sub-classes are compared. The variances were es-timated by a replication technique that will beintroduced in “Balanced Half-Sample Replica-tion, ” a technique that to some extent overcomesthe problems of analysis that have just been raised.‘The results obtained through the application ofthis technique will be used for illustration atseveral points throughout this report.

GENERAL APPROACHES FOR

SOLVING PROBLEMS OF

CRITICAL ANALYSIS

Two Extreme Approaches

It is possible to identify two extreme viewsthat one may hold with respect to the problemsraised in the preceding section. First of all,one might conceivably argue that analytical workwith survey data should be done only “by design. ”That is, areas and methods of analysis shouldbe set forth in advance of taking the survey andthe sample should be selected so as to conformas closely as pssible to the requirements of thestated methods. On the other extreme, one mightdecide to throw up his hands in dismay, ignoreall the complicating factors of an already executedsurvey design, and treat the observations asthough they had been obtained by random sampling,presumably from some extremely ill-defined su-perpopulation.

The first approach, that of “design for a-nalysis, “ is certainly the most rational view thatone can adopt. No careful examination of theliterature was made to search out actual ex-periences on this point, but it would appear un-likely that one could find any examples of large-scale, complex, and multipurpose surveys inwhich this approach had been attempted. A pos-sible exception might be the Census EnumeratorVariation Study of the 1950 U.S. Census, as de-scribed by Hanson and Marks (1958), althoughthis study was based primarily on the completeenumeration of designated areas rather than ona sample of individuals selected in accordance

with a complex sample design. In other casesindividuals have been randomly selected from adefined population of adults to provide observa-tions for a complex “experimental design, ” asin the Durbin and Stuart (1951) experiment onresponse rates of experienced and inexperiencedinterviewers, but again this differs considerablyfrom the type of problem raised in the precedingsection. Another example in which an experi-mental design has been applied to survey data isprovided by Keyfitz (1953) and discussed byYates (1960, pp. 308-314). In this case, thesample elements were obtained by cluster sam-pling, but the author investigates the possibleeffects of the clustering and concludes that itcan be ignored in the analysis of variance.

Some recent work by Sedransk (1964, 1964a,1964b) bears directly on the problem of designfor analysis and it assumes that the primary goalof an analytical survey is to comp~re the weansof different domains of study. If Yi and Vi arethe estimated means for the ith and ;th do-mains, Sedransk places constraints on the vari-ance of their difference, for all z and j, andsearches for sample-size allocations that willminimize simple cost functions. A variety ofdifferent situations are considered. Random sam-ples can be selected from each of the domains;random samples can be selected from the over-all population, but the number of elements fallingin each domain then becomes a random variable;two-stage cluster samples can be selected fromeach of the domains; and two-stage clustersamples can be selected from the total population,but again the number of elements falIing in eachdomain is a random variable. In the second andfourth cases, the author considers double samplingprocedures and obtains approximate solutions toguide one in choosing sample sizes for samplingfrom the total population so as to satisfy the con-straints which are phrased in terms of all pos-sible pair-wise domain comparisons. Even ifone does not wish to impose constraints ondomain comparisons and on minimization of cost,the cited papers contain of necessity many de-velopments in theory that will be of assistancein attacking the problems raised in the precedingsection. It should be observed that the com-plexity of the designs considered is still farfrom that of the Current Population Survey orthe HeaIth Examination Survey.

7

Page 14: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Major difficulties in designing for analysisare and will continue to be encountered when theprimary goal of a survey is to describe a largeand dispersed population with respect to manyvariables, as the analytical purposes are some-what ill defined at the design stage. Thus thebro,ad primary purposes of HES were to providestatistics on the medically defined prevalence inthe total U.S. population of a variety of specificdiseases, using standardized diagnostic criteria;and to secure distributions of the general popula-tion with respect to certain physical and physio-logical measurements. Nevertheless, analysis ofrelationships among variables is also an impor-tant product of the survey.

A similar set of circumstances arises withrespect to data on unemployment collected by theCurrent Population Survey. Clearly the primarygoal is to describe the incidence of employmentand unemployment in the total U.S. population,and yet the data obtained must also be used forcomparison and analysis. Faced with difficultiesof analysis, as described in the preceding sections,one may wish to retreat to the opposite extremefrom design for analysis and view the observa-tions as coming from a simple random sample.

Actually, this type of retreat would appearto place the analyst in a difficult, if not un-tenable, position. Cornfield and Tukey (1956)speak of an inference from observations toconclusions as being composed of two parts,where the first part is a statistical bridge fromobservations to an island (the island being thestudied population) and the second part is asubject-matter span from the island to the farshore (this being, in some vague sense, a pop-ulation of populations obtained by changes intime, space, or other dimensions). The firstbridge is the one that can be controlled by theuse of proper procedures of sampling and ofstatistical inference. One may be willing to in-troduce some uncertainty into the position of theisland, for example by ignoring finite populationcorrections, in the hope of placing it nearer thefar shore than would otherwise be the case. How-ever, there seem to be no grounds for suggestingstatistical procedures that may, unbeknownst totheir user, succeed only in moving the island ashort distance from the near shore.

The Health Examination Survey was carriedout on a sample chosen to be broadly “repre-

sentative” of the total U.S. population. Amongother characteristics of design, the sample wasof necessity a highly clustered one. As is wellknown, a highly clustered sample leads to es-timates that have much larger standard errorsthan would be predicted on the basis of simplerandom sampling theory if the elements withinclusters tend to be homogeneous with respectto the variables of interest. Since geographicclustering leads to homogeneity y on such char-acteristics as racial background, socioeconomicstatus, food habits, availability and use of medi-cal care, and the like, it can therefore be ex-pected that there will also be homogeneity withrespect to many of the variables of interest inthe Health Examination Survey. A portion ofthis loss of precision is undoubtedly recoveredby stratification and poststratification, but thereis no guarantee that the two effects will balanceone another. Hence the ignoring of sample designfeatures might well lead to gross errors indetermining the magnitude of standard errorsof survey estimates. In effect, the situation mightbe viewed as one in which inferences are beingmade to some ill-defined population of adults,rather than to the population from which the sam -ple was so carefully chosen. These points havebeen emphasized by Kish (1957, 1959) as theyrelate to social surveys in general.

The effect of these sample design and es-timation features on the variances of estimatesfor the Health Examination Survey are illus-trated in the data presented in the appendix. Foreach of 30 designated subclasses, the varianceof the estimated percentage of adults in a sub-class with hypertension was estimated by twomethods: (1) the replication technique to be in-troduced in “Balanced Half-Sample Replication”was employed, thus accounting for most of thesurvey features, and (2) the observations fallingin a subclass were treated as if they had arisenfrom simple random sampling. In the secondcase, variances were computed as pqfn, wherep was the observed fraction of hypertensiveindividuals among the n sample individuals fallingin a particular subclass. The ratios of the firstof these variances to the second, for the 30comparisons, ranged from .45 to 2.87, with anaverage value of 1.31; the ratios of the standarderrors ranged from .67 to 1.69, with an averagevalue of 1.12. These ratios probably overes -

8

Page 15: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

timate slightly the true ratios, since the rep-lication technique uses the method of collapsedstrata and in this instance does not account forthe effects of controlled selection. Furthermore,they are subject to sampling variability.

Also included in the appendix are threeexamples which refer to the estimated differencebetween the percentages of hypertensive individ-uals in each of two subclasses. In this instance,the average ratio of variances is 1.51 while theaverage ratio of standard errors is 1.23. Thesecomparisons are not as “clean” as the ones forsingle subclasses since the random samplingvariances were computed on the assumption ofindependence of the two estimates, and this isnot necessarily the case. This set of data, limitedthough it may be, tends to confirm the generalexperience. that estimates made from stratifiedcluster samples will tend to have larger samplingvariances than would be the case for simple ran-dom samples of the same size, although thedifferences are not so pronounced as in situationsin which the intraclass correlation is strongerthan it is for this statistic.

Obtaining “Exactw Solutions

If one wishes to consider that the principalgoal of analytical surveys is either the estimationor the direct comparison of the means of variousdomains of study, then there already exists inthe literature a number of results that can assistin achieving this goaL This “exact” theory canbe generally characterized as follows.

1. Ratio estimates of population means areemployed, primarily because sample size is arandom variable as a result of sampling clusterswith unequal and unknown sizes. This of courseintroduces the possibility of bias for these es-timates, although empirical research-e. g., Kish,Namboodiri, and Pillai (1962)—indicates that theamount of bias is apt to be negligible.

2. Expressions for the variance of a singleestimate and the covariances of two or moreestimates are obtained from the Taylor seriesapproximation, and variance estimates are con-structed by direct substitution into these ex-pressions. Hence variance estimates are subjectto possible bias.

3. In multistage sampling, it is either as-sumed that the first-stage units are drawn withreplacement or that the first-stage samplingfractions are very small. This means that vari-ance estimates can be obtained without explicitlytreating within-first -stage unit sampling variabil-ityy.

4. The most powerful tool for deriving re-sults for domain-of-study estimates has been thatof the “pseudovariable” ‘–= ‘L- “-----– ‘----’-L’- “That is

Yiij ‘Yhlj,

mu me mum V3THDUS. ~”

if the j th element in theith first-stage unit ofthe h th stratum belongsto the domain of interest

= O, otherwise

Uhij =1, if the j th element in theith first-stage unit of theh th stratum belongs to thedomain of interest

= O, otherwise

Using this approach, which is related to Cor-nfield’s (1944) earlier work, it is possible tospecialize ordinary results to domain-of-studyresults.

A very brief summary of some of the litera-ture on these aspects of analysis is as follows,where no attempt has been made to assign

priorities to the various authors. Results that aredirectly phrased in terms of domains of studyare given by Cochran (1963), Durbin (1958),Hartley (1959), Kish (1961, 1965), and Yates(1960). Related work on the estimation of thevariance of a variety of functions of ratio esti-mators is presented by Jones (1956), Keyfitz(1957), Kish (1962), and Kish and Hess (1959).Aoyama (1955), Garza (1961), and Okamoto (1963)discuss chi-square contingency-table analysesin the presence of stratification. McCarthy (1965)considers the problem of determining distribu-tion-free confidence intervals for a populationmedian on the basis of a stratified sample.Finally, we observe that, in the case of “small”samples, not even the ordinary normality as-sumptions are able to do away with difficulties,

9

Page 16: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

even without domain-of- study complications .l%usunequal strata variances lead to difficulties inobtaining tests and confidence intervals for apopulation mean, although some approximatesolutions are available—e.g., Aspin(1949), Meier(1953), Satterthwaite (1946), and Welch (1947).If one has essentially unrecognized stratification—that is, normal variables with common variancebut differing means—then it is necessary towork with noncentral t, X2, and F distributions asdescribed by Weibull (1953).

Replication Methods of

Estimating Variances

As a result of the indicated theoretical andpractical difficulties associated with the esti-mation of variances from complex sample sur-veys, interest has long been evidenced in de-veloping shortcut methods for obtaining theseestimates. For example, we noted earlier thatKeyfitz (1957) and Kish and Hess (1959) haveemphasized the computational simplicity thatcan result when primary sampling units are drawnwith replacement from each of a number of strata,and when one can work with variate values as-sociated with the primary sampling units. Thereare, however, other approaches that have beensuggested and applied to accomplish these sameends. We refer in particular to methods thathave variously been referred to as interpene-trating samples, duplicated samples, replicatedsamples, or random groups. In the succeedingdiscussion, the term “replicated sampling” willbe used to cover all of these possibilities. Ref-erences will be made to the pertinent literature,but no attempt will be made to assign prioritiesor to be exhaustive. Deming has been a consistentand firm advocate of replicated sampling. Hefirst wrote of it as the Tukey plan (1950); hisrecent book (1960) presents descriptions of theapplications of replicated sampling to many dif-ferent situations, and contains a wide varietyof ingenious devices that he has developed forsolving particular problems.

In simplest form, replicated sampling isas follows. Suppose one obtains a simple random

sample of n observations—drawn with replace-ment from a finite population or drawn independ-

ently from an infinite population-and that theassociated values of the variable of interest are

Y1*Y2 *...7 ~n , Then, if y denotes the samplemean and Y the population mean, E (y) = ~ and

;“(J)4,(H -ja2/n (n - 1)

~n(y) provides an unbiased estimate of V(Y).Suppose now that n observations are randomlydivided into t mutually exclusive and exhaustivegroups, each containing (n/t) elements, and thatthe means of these groups are denoted by

Yl*72*. ... Yt. It is clear that

and that the variance of J can be estimated by

tt (y) -j:, (Yj -y)2/f(t - 1)

In this simple case, th: advantages gained byusing ~t (y) rather than V“(y) lie in the faCt thatone has to compute the sum of t squared de-viations instead of the sum of n squared de-viations. If t is considerably smaller than n andif such computations must be carried out formany variables, the savings in computationaltime may be substantial. Also, the kurtosis ofthe distribution of the ~i is less than that of

Yi, PossiblY offsetting some of the effect ofhaving fewer degrees of freedom to estimate V(Y).

There is, of course, a loss of informationassociated with the subsample approach for es-timating variances since ~t (ji) is subject togreater sampling variability than is fi (y). Avariety of ways have been suggested for measur-ing this loss of information. Hansen, Hurwitz,and Madow (1953, vol. I, pp. 438-449), who des-ignate this the random group method of esti-mating variances, make the comparison in termsof the relative-variance of the variance estimate.For example, they show by way of illustrationthat the relative-variance of a variance estimatebased on 1,200 observations drawni from a nor-mal distribution is 4.1 percent while the relative-variance based on a sample of 60 random groupsof 20 observations each is 18.3 percent. Actually,

10

Page 17: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

this approach places emphasis on the varianceestimate itself rather than on the fact that oneusually wants to use the variance estimate insetting confidence limits for a population meanor in testing hypotheses about a populationmean. Under these circumstances, Fisher (1942,sec. 74) suggests a measure for the amount ofinformation that a sample mean provides re-specting a population mean. Since his approachhas been questioned by numerous authors -e.g.,Bartlett (1936)—we shall simply adopt the ex-pedient of taking the ratio of, say, the 97.5percentiles of Student’s t distribution for 1,200and 60 degrees of freedom, which is .981, andinterpreting this as a measure of the relativewidth of the desired confidence intervals. Thisis evidently the approach used by Lahiri (1954,p. 307).

The foregoing is, of course, the familiarargument that a sample of roughly 30 or moreis a “large” sample when dealing with normalpopulations, since [g,~ for 30 degrees of free-dom is 2.042 and for a normal distribution is1.960. Ninety-five percent confidence intervalswill, on the average, be only about four percentwider when S2 is estimated with 30 degrees offreedom than when U2is known. A slightly dif-ferent measure has been proposed by Walsh(1949), formulated in terms of the power of at-test.

If one wishes to use replicated sampling inconjunction with drawing without Yeplacernentfrom a finite population, then two different pos-sibilities arise. One can first draw without re-placement a sample of (n/t) elements, then re-place these elements in the population and drawa second sample of (m/t) elements, and continuethis process until t samples have been selected.Denoting the sample means by 71, Y2, . . . . Yt ,we have

y= ; j7j/fj+

t(Y) -j~*(Y, -Y)7t (f-1)

and

E[tt (y)]= ‘-$’t] &

N- (n/f) S2.— —Nn

This type of replication makes the successivesamples independent of one another, but it doespermit the possible duplication of elements insuccessive samples and hence lowers the pre-cision of Y as compared with the original draw-ing of a sample of n elements without replace-ment. However, there may be a saving in the costof measuring the duplicated items. Hence aslightly larger sample could be drawn for thesame total cost, recapturing some of the loss ofprecision.

Finally, one can draw a sample of (n/t)

elements without replacement, a second samplewithout replacing the first sample, and so on.This is, of course, equivalent to drawing a sam-ple of n elements without replacement and thenrandomly dividing the sample into t groups. Itfollows that

; jijj=l

Y=y

; (yj - y)2N-n I=]01(y)= ~

t(t-1)

and

This latter variance is smaller than the precedingone because of the difference in finite populationcorrections. Note, however, that the independenceachieved by the first method of drawing makesit easy to apply nonparametric methods (e.g.,the use of order statistics) for estimation andhypothesis testing. These pints have been dis-cussed, in a more general framework, by Koop(1960) and by Lahiri (1954).

Although replicated sampling for simplerandom sampling, as described in the precedingparagraphs, does provide the possibility of a-chieving some gains in terms of computationaleffort, the principal advantages of replicationarise from other facets of the variance estimation

problem. Some of these facets may be identifiedas follows.

1. ‘There are instances of sample designsin which no estimate of sampling precision canbe obtained from a single sample unless certainassumptions are made concerning the population.Systematic sampling is a case in point. (See

11

Page 18: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Cochran, 1963, pp. 225,226.) If the total sample isobtained as the combination of a number of rep-licated systematic samples, then one can obtaina valid estimate of sampling precision. Thisapproach was suggested by Madow and Madow(1944, pp. 8,9) and has been discussed at greaterlength and with a number of variations by Jones(1956). In some instances, estimates made fromreplicated systematic samples may be less ef-ficient than from a single systematic sample, andthen one must cho”ose between loss of efficiencyand ease of variance estimation, as discussed byGautschi (1957).

2. As is well known, the ordinary Taylorseries approximation for obtaining the varianceand the estimated variance of a ratio estimate,even for simple random sampling, provides apossibly biased estimate of sampling precision.As an alternative, one can consider drawing anumber of independent random samples, com-puting a ratio estimate for each sample, andthen averaging these ratio estimates for the finalestimate. A valid estimate of sampling precisioncan then be obtained from the replicated valuesof the estimate. It is true, however, that thebias of the estimator itself is undoubtedly largerfor the average of the separate estimates thanit is for a ratio estimate computed for the com-plete sample since this bias is ordinarily a de-creasing function of sample size. Thus gainsmay be achieved in one respect, while lossesmay be increased in the other. As far as theauthor knows, no completed research is availableto guide one in making a choice between these twospecific alternatives. This problem is, however,related to some suggestions and work by Mickey,Quenouille, Tukey, and others, and their resultswill be discussed in some detail in the followingsection.

3. After an estimate and an estimated var-iance have been obtained, confidence intervalsare ordinarily set by appealing to large samplenormality and to the approximate validity ofStudent’s t distribution. Replication can some-times assist in providing “better” solutions.For example, consider a stratified population inwhich the variable of interest has a normal dis-tribution within each stratum, but where thevariance is different for the separate strata.Difficulty is then encountered in applying the chi-

square distribution to the ordinary estimate ofvariance, as discussed by Satterthwaite (1946),Welch (1947), Aspin (1949), and Meier (1953).However, the mean of a replicate will be nor-mally distributed, being a linear combination ofnormally distributed variables, and the chi-square distribution can be applied directly to avariance estimated from the means of a numberof independent replicates, This aspect of theproblem has been discussed at some length byLahiri (1954, p. 309).

4. If one is using a highly complex sampledesign and estimation procedure, and if independ-ent replicates can be obtained, then replicatedsampling permits one to bypass the extremelycomplicated variance estimation, formulas andthe attendant heavy programming burdens. Vari-ance estimates based upon the replicated esti-mates will mirror the effects of all aspects ofsampling and estimation that are permitted tovary randomly from replicate to replicate,, This,of course, includes the troublesome domain-of-study type of problem.

One major disadvantage of replicated sam-pling has been mentioned in the preceding para-graphs, namely that the variance estimate re-fers to the average of replicate estimates ratherthan to an estimate prepared for the entire sam-ple. If the estimates are linear in the individualobservations, the two will be the same. ThEy willnot be the same, however, for the frequently em-ployed ratio estimator and the other nonlinearestimators, and the average of the replicate es-timators may possibly be subject to greater biasthan is the case for the overall sample estimate.

Another major disadvantage arises from thedifficulty of obtaining a sufficient number ofreplicates to provide adequate stability for theestimated variance. Thus the commonly useddesign of two primary sampling units per stratum(frequently obtained by collapsing strata from eachof which only a single unit has been drawn) givesonly two independent replicates, and the resultingconfidence intervals for an estimate are muchwider than they should be or need to be. Somesuggestions have been made for attacking thisproblem, and they will be discussed in the follow-ing section.

Another, but subsidiary, problem ariseswith replication if one wishes to estimate com-

12

Page 19: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

ponents of variance— that is, to determine whatfraction of the total variance of an estimatearises from the sampling of primary samplingunits, what fraction arises from sampling withinprimary sampling units, and the like. This prob-lem does not appear to have been discussed atany great length in the sampling literature andwill not be considered here since it bears moredirectly on design than on analysis. Some ofSedransk’s work (1964, 1964a, and 1964b) doesrelate to the problem, and McCarthy (1961) hasdiscussed the matter in connection withfor the construction of price indexes.

PSEUDOREPLICATION

sampling

Half-Sample Replication Estimates of

Variance From Stratified Samples

If a set of primary sampling units is strati-fied to a point where the sample design calls forthe selection of two primary sampling units perstratum, there are only two independent repli-cates available for the estimation of sampling pre-cision. Conf@ence intervals for the correspond-ing population parameter will then be much widerthan they need to be. To overcome this difficulty,at least partially, the U.S. Bureau of the Censusoriginated a pseudoreplication scheme called half-sample replication. The scheme has been adaptedand modified by the NCHS staff and has been usedin the HES reliability measurements. A brief de-scription of this approach is given in a report ofthe U.S. Bureau of the Census (Technical PaperNo. 7, p. 57), and a reference to the Census Bu-reau method of half-sample replication was madeby Kish (1957, p. 164). We shall first present atechnical description of half-sample replicationas used by NCHS in the Health Examination Survey.The theoretical development duplicates, in part,work by Gurney (1962). We shall then suggestseveral ways in which the method can be modi-fied to increase the precision of variance esti-mates.

Consider a stratified sampling procedurewhere two independent selections are madein each stratum. Let the population and

sample characteristics be denoted asfollows:

Stratum Weight

1

2

h

L

WI

W2

Wh

.

w.L

-r

Popula-tion

mean

-,Y1

Y2

.

?h

1.-,L

Popula-

vam”ance

s;

s:

s;

s:L

Sample Sampleobservations mean

Yh ~YYh2 jh

YLIJ’L2

An unbia~ed estimate of the population mean~ is ~st ‘~~1 Wt ~h , and the ordinary sampleestimate of V (ySt) is

where dh = (Yhl-yhz).Under these circumstances, a half-sample

replicate is obtained by choosing one of Yll and

Y12* one of Y21 and Yzz, . . ., and one of YL~and YL*. The half-sample estimate of the popu-lation mean is

where i is eithey one or two for each h. Thereare 2 L possible half samples, and it is easy tosee that the average of all half-sample estimatesis equal to y$t. That is, for a randomly selectedhalf sample

13

Page 20: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

If one considers the deviation of the meandetermined by a particular half sample, for ex-ample ~~~,j= 2 ‘h Yhl > from the overall samplemean, the result is obtained that

= (1/2) z Wh(yh, - yh2)= (1/2) z J’Vhdh

In general, these deviations are of the form

(~~,-j,l) = (1/2)(f Wldl * W2d2 . . . f WL dL)

where the deviation for a particular half sampleis determined by making an appropriate choice ofa plus or minus sign for each stratum. In the ex-ample given above, each sign was taken as plus.The squared deviation from the overall samplemean is therefore of the general form

(yh,- ji,t)z= (1/4) Z W; d: + (1/2)~Z~~W, Wkdhd~ (4.1)

where the plus or minus signs in the cross-prod-uct summation are determined by the particularhalf sample that is used.

If the squared deviations of a half-sampleestimate from the overall sample mean aresummed over all possible half samples, then it iseasy to demonstrate that the cross-product termsappearing in the separate squared deviations can-cel one another. Thus, for a randomly selectedhalf sample

Since v (~,t) is known to be an unbiased estimateof V (ji~t) if one takes expected values over re-peated selections of the entire sample, we havethe result that

This also follows directly from (4. 1) if we notethat E (dh d,) = O because of the independence ofselections within strata. If the sampling fractionsare the same in all strata, then a finite popula-tion correction can easily be inserted at the endof the variance estimation process. The effect ofdiffering sampling fractions could be taken intoaccount by working with W; ‘s, where W; is equal

to Wh multiplied by the square root of the appro-priate finite population correction.

The foregoing properties of a half-sampleestimate of the population mean have been ex-ploited for variance estimation in the followingmanner.

Consider the population of 2L possiblehalf-sample estimates, for fixed values of

Y]1JY12... .JYL1TYL2. This population ,hasmean equal to y,t. Furthermore, the meanvalue of (y~~- y$J 2 is equal to v(y,t). Draw withreplacement, a sample of k half samples. Denotetheir means by Yks,l, yh~,2,. . . . y~~,~. Then take

(4.3)

as an estimate of V(y$t). Since the expected

value of each squared term is equal to V @St),the expected value of their average is also equalto V(Y~t). The different squares are not indepen-dent, since the ~~~,i ‘s have elements in common,but this does not influence the expected value.

Quite clearly, vh,,~ (y,t) is o@y an approxi-mation to v (ySt ) which, in turn, is only an ap-proximation to V(JSt). Let us therefore attemptto estimate the degree of approximation by com-

Puting the variance of vh,,~(y,t). The relationshipgiven in Hansen, Hurwitz, and Madow (1953, vol.II, p. 65)

v(u) = E, [v(ub)] + ~ [E(UIV)] (4.4)

will be applied to the variable u = (~h~ - ~~t) 2,

and the conditioning variable v will be replacedby the set of dh’s. The first term then becomes

,,...dL[V{(Yh,-y,,)’ Id,. . . dL}]E

By definition, where it is understood that alldh ‘S are fixed,

[2

v[(yh~-jiJ21=E (yfis-Yst 1)2- (1/4)Z W; d:

ZL

[ 1= ~ 2 (l\2)h,Zj ahj,i (Wh dh) (Wj dj)

22L i=l

where the summation on i is over all possiblehalf samples, and ahj, i is either plus or minus

14

Page 21: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

one, depending on the particular half sample andthe particular combination of h and j. This ex-pression is equal to

~L

[~~ ~(Wh dh)2 (Wj dj)z+ (sum of cross-2L+ZI=1~<j product terms, each

term containing atleast one dj to thefirst power) 1

It can be demonstrated that the cross-productterms, when summed over all 2L samples willvanish. Hence

V[(yh, -y,t)z Idl. . . dL]

= (1/4) ~:j (W~dJ2 (Wjdj)2

Since E (di12) = S: and dh and dj are independent,the expectation of this variance over sets of dh

is equal to

We shall now evaluate the(4.4). Since

(4.5)

second term in

E[@h,-~,~)2 \ d, . . . d~] = (1/4) ~ W;d;

We have

~[E(ulv)]=E[(l/4)Z W; d;- (1/2) 2 W:S:]2

= (1/4)13[z W/(+z- S;)lz (4.6)

= (1/4) Z W:E (~ -S;)2

where the cross-product terms vanish becausethe selections within strata are independent fromone stratum to another and because E (di /2) = S~

Putting expressions (4.5) and (4.6) together,using relative variance (rel-variance) instead ofabsolute variance, and taking account of the factthat we are averaging k half-sample estimates ofvariance (which affects only the first term) we have

Rel-var [vh,,k (~,t)] =(4. 7)

The exact value of this expression dependsupon the values of W: S: and the distribution ofthe y ij .s within strata. However, reasonable ap-proximations can be obtained. Thus the first tc t-m,in (4.7) is a maximum, for fixed value of Z Wh2~2,if W~S~= W~S~... = W~S~ = W2S2. Under thesecircumstances, the value of this term is

L (L–1)4 w’s’ 2 2 (L-1)—— — .k w4sd L’ kL

For the second term, assume that the strataweights are equal, that the strata variances areequal, and that B( = ~4/u’) is the same for eachstratum. Using the expression for the rel-vari-ance of the estimated variance (Hansen, Hurwitz,and Madow, 1953, vol. II, p. 99) the second termin (4.7) is then approximated by

/9+1 I—..n.

Then the rel-variance of Vh$,K(y$t) is approxi-mated by

2(L-1) + 9+1kL 2L

(4.8)

This expression, as earlier developed by Gurney(1962)., was used in obtaining values given in aU.S. Census report (Technical Paper No. 7, table29), where L was taken to be 85 and a table wasprepared for several different values of B.

The Health Examination Survey is based on42 strata, collapsed into 21 strata for the presentmodel. Hence we have the results given in table1 for 19=3 and 19=6.

Table 1. Rel-variance of a variance es t i-mated from k independent half -saznplereplications

(Number of strata, L , is ?1)

k

1 ------- ------- ------------ ------- -----

;-----------------------10------------------

------- ------- -----%--------------------100-----------------00 ------- ------- ---

t’3=3

2.00001.0476.4762.2857.1904.1428.1142.0952

P=6

2.07151.1191.5477.3572.2619.2143.1857.1667

15

Page 22: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table 2. Equivalen; degrees of freedomobtained from independent half-sample replications

(Analysis of variance assumptions; 21 strata).—

Rel-varianceof estimated

variance

2.00001.0476.4762.2857.1904.1428.1142.0952

Equivalentdegrees of

freedom

1.01.9

/..::

14:017.521.0

t.975

12.7064.7402.7272.3652.2132.1452.1052.080

If one is committed to using independenthalf-sample replications ,towhich~eabove tablerefers, an appropriate number ofreplicatesmustbe chosen.Insome sense or other, one would liketo use a sufficient number of replicates so thatnot “too much” of the available information islost. On the other hand, this desire must rebal-anced against the cost of processing “too many”replicates. If a simple analysis ofvariancemodelis assumed, it is possible to obtain some idea ofthe amount of information that is lost for variousnumbers of replicates. For example, assumethat the strata are of equal weight, that thevari-able y has a normal distribution in eachstratum,that the strata means are possibly different, andthat the variances within strata have a commonvalue S2. Under these circumstances, theordi-nary within- stratum estimate of S 2 has 21 de-grees of freedom, and confidence intervals for apopulation mean would be based on Student’s tdistribution with 21 degrees of freedom.

For a normal distribution, the relative vari-ance of an estimated variance is 2/(n-1) -Coch-ran (1963, p. 43)—where n is the number ofindependent observations. If the relative variancesgiven in table 1 for P = 3 are set equal to 2/(n -J)and we solve for (n – I ), the result can beviewed as “equivalent degrees of freedom” foruse with Student’s t distribution. The valuesobtained from this process are given in table 2.

As previously noted, there are a variety ofways that one can make comparisons among these

Table 3. Relative width of confidence in-tervals obtained from k independenthalf-sample replications

(Analysis of variance assumptions; 21 strata)

k

1 -------- ----------------------------L--------------

1o-----------------20-----------------

1oo----------------00 .------- --------

Width of confidenceinterval relative

to the full 21degrees of freedom

6.1092,,2791(,31114,1371,,0641“0311“0121.000

various cases, but the simplest would appear tobe on the basis of width of the95 percent confi-dence interval, for fixed value of the estimatedvariance. This approach has evidently been usedby Lahiri (1954, p. 307). Table3 points this out.Remembering that thedetermination of therela-tive variance of the variance is probably anover-estimate, and that the effect of replication is lesspronounced for valuesof P greater than3, it wouldappear that between 20 and 40 independent repli-cations would work out quite satisfactorily inpractice. This statement must of course bequali-fied in terms of the cost of additional replications.

Balanced Half-Sample Replication

Although the point has not been explicitlyemphasized, the preceding development shows thatthe variability among half-sample estimates ofvariance arises from between-strata contribu-tions to these estimates, as is evident fromequations (4. 1) and (4 .5). These contributions comefrom the cross-product terms involving d~ d~ .These cross-product terms cancel one anotherover the entire set of 2L half samples, or whenone uses an infinite number of half-sample rep-lications. The question then arises whether onecan choose a relatively small subset of halfsamples for which these terms will also disap-pear. If this can be done, then the correspondinghalf-sample estimates of variance will contain allof the information (21 degrees of freedom in the

16

Page 23: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

simple situation being considered here) availablein the total sample. Some attempt at “balancing”is indicated in the U.S. Bureau of the Censuspublication (Technical Paper No. 7, p. 57) andthis is described in the following quotation:

For the non-self-representing strata, thestrata were grouped into 85 homogeneouspairs. For each of these 85 pairs, a set of20 random numbers between 1 and 40 wereselected without replacement. The first PSUin the pair was included in the 20 replica-tions corresponding to these 20 random num-bers, and the second PSU in the remaining20 replications. In this way, a set of 85PSU’S, one from each pair, was assigned toeach replication. It would have been possibleto make an independent selection from eachpair for each replication; however, it is be-lieved that the reliability of the estimateswas increased somewhat by insuring thateach PSU would appear in 20 of the 40 rep-lications.

It is unlikely, however, that this type of balancingwill appreciably affect the reliability of varianceestimates, since as noted above, this reliabilityis determined by the joint occurrence of elementsfrom pairs of strata.

A simple example will show that it is possibleto select a subset of half-sample replicationsthat will have the desired property. Considera three-strata situation with observations

(Y**>Y12). (Y21* Yzz)! and (Y31,Y32). Thereare 23= 8 possible half-sample replicates. Nowconsider the following subset of four replicates:

StratumReplicate 1 2 3 (Yh+ - i~t)

1Yll Y21 Y31 ‘1’2) (+WI ‘1+ ‘2 ‘2+ ‘3 ‘3)

2Y11 ’22 ’32

(1/2) (+W1dl-W2 d2-W3 d3)

3 y12 Y22 Y31(1/2) (-WI dl-W2 d2+ W3 d3)

4 ’12 ’21 ’32 (1/2)(-wl dl+ W2 d2-‘J’3d3)

The signs of the separate terms in thedeviations are determined by the definition ofdh = (y~~- y~.2). It is, of course, immaterialhow the two observations within a stratum arenumbered originally. Once the numbering is set,

however, as in the first replicate, it is main-tained in determining the remaining replicates.If these deviations are squared, the first part ofeach expression is W12d~/4+ W~d~/4 + W32d~/4,

which is the desired estimate of variance. Thesecond part of each expression contains the cross-product terms, and it can easily be checked thatall these cross-product terms cancel when thesquared deviations are added over the four rep-licates. This follows from the fact that thecolumns of the matrix of signs in the deviationsare orthogonal to one another. Thus this set ofbalanced half samples can be identified as

+++-+ -.-- +-+-

where a plus sign indicates Yh], whiIe a minussign denotes yhz. Notice that this particular setof replicates does have the property that each ofthe two elements in a stratum appears in half thesamples.

If one wishes to obtain a set of half samplesthat will have this feature of “cross-productbalance,” for any fixed number of strata, then itbecomes necessary to have a method of construct-ing matrices of + and - signs whose columns areorthogonal to one another. A method is describedby Plackett and Burman (1943-46, p. 323) forobtaining k x k orthogonal matrices, where k is amultiple of 4. Suppose, for example, that we have5, 6, 7, or 8 strata. The Plackett-Burman methodproduces the following 8 x 8 matrix, which is thesmallest that can be used for these cases becauseof the muItiple-of-4 restriction. The rows identifya half sample, while the columns refer to strata.

+ -- +-++--+-!- --+-+-

++ +--+--

- +-+-+---!--

+--l- +i----

-+- -!- ++--

-- +-+++--

----- ---

Any set of 5 columns for the 5 strata case, 6columns for the 6 strata case, 7 columns for the7 strata case, or the entire 8 columns for the 8

17

Page 24: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

strata case, defines a set of eight half-samplereplicates which will have the “cross-productbalance” property. If it is necessary to use theeighth column, the resulting set of half sampleswill not have each element appearing an equalnumber of times. This will not destroy the varianceestimating characteristics of the set of halfsamples, but it does mean that the average of theeight half-sample means will not necessarily beequal to the overall sample mean.

Since orthogonal matrices of plus and minusones can be obtained whenever the order of thematrix is a multiple of four, it is always possibleto find a set of half-sample replicates havingcross-product balance. It follows that the numberof half samples required will be at most threemore than the number of strata, The HES analysisis based on 21 strata and it is therefore necessaryto use 24 half-sample replicates. A possible setof balanced half samples for this particular caseis shown below. This design was obtained by usingthe first 21 columns of the construction given inthe Plackett-Burman paper. Any two columns ofthis design are orthogonal, and each element ap-pears in 12 of the 24 replicates. The entire patternis completely determined by the first column. Thesecond column is obtained from the first by mov-ing each sign down one position and placing the23d sign at the top of the second column. Thisrotation is applied repeatedly to obtain the remain-ing columns. The 24th position is always minusand is not involved in the rotation.

The staff of NCHS has adapted the foregoingmethod of balanced half-sample replication to theroutine analysis of data from HES, and this adap-tation is described in the appendix. ProfessorLeslie Kish of the Survey Research Center of theUniversity of Michigan has also employed themethod in estimating the sampling errors ofmultiple regression coefficients, although the re-sults of his study are not yet available for publi-cation.

Partially Balanced Half samples

If a complex sample design is based on alarge number of strata, say 80, then it wouldordinarily be undesirable to use a complete setof balanced half samples. As we have seen, acomplete set would consist of 80 samples, and the

processing of results for such a large number ofhalf samples might prove too costly. The questioncan then be raised of whether it is possible to de-vise a set of k partially balanced half sampleswhich will produce a more precise estimate ofvariance than can be obtained from k independenthalf -sample replicates. One way of accomplishingthis will now be described and evaluated.

To illustrate the approach, consider the caseof L = 4. There are 24 = 16 possible half sam-ples; a balanced set of four half samples can beconstructed as shown in the preceding section; wedesire a partially balanced set of two half samples.The following design was proposed by the staff ofNCHS:

St7-atumSample 1234

1T

2 +-l+-

A plus indicates yh, and a minus yh2. As before,‘h= (Yh]- ybz) . This desigh was obtainecl byusing orthogonal columns for the first two strataand then repeating this in the second two strata.The corresponding squared deviations of the half-sample means from the overall sample mean are:

+ (1/2)(-WI W2 d,dz + ~ W3 d, d3

– WIWhdld~ - W2W3 d2d3

+~~d2d4—W3~ d3d4)

Their sum, divided by two, is

(1/4) ~ W: d; + (1/2) (~ W3 d1d3 + W2W4 d2d4)

Thus we see that cross-product terms withinorthogonal sets have been eliminated, and that aportion of the cross-product terms between or-thogonal sets have been eliminated. Applying the

18

Page 25: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

.A set of balanced half samples for 21 stratal

Stratum

Half sample 1 2345678910

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

+--- -+ -+--

++----+-+-

+ + +----+--+

++++----+-

+ ++++----- i-

++ +++----

-1-- -F i-+++---

-+- +“+ +++--

a

i- -+- +++++-

-F+-+- ++-+++

-++-+-++++

-++-+--!-++

+ --++-+--+ i-

++ --++-+ -+

++- -++ - + -

-- ++- -++-+

+ - - ++ - - +-!- -

-+- -++- -++

+-+--++- -+

-+-+- -++--

-- i--+ --+i--

--- +--l---++

--- - +-+ - - +

--- --- --- -

11

+

+

-F

+

-1-

+

+

+

+

+

-!-

+

12

+

+

+

-1-

+

+

+

+

-1-

+

+

+

13

+

+

+

+

+

+

+

+

+

+

+

+

l-l

+

+

+

+

+

+

i-

+

+

+

+

+

15

+

+

+

+

+

+

+

+

+

-1-

+

+

16

+

+

+

+

+

+

-1-

+

+

+

+

+

17

+

+

+

+

+

+

4-

+

+

•!-

+

+

18

-1-

+

+

+

+

-!-

+

+

+

+

+

+

19

-1-

+

+

+

i-

+

-!-

+

+

+

+

+

20

-!-

+

+

+

+

+

-t

+

+

+

+

i-

21

+

+

+

+

+

-1-

+

+

+

-?-

-1-

+

lItisassumed thattwo e[eme~@ aredrawn independent] yfromeaCh 0f21 Strain- Aplussiw isarhitrarily assiwed

to one of the elements and a minus sign to theothdr. A row in this table then identifies a particular semple, the element

with the indicated sign being tsken from each of the21 strata.

19

Page 26: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

argument used in obtaining equation (4.8), the is the sum of the separate variances. If one nowrel-variance of the estimated variance is ap- assumes that the ~h’s are equal and that the S/ ‘sproximate by are all equal, the argument that led to expression

2x2 p+] I B+l (4.8) gives for the rel-variance of the estimated

If the two half-sample replicates had been chosenindependently this rel-variance would be, fromequation (4.8),

Hence a gain has been achieved, over the use ofindependent replicates, by using a partially bal-anced set of half samples.

This argument can be set forth in generalterms, Suppose there are L strata and that a bal-anced set of k half samples is used, where forconvenience we assume that L divided by k is aninteger, and the k-pattern of half samples is re- speated over each succeeding set of k strata. (Inthe preceding example, L =4, k =2, and L/k= 2.)

Then

variance

2( L–k)+O+lkL 2L

(4.10)

Since the first term in expression (4.8) is2 (L -1 )/kL, we see that the reduction in this por-tion of the rel-variance is like a finite popula-tion correction effect. By way of illustration, letus take L= 80, k=20 or40, and /3=3 or 6. Theresults are given in table 4. The gains for 20replications are modest, but they are essentiallyachieved at little or no expense.

In conclusion, we observe that Gurney (1964)

has approached this problem in a different fashion.Suppose again that there are 80 strata and thatone enumerates the complete set of 80 balancedhalf samples. If one now selects without replace-ment 20 replicates from the complete set of 80,Gurney shows that the gains over independent rep-lication are essentially the same as for the pro-cedure that has just been considered. Thus twoalternative ways of proceeding are available, bothgiving finite population factor gains in one com-ponent of the total variance,

where the second summation is taken over allHalf-Sample Replication and the Sign Test

pairs (h, j ) such that

h<j

h is from one set of k strata and j is froma second set of k strata

h and j represent corresponding columnsfrom the k orthogonal columns that make upa balanced set.

The remaining cross-product terms disappear be-cause of orthogonality conditions. It is easy tosee that there are

+(:-1)k ~L,k)c2= = L(L-k)

2 2k

terms in the summation.Since h and j are always different in the

second part of equation (4.9) and since E (d~ ) = O,

the covariance of the two parts of (4.9) is equalto zero. Therefore the variance of the expression

Suppose that the sample observations Yhland yh2 with which we have been dealing actuallyrepresent the differences of two other variables,say Zh,and Xhi. That is, yh,= (zh~- xl,,) and~~z= (~hz- Xhz). Under these circumstances,the population mean of the variable y is equal tothe difference of the population means of thevariables z and x. If the assumption of approxi-mate normality for Y$t is questioned, or if onewishes to avoid the computation of variances, itis tempting to think of combining the half-samplereplication technique and a nonparametric tech-nique in the following manner.

Obtain k half-sample estimates of ~. Asbefore, denote these by ~h~,~,~h~,z. . . . . Yh5,~.

Apply some type of nonparametric techniqueto these observations. For example, one mightuse a sign test to test hypotheses about the medianof the distribution from which the ~h~,i are drawn,or one might use the order statistics of this set of

20

Page 27: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table 4. Comparison of components of rel-variance for independent replicates and forpartially balanced replicates

(L ==80)

/9=3 (3=6

Independent replicates .099+ .025 = .124 .099+ .044 = .143

k=20 Partially balanced .075+ .025 = .100 .075+ .044= .119replicates

(.100)/(.124) =81% (.119)/(.I.43) ’83%

I Independent replicates .049+ .025 = .074 .049+ .044 = .093

k=40 Partially balanced .025+ .025 = .050 .025+ .044 = .069replicates

(.050)/(.074) = 68% (.069)/(.093) = 74%

*The first term in each sum represents the component due to half-sample replication.The second term represents the component that arises in ordinary variance estimation.

~hs,i ‘s to determine confidence intervals forapopulation median.

The difficultiesassociated withsuchananaly-sis arise from the fact that the half-sample rep-licates are not independent of one another, whenthey previewed as samples from theoverallpop-ulation. As an illustration, let us consider thefollowing two half-sample estimates:

~h$,l= w’lYll+w2Y21 + W3Y31

~hs,2 = wlYII+ wzyz2 + ‘3y32

If repeated drawings of the entire sample aremade, then

E(~h,,i)=W,P,+W2T2+W3P3

cOvbhs,l, ~hs,2 )=E{[q(Y,l-q)

+ w’2(y21-q) + W3(Y31-F3)] x

m]}[W(YI1- q+ W*(Y22- 19+ w3(y32- j

All other cross-product terms titiecovariancedropout because oftheindependence ofselectionswithin and between strata. Finally, the correla-tion between ~h~,1 and ~h~,z is

‘~hs,l ● ~hs,2 = 1/3

and we see, in general, that the correlation be-tween any two half-sample means will be equalto the fraction of common elements, providedfiat the W~S~are allequal. ThUS ~cantakeonthe

values 0,1/L,21L,3/L,...,(1)lL/L ,1.If one makes independent selections ofhalf-

sample replicates, then the number ofeIementscommon to any two replicates will be arandomvariable whose average will be L/2, and thereislittle one cando ina preciseway about evaluatingthe effect of the correlation ona sign test. How-ever, the completely balanced set of replicatesthat was described in “Balanced Half-SampleReplication” does have “nice’’propertiesint hisrespect. For example, the set offour replicatesgiven in this section for L=3 is such that anYpair of replicates have a single common element.Therefore the correlation between any two of these~h,’s is 1/3, assuming equality of the Wh2&.

21

Page 28: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Furthermore, these y~, ‘s have the same expectedvalue and variance.

If one were to apply a sign test to these foury~,’s for the purpose of testing the hypothesisthat median of the distribution from which theyare drawn is zero, then one would look at thenumber of positive ~~~’s in the sample of four.

Ordinary theory assumes that the observations areindependent, and that the probability that any singleobservation is greater than zero is 1/2. Hence,if the hypothesis is rejected whenever there areeither zero or four positive observations, thelevel of significance is (1/2)4 + (1/2)4 = .125.This value does not hold in the present situationbecause observations are dependent upon oneanother.

It is possible, however, to obtain an appro-priate value in this case if one is willing to as-sume that the y~, IS have a normal multivariatedistribution. Gupta (1963, p. 817) provides a tablewhich gives the “probability y that N standard nor-mal random variables with common correlationp are simultaneously less than or equal to H!’ Ifwe take N =4, P =1/2, and H=o,we find thisprobability to be .14974. Therefore the level ofsignificance of the test procedure is 2(.14974)=.299 instead of the .125 that would be obtainedfrom independent observations. This is a clear in-dication that the nominal significance levels of asign test will be changed markedly if there iscorrelation among the observations.

Whether or not the analysis of the precedingparagraph can be generalized to a larger numberof strata depends upon the pattern of commonelements in the balanced half-sample replicatesand upon the availability y of tables. The situationregarding common elements can be inferred froman examination of the 8 x 8 orthogonal matrixgiven in “Balanced Half-Sample Replication.”Because the matrix is orthogonal, any pair ofrows of the full 8 x 8 matrix will contain fourelements in common and the common correla-tion between half-sample means will be P = 1/2,

assuming that the W~S~are constant. This will holdin general if the number of strata is a multipleof four. If the number of strata, L, is one lessthan a multiple of four, and if one deletes thecolumn whose sign entries are all the same thenany pair of rows will still have the same number

of elements in common. This will be ~ -1,and the common correlation will be

L+l ~—.L-1p= 2 .—

L 2.L

For L = 7 the number of required half-samplereplicates will be 8, the number of common ele-ments between pairs of replicates will be 3, and/l = 6/14 = .43. Using the Gupta tables forN = 8 and P = ,40, the probability of obtaining allplus or all minus signs is 2 x (.07909)= .158.The corresponding value for eight independentobservations is (1/2)8 + (1/2)8 = .0078. Againwe see the marked disparity between the twosignificance levels.

If the number of strata is two less than amultiple of four, pairs of half samples will nolonger have the same number of elements incommon. It is, however, easy to show the follow-ing

Numbev of common Numbev of pairs ofelements samples

L+2 z—-2

(L+ 2) (L+ 2)/4

L(L + 2)/4(L +1) (L+ 2)7

One might then consider using an average correla-tion coefficient. A similar analysis could be usedfor cases where the number of strata is threeless than a multiple of four, and one would thenfind three possible values for the “number ofcommon elements .”

At the present time, this type of analysisis not very practical because Gupta’s tables giveprobabilities only for the case where all valuesare positive or all are negative. For larger num-bers of strata, one would have to be able to movein from the extremes on the distribution of num-ber of plus signs in order to get meaningful levelsof significance. Lacking such tables, the precedinganalysis shows that levels of significance are“too far off” when obtained from the assumptionof independence. Actually, if one must assumenormality in order to obtain probabilities for

22

Page 29: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

making a sign test, then one might just as welluse normal tests for correlated observations, asdescribed by Walsh (1.947).

Jackknife Estimates of Variance From

Stratified Samples

Quenouille (1956) introduced a method for ad-justing the members of a certain class of biasedestimators in order to reduce the bias from order1/n to order I/n*. Numerous other authors—Jones(1956, 1965), Tukey (1958), Durbin (1959), Deming(1963), Lauh and Williams (1963), Robson andWhitlock (1964), Miller (1964), and BriHinger(1964)—have made contributions to the develop-ment and behavior of such estimators and to theestimation of their variances. Tukey (1958) gives,in an abstract, the following brief account of thesecharacteristics:

The linear combination of estimates based onall the data with estimates based on partsthereof seems to have been first treated inprint as a means of reducing bias by Jones(J. Am. statist. Ass., Vol. 51 (1956), pp.

54-83). Let y(.) be the estimate based on allthe data, Y(i~ that based on all but the ith

piece, Y(,, the average of the Y(, ~. Quen-ouille (Biovzetrika, Vol. 43 (1956), pp. 353-360) has pointed out some of the advantagesof Zly(.) - (n - l)ji(,) as such an estimate ofmuch reduced bias. Actually, the individualexpressions my(.)– ( m- l)Y(,) may, to agood approximation, be treated as though theywere independent estimates. Not only iseach nearly unbiased, but their average sumof squares of deviations is nearly n (n – 1)times the variance of their mean, etc. In awide class of situations they behave ratherlike projections from a non-linear situationon to a tangent linear situation. They maythus be used in connection with standard con-fidence procedures to set closely approximateconfidence Iimits on the estimate.

This general procedure, as decribed by Tukey,has been called the Jackknife method since “Likea boy scout jackknife, such a technique should beusable for anything. . . although, again like a jack-

knife, many of its jobs could be better done bythe corresponding specialized tool, if that toolwere only at hand. ”

The application of these ideas to stratifiedsampling, with two independent selections perstratum, is straightforward. Consider the simplestratified model described in “Half-Sample Rep-lication Estimates of Variance From StratifiedSamples.” For simplicity, let L = 3. Then if oneleaves out each “piece” in succession, and formsthe estimate based on the remaining “pieces,”we have the following six estimates of the popu-lation mean:

= y Yll + ~2(Y21 + Y~*)/2 + W3 (Y31 + Y3Z)/2‘(1)

0)/2 + W~(Y~, + y32)/2= q Y~2 + W2 (Y2] + J’2.‘(2)

q(3)= ~(y,l + y,2)/’2 + W2 yz, + w3(yJI + Y3zJ/z

‘(4) = w,(yll + y12)/2 + W2Y22+ ~3(y31 + y32)/Z

1$) =W] (y]~ + yl~)\2 + W2(Y71 + Yzz)/2 + ‘3Y31

W,(y,, + Y12)/2 + W2(Y21 + y22)\2 + ‘3y32‘(6) =

In general, there will be 2L such quantities. Theestimation procedure applied to the entire samplegives

f?= w, (Y~*+ YJ2 + w2(y21+ y22)/2 + W3(y31+ y32)/2

If the original sampIe had consisted of six inde-pendent observations, then the Jackknife proce-dure would call for obtaining six estimates of theform q~i)= 6q - Sq(i). In the present example,however, there are two independent selectionswithin each stratum. Hence it is appropriate toform the six estimates

q(;)= 2q – q(i)

The final estimate of the population mean is

23

Page 30: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

and the variance of q* is estimated by

i:, (q;) - 17*)Y2

In general, the mean will be estimated by

q’ =i:l q;)/2L

and its variance by

i:,(q;, – q*12/2

In the simple linear case that is being con-sidered here, it is easy to show that an analysisin terms of the q~,’s produces results that areidentical with those obtained by the standardanalysis. For example,

Q(l) =7st + q4h) f?;) = Y,t – y~~iz

q(z)= y~t – ~dl/2, q* = y~t + ~dl/2

(2)

q(5) = 7$t + W3d3/2, q“(5) = Yst - W3d3/2

‘(6)= y~t – W3 d3 /z , q;, = ji$t + W3d3/2

q = Yst Q* =, Y*t

Furthermore

~ (q: - q*)2/2 = (W;d; + W22d: + W;d:)/4i=l (1)

which is the ordinary estimate of the variance ofY,t, just as in the case of the balanced half-sample replicates.

There is another variant of the Quenouilletype of estimate which is closely related to thehalf -sample approach. Suppose that a particularhalf sample is chosen. Denote its elements by

Y1],Y21,...7YL1 and its mean by ~h~. Theset of remaining elements, one in each stratum,then constitutes an independent half sample whose

mean we denote by y~~.mate is then defined by

2y~, - (yh, +

A Quenouille-type esti-

y$ )/2

which, in the simple linear case that is beingconsidered here, is identically equal to Y,t. Thisapproach does not provide an estimate of vari-ance in the present instance since only one esti-mate is obtained. In more complicated situations,however, different half samples will provide dif-ferent values of the Quenouille half-sample esti-mate, and it might be possible to base estimatesof variance on these different values. This pos-sibility will be discussed briefly in the followingsection.

Half-Sample Replication and the

Jackknife Method With Stratified

Ratio-Type Estimators

We have introduced half-sample replicationand the Jackknife method in the setting of a simplelinear situation, where they obviously have no realutility. Under these circumstances, they merelyreproduce results that can be obtained by directanalysis. If, however, more complicated methodsof sampling and estimation are employed, thendirect methods of analysis may not be available,may require a prohibitive amount of computationin comparison with the methods being consideredhere, or may even give results that are in oneway or another inferior to those provided by half -sample replication and the Jackknife.

Although one may accept on intuitive groundsthe general premise that half- sample replicationand the Jackknife do permit the “easy” computa-tion of variance estimates that in one way oranother mirror most of the standard complexitiesof sample design and estimation, the exact char-acteristics of the resulting estimates and theircorresponding estimates of variance are, for themost part, unknown. This is particularly true forhalf-sample replication, even though the intuitiveappeal of this method may be more direct thanthat of the Jackknife. No published or unpublishedreferences to the behavior of half-sample rep-

24

Page 31: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

lication in complex situations were discovered,and the notion of balanced half samples wasintroduced in this report for the first time. Onthe other hand, there is a growing body ofliterature and data relating to the Jackknife. Weshall now summarize this material on the Jack-knife and then report the results of a very smallexperiment which compares results obtained bybalanced half-sample replication and by the Jack-knife.

Although QuenouiIle (1956) introduced hismethod of adjustment as a means for reducingthe bias of an estimator, our interest in the Jack-knife is primarily focused on its utility for vari-ance estimation. One is naturally interested inobtaining any reductions in bias that are possible,but there is a considerable body of empiricalevidence—notably in the work of Kish, Nam -boodiri, and Pillai (1962)—which indicates that the“combined ratio estimator” for population means,subpopulation means, and differences of sub-population means probably has negligible bias inmost practical surveys. On p. 863, Kish et al.say “Our empirical investigations, set in a theo-retical framework, show that the bias inmost prac-tical surveys is usually negligible; the ratio ofbias to standard error (B/a) was small in everytest, even those based on small sub-classes. ”

There is actually very little published ma-terial which has a direct bearing on our presentconcern. The pertinent items are briefly sum-marized.

1. Quenouille (1956) has shown by formalanalysis that the variance of his esti-mator, where such an estimator is ap-propriate, differs from the variance ofthe unadjusted estimator by terms oforder I/n2.

2. Durbin (1959) applied the method of Quen-ouille to the ratio estimator r = ~/.F,

where a random sample was divided intotwo groups of equal size. Thus he con-sidered the estimator, of E (Y)/E (x),

f= 2r – (rl + r2)/2

where ~ = {y/x), r] = (yl/xI), r2= (y2/x2),

y and x are sample totals, y, and x*

are half-sample totals, and Yzand X2 are

the other half -sample totals. Durbin con-siders two cases: (1) x is a normal vari-able with variance O (n-l), and the regres-sion of y on x is linear, not necessarilythrough the origin; (2) x is a gammavariable with mean m and the regressionof y on x is linear. For the first case,when terms of O(n- 4)are ignored, theresult is obtained that the variance of f

is smaller than the variance of r. Forthe second case, it was not necessary touse an approximate form of analysis.Durbin concludes that “. . . whenever thecoefficient of variation of x is less than1/4, which will be satisfied by all exceptthe most inaccurate estimators, Quen-ouille’s estimator has a smaller meansquare error than the ordinary ratio es-timator. ‘This is an exact result for anysample size. ”

3. Brillinger (1964) studies the propertiesof these estimators in relation to maxi-mum likelihood estimators. His conclu-sions are: “Summing up the results of thepaper, one may say that Tukey’s generaltechnique of setting approximate confi-dence limits is asymptotically correct,under regularity conditions, when appliedto maximum likelihood estimates and thatthe technique provides a useful method ofestimating the variance of an estimate.Also one may say that the estimate pro-posed by Quenouille will on many occa-sions have reduced variance, smallermean-squared error and closer to asymp-totic normality properties, when com-pared to the usual maximum Likelihoodestimates. ”

4. Robson and Whitlock (1964) apply Quen-ouille’s method of construction to obtainestimates of a truncation point of a dis-tribution. One of the interesting featuresof their work relates to the constructionof estimators that successively eliminatebias terms of order n-z, n ‘3, etc. Theyfind for their particular problem that thevariance of these estimators increasesas the bias is decreased.

5, Miller (1964) is concerned with conditionsunder which a JaclQcnife estimator, and its

25

Page 32: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

associated estimated variance, will as-ymptotically have a Student’s t distribu-tion. Both of the situations described byMiller are ones in which the unjackknifedestimator had a proper finite or limitingdistribution under weaker conditions thanrequired for the Jackknife.

The foregoing five references are concernedwith estimators and with their bias and variance.None deals with the problem of estimating vari-ances. However, Lauh and Williams (1963) dopresent some Monte Carlo results which relateprimarily to the estimation of variance. They areagain concerned with the estimation of a ratio,E(y)/.E(x), but the Quenouille procedure is ap-plied to the individual sample observations insteadof to half samples as in the DurbinThat is, they compare the behavior

investigation.of

q=(i:,Yi)/( $ xi)i =]

with the behavior of

q “=iq*/n1=, (i)

where

‘(+ J“j-_Vi)/(l~lX,-xi)q(l) j=l

and

~~l=nq–(n–l)q. (,)

This is similar to the estimatorposed for stratified sampling in

:hat was pro-the preceding

section. Lauh and Williams define two populationswhich are used for empirical sampling: (1) x isa normal variable, while the regression of Y onx is linear thwugh the origin; (2) x is a chi-square variable with 2 degrees of freedom, andthe regression of Y on x is linear through theovi~”n. Since the regressions are forced to gothrough the origin, both q and q* are unbiasedestimators of E(Y)/E (x). For each population1,000 samples of n are drawn, n = 2, 3, . . . . 9.and a variety of variance estimators are con-

sidered. In particular, the ordinary estimate ofvari;mce obtained from a Taylor series approxi-mation was employed, denoted by VI(q); also an es-timate of variance was obtained from the ~ {i~‘s,namely

v(q*) = i (q* – q*)2/r2(n– 1)i=l (’)

The results of this investigation are most in-teresting, particularly the fact that the precisionof v (q’) is much better than that of VI(g) when xhas an exponential distribution, and are sum-marized by the authors as follows:

From the results of these two studies, itmay be inferred that the bias of the esti-mator v,(q) is dependent upon the degreeof skewness of the original y and x popu-lations. Estimators of the true variance takenfrom higher order approximations lead onlyto slight improvements over the secondorder approximation v,(q), and in some casesthe estimate is actually worse. The preci-sion of v (q’) is nearly double that of VI(q)

for exponential x distributions and the bias ofv (9*) is smaller than that of V1(q). ThUSit appears that the split-sample estimator

cl” may be definitely preferable to q in somesituations.

Finally, we note that extensive Monte Carloinvestigations of many of these points have beeninitiated by Dr. Benjamin Tepping, Director of theCenter for Measurement Research in the U.S.Bureau of the Census. Results of these investiga-tions are not yet available.

The foregoing references are concerned onlywith random samples drawn from infinite popula-tions. Our principal concern is with stratifiedsamples drawn, usually without replacement, fromfinite populations where complex estimation pro-cedures are applied to the basic sample data. Un-der these circumstances, a careful investigationof the behavior of estimators and of variance es-timators would undoubtedly require a large-scale,Monte Carlo type of program, integrated with asmuch analytic work as possible. This was notfeasible within the confines of the present study,even in terms of planning. Nevertheless, we did

26

Page 33: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table 5. Artificial population

I

Y x

3 T

4 6

11 20

Total 18 30

RI .= = .600030

Stratum

II

Y x

5 Y

9 8

24 23

38 35

R, = 38—= 1.085735

R-97-—= 1.154884

desire some small numerical model that wouldillustrate the various points that have been raised.

As an example, we started with the smallartificial population that is used for illustrativepurposes in Cochran (1963, p. 178, 179). However,since we wished to enumerate all possible samplesand thereby investigate the behavior of both bal-anced half -sample replication and the Jackknifemethod of variance estimation, and since com-putations were to be carried out on a desk cal-culator, we were not able to use the full popula-tion as given by Cochran. (It did not appear worth-while to invest computer programming time on oneisolated and artificial example.) Accordingly oneobservation was dropped from each stratum apdthe following population was used as shown intable 5.

For this population, all possible samples ofsix, r7~=2 in each stratum, were enumerated-33= 27 possible samples. For each sample,the following quantities were computed.

1. The combined ratio estimate.2. The estimate of variance based on the

ordinary Taylor series approximation tothe variance of the combined ratio es-timate.

3. The Quenouille estimate of the populationratio, using individual observations aspreviously described.

4.

5.

6.

7.

III

Y x

7 7

9 4

25 12

41 19

R .41_3

= 2.157919

The Jackknife estimate of variance, asdescribed in the preceding section.The average of four balanced half-sampleestimates, as described in “BalancedHalf-Sample Replication.” It was neces-sary to consider two sets of balanced halfsamples, one complementary to the other,,since this is a nonlinear situation. It isassumed that one of these two sets will bechosen randomly in practical applications.

The estimate of variance based on thefour balanced half-sample estimates,This estimate of variance is the sum ofsquared deviations of four half-sampleestimates about the combined ~atio esti-nzate, divided by four, and multiplied bythe finite population correction. This is themanner in which the half-sample esti-mate has been applied in the work of theU.S. Bureau of the Census and in HES.Again the estimate was made for each ofthe two sets of balanced half samples.The Quenouille estimate based on thebalanced half sampleq. That is, a Quen-ouille estimate was obtained from a halfsample and its complement. T’his wascarried out for each half sample and thenaveraged over the set of four balancedhalf samples. The results of these com-putations are summarized in table 6.

27

Page 34: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table 6. Behavior of estimates and estimates of variance obtained by enumeratingsamples drawn from artificial population of table 5

Mean Average Variance

Estimate Bias Standard Variance square variance V::i:::eerror error estimate estimates

Combined ratio estimate------ +.0118 .122 .0148 .0149 .0099 ● 000034Quenouille estimate,

individual observations ----- +. 0034 .126 .0160 .0160 ,0110Balanced half-sample

.000040

estimate-------------------- +.0428 ● 104 .0109 .0127 .0122 ● 000047Quenouille estimate,half samples---------------- -.0198 .137 .0188 .0192 ... ...

In obtaining the variance estimates a finite pop-ulation correction of(l- -$) was applied uni-formly. It can be readily demonstrated that thisis appropriate for the Jackknife and balancedhalf-sample estimates of variance, atleast inthesimple linear case that was used to introducethese techniques. Jones (1965) describes amodi-fication of the Jackknife, whose purpose is tointroduce the finite population correction into the“bias-reducing” argument upon which the Quen-ouille adjustment rests. This modification wasnot used here.

Although it is clearly impossible to drawany general conclusions from one artificial ex-ample such as the above, theresults areinterest-ing, In particular, we find that the combined ratioestimate has almost negligible bias and that theordinary variance estimate seriously underesti-mates the true variance; that the Quenouille es-timate with individual observations does reducethe bias at the expense of increasing the vari-ance by about 8 percent, and that the Jackknifeestimate of variance again seriously underesti-mates the true variance; that the average of thefour balanced half-sample estimates has thesmallest variance and the largest bias, while at

‘the same time providing a reasonable estimateof the correspondifig variance. Thevarianceof thevariance estimates is largest for the estimatebased on the balanced half samples. Even ifonemakes the comparison onthebasisofmean squareerror, the balanced half-sample averageis stillsuperior to the other estimates in spite of itslarger bias, except for the variance of thevari-

ante estimate. As a final point, the Quenouilleadjustment applied to the balanced half samplesdoes reduce the bias, but at the expense ofamarked increase in terms of variance.

This example, trivial and artificial as itmay be, does raise one question that concernshalf-samplt replication. Theuseofthe Quenouilleestimate and the Jackknife method of estimatingvariances have usually been considered simulta-neously. On the other hand, the half-sample rep-lication method of estimating variances hasalways been used to estimate the variance of theestimate obtained from the entire sample, andnot the variance of the average of the half-sampleestimates. Of course when one does not have an“exhaustive” set of half samples, as in the casewhere they are drawn with replacement, theaverage of half-sample estimates would not beappropriate. Here, however, we do have an“exhaustive” set of balanced half-sample esti-mates, and we might well consider using theiraverage in place of the combined ratio estimate.In terms of our example, a variance estimatewith average value .0122 is being used to esti-mate the variance of the combined ratio esti-mate, whose true value is .0148. This is some-what better than the ordinary Taylor series var-iance estimate, whose average value is .0099,but not nearly as good as if one uses the half-sample estimate of variance to estimate themean square .error of the average of the balancedhalf-sample estimates—namely, a quantity whoseaverage is .0122 to estimate a quantity whosetrue value is .0127.

28

Page 35: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

A small amount of data which relates to theforegoing point is presented in table IV of theappendix. For each of the six subclasses forwhich comparisons of percentages are presented,the estimate obtained from the entire sample canbe compared with the average of the 16 balancedhalf-sample estimates. It follows from the argu-ment used in developing the Quenouille-type es-timate that the difference between the two can beviewed as an estimate of bias for the overallestimate. This approach has been used by Deming(1960, p. 425). These estimates of bias, expressedas fractions of the estimated standard errors,range from approximately .03 to about .38. Thesedata are reassuring, but they are also too frag-mentary to support any general conclusions con-cerning the bias of estimates for HES analysis.

In conclusion, attention should be drawn toanother avenue of approach to estimation andvariance estimation which is somewhat related tothe Jackknife method, although this relation hasnot been explored or even noticed in the litera-ture. Mickey (1959) presents a general methodfor obtaining finite population unbiased ratio andregression estimators building on the work ofGoodman and Hartley (1958). In addition, he con-structs unbiased estimates of the estimator vari-ance by the process of breaking up the sampleinto subsamples, more or less along the lines ofTukey’s general version of the Jackknife. Williams(1958, 1961) specializes these results to regres-sion estimators, and considers their propertiesin some detail. No detailed attention has beengiven to this topic in connection with the prepara-tion of the present report.

SUMMARY

Sampling theory provides a wide variety oftechniques which can be applied in sample designto obtain estimates having essentially maximumprecision for fixed cost. These techniques areparticularly um?ful when populations are spreadover wide geographic areas so that highly cluster-ed samples must be obtained, and when extensiveprior information about the population under studycan be used in sample selection or in estima-tion. Such complex sample designs do, however,require extremely complicated and only approxi-mate expressions for estimating from a sample

the variance of survey estimates. If an extremelylarge number of widely differing types of esti-mates are to be made from a single large-scalesurvey, the burden of developing appropriatevariance expressions, of programming these fora computer, and of carrying out the computationsmay become excessive.

The foregoing problems are intensified, al-though not .appr~ciably changed in kind, if surveyanalysis is to go much beyond the estimation ofpopulation means, percentages, and totals. l%isis particularly true when the goals of analysis areto compare and study the relationships amongsubpopulations, or domains of study. Investiga-tors are then interested in applying such standardstatistical techniques as multiple regression oranalysis of variance, and find that many of theassumptions required for the application of thesetechniques are violated by the complexities of thesample design. Some authors have used the term“analytical survey” to refer to any survey inwhich extensive comparisons are made amongsubpopulations; other authors reserve this termfor surveys that are specifically designed to con-trol the precision of these comparisons. Thereseemed to be little point in arguing this issue inthe present report, since most surveys are multi-purpose in character and it is usually impossibleto design for a specific comparison. The majorportions of the first three sections of the reportare devoted to a literature survey and discussionof these topics.

Survey design (as opposed to the analysis ofsurvey data) requires the use of “exact” varianceexpressions since it is necessary to balance theeffects on precision of a wide variety of samplingtechniques. It is possible, however, to bypass thecorresponding detailed variance estimation tech-niques in the actual analysis of survey data throughthe use of replication. This approach is discussedin “Replication Methods of Estimating Variances, ”where an attempt is made to set forth its advan-tages and limitations. Emphasis has been placedupon variance estimation, although it is clear thatcovariances can also be treated in the samemanner.

One of the most serious limitations of rep-lication as applied to the analysis of complexsample survey data arises from the difficulty ofobtaining a sufficient number of independent rep-

29

Page 36: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

lications to assure reasonably stable varianceestimates. This fact has beenparticularly obviouswhen a set of primary sampling units is stratifiedto a point where the sample design calls for theselection of two primary sampling units perstratum, thus leading to only two independent rep-licates, To overcome this difficulty the U.S.Bureau of the Census and the National Center forHealth Statistics have been using a pseudoreplica-tion method for variance estimation, called half-sample replication. This procedure is describedin “Half-Sample Replication Estimates of Vari-ances From Stratified Samples ,” and severalimprovements, balanced half-sample replicationand partially balanced half-sample replication,are introduced in “Balanced Half-Sample Repli-cation” and “Partially Balanced Half Samples.”Still another, but related, variance estimationtechnique, the Jackknife, is described in “Jack-knife Estimates of Variance From StratifiedSamples. ” The application of these methods isillustrated on an artificial set of data in “Half-Sample Replication and the Jackknife MethodWith Stratified Ratio-Type Estimators ,” and theappendix shows how balanced half- sample repli-cation has been used in analyzing data obtainedfrom the Health Examination Survey.

It would appear that replication and pseudo-replication are extremely useful procedures for

obtaining variance estimates when one is makingdetailed analyses of data derived from complexsample surveys. Nevertheless, there are manyunresolved problems relating to the applicationof these methods. Among these are the following:(1) The effects of certain sampling techniqueson variances will not be picked up—e.g., theselection of one primary unit per stratum andcontrolled selection; (2) The variance estimateordinarily refers to the average of the replicationestimates, whereas the ordinary procedure is touse an overall sample estimate, and the two willnot be the same except in the rare case that theestimate is linear in form; (3) No investigationshave been carried out of the applicability of theseapproaches to such problems as contingency tableanalyses and standard analysis of variance ap-proaches; and (4) It is extremely difficult toattack any of these problems analytically, andthe development of empirical approaches thatwill have widespread applicability seems mostdifficult.

As a final point,. we call attention to the prob-lems that arise when survey data, as opposed toexperimental data, are used to develop generalscientific conclusions. This topic has not beenmore than mentioned in this report, but referencemay be made to discussions by Yates (1.960),Kish (1959), and Blalock (1964).

30

Page 37: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

BIBLIOGRAPHY

~oyama, H.: A study of the stratified random samplin{..

Annak of the institute of Statrktical Mathematics YI(l): 1-36,

1954-55.

.Aspin, A.: Tables for use in comparisons whose accuracy

involves two variances, separately estimated. ?iometn”lc36:290-!93, 1949.

Bartlett, M. 5.: The information available in small samdes.Proc.Camb.Phil. Soc. 32:560, 1926.

Blalock, H. M., Jr.: Causal Inferences in NonexperimentalResearch. Chapel Hill. University of North Carolina ~ress,

1964.

f3rillinger, D. R.: The asymptotic behavior of Tu3iey’s wm-eral method of setting confidence levels (the Jack!ini fe) when

applied to maximum likelihood estimates. Fe z,icw of [nlerna.ticinal Str2tistical In.s!itute 32:20!2-206, 1964.

Cochran, W’. G.: Sampling Techniques, ed. 2. Ncw Yd.

John Wiley and Sons, Inc., 1?63.

Cornfield, J.: Cn samrles from finite porulatimr~. .?.; ,/,.stati,st. INS. 39(??6):?:36-!?3 !+, June 1944.

.CornfieId, tJ., and Tukej, J. 1~’.: iverrige values of mean

squares in factorials. inn. math. Statisl. S7:907-949, 1956.

Deming, W. E.: Swne Throry of ~ampling. Km York. John

Niley and ,%n,s. Inc., 1950.

fleming, !V. E.: Sample Design in r{usiness Pc..earch. FewYork. John !Viley and !%ns, Inc., 196(?.

!leming, W. E.: tJn the correction of r.nthematical bias byuse Of py]icaterj design: =.. )irtril-a 6:37. iz, 1$16X.

Ueming, W. F., and Sttwhan, ~. F.: CM the interpretationof censuses as samples. .1. .fm.statisf. 1.s,s. ‘36(913):4.5-49,

tdar. 1!311.

Durbin, J.: <ampling thwry for estimates based on fe~~er

individuals than the number selected. SU[[Ctin of tfie Inter-},~tiori.-l tc ‘j :fical In.itihtc Q(’: 1 ? “-11 II, 1958.

r.ilr!ilrl, ,;. : ‘. notc~n the appllc:~tiou of “uenouille’, s n,etfw

od of hiss reduction b the estimation of ratios. F’iwnetn”kz46:477-4/3(, 1959.

Durbin, J., and Stuart, .1.: Differences in response rates

of experiencer3 and inexperienced interviewers. JournaZ oftle Roya/ Statistical Societ,v, Series .A (General), 11-1, I_’t. II,

Pp. 168-206, 1951.

Fisher, R. .A.: TAe Design of Experiments, ed. 3. I,ondon.

Oliver and i?oyd, Ltd., 194!2.

~;autschi, W.: Some remarks on systematic sampling. Arm.math. Statist. 28:385-394, 1957.

:Carza-Hernandez, T.: An +fpproximatt’ Test of I!omogeneityon the ,~a,si.sof a Stra ified Random Sample. M. S. thesis, New

/York State School of ndustrial and Labor Relations, CorneIl

University, 1961.

Goodman, L. A., and Hartley, H. G.: The precision of un-biased ratio-type estimators. J..4m .statist..4ss. 53(282):49 1-508, June 1958.

(iOodman, ~., and Kish, 1..: Controlled selection–a tech-

niquein probability sampling. J..-lstatistst. 4ss. 45(251):350-37% Sept. 1956.

Gupta, S. S.: !%obability integrals of multivariate normal

and multivrmiate t. .%n.math.Statist. 34:792-828, 196’3.

Gurney, M.: The Variance of the I?eplication Method for~slimatinq Variances for the C’PSSample Oesign. Unrwblishedmemorandum, U.S. f3ureau of the Census, 1962.

Gurney, I,;.: UcCarthy’s Orthogonal Replications for ~sti-rnczting Variances, With Crouped Strata. Unpublished memo-

randum, U.S. Bureau of the Census, 1964.

Hansen, M. FJ.l Hurwitz, W’.N., and hfadowr, W. G.: San~pleSurvey Methods and Theory, VOIS. I and II. New York. John

Wiley and Sons, Inc., IW3.

Hanson, R. H., and hfadis, 3?. S.: Influence of the inter-

viewer on the accuracy of survey results. J. Am.statist.Ass.53(283):635-655, ~ept. 1958.

Hartley, H. O.: Analytic .Wwiies of Svrwy data. Institutodi Statistic, Rome, ~~olume in onora di Corrado Cini. Imes,

Iowa. Statistical Laboratory, Iowa State I.university of Science

and Technology. Reprint Series 63. 1959.

.Jones, H. L.: Investigatirwthe properties ofa samplemeanby employing random wbsample means. J..-fstati,stst. ~SS.51(’?73):54-S3, Mar. 1956.

Jones, H. L.: The Jackknife method, Proceedings ufthe1?!1 Scientific Computing Symposium on Statistics. White

Plains, New York. 1PM f?ata Processing Division, 1965.

Key fitz, N.: .1 factorial arrangement of comparisons of fam-

ily size. Am.J..Soc. 58(5):470-+80, Kiar. 1953.

Keyfitz, h’.: Estimates of sampling variance where two

units are selected from each stratum. J..4m.statist..4 ss. 52(% C):5O!V51O,Dec. 1957.

Kish, L.: Confidence intervals for clustered samples.

lmt-rican Sociological Fe eicw ?2(2): 154-165, Apr. 1957.

Kish, L.: Some statistical problems in research design.

,4mcrican Zoriological Re riera 24:3W33t5, June 1959.

Kish, L.: F’fficient allocation of a multi-purpose sample.

Econorvetn”ca 99:2c’I-28.5, 1961.

Kish, L.: t ariances for indexes frorr, complex samples.Proceedings of the Social Statistics Section of the 4rnencanStatistical Association, 1962, pp. 190-199.

3iish, L.: Survey Sampling. New York. John l~iley and

%ns, Inc., 1965.

Kish, L., and Hess, L: On variances of ratios and theirdifferences in multistage samples. J..4m.statist. ~ss. 54(266):416-446, June 1959.

Kish, L., Namboodiri, N. K., and Fillai, R. K.: The ratio

bias in surveys. J. Am.statist. $ss. 57(30G):$.36:3-S76,Pee. 1962.

Iioop, J. C.: On theoretical questions underlying the tech-

nique of rerdicated or interpenetrating samples. Proceedings

31

Page 38: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

of the Social Statistics Section of the Amerioan Statistical.Association, 1960, PD. 196-205.

Lahiri, D. B.: Technical paper on some aspects of the de-

velopment of the sample design, in P. C. Mahalanobis “Tech

nical Paper No. 5 on the National Sample Survey. ” Sankhya

14:264-316, 1954.

Lauh, E., and Williams, W. H.: Some small sample results

for the variance of a ratio. Proceedings of the SociaZ Statis-tics Section of the American Statistical Association, 1963,

pp. 273-283.

Madow, W. G., and Madow, L. H.: On the theory of system-

atic sampling. Ann.math.Statis t. XV:l-24, 1944.

McCarthy, P. J.: Sampling considerations in the construc-

tion of price indexes with particular reference to tbe United

States ~onsumer Price Index. U.S. Congress, Joint Economic

Committee, 6’overnment Price Statistics, Hearings. Washingt-on. U.S. Government Printing Office. Part 1, pp. 197-232,1961.

McCarthy, P. J.: Stratified sampling and distribution-free

confidence intervals for a median. 4ccepted for publication

in J. Am.statist.Ass., 1965.

Meier, P.: Variance of a weighted mean. Biometrics 9(l):

59-73, Mar. 1953.

Mickey, M. R.: Some finite population unbiased ratio andregression estimators. J. Am.statist. Ass. 54(287):594-612,

Sept. 1959.

Miller, R. C., Jr.: 4 trustworthy Jackknife. Ann ,math.Sta-

tiet. 35(4):1594-1605, 1964.

National Center for iiealth Statistics: Cycle I of the iiealth

Examination Survey, sample and response, United Statss,

1960-62. Vital and i/ea2th Statistics. PHS Fub. No. 1000-

Series 11-No. 1. Public Health Service. Washington. U.S.

Government Printing Office, Apr. 1964.

National Center for Health Statistics: Blood pressure of

adults by age and sex, United States, 1960-62. Vital and

Hea2th Statistics. FHS Pub. No. 1000-Series 11-No. 4. I’ub-

Iic Health Service. Washington. U.S. ‘government Printing

Gffice, June 1964.

National Center for Health Statistics: Blood pressure of

adults by race and area, United States, 1960-62. Vital and

Health Statistics. FHS Pub. No. 1000 -Seriee 11-No. 5. Fub-

lic Health Service. Washington. U.S. (:ovemment Printing

Office, July 1964.

National Center for Health Statistics: r Ian and initial pro-

gram of the Health Examination Survey. Vital and l!edthStatistics. PHS Pub. No. 1000-Series l-No. 4. Public HealthService. Washington. U.S. government Frinting Office, July

1965.

National Health Survey: The statistical design of the.Health

Household-Interview Survey. VeaZth Statistics. FHS Pub. No.

584- ;2. Public i!ealth Service. “/nshington. !l. <. t ovorn-

ment Erinting Office, JuIy 1958.

Ohamoto, ?,!.: Chi-square statistic based on the nooled

frequencies of several observations. Riometrika 5(}: 5~4-528,

1963.

Plackett, R. L., and %rrnan, J. F.: The’ design of optitr,um

multifactorial experiments. i~iometrika 33:305-32%, 1943-46,

Quenouille, M. H.: Notes on bias in estimation. :5’iometrika

43:35.3-360, 1956.

Robson, D. S., and Whitlock, J. H.: Estimation of a trun-

cation point. 13iometrika 51:33-WI, 1!364.

Satterthwaite, F. 1?.: 4n approximate distribution of esti-

mates of variance components. i?iometrics 2:110-114, 1946.

Sedransk, J.: Sample Size Determination in Analytical SuT-veys. Ph.D. dissertation, Harvard University, 1964.

Sedransk, J.: 4nalytical Surveys Vith ClusteT Sampling.Unpublished nay=, Iowa State University, 1964a.

Sedransk, J.: .4 Double Sampling $chtirne for ‘!wlyticolSwr,eys. IJnpublisherl paper, Iowa State t:niversity, 196.lb.

%kh?tmol T. F.; ::umplin.g ~he OTyOf ‘lLTV~l/3 .’fth 3pp~i-catiojts. ,Ames, [owa. Iowa State College Press, 1954.

Tuke::, J. ‘~.: T;ias and confidence in not-quite large sam-

!ies. ,Ahstrwcted in Ann,math.Statist. 29:614, 195/7.

U.S. Eureau of the Census: The Current Population Swuey,A Report on Methodology. Technical Paper No. 7. Washing-ton. U.S. Government Printing Office, 1963.

Walsh, J. E.: Concerning the effect of intraclass correla-

tion on certain significance tests. ,4rw.rnath. Statist. 18(1):

88-96, 1S47.

Walsh, J. E.: On the “information” lost by using a t-testwhen the population variance is known. J..4m, statisf .,4 ss,

44(245):122-125, hlar. 1949.

WeibuIl, M.: The distributions oft- and F-statistics and of

correlation and regression coefficients in stratified samples

from normal populations with different means. Skand. Aktuar-Tidskr. 3&(suppl. 1, 9):9-106, 1953.

Welch, B. L.: The generalization of “Student’s” problemwhen several different population variances are involved. Bio-

.aetri&a 34:28-35, 1947.

“’:illiams, W. H.: Unbia,eed Regression Estimators and T,hei?

Efficiencies. I’h. D. dissertation, Iowa State ~ollege, 1558.

lvilliams, W. H.: Generating unbiased ratio and regression

c sti m.ators. Biometrics 17(2):267-274, June 1961.

Yates, F.; Sampling Methods for Censuees and Surveys,

ed. 3. New York. Hafner Publishing Go., 1960.

000

32

Page 39: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

APPENDIX

ESTIMATION OF RELIABILITY OF FINDINGS FROM THE FIRST CYCLE OF

THE HEALTH EXAMINATION SURVEY

Survey Design

The sampling plan of the first cycle of the HealthExamination Survey followed a highly stratified, mul-tistage probability design in which a sample of thecivilian, noninstitutional population of the conterminousUnited States, 18-79 years of age, was selected. Inthe first stage of this design, the 1,900 primary sam-pling units” (PSU’S), geographic units into which theUnited States was divided, were grouped into 42 strata.Here a PSU is either a standard metropolitan statisticalarea (SMSA) or one to three contiguous counties. Byvirtue of their size in population, the six largest SMSA’Swere considered to be separate strata and were in-cluded in the first-stage sample with certainty. AsNew York was about three times the size of other strataand Chicago twice the average size, New York wascounted as three strata and Chicago as two, making atotal of nine certainty strata. one PSU was selectedfrom among the PSU’S in each of the 33 noncertaintystrata to complete the first-stage sample. Later stagesresulted in the random selection of clusters of typicallyfour persons from segments of households within thesample PSU’S. The total sampling included some 7,700persons in 29 different States.

All examination findings for sample persons areincluded in tabulations as weighted frequencies, theweight being a product of the reciprocal of the prob-ability of selecting the individual, an adjustment fornonresponse cases, a stratified ratio adjustment of thefirst-stage sample to 1960 Census population controlswithin 6 region-density classes, and a poststratifiedratio adjustment at the national level to independentpopulation controls for the midsurvey period (October1961) within 12 age-sex classes.

The sample design is such that each person hasroughly the same probability of selection. However,there were sufficient deviations from that principle inthe selection and through the technical adjustments toproduce the following distribution of sample weights as

required to inflate to U.S. civilian, noninstitutional pop-ulation levels:

ClassWeight class avewge

7,000-20,999 14,00021,000-34,999 28,00035,000-48,999 42,00049,000-62,999 56,00063,000-76,999 70,00077,13(XZ-90,999 84,000

l-di~”trelativeweight

1

23456

Peycentdistribution

78.7

18.41.90.60.0

0.4

A more detailed description of the sampling planand estimation procedures is included in Vital and

HeaZth Statistics, Series 11, No. 1, 1964: “Cycle I of theHealthExamination Survey, Sample and Response.”

Requirements of a Variance Estimation

Technique

‘The Health Examination Survey is obviously com-plex in its sampling plan and estimation procedure. Amethod for estimating the reliability of findings is re-quired which reflects both the losses from clusteringsample cases at two stages and the gains from strati-fication, ratio estimation, and Poststratification. Ideally,an appropriate method once programmed for an elec-tronic computer can be used for a wide range of sta-tistics with little or no modification to the program.This feature of adaptability is an important and specialrequirement in HES. The small staff of anaIysts in theDivision of Health Examination Statistics typically workson ordy a few sections of the examination and Ialmratoryresults at a time. Consequently tabulation specifications

33

Page 40: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

and edited input for a sizable variety of report topicsare not available until shortly prior to the need forestimates of sampling error. New tables of samplingerror have been prepared for each of the 12 reportspublished to date and at least a dozen more will beprepared before the Cycle I publication series is com-pleted.

Development of the Replication Technique

The method adopted for estimating variances inthe Health Examination Survey is the half-samplereplication technique. The method was developed atthe U.S. Bureau of the Census prior to 1957 and hasat times been giveq limited use in the estimation ofthe reliability of results from the Current PopulationSurvey. A description of the half-sample replicationtechnique, however, has not previously been published,although some references to the’ technique have ap-peared in the literature.

The half-sample replication technique is particu-larly well suited to the Health Examination Surveybecause the sample, although complex in design, isrelatively small (7,000 cases) in sample size. Onlya few minutes are required for a pass of all casesthrough the computer. This feature permitted the de-velopment of a variance estimation program which isan adjunct to the general computer tabulation program.Every data table comes out of the computer with atab~e of desired estimates of aggregates, means, or dis-

tributions togetherwith the estimated

with a table identical in format butvariances instead of the estimated

statistics. The computations required by the method areindeed simple and the internal storage requirements arewell within the limitation of an IBM 1401-1410 computersystem.

The variance estimates computed for the firatfew. reports of Cycle I findings were based on 20random half-sample replications. A half sample wasformed by randomly selecting one sample PSU fromeach of 16 pairs of sample PSU’S, the sample repre-sentatives of 16 pairings of similar noncertainty strata,and 8 of 16 random groups of clusters of sample per-sons selected from the 9 certainty strata and the SanFrancisco SMSA, the largest sample PSU of the 33noncertainty strata. The concept of balanced halfsamples is utilized in present variance estimates forHES. ‘The variance estimates are derived from 16balanced half-sample replications. The compositionof the 16 half samples, shown in tables I and H, wasdetermined by an orthogonal plan. In the tables an “X”indicates that the PSU or random group was includedin the half sample. The construction using 16 balancedhalf-sample replications results from viewing thecertainty and noncertainty strata as independent tmi-verses. This is only approximately true as the post-stratified ratio adjustment to independent populationcontrols is made across both certainty and noncertaintystrata. An alternative construction, and perhaps a slight-ly more accurate one, would have been to use 24 bal-

Table 1. Composition of the 16 balanced half-sample replicates-certainty strata

1

2

3

4

5

6

7

8

1 ------- ------- ------- ------- ------2-------- -------- -------- -------- --

------- . . . . . . . ------- ------- ------;----------------------------------

5-------- -------- -------- -------- --6-------- -------- -------- . . . . . . . . --

7-------- . . . . . . . . -------- -------- --8-------- -------- . . . ----- -------- --

9------- .- . . ..- .- . ..-. . . . . . . . ------10----------------------’-----------

11---------------------------------12---------------------------------

I

13---------------------------------14---------------------------------

15---------------------------------16---------------------------------

1

x

x

x

x

x

x

x

x—

2

x

x

x

x

x

x

x

x

3

x

x

x

x

x

x

x

x

Balanced half-sample replications—

4—

x

x

x

x

x

x

x

x—

5

x

x

x

x

x

x

x

x—

6—

x

x

x

x

x

x

x

x

.

7—

x

x

x

x

x

x

x

x

8—

x

x

x

x

x

x

x

x.

9—

x

x

x

x

x

x

x

x

10 11

x

x

x

x

x

x

x

x—

x

x

x

x

x

x

x

x—

.

L2—

x

x

x

x

x

x

x

x

L3

x

x

x

x

x

x

x

x

14

x

x

x

x

x

x

x

x

15

x

x

x

x

x

x

x

x—

16—

x

x

x

x

x

x

x

x

.

34

Page 41: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table II. Composition of the 16 balanced half-sample replicates—noncertainty strata

Pair

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

Sample PSU

Pittsburgh, Pa., SMSA---Providence, R.I., SMSAl

Columbus, Ohio, SMSA----Akron, Ohio, SMSA-------

York, Pa., SMSA---------Muskegon-Ottawa,Mich---

Cayuga-Wayne, N.Y-------York, Me----------------

Baltimore, Md., SMSA----Louisville, Ky., SMSA---

Nashville, Term., SMSA--San Antonio, Tex., SMSA-

Savannah, Ga., SMSA-----Midland, Tex., SMSA-----

Barbour, Ala------------Independent cities inVirginia in 1950-------

Brooks-Echols-Lowndes, Ga------------Jackson-Lawrence,Ark---

Horry, s.c--------------Franklin-Nash, N.C------

Lafayette-Panola,Miss--E. Feliciana-St.Helena, La-------------

San Jose, Calif.,SMSA---M&n:polis-St. Paul,

-------------------

Ft. Wayne, Ind., SMSA---Topeka, Kans., SMSA-----

Grant, Wash-------------Apache-Navajo, Ariz-----

Dunklin-Pemiscot,Mo----Franklin-Jacksen-Williamson, Ill--------

Bates, Mo---------------Bayfield, Wis-----------

1—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

2—

K

K

K

K

x

x

x

K

x

x

x

x

x

x

x

x—

3—

x

x

x

x

x

‘x

x

x

x

x

x

x

x

x

x

x—

Balanced half-sample replications.

4—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

5—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

6—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

7—

K

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

8—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

.

9—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

!0

x

x I

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

11—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

12—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

13—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x.

L4—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

15—

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x—

16

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

x

1Providence, R.I., SMSA figures into the variance computations as it is always a

part of the right hand size of the difference Z’i - Z’ in the variaIIceequa’tion”

35

Page 42: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table III. Estimates of the percent of demographic subgroups of the U.S. adult population withhypertension and estimates of variance in percent

Demographic subgroup

Males aged 35-44 with income lessthan $2,000------------------------

Females aged 55-64 with income of$4,000-$6,999----------------------

Females with income of $10,000+-----White rmles with income of$4,000-$6,999----------------------

White females with income of$7,000-$9,999----------------------

Negroes with income of$2,000-$3,999----------------------

Males aged 18-24 with 9-12 yearsof school--------------------------

Females aged 25-34 with none orless than 5 years of school--------

Males with 5-8 years of school------~;i.o~les with 13+ years of

-----------------------------White females with 9-12 years ofschool-----------------------------

Negroes with 5-8 years of school----

Males aged 65-74 who are married----Females aged 35-44 who areseparated--------------------------

Females who are divorced------------White males.who are single----------White females who are widowed -------Negroes who are married -------------

Males aged 55-64 who are craftsmen--Females aged 35-44 who are privatehousehold workers ------------------

Males who are laborers--------------White males who are farmers orfarm managers ----------------------

White females who are clericaland sales workers ------------------

Negroes who are professionalworkers- ---------------------------

Males aged 25-34 who are employedin construction and mining ---------

Females aged 18-24 who are employedin wholesale and retail trade------

Males who are employed intransportation---------------------

Wh;;;n~les who are employed in, insurance and real estate-

White females who are employed inservices---------------------------

Negroes who are employed inGovernment-------------------------

HESestimate

ofpercent

QL_!J

19.69

25.7511.75

12.21

11.48

23.08

2.45

2.4917.82

9.34

10,3330.73

27.26

18.8013.479.0535.7727.79

15.15

10.6019.93

10.89

9.80

16.57

7.46

2.29

11.29

12.34

10.48

22.19

Replica-cationestimate

ofvariance

QQ..LL)

34.1056

17.22254.7524

1.2321

5.7600

10.2400

1.0609

5.33611.8225

2.3716

0.532910.3684

19.2721

78.322513.76411.932115.13215.1076

29.9209

21.252111.4921

2.8900

2.8561

31.2481

9.3025

6.8121

4.1209

24.5025

3.0276

9.3636

SRSestimate

ofpsrcent

~

22.22

24.4911.95

12.39

10.59

23.41

2.37

3.3317.92

9.01

9.5431.19

26.87

22.221;.;:

33:5528.79

14.29

10.6718.25

11.39

9.78

16.22

9.52

2.27

10.76

11.54

10.59

22.65

SRSestimate

ofvariance

@QAl

27.4348

18.86972.7326

1.2114

2.0066

8.7474

0.9151

10.74071.7805

1.4211

0.51877.4948

9.7751

96.02179.46582.23947.12233.8316

19.4363

12.70525.6730

6.3889

i.9522

36.7202

13.6775

500477

4.3067

13.0860

2.3324

9.6800

Samplepersonsexamined

@!Lzl

63

3::

896

934

205

253

8%

577

1,666286

201

1;:401313535

63

2::

158

451

37

63

44

223

78

406

243

Ratioof

variances

(CO1. 6)

1.2432

0.91271.7391

1.0171

2.8705

1.1706

1.1593

0.49681.0236

1.6688

1.02741.3834

1.9716

0.81571.45410.86282.12461.3330

1.5394

1.67272.0258

0.4523

1.4630

0.8510

0.6801

1.3495

0.9569

1.8724

1.2981

0.9673

36

Page 43: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

Table IV. Estimates of differences in percent between demographic subgroups of the U-S, adultpopulation with hy-pertension and estimates of variance in percent

Demographic subgroup

1. Adults with incomeless than $2,000-------

2. Adults with incomeof $10,000+------------

Difference (l-2)-----

3. Males with income$2,000-$3,999----------

4. Males with income$4,000-$6,999----------

Difference (3-4)-----

5. Females a ed 55-64 withtincome $ ,000-$6,999---

6. Females aged 55-64 withincome $7,000-$9,999---

Difference (5-6)-----

HEsestimate

ofpercent

Jk2XQ

26.23

11.75

14.48

15.36

12.83

2.53

25.75

25.25

0.50

Replica-cation

estimateof

variance@QQ.)

4.5796

1.3924

5.2850

4.2025

1.4641

4.5600

17.2225

99.8001

110.1069

SRSestimate

ofpercent

(Col. 3)

26.66

12.17

14.48

15.51

13.24

2.27

24.49

24.32

0.17

SRSestimate

ofvariance

(Col. 4)

1.7791

1.3933

3.1784

2.4499

1.1797

3.6296

18.8697

49.7500

68.6197

Samplepersonsexamined

(Col. 5)

1,099

764

535

975

98

37

Ratioof

Yariance

(CO1. 6)

2.5741

0.9994

1.6628

1.7154

1.2411

1.2563

0.9127

2.0060

1.6046

Averageof rep-licate

per-

m

26.06

11.34

15.72

14.75

12.35

2.40

25.91

26.67

0.66

anced half-sample replications, viewing the 16 pairsof dozen demographic variables for which informationnoncertainty strata and 8 pairs of randomly groupedclusters from the certainty strata as a single universe.

After the composition of each ofthe balanced halfsamples was determined, the resulting half sampleswere then separately subjected to all the estimationprocedures and tabulations used to produce the finalestimates from the entire sample.

An estimated variance S2Z~ of an estimated sta-tistic z’ofthe parameter z is obtained by applying theformula

where z; is the estimate of z based onithhalfsample and z“ is the estimate of z basedon theentire sample.

Computer Output

For the Health Examination Survey the variancetabulations and prepublication tabulations of estimatesare derived from the same computer output. Since thefindings are generally expressed as rates, means, orpercentages, each output “table” actually consists ofthree tables, the statistic of interest, such as thepercent of persons with hypertension, the numeratorof each cell in the “table, ” and the denominator of eachcell. The cells of the table are across-classificationof the statistic by age and sex with one of abouta

was collected in the survey. The analyst can alsoreceive a printout of the same three tables for eachof the 16 half-sample replications. lle replicationtables are useful when estimates of the variance ofestimated differences between statistics or of suchderived statistics as medians are needed or forevidence to support or refute a hypothesis concerningobserved patterns in the data. In additionto the “table”of findings, the output alsoincludes a’’table”of estimat-ed standard errors (of the statistic, its numerator,and its denominator), a “table” of estimated relativevariances (the estimated variance of a statistic dividedby the square of the statistic), and a “tabIe’’ofthenumber of sample observations on which the statistic,its numerator, and its denominator are based. l%elast table together with the others gives some insightinto the effect of the sampling plan and estimation pro-cedures.

Illustration

The figures in table HI are estimates from theHealth Examination Surveyofthepercent ofdemographicsubgroups of the adult population with hypertension andtheir estimated variances. ‘The officiaI HES estimatesbased on unbiased inflation factors adjusted for non-response and ratio adjusted to independent populationcontrols are shown in column 1. Estimates of theirvariance derived from 16 balanced half-sample repli-

37

Page 44: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

cations treating the estimated percent of replicatei as Z; are shown in column 2. For comparison, theestimates of percent and variance which would haveresulted if the 6,600 examined persons had been asimple random sample of the U.S. population and thesample size in each demographic subgroup or domainis considered to be fixed, are shown in columns 3 and4. The number of examined sample persons in the demo-graphic subgroup or domains (the bases of the percents)are shown in column 5. The ratios of the two varianceestimates are shown in column 6. These ratios are in-dicative of the net effect of clustering and stratificationin the sample design, deviations from equal probabilitiesof selection, and nonresponse and ratio adjustment inthe estimation procedures, and reflect as well thevariance of the estimated variance.

The median ratio of replication variance to simplerandom variance—i. e., of an appropriate variance to

a much cruder measure—is 1.30. The mean ratio is1.31. As one would expect, there is a tendency for theratio to be higher for larger values of the statistic, al-though this tendency is not very pronounced.

me Criteria for hypertension was 160 mm. Hg.

or over systolic blood pressure and 95 mm. Hg,, orover diastolic. The average of three blood pressurestaken over a 30-minute period was used for eachexamined person.

Table IV is similar to table HI but it also includesthe estimated difference in percent between two demo-graphic subgroups. Estimates of variance of the dif-ference between two estimated percents which wouldhave resulted if the sample had been a simple randomsample were obtained by summing, the estimatedvariances of the two estimated percents. The averageof the estimated percents over the 16 replicates isshown in column 7.

38 ~. S.GWEP.MENT PSINTING OFFICE : 1979 0-281-259/30

Page 45: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

VITAL AND HEALTH STATISTICS Series

Series 1. Programs and Collection Procedures. –Reports which describe the general programs of the NationalCenter for Health Statistics and its offices and divisions and data collection methods used and includedefinitions and other material necessary for understanding the data.

Series 2. Data Evaluation and Methods Research. –Studies of new statistical methodology incIuding experi-mental tests of new survey methods, studie~ of vital statistics colkction methods. new analyticaltechniques, objective evaluations of reliability of collected data, and contributions to statisticrd theory.

Series 3. Analytical Studies. –Reports presenting analytical or interprcti}re studies kmsed on vital and healthstatistics, carrying the analysis further than the exposito~ types of reports in the other series.

Series 4. Documents and Committee Reports. – Final reports of major committees concerned with vital andhealth statistics and documents such as recommended model vital registration laws and revistd birthand death certificates.

Series 10. Data From the Health Interuicw Swwy. –Statistics on illness, accidental injuries, disability, use ofhospital, medical, dental, and other services, and other health-related topics, all based on data collectedin a continuing national household intet-viev.’ survey.

Seri,> 11. [)ata lrorn the Health Examination Survey and the Heaith and iVutrition Examination SUrwy. –D~tafrom direct cxamirmtion, testing, and measurement of national samples of the civilian noninstitu-timmlized popzdation provide the basis for two types of reports: (1) estimates of the medically definedprevalence of specific diseases in the United States and the distributions of the population with respectto physical, physiological, and psychological characteristics and (2) analysis of relationships arnon$ thrvarious measurements without reference to an explicit finite universe of persons.

S.,ri, s 12. Daia From the institutionalized Population Surwys. –Discontinued effective 1975. Future reports frumthese surveys will be in Series 13.

Series 13. Data on Health Resources Utilization. –Statistics on the utilization of health manpower and facilitiesprovidin~ long-term care, ambulato~ care, hospital care, arzd family pkmning services.

Series 14, Data on Health Resources: .}lanpo uvr and Faci[itics. –Statistics on the numbers, geographic distri-bution, and characteristics of health resources including physicians, dentists, nurses. other healthoccupations, hospitals, nursing homes, and outpatient facilities.

Series 20. Data on Mortality. –Various statistics on mortality other than as included in regular annual or monthlyreports. Special analyses by cause of death, age, and other demof$waphic variubIes; geographic and timeseries analyses; and statistics on characteristic:; of deaths not available from the vital records based onsample surveys of those records.

Series 21. Data on Aratulity, ,Ilarriago, and Divorce. —Vzrious statistics on natality, marriage, and divorce other.’

than as included in regular unnual or monthly reports. Special amdyses by demographic variables;geographic and time series analyses: studies of fertility; and statistics on characteristics of births notavailable from the vital records based on sample surveys of those records.

.%ries 22. Data From the Nationai Mortality and i\Tatality Surzvys. –Discontinued effective 1975. Future reportsfrom these sample surveys based on vita] records v:ilI be included in Series 20 and 21, respectively.

Series 23. Data From the Nationa2 Survey o]’ Family GrozL~th.–Statistics on fertility, family formation and dis-solution, family pknnin,g, and related maternal and infant health topics derived from a biennizd surveyof a nationwide probability sample of ever-mamied women 15-44 years of age.

For a list of titles of reports published in these series, w’rite to: Scientific and Technical Information BranchNational Center for Health StatisticsPublic Health ServiceHyattwille, hfd. 20782

Page 46: VITAL AND HEALTH STATISTICS; SERIES 2, NO. 14 (4/66)

POST A(, E- APJO FEES PAIO

LJ S DEPARTMENT OF H E.W

HEW 396

THIRD CLASS

L)*-u.S.MAIL