Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
.-- . t. orô
The mathematical analysis of
crossover designs
Sadegh Rezaei, B.Sc (Hons) (Ahwaz University), M.Sc (Tarbiat
Modarres University)
Thesis submitted for the degree of
Doctor of Philosophy
?,n
Stat'ist'ics
at
The Uniuersity of Adela'ide
(Faculty of Mathematicøl and Computer Sciences)
Department of Statistics
November 18, 1997
I
*tr|'
Contents
Contents
List of Tables
List of Figures
Summary
Signed Statement
Acknowledgements
1 Introduction and literature revlew
1.1 Introduction
1.2 Definitions .
1.3 Fields of application
I.4 Background and problems
1.5 Review of previous work on two-treatment designs
1.5.1 Optimality for crossover designs
1.6 Structure of the thesis
2 Models for crossover designs
2.1 Introduction
2.2 The model and information matrices
ll
vlll
xll
xlll
xv
xvt
1
1
,)
3
4
5
I
I
12
t2
ll
13
2.2.1. Nlixed linear model
2.2.2 Information matrices
2.3 Estimating the parameters of interest
2.4 Building the model and information matrices using averages
2.4.I Linear model for averages
2.4.2 Variance matrix of the groupxperiod means
2.4.3 Idempotent matrices for strata
2.5 Analysis of variance .
2.6 Analysis based on means
2.6.I The vector of means
2.6.2 Combining information about treatment parameters
2.6.3 SummarY of Chi's Paper
2.7 trqual gïoup sequence analYsis
2.8 Conclusion .
3 Analysis of two-treatment two-period crossover designs
3.1 Introduction
3.2 Two-treatment, two-period crossover design
3.2.1 Analysis of design when ) is present
3.2.2 Treatment information
3.2.3 Analysis of the design when ) is not present
3.2.4 Analysis of design when subject effect is fixed
3.2.5 A comparison of the three cases of 2 x 2 crossover design
3.3 Baseline measurements in the 2 x 2 crossover design
3.3.1 One baseline measurement in the design
3.3.2 Two baseline measurements in the design .
13
15
1,7
18
20
21.
22
22
23
23
26
27
28
30
31
31
32
33
35
JI
38
39
40
4l
45
483.3.3 Conclusion and discussion
lil
3.4 Balaam's design for tlvo treatments
3.4.1 Balaam's design rvhen subject effects are random and unequal
group slzes.
3.5 Tlvo-treatment, three-period crossover design
4 A Bayesian analysis for the general crossover design
4.1 Introduction
4.2 The statistical linear model
4.3 Likelihood and parameter estimation
4.4 Bayesian analysis
4.4.1 Xo is a nonsingular matrix
4.4.2 X6 is a singular covariance matrix
4.5 Choice of prior distribution
4.6 Two-treatment, two-period design
4.6.1 Posterior estimates
49
50
54
67
õ(
59
60
63
63
64
67
69
69
7t
t.)
74
/b
4.7
4.8
4.9
Two-treatment, two-period with one baseline measurement
Two-treatment, two-period with two baseline measurements
Bayesian analysis of Balaam's design
4.10 Three-period designs with two groups
5 Analysis of a two-period crossover design for the comparison of two
active treatments and placebo 78
5.1 Introduction 78
5.2 The linear model
5.3 Treatment information 81
5.4 Combination of the estimates
79
83
5.5 Conclusions
IV
84
6 Cohort designs for two-treatment crossover trials with one baseline
measurement 87
6.1 Introduction.
6.2 Age-period-cohort design
6.3 Building the model for standard and cohort designs
6.4 Standard design
6.4.1 Information matrices and parameter estimates
6.4.2 Analysis of variance for standard design
6.5 Cohort design
6.6 Treatment information
6.6.1 Information matrices or þt:10 * )6 and þz: ro- Ào
6.6.2 Where is the treatment information?
6.7 Combining information
6.7.1 Estimation of (re, Às) in cohort design only
6.7.2 Estimation of (rp,)p) in both designs
6.7.3 Estimates of treatment effect minus baseline
6.8 Limiting estimates in terms of p .
6.8.1 The estimates and covariance matrix when p -+ æ
6.8.2 Estimates and their covariance when p -+ 0
6.9 Conclusion .
7 Block structure of cohort designs
7.t Introduction .
7.2 Cohort design in general
7.3 Analysis of variance
7.4 Fitting periods and cohorts
7.5 James and Wilkinson theorem
7.5.7 Canonical variables
7.6 Expected mean squares of each block structure
7.7 Description of a function cc in S-PLUS to get the result of James and
Wilkinson's theorem
7.7.1 Projector matrices
\23
r25
129
t32
7.8 To get a pattern on p and the Cohort (elim. Periods) contrasts 133
7.9 Conclusions 134
I Treatment structure of cohort designs 135
8.1 Introduction 135
8.2 Treatment structure for cohort and standard designs in general 136
8.3 Projector matrices for cohort and standard designs in general 137
8.4
8.5
8.6
8.7
8.3.i Estimates of parameters and covariance matrix of estimates 139
Treatment information for two-treatment crossover design with two base-
line measurements and a corresponding cohort design i40
8.4.1 Treatment information when the first-order carryover is present 742
8.4.2 Treatment information when carryover effect is not present I44
Treatment information for two-treatment, extra-period crossover design
and its corresponding cohort design with one baseline measurement I45
8.5.1 Treatment information when first-order carryover effect is present I47
8.5.2 Treatment information when the first-order carryover is not present 151
Treatment information for three-treatment and three-period crossover design152
8.6.1 Treatment information when carryover effect is present 153
8.6.2 Treatment information when ca ryover effect is not present 155
Treatment information for two-treatment with one baseline measurement
with more than two cohorts 156
8.7.1 Treatment information when c: 3 and first-order carryover effect
is present
VI
r57
8.8 Conclusion
Appendices
A Some useful concepts from linear algebra
4.1 The Kronecker product .
A.2 Idempotent matrices
r59
161
161
161
r62
r62
t62
t62
164
165
165
167
r67
170
L76
Positive definite quadratic forms and matrices
Contrast and orthogonal contrasts .
ANOVA sum of squares as quadratic forms .
4.6 Summing vectors,, and -E-matnces
B Bayes'theorem
8.1 Normal prior for multinormal sample
C The S-PLUS program for cohort designs
C.l S-PLUS functions
Bibliography
4.3
4.4
A'.5
C.2 Use of S-PLUS functions for Chapter 6
vll
List of Tables
2.1 Layout of the design 13
2.2 Group x period means 19
ANOVA table for standard design, no treatment terms. 22
ANOVA table for standard design, using means, no treatment terms. 26
ANOVA table for standard design, equal size groups, no treatment terms. 29
2.3
)A
2.5
3.1 Notation and layout for the simple crossover design
3.2 Expectation of responses in 2 x 2 crossover design when ) is in the model 33
3.3 Weighted orthogonal matrices for four strata in the design 35
3.4 Estimates of parameters and their variances. 39
3.5 Two-treatment design with one baseline measurement 4l
3.6 Expectation of responses in 2 x 2 crossover design when À is in the model 42
3.7 Weighted orthogonal matrices for four strata in the design 43
3.8 Two-treatment crossover design with two baseline measurements 45
3.9 Weighted orthogonal matrices for four strata in the design 47
3.10 The layout of Balaam's design with the number of subjects in each group. 50
3.11 Expected value of mean in Balaam's design
3.12 Weighted orthogonal matrices for four strata in Balaam's design
3.13 Two-treatment,, three-period crossover design
4.I Layout of general crossover design .
32
50
51
54
58
vlll
4.2
4.3
4.4
4.5
Idempotent matrices of each stratum
Table of idempotent matrices with their ranks
Expected values of responses in two-treatment.l two-period design.
Expected values of responses when treatment B is a standard treatment
59
61
67
67
795.1 Layout of design with two active treatments and placebo
5.2 Expected values of responses in comparison of two active treatments and
placebo. 80
b.3 Idempotent matrices for four strata in the design including placebo. 8i
6.1 Two-treatment three-period crossover design with baseline measurements 91
6.2 Cohort design in two-treatment three-period crossover design with one
baseline measurement .
6.3 Two-treatment three-period crossover design with one baseline measure-
ment repeated as 4 groups of n.
6.4 Treatment effects for the 12 means of observations for two-treatment three-
period crossover design
6.5 Ana,lysis of variance for standard design
6.6 Projector matrices for cohort design for two orders of fitting 101
6.7 Matrices introduced in the table of projector matrices for cohort design 101
6.8 Information matrix in each stratum for both designs 103
92
92
93
97
6.9 Information matrix about Ér and B2 in each stratum for both designs
6.10 Percentage of information in each stratum about ro * )o and rs - )o for
the cohort design 105
6.11 Those strata which contribute to estimate some parameters of interest . 107
6.12 Contrasts for estimating parameters in cohort design 108
6.13 Contrasts for estimating parameters in standard design i08
7.1 ANOVA table for standard design, no treatment terms.
105
IX
i19
7.2 ANOVA table for general cohort design, no treatments term
7.3 Helmert contrasts for Periods
7.4 Helmert contrasts for Cohorts
7.5 Contrasts for three cohorts and three periods
7.6 Canonical correlation coefficients and orthogonal contrasts for two cohorts
and various periods
7.7 Canonical correlation coefficients and orthogonal contrasts for three co-
horts and various periods .
8.1 Projector matrices for general standard design
8.2 Projector matrices for general cohort design
8.3 Two-treatment ctossover design with two baseline measulements
8.4 Cohort design in two-treatment, two-period crossover design with two
120
130
131
t32
I ,J,)
734
138
138
r40
baseline measurements. T4T
8.5 Expected values of two-treatment, two-period crossover with two baseline
measurements when we consider the cohort design or double standard design.141
8.6 Information matrices for standard and cohort designs in two-treatment
design with two baseline measurements, when first-order camyover effect
is present 143
8.7 Orthogonal contrast on Cohort (elim. Periods) I44
8.8 Information matrices for two-treatment, two-period standard and cohort
designs when first-order carryover effect is not present. 145
8.9 Tlvo-treatment, extra-period crossover design with one baseline measurementl46
8.10 Cohort design for two-treatment, extra-period crossover design with one
baseline measurement t46
8.11 trxpected values for design of two-treatment, extra-period crossover trial. 147
8.12 Information matrices for standard and cohort two-treatment, extra-period
designs lvith one baseline measurement. .
X
r49
8.13
8.14
8.15
8.16
Treatment information for both standard and cohort two-treatment, extra-
period design with one baseline when the carryover effect is not present. . 151
Three-treatment, three-period crossover design 152
Cohort design in three-treatment and three-period crossover design 153
Expected values for three-treatment, three-period crossover and its corre-
sponding cohort design 153
8.17 Information matrices for standard and cohort designs for three-treatment,
three-period
8.18 Treatment information for both standard and the cohort designs in three-
treatment, three-period when the carryover effect is not present. 155
8.19 Two-treatment crossover design with one baseline measurement when c:3156
8.20 The 18 means for two-treatment with one baseline measurement when c : 3156
8.21 Information matrices for standard and cohort designs for two-treatment
with a baseline, c: 3 158
8.22 Information or rs and Às in three cases of design . . 159
. 754
XI
List of Figures
lú times variance of (i) direct treatment effect and of (ii) carry-over effect
estimatorfor 2x2 designinterms of p:0,1,2,5,10,20 and q : rtl(nt-ln2). 40
Conditional distribution of carryover effect given direct effect 68
Relation between posterior estimates when Ic :0.2 73
3.1
4.t
4.2
5.1 Standard deviation of î¡ as p changes
6.1 Diagram of cohort design and its main concepts in crossover trial
6.2 The relationship of projector matrices in cohort design
7.1 Full cohort design length cpgn .
7.2 Full cohort design, length cp . .
7.3 Cohort design for three periods in each cohort
99
86
89
118
tzt
130
xll
Summary
The mathematical analysis of crossover designs
Sadegh Rezaer
Department of Statistics
The common theme of this thesis is the theory and application of crossover designs. In
this thesis both classical and Bayesian approaches are considered. The thesis covers three
broad areas.
In the classical approach, the methodology of repeated measurements is used to de-
velop structure and models to describe the properties of crossover designs with different
numbers of periods and unequal numbers of subjects in each group. The common ap-
proach in this part is to look at the information available about treatment parameters
and to see in which strata this treatment information resides. Estimates are obtained
by combining the available treatment information from different strata, in particular,
from the between-subject and within-subject strata. The estimates are equivalent to the
generalised least squares estimates.
To obtain the Bayesian analysis of crossover designs, prior distributions are chosen for
carryover and direct treatment effects to reflect the current state of information about
these parameters and the relationship between them. The assumption made in this
thesis is that since the effect of a treatment dies away over time, we might expect that
the carryover effect in the next period is some proportion, less than 1, of the original
treatment effect. It is assumed that the a priori information is summarised by the fact
that lve expect À to be a proportion k of the treatment effect r, but with uncertainty
described by a variance oSl where afr ìs known and k is some small positive value less
than one. In addition it is supposed that the r has an un-informative uniform prior
xlll
distribution.
Traditional crossover designs, even if baseline measurements are included' still do
not allorv estimation of the difference between the average treatment effect and baseline,
unless one is prepared to make strong assumptions about the period effects, in terms of
their size or likely variance. In the third part of the thesis, an approach is developed in
which some subjects enter the trial with a delayed start. If the situation justifies the twin
assumptions that period effects are related to calendar time (e.g. time of year), and that
there are no effects due to the length of the time in the trial, referred to here as 'age', then
such designs allow the estimation of the average effect of treatment versus baseline. The
designs have similarities to age-period-cohort designs, and they are referred to here as
'cohort designs'. They represent an interesting example of designs with non-orthogonal
block structure.
XIV
Signed Statement
This work contains no material which has been accepted for the award of any other
degree or diploma in any university or other tertiary institution and, to the best of my
knolvledge and belief, contains no material previously published or written by another
person, except where due reference has been made in the text.
I consent to this copy of my thesis, when deposited in the University Library, being '
available for loan and photocopying'
DArE , . .L/.1L-/ .t? 2 7SIGNED:
XV
Acknowledgements
I wolld like to thank my supervisor, Prof. Richard Jarrett, for his support, encourage-
ment and enthusiasm throughout the development of this work. I also thank him for his
accessibility at all hours in considering aspects of the work, and the many hours spent in
discussion and proof-reading.
Many thanks go to Dr Bill Venables due to his assistance relating to computer pro-
grams in particular, S-PLUS and MAPLB enquiries.
I also would iike to acknowledge the financial assistance of the Ministry of Culture
and Higher Education (MCHtr) of Islamic Republic of Iran during the period 26lal92 to
26lI196 in the form of a University of Shahid Chamran Schoiarship.
XVT
Chapter 1
Introduction and literature revle\M
l-.1 Introduction
Clinical trials attempting to compare the efficacy of two treatments often use the two-
treatment two-period crossover design, in which each subject is randomly assigned to
receive both treatments in one of two sequences. This can lead to estimates of the
difference in treatment efficacy that have variances which are less than half of those
provided by the standa¡d parallel design, in which each subject receives, by random
allocation, one treatment only. This reduction of variance results from the elimination
of between-subject variability. Another reason for choosing the crossover design is the
reduced cost of performance of the experiment.
Designs in which experimental units receive more than one treatment application in
the course of the experiment are called crossover designs. Such designs are known by
different names in the literature: changeover designs, or repeated measurement
designs, in some cases. In this design, the experiment is split up into different periods.
Each subject receives one treatment in each period. Usually each subject is observed for
the same number of periods. Designs are composed of several treatment sequences, each
subject being allocated to a sequence at the start of the experiment. It is often assumed
that equal numbers of subjects are allocated to each sequence.
The response variable is often supposed to be continuous. The analysis of crossover
designs when the response is binary has been discussed in the literature by, for example,
Gart (1969), Prescott (1981), Farewell (1985), Jones & Kenward (1987), Kenward &
1
Jones (i9S7b), but little has been written concerning the design of such experiments.
Gart (1969) has given an exact test for comparing matched proportions in the analysis of
binary responses in the crossover designs. Layard & Arvesen (1973) discuss the analysis
of Poisson data in crossover designs, although this seems to be restricted for all practical
purposes to the two-treatment, two-period crossover trial.
As Hedayat & Afsarinejad (1973) pointed out, the need for this design cari be justified
in several ways.
1 Due to budget limitation, the experimenter has to use each subject for several tests
2. In some experiments, the subjects are human beings or animals and often the nature
of the experiment is such that it calls for special training over a long period of time.
Therefore, due to time limitation, one is forced to use each subject for several tests.
3. One of the objectives of the experiment is to find out the effect of different treat-
ments on the same subjects in drug, nutrition or learning experiments.
4. Sometimes the subjects are ra e, therefore the subjects have to be used repeatedly.
The crossover design is a special case of a randomised control trial and has some appeal
to statisticians, medical and psychological researchers. The crossover design allows each
subject to serve as their own control and this, in theory, should reduce the background
level of variation affecting treatment comperisons. Direct evidence moreover., can be
obtained about individual subject preference since each subject receives two or more
different treatments.
In the simple casc known as the two-period two-treatment crossover design or the
2 x 2 design, each subject receives A or B in the first period and the alternative in the
succeeding period. The order in which A and B are given to each subject is randomised.
Ideally, half of the subjects receive the sequence AB and half of the subjects receive
the sequen ce B A. This is so that any trend from first-period to second-period can be
eliminated in the estimate of differences in response. In any particular case, the numbers
in the groups may not be identically equal for a variety of reasons, including drop-out and
the nature of the randomisation which might necessarily be done sequentially as subjects
enter the trial over time.
2
I.2 Definitions
The area of crossover design like any other area of statistics contains certain terms rvhich
are not found or used elsewhere. Terms like "direct effect "and "residual effect"
or "carryover effect" ,, washout period and baseline measurement are the most
commonly used ones. These terms are defined belolv.
1. The effect that a treatment has during the period in which it is applied is referred
to the direct treatment effect.
2. The effect of a treatment that persists into the next treatment period is referred to
as the carryover effect.
3. Sometimes, steps are taken by the experimenter to prevent or make less severe
the occurrence of carryover effects by use of a waiting period, commonly called a
washout period, between applications of treatments.
4. ln some situations, the experimenter takes a measurement from the subject before
a treatment is giver i.: the subject. These measurements are known as baseline
rneasurements.
Many authors have discussed the design and analysis of the two-period and two-treatment
crossover design with and without baseline measurements or using washout period or
using extra treatment periods or extra groups in comparing two treatments and have
addressed the issue of carryover effects. The general conclusion of this work is that the
presence of carryover efiect invalidates the use of this crossover design, and that, unless
carryove effects are negligible, a parallel design should be employed, or,, rf a crossover
design has been used, that the analysis should be based only on first period data.
1.3 Fields of application
Crossover designs have had application over many years in a broad spectrum of research
a eas, including agriculture experiments, Cochran (1939), animal husbandry, Cochran
et al. (1941), bioassay procedures, Finney (1978)) food science, market research, medicine,
pharmacology, psychology. However, among applications in occupational psychology,
ù
Parkes (1982) gives an interesting example where the crossover design occurs naturally.
Various examples of the use of these designs in industry can be found throughout the
literature, for example Raghavarao (1989). An important area rvhere crossover designs
are often used is in clinical trials and the pharmaceutical industry. One particular de-
sign, the trvo-treatment, two-period design has been extensively used and lvidely studied
in the literature. The book by Jones & Kenward (1989) lists more than one hundred
papers that have been written on this subject. But there are still several challenging and
practically useful unsolved problems awaiting solution'
L.4 Background and problems
Despite the advantages mentioned above, the design has fallen somewhat into disrepute
because of the possibility of a carryover effect, or a period-by-treatment interaction. A
term for carryover treatment effect is introduced into the model to allow for the ab-
sence or inadequacy of washout periods. If the effect of treatment does persist into the
period following the period of administration, then a carryover term is inclucled in the
model and the estimates of the direct treatment effect will be based on all the data of
design. For a more detailed discussion of the issues involved, see Abeyasekera & Curnow
(1984). Unfortunately, if such a carryover effect is present, naive estimation of the direct
treatment effect will be potentially completely misleading and this design cannot give
an unbiased estimate of direct treatment effect. The US Food and Drug Administration
suggested that this design should not be used, unless unequivocal external evidence of
the absence of carryover was available. Brown (1930) took a similar unfavourable view
of this particular crossover design. There are many papers that react against this feeling
about crossover in general,, see Healy, in discussion of Lewis (1983), Barker et al. (1982),
Patel (1933) and Willan (1983). Moreover, although it is theoretically possible to test for
a caïryover effect, in most small experiments the test is not at a1l powerful. In addition
as Senn (1988) has argued "the significance tests for a ca ryover effect are a form of
self-delusion" and he agreed that the justification for a crossover design must depend on
medical opinion. In this regard he recommended using a washout period to achieve the
atm
The main purpose of a crossover design has been to devise designs that allow the
4
treatment effect to be estimated within-subject. If the subject effects are assumed to be
fixed, then any estimator of direct treatment effect must be within-subject. Some authors
assume that the subject effects are random; this assumption leads to a betrveen-subject
estimate for direct treatments as well, and this can be combined with the rvithin-subject
estimates, as described by Chi (1991).
1.5 Review of previous work on two-treatment de-
srgns
The analysis of crossover designs has been done in the literature using various methods.
In parametric methods, a linear model is set up with all the parameters of interest and
normal least squares techniques are used to obtain estimates of these parameters or
parameters of interest, thus allowing hypothesis tests to be performed. Despite many
variations of approach there is one point on which authors are in substantial agreement,
namely the desirability of a preliminary check for the presence of carryover effects in the
design. The development of the methods starts from the most elementary technique, given
by Gr\zzle (1965), reviewed recently by Senn (1991), and continues through the various
statistical methods proposed by Balaam (1968), who used four groups to compare just two
treatments. Chassan (1964), Ebbutt (1984), Federer & Atkinson (1964), Fletcher (1987)'
Freeman (1989), Jones & Kenward (1939) consider this design with various features of
analysis to get unbiased estimates for direct treatments. The fully Bayesian methods with
informative prior for the parameter of interest were initiated by Grieve (1985), Grieve
(1e86).
The analysis of this design was given by Grizzle (1965) (with a subsequent correction
it Grizzle (1974)), who focused on the simple two-period, two-treatment crossover design
under the model in which subject effects are random. In this paper, Gizzle (1965)
proposed a mixed model for univariate analysis of crossover design. In his model, the
hypothesis of equal carryover (I1o) effects for two treatments is tested from between-
subject variability and since it is regarded as a preliminary test, a relatively high levei of
significance is used. If /lo is not rejected the data from both periods are used for testing
the hypothesis of equal treatment effects. Otherwise, the use of the data from the first
period alone is justified for treatment comparison, resulting in a loss of information.
5
Based on the restrictions of Grizzle's mixed model, several researchers, e.8. Wallen-
stein & Fisher (1977), have been led to respond to the use of crossover designs. They
generally pointecl out the main disadvantages of this design, that is, (i) the loss of infor-
mation when the carryover is present and (ii) the low power of the preliminary test for
ca ryover, as it is based on between-subject variability, and (iii) the increased chance for
bias in the test for equal direct treatment effects derived from the data of both periods.
Zimmermann & Rahlfs (1978) proposed a bivariate normal model and analysed ctossover
,design using a multivariate analysis of variance approach to the repeated measutements
design. Their approach leads to tests identical to those in Gr\zzle's mixed model ap-
proach. They also proposed a method for testing equal camyover effect and direct effect
simuitaneously.
I(och (\g72) described non-parametric methods of analysis for the case of two-treatment,
two-period crossover designs. He proposed a number of non-parametric procedures for
performing various hypothesis tests in connection with the Grìzz\e (1965) model of
crossover trials. One of the various tests which he used was a rank test for direct treat-
ment effect in the presence of carryover effects. Koch's procedure consists of ranking
the period differences for all subjects in the design and then using the Wilcoxon test for
difierences between the two sequence groups. Cornell (1930) extended Koch's result but
only to a small extent.
Brown (19S0) compares the crossover and parallel group or completely randomised
one.period designs in terms of the number of replicates required to achieve a given power
and concluded that the crossover design can yield great savings in cost if the assumption
of no ca ryover effect is valid, but the design should not be used if this assumption is in
doubt. He also showed that the lack of power of the pretest leads with a relatively high
probability to the non-rejection of the hypothesis of no carryover effects when they exist
and the following analysis of the treatment effect in crossover designs is based on a test
biased by carryover effect.
Hills & Armitage (1979) describe particularly clearly the usual method of analysis
for two period crossover design with one measurement in each period per sub.ject for an
ordinal response. For more than two periods it would appear that these analyses are of
little use.
There a e some papers in the study of crossover design in which baselìne measurements
6
are considered. Hills & Armitage (1979), Armitage & Hills (1982), I(ershner & Federer
(1981) and Federer & Atkinson (196a) all applied the baseline measurements and have
limited their discussion to the tlvo-treatment case only. Patel (1983) proposed to consider
baseline measurements which might be obtained prior to each perìod in a two period
crossover design. He showed that these measurements can be used in a preliminary test
to determine the validity of a test for treatment comparison and also for testing the
hypothesis of equal treatment effects.
Wallenstein (1979) showed that, if baseline observations are available before each
period, the two-period two-treatment crossover design may be used for valid estimation
of direct effect even in the presence of carryover effects, under certain constraints about
period effects.
Kershner & Federer (1981) restricted their attention to only two treatments and
compared the variance of contrasts for many higher-order crossover designs in presence
of a mixed effect due to treatment sequences in the model.
We can find an excellent introduction to crossover designs in the book by Jones &
Kenward (1939). Kenward & Jones (i987a), when they introduced the treatment-by-
period-interaction, said that it is desirable that a check be made for it in the statistical
analysis.
Senn (1991) however completely rejected the significance test for carryover and he
said that despite many variations of approach for analysing the crossover design, there
is one common opinion upon which most authors are in substantial agreement, namely
the desirability of a preliminary check for the presence of carryover effects. He in his
paper has argued that the signifrcance test for carryover effect "is a form of self-delusion
and the justification for a carryover design must be dependent on medical opinion as to
whether the wash-out period can be regarded as achieving its aim."
Chi (1991) has shown that we can recover information on direct and carryover treat-
ment effects from a between-subject analysis as is done in an incomplete block design with
subject as blocks. He proved that by combining the within-subject and between-subject
estimates, we can obtain the generalised least square estimate (GLStr), such that the
GLStr estimation is a weighted combination of the within-subject and between-subject
estimates, where the weight depends on the ratio of the between-subject variance compo-
nent o! to the within- subject variance component o2. Laird et al. (1992) also consider
7
the combination of information from between and within subjects and apply their results
to a number of standard designs, such as those of Balaam (1968) and Koch et al. (1989).
Freeman (1989) considered the usual analysis of tlvo-period, tlvo-treatment crossover
design, that is, a two stage procedure in which (i) the presence of carryover is first
tested and then (ii), according as the preliminary test is or is not significant at some
pre-specified level of probability, we eìther use just data from the first period or use all
the data assuming there is no differential carryover effect. Because the preliminary test
for carryover is highly correlated with the analysis of data from the first period only' he
showed that actual significance levels are higher than the nominal level even rvhen there
is no differential carryover. In order to make inference about the difference in treatment
effects he compared three options:
o Procedure PAR, which uses the simple parallel group design, ignoring the second period
altogether.
o Procedure CROS, which uses the differences between first and second periodsl, but
assumes that no carryover effect is present.
o Procedure TS, that is, two-stage procedure for 2 x 2 crossover design proposed by
Grizzle (i965).
He examined the two-stage procedure in terms of mean square error of point estimate,
confldence intervals and actual significance level of hypothesis tests for the differences
between the effects of the two treatments. He showed that even when no carryover
effect exists, the actual significance level of the two-stage procedure of Grizzle (1965)
is substantially larger than the nominal level. The reason is that the pre-test against
carryover effect is highly correlated with the first period comparison test. Thus the
conditional first period test is biased.
One of the main advantages of the crossover design compared with a simple completely
randomised design is that, for a given precision, the study requires fewer subjects. In
order to keep this essential advantage, Lasserve (1991) determined the optimal design for
crossover design with fixed size of population. He paid special attention to the design
with various numbers of periods for comparing two treatments. Willan & Pater (1986)
and \Millan (19S8) develop approaches based on the assumption regarding the rtlationship
betlveen catryover and direct treatment effects.
Some authors in terms of solving the problems have chosen the Bayesian approach.
8
Sehvyn et a1. (1981) have presented a method of analysis for direct treatment effects
in 2 X 2 crossover design with an equal number of subjects in each sequence and an
un-informative prior on the variance component. The Bayesian approach leads to a
magnitude and precision of the experimental estimate, rather than the classical approach
impliecl by a preliminary test. Grieve (1935) has given a Bayesian analysis based on
the Bayes factor against unequai carryover effects which provide a mixture of the tlvo
moclels corresponding to t'absence of the carryover effect" and "presence of the carryover
efiect", in which the weights are both a functìon of the data and of the likelihood of a
carryover efiect. Grieve (1936) in a recent paper extended the Bayesian approach for
tlvo-period crossover when there are tlvo baseline measurements in the design. In his
paper he cleveloped the study of a two-period crossover for displaying the dependence of
posterior inferences concerning the treatment effect on unavoidable prior beliefs about
the correct model.
1.5.1 Optimality for crossover designs
With the restrictions on the number of periods, p, and the number of treatments, f , in a
crossover design, optimality is defined as finding a design that gives a minimum variance
treatment estimator
Just for two treatments, Laska et al. (1983) pointed out that for any number of periods
exceeding two, we can find optimal designs. They have shown that for even numbers of
periods, balanced uniform designs are optimal and for odd numbers of periods, an extra
period design can be used. These results were obtained as special case by Laska &
Meisner (igS5). Kershner & Federer (1981) calculated the variance of estimators for
direct treatment, carryover and total treatment effects for a number of two treatment
designs, using from two to four periods. The various optimality results pointed out are:
the most efficient three-period design is AB B, B AA and the most efficient four-period
design is AAB 8,, B B AA, AB B A, B AAB.
1-.6 Structure of the thesis
The first main purpose of this thesis is focused on the analysis of two-treatment crossover
and using the cohort design in this trial with one and two baseline measurements.
I
The second main purpose of this thesis is concerned with developing a Bayesian
approach to analysis of two-treatment crossover designs by using an informative Normal
density for the parameters of interest.
Chapter 2 develops a general analysis of crossover designs, with the aim of showing
rvhere the treatment information lies in different strata. The focus is on combining all
information in a1l strata to obtain appropriate estimates and showing that r,he estimate
is equivalent to the generalised least squares estimate. In Chapter 3 lve apply the results
from Chapter 2to the two-treatment, two-period crossover design without and with one
and two baseline measurements, Balaam's design and the two-treatment, three-period
crossover design.
Bayesian analysis of general crossover design with an informative prior density with
full rank and non-full rank variance-covariance matrix for the prior density is dealt with in
Chapter 4 and application of the results are shown in this chapter. For choosing the prior
distribution for the parameters of interest, we assume that since the effect of a treatment
dies away over time, we might expect that the carryover effect in the next period is some
proportion of the original treatment effect. It is assumed that the a priori information
is summarised by the fact that we expect À to be a proportion ,k of the treatment effect
r, where k is a small positive value less than one. In addition it is supposed that the r
has an un-informative uniform prior distribution. Analysis of two-treatment, two-period
design with a placebo is shown in Chapter 5.
In the classical analysis of crossover designs, even if baseline measurements are in-
cluded, we could not estimate the difference between the average treatment effect and
baseline, unless we make strong assumptions about the period effects, in terms of their
size or tikely variance. In Chapter 6 an approach is developed for situa,tions in which
some subjects enter the trial with a delayed start. If the situation justifies the twin as-
sumptions that period effects are related to calendar time (e.g. time of year), and that
there are no effects due to the length of the time in the trial, referred to here as 'age',
then such designs, which are referred to here as cohort designs, enable us to estimate
the average effect of treatment against baseline. We compare the results of the cohort
design with the standard design. This alternative design is like an age-period-cohort
design applied to the crosso\¡er trial with baseline in order to recover information about
average effects of treatment and ca ryover. By making some orthogonal contrasts for two
10
important strata lve get the information on the parameters of interest in each stratum.
This treatment information can then be combined to get estimates of the parameters of
interest for the two designs.
In Chapter 7, the block structure of the experiment is presented for the general class
of cohort designs. We use the work of James & Wilkinson (1971), who provide a way of
looking at the array of means and identifying the contrasts for cohorts after eliminating
the effects of periods, i.e. Cohorts(el. Periods), and their expected mean squa es. For
this purpose \rye need to split the Cohort(el. Periods) stratum into different 1 degree
of freedom contrasts each with a different expected mean square and then to identify
the projector matrix for projecting onto the vector space spanned by the columns of
Cohort(el. Periods). This is done in detail for cohort designs with two and three cohorts.
For these special cases we obtain the analysis of variance tables and by using a function
written in S-PLUS (Venables & Rezaei (1996)) we show their projector matrices and the
expected mean square for each contrast in the various strata.
In Chapter 8 we apply treatments to the non-orthogonal block designs of Chapter 7
and use the results of James and Wilkinson's theorem to estimate the treatment effects
in a cohort design and then compare it to a corresponding standard design. We want
generally to see where the treatment information goes and what treatment information
is available by splitting the observations into the several strata and using the idempotent
matrices. To get the estimates of parameters we combine the estimates of parameters
from those strata in which there is some information about parameters.
11
Chapter 2
Models for crossover designs
2.L Introduction
As we mentioned in Chapter I the crossover design can be considered as a repeated
measurements design, since each subject is used on more than one occasion.
The common way of analysing crossover design is to consider subject effects as fixed
effects. As Chi (1991) pointed out, Milliken & Johnson (1984) considered the subjects
as blocks in an incomplete block design and have given the within-subject analysis with
analysis of the averages across periods for all subjects. Gough (1989) has used the
REML approach to recover between-subject information. Chi (1991) also recovered
between-subject information and has shown that the combination of between-subject
and within-subject information is equivalent to a generalised least squares analysis of
crossover design.
In this chapter the methodology of repeated measurements and combining information
from different strata is used to obtain some information about parameters of interest in
the general crossover design with p periods and l/ subjects. The data can be presented
in a table as shown in Table 2.1. Our aim is to look at the information available about
treatment parameters and to see in which strata this treatment information resides.
We will get the estimates by combining information from the different strata and will
show that the estimate is equivalent to the generalised least squares estimate.
t2
2.2 The model and information matrices
Suppose lve arrange the data in a vector, y, reading in order across the rows' so that
U:(At,...,Urp,,...,Uijr...)ANe)tdenotesthevectorofallobservationsonthe.À/subjects
and p periods, such that g;¡ is an observation on the ith subject and in the jth period' as
shown in Table 2.1. Treatments,, yet to be defined, will then be applied to each subject in
each period. Some treatments may in fact be null treatments, corresponding to baseline
measurements.
Period
1
2
Subject
z
¡\i
I2 J p
yij
Table 2.1: Layout of the design
2.2.L Mixed linear model
The mixed linear model for this general case of the crossover design can be given as
y:pl*Pr+BplT0 Ie, (2.2.r)
13
u'here
v
p
7l
p
is the Np x 1 response vector from all subjects in the
design,
is the grand meanl
is the vector of period effects,
is the vector of subject effects, which can be considered
as random effects, normally distributed and independent
with mean zero and variance o!,
is a vector of fixed effects including direct treatment and
ca ryover effects and in some designs may include group
and second carryover effects,
is an lúp x 1 vector with all elements 1,
is the period design matrix of dimension Np x p,
is the subject design matrix of dimension ly'p x N,
is the treatment design matrix of dimension ly'p x ú'
where ú is the number of treatment parameters in the
design,
is an lúp x I error vector whose elements are assumed
normally distributed and independent with mean zero
and variance 02.
Hence the total number of observations is lfp.
For P and B we can put
P : liv E) Ip (2.2.2)
B : 1¡r81p
where I notes the Kronecker product for which the definition and properties are given
in Appendix A, 1 is the identity matrix, and the subscripts denote the length or size of
the matrices, as appropriate.
For example, the Kronecker product of 1¡¿ and I, is a matrix of dimension Np x p
0
t
P
B
T
€
I4
given by
1¡S1o:I v̂
IpNpxp
and the Kronecker product of {y and 1o is a matrix of dimension Np x l/ given by
Leo 0
/ru8lp:o1o 0
00 1pNpxN
where each 0 is a column of length P
In this formulation of the model
V ar(y) : 62lxn + olBB'
: 62lwn + p"?(Iu I /o)
: o'(Iy A /e) + poSUN I /r),
: o'(tu Ø Kp) * (o' + po?)(Iw Ø Jr),
(2.2.3)
where "/- will be used for the n'¿ x Tn idempotent matrix of rank 1, with all elements
equal to If m. The matrix I{^ : (I - "I-) will be an m x m idempotent matrix of rank
(m - 1). Thus, applied to any vector of the appropriate length, ,I replaces each element
by the mean, and If replaces each element by itself minus the mean.
2.2.2 Information matrices
If there were no terms in the model other than d, the estimate of 0 would be (T'T)-rT'y
and the variance matrix for the estimates would be o2(T'T)-1. The Fisher information,
given by the expected value of minus the second derivative of the log likelihood, is the
inverse (T,T) lo2. As a matter of convention,, we shall regard the information about
9 as being given by the matúx T'7. As further terms are included in the model, in
particular those whose design matrices are not orthogonal to the columns of the T, the
information available for estimating d is partially lost in estimating the additional terms.
For example, the columns of T may not be orthogonal to the vector 1, in which case
some of the treatment information in T'T will be confounded with the grand rnean. The
15
information so confounded is T'J¡¡rT. Then the available treatment information in the
space orthogonal to the grand mean is
Te : T'KT.
The information about the parameters of interest 0 can be thought of as residing in
4 different strata, corresponding to the following orthogonal idempotent matrices:
. Qo -- /¡¡ I Jo for grand mean stratum,
. Qt : /¡¿ I K, for period stratum,
. Qz:I{¡v I Jo f.ot between subject stratum,
. Qs :11¡¡ I K, for subject x period stratum,
where we have defined idempotent matrices and their properties in Appendix A.
Information about d is distributed into these 4 strata as
T'Q¿T (i : 0,1,2,3).
Since Q¿ are orthogonal idempotent matrices, we can split the data into four separate
component" Q ¿y and show the properties of each and the relation between them. For the
grand mean stratum, this component can be written as
QoU: p¿l * (lru Ø Jr)r * ("/¡n Øtr)0 * J¡¡oTî * Jxpe.
As noted before, the information about g here is T'JT and we can see that the single
value obtained for the grand mean estimates
u I (Ltn)lp+ Í'T0)l@P),
with a variance ç"' + n"?) l(¡fp). Then ?'("Iiv I Jr)T is the information about 0 contained
in the grand mean. This is completely confounded with ¡; and hence in the absence of
any knowledge about ¡-1, there is no recoverable information about 0, and for this reasonl
the only treatment information available to us is Tt KT.
In the period stratum, the component is
Qß: (1"ø Kr)o*QtTïtQÉ,
so that the treatment information is T'Q1T' andvar(Qg): o"Qt' we note' however'
that this treatment information is compromised by the presence of the period effects.
The information in Periods is only available if we assume that period differences are not
16
present or that they have some known variance. There are unlikely ever to be enough
periods to estimate the variance of an assumed random effect for periods.
Similarly, in the subject stratum lve have'
Qza : (It¡¿ I Lr)p + QrTï I Qze,
so that the treatment information is T'Q¡T, and Var(QzA) : @' + po?)Qz.
Finally for subject x period stratum we can write
esU:esTï*ese,
so that the treatment information isT'Q3T, andVar(QzU): o2. We note that, since
Q¿Q¡:0 (i I j),the components in different strata are uncorrelated. Further,
T,KT: D T,Q¿T,,i=l
so that the total treatment information is split between these strata, each with its own
precision and 3
Var(y):V :D6nQ,,i=O
so that the weighting for the orthogonal idempotent matrices corresponding to each
stratum is given by
6r: o2 + po?, (2.2.4)
2.3
In the general case \¡¡e can rewrite the mixed linear model in Q.a.l) in the following form
a:T0*t (2.3.1)
where all other terms are absorbed into the covariance matrix of (, so that ( - l/(0, V)
such that V is an N x l/ positive definite variance matrix. In doing this, we assume for
the moment that all other terms, even the period effects, can be represented by random
effects. Then
0: (T'V-tT)-LTtV-ta'
6s: o2
do
ôr
Estirnating the parameters of interest
17
If we can write
V :D6nQn,
lvhere Q¿ is the orthogonal idempotent matrix of ith stratum, then we can rvrite
v-t -Сn'Qn'
and the information about 0 canU" tnolrght of as coming from the diftèrent strata, as
T'Q¿T, and providing separate estimates
0¿: (T'Q¿T)-T'Q¿7.
where {-} notes the generalised inverse of a matrix. To get the overall estimate of 9, we
can combine all information in the strata with appropriate weights (ll6;), and hence we
can obtain the following estimate for parameters of interest
0 (Ð ¿,'r' 7nr)- D õo'T' Q oa (2.3.2)
(Ð ¿,' r' 8,r)- (Ð 6;r r' Q ¿T o ¿)
Var(0): (D 6itTtQiT)-
Care needs to be taken in deciding which parts of the information are included. For
example, we decided that the treatment information T'QsT is not useful. Simiiarly, if
we believe that there a e nonzero period effects, then the information T'Q1T will also
contain period effects and hence will not provide reliable unbiased estimates of 0. Hence,
rve would generally use only i :2,3 in recovering the information. This is equivalent to
allowing ôs, ð1 -+ oo, or regarding ¡-l and ¡' as random effects whose variances approach
The variance of this combined estimate is
zero
2.4 Building the model and information matrices us-
ing averages
In later chapters rve consider designs in lvhich subjects are allocated at random into one
of g groups, where all subjects in the same group get the same treatment regime. Suppose
norv that the /ú subjects are divided into g groups, with n¿ (i : l, 2,. . . ,9) in each group'
18
sirch that N : Ðf=r n¿. The data now needs to be indexed with three subscripts às U¿jt ,
where i refers to the group, 7 refers to the subject rvithin the group, and k refers to the
period. If we write the data values as a single vector in standard order, reading across
the rorvs of the table of data, rve have, analogously to equation (2.2.1),
A : Lr]- * Pr + Bp I T0 + e, (2.4.I)
rvhere y is now a vector of length 1/p. The matrix T can be conveniently partitioned into
a separate matrix for each of the g groups of subjects as
T_
1rr, I Tr
ln, Ø Tz
1- eT",'q - v
where 4 is a p x ú matrix defining the treatment assignment for each subject in group i,
for the p periods and the ú treatment parameters in 0.. We can then consider a reduced
table which contains means for each group and each period. In Table 2.2 we show the
group x period means.
Period
1
2Group
g
t2 p Average on Periods
Utt. An. Utp. Ut
Uzt Uzz. Azp. Uz.
Usr. Us2 U sp. Us
Average on Group A t. U.z U.p a
Table 2.2: Group x period means
The period means here are y.j.:Dr¿y¡¡.|N, and the grand mean is y... - Ð"¿y¿..1N.
To identify where the treatment information resides, we give the following G matrix as
the idempotent matrix which produces a vector Gy, where each value U;¡x in the original
data vector is replaced by its group x period mean y¿.¡,:
Jnt o
oJnz00
Jno
Ç-
0
19
8Ip:wØIp (2.4.2)
Thus, Gy has length Np and contains the means repeated so that in the ith group and
jth period, each mean occurs n¿ times. We note that W is an N x N idempotent matrix.
2.4.L Linear model for averages
If we apply the G matrix in the model in (2.4.1)' then rve have
Ga : Gtp, t GPr * GBþ + GTï + Ge,
where the matrix coefficient of ¡; is
ÇlNn: (W ø1o)(l,vo) : (W A 1o)(1,v810) : (Wlr,') S(10) : (1,v) 8(1r) :lryp,
because W is a diagonal matrix with the idempotent matrices as elements on diagonal.
Similarly, for the coefficient of n, we can write
GP : (W Ø/o)(1r I 1o) : (Wt¡v) Ø Ip : lrv I Ip.
Furthermore.'
GB : (W Ø h)QN I 1o) : (W I 1r) (2.4.3)
Let p : (8,r,. . . , p,), be the vector of subject effects, where B¿ is a vector of length
n¿(i:1,. . .,,g). Then for the subject effect we can define
0i : *r'^,þn,where Bi is a scalar, the average of the elements of 0¿. It follows that
GBp : (W ØtòP
Jn. Øle o o
0 JnrØLp 0
Jnn Ø to þn
0'
0,
0
(J",1t) Ø L,
(J",pz) Ø Le
þiL^, Ø L,
þiL"" Ø lp
0
0
(J"ops) Ø le 0[l"n Ø t,1 L",o
08lp
p
pi1
1
n2
00
nl
20
11ng ng p;
B* þ*,
say, where B* : diag(fiL^r,...,fiLnn) 81o.For the treatment coefficient matrix T we
have
Jn, Ø Ip
0
0
Jn, Ø Ip
1', I Tr
Ln, Ø Tz
0
0GT (2.4.4)
(2.4.5)
(2.4.6)
(2.4.7)
(2.4.8)
Lnn Ø Tn
Then the mixed linear model for this purpose can be given by
Gy : þLNp* (lru Ø Ip)n + B*P* + T0 + €"
where e* : Ge.
2,4.2 Variance matrix of the groupxperiod means
The variance matrix of B* B* can be given,, using 2.4.3, by
Var(B*B\ : o2"GBB'G'
: "3(W Ørr)(W S r;)
: p"?(W a /o)
and the variance matrix for e* can be expressed as
V ar(e*) : o2GG' : o2G : oz(W A /o)
Then the variance-covariance of Gy is
Var(Gy) : o'(Wa1o) tpo?(WØJr)
: o"(w Ø Kr) * (o, + po:)(W Ø Jò.
Jnn Ø I,0
T
(2.4.e)
Since Gg provides the vector of group x period means, the matrix (I - G) is the idempo-
tent matrix which projects onto the space orthogonal to this. From (2'4.4), we note that
(I - G)T: 0 and hence that this space has no treatment information, so all treatment
information is in Gy.
27
2.4.3 ldempotent matrices for strata
The information about parameters of interest d can now be thought of as residing in 6
different strata, corresponding to the follorving orthogonal idempotent matrices:
. Qo : /ru I Jo fot Grand mean stratum,
. Q, : /ru I K, for Periods stratum'
. Qz: (W - 4v) I J, for Groups stratum,
. Qs : (W - "I¡v) I I{, for Group x Period stratum,
. Qs: U -W) I "/, for Subject within Groups stratum,
t Q5 : U - W) @ 1lo for Period x (Subject within Groups) stratum'
Now for the analysis of the model lve will consider these 6 idempotent matrices' noting
that in the last two strata, that is, in Qa and Q5, there is no treatment information.
2.5 Analysis of variance
In the previous section some general structure \ryas provided for the general crossover
design. Now in this section we will provide the analysis of variance table for this de-
sign, which simply identifies the components in the block structure and ignores, for the
moment, the treatment structure.
Source Idempotent matrix d.f EMS
GM JxØJp 1
Period JxØKp p-l o2
Group (w-Jp)ØJ, s-I o' + pr?
Group x Period (w-J¡t)ØK, (g-txp-t) o t
Subject within Group (r-w)ØJ.p N-s o' + po?
Period x (subjects within Group ) (r-w)ØKe (¡r-g)(p-t) o2
Total Np
Table 2.3: ANOVA table for standard design, no treatment terms
The data vector y of length lúp, written in standard order, can be partitioned into
various components, corresponding to the terms that can be identified in the linear model
in (2.4.1), and the expected mean squares determined. The general ANOVA for this
22
design is shown in Table 2.3. We note again that the last two strata contain no treatment
information because T''{U -W)ØA}f :0, rvhere A: J ot A: K'
2.6 Analysis based on means
It will be convenient to work lvith the table of group x period means' shown in Table 2.2.
An analysis of this table will give the first four rows of the Table 2.3' but we need to see
how the sums of squa es should be determined from the vector g* of means.
2.6.L The vector of means
To arrange the data in a vector say, g* in terms of averages, the vector y should be
pre-multiplied by a matrix, G", of dimension gp x Np. This matrix is similar to G except
that it does not have repeated rows. Thus the G* matrix can be given as
hL'n,0
0
Ø Ip: (W. Ø h), (2.6.1)1'n"1
n2
0
0L7
0 ùt',0
so that the G* matrix produces the means in a vector y* of length gp and contains just
the means so that, in the ith group and jth period, each mean occurs once. We want to
reproduce the analysis of variance using y*, but with just grand mean,' period, group and
groupxperiod strata. Now in terms of G* we can write
A -<.4
In multiplying the model (2.4.I) by G*, we have
ç"lyn: (W* A 1o)(1¡v A 1r) : ls I Lr: lsp,
G* P : (W- Ø10)(1r I 1r) : (le I 1e),
G*B: (W"8Ir)(I¡u 8lp) :(W* 810).
and
23
G*T
1
nl 1 0
nI
0
ØIp
18nt ØTz
18 r,1
0
0
1
ng
o
0
1 1
fit'ø t, 0
0
0
0 fit'Ø t,18nL ØTz
L,ØIP 1s?n
Tt
T2
Then the mixed linear model for the analysis with means can be expressed as
y* : pLsp* (1, Ø Iòn + (W. Øt)þ + T*0 * G*e, (2.6.2)
such that
V ar(y") : Var(G"e)*Var(G.BB)
: or(w* Ø lr)(w.'a Ie) + "?(w. ØLr)(w*'s t;)
: o2çw*w*'a 1o) * po2,(w.w*' Ø Jo)
: oz1¡-t a 1o) + po"'(N-t Ø Jo)
: ozl¡-t I Ko) + (ot + pø3XN-t I /o)
where
00 ng
Ts
TLy
0 rù2
0
na0
0
0N : (I4l.W*,)-t :
0
We now take each term in Table 2.3 and write it in terms of y* rather then y. All Qis
can be written in the fonnW Ø A, J SAor I Ø A, where A-- J, or A:110. Thus, the
24
sum of squares of W I A is
and, since for W we can lvrite
#t,,0
S Sw.t a*'w Ø A)a*
0
1
I00
In2
0
0ng
0
0 0
0
Tt'¡ n1L,N, 0
0
Ing1ng
iL*, 0 rL2
0 ng lno 0
Tù1 0
w*l0 Tù2
W* : W"'*W*,
0
we can write
SSw¿, : y'(W*'S 1)(N Ø A)(W. ø I)y
: y.'(N g A)y*.
Thus, for A : Jp arrd A: Kp, the sum of squares are
SSwt : y.'(N8J)y*,
SSwp: : y.'(N8 K)y".
Similarly, the sum of squares for the grand mean and periods can be given as
SSt¿, : y'(JØA)y,
where A is replaced by J, and 11r,, respectively.
To get the above sum of squares in terms of U*, the vector of means, we note that
I4l.Nle : 1N, so that
JN : ft"t;1
= Fl4l'.'Nle1;Nl,Ír-
and lve can write1
SSts : ¡u'lw-'A 1)(N1N1'rN ø A)(W. Ø I)y
I *tts: ¡u-'(Nrr'N I A)y..
n2W
I
0
0
ns
25
If we put "/.: #(N11'N), then the sums of squares can be presented as Y*'Q¿y", whete
. Qå: ü Ø Je for the Grand Mean stratum,
. Qî: J; Ø I{p for the Period stratum,
. Q;: (N - 4) E ,Io for the GrouP stratum,
. Qä: (N - 4) Ø I{p for the Groupx Period stratum'
These matrices Qi are no longer orthogonal idempotent matrices because the elements
of y* have different accuracy. They are however orthogonal with respect to the lveight
matrix N-1 81, such that
gi(N-' ø r)Qi:
and
\ai: 1N-1 ø 1)'
The analysis of variance table corresponding to the first four lines in Table 2'3 can
be given in Table 2.4.
Source Sum of Squares d.f EMS
GM v*(4 Ø Jòv" I o' + po?
Period v"'(4 Ø Kr)a" p-l o2
Group y.,[(N -¡;)ØJr]y. s-I o2 + po!
Group x Period y.'[(N-JÐØKr]a* (g-tXp-t) o
Total gp
Table 2.4: ANOVA table for standard design, using means' no treatment terms'
2.6.2 combining information about treatment parameters
The total information about treatment parameters can be given as
T'T :ln¿TiT¿: T*'(N 611)T.,
and the treatment information in each stratum can be obtained by applying the methods
of Section 2.6.1 using the fact T" -- G"T' We then obtain
cT"'(Q Ø Jr)T* in Grand Mean stratum,
oT.t($ Ø I{òT. in Period stratum,
0
ai
i+ji:j
26
o?./[(N - JÐ Ø Jr]T" in Group stratum,
oT.'f(N - JÐ Ø I{elT* in GroupxPeriod stratum.
As mentioned before there is no treatment information in the last trvo strata in Table 2.3.
Nolv the estimation of treatment parameters in Grand mean' Period, Group and Group
x Period strata respectively can be given as
0¿ : (7"' QîT-)-T-' Qîy*, (2.6.3)
and the overall estimate of 0 can be obtained by combining the information from those
strata which have useful treatment information, then
á : (D 6iLr.'gîT.)-(Ð 6;tT.'Q;T.o¿), (2.6.4)
and the covariance matrix of this estimate can be given by
V ar(0) : (t 6i rT.' QiT.)- (2.6.5 )
If we regard ¡l and ¡' as fixed effects then the information in the Grand Mean and the
Period strata give one observation or mean for each of the unknown parameters in p and
n-. Hence, unless we know something about the values or distributions of ¡l and n, the
strata represented by the Grand Mean and the Periods provide no accessible information
about the treatment parameters d. We therefore only recover information from Qi and
Q$ with weights óz and ôs.
2.6.3 Summary of Chi's PaPer
Chi (1991) in his paper formalises the between- and within-subject analr'sis for a general
crossover design and shows that the combined estimates are equivalent to the gener-
alised least squares estimates. He defines the following mixed linear model for a general
crossover design
u:T0¡BBIe.,
where g is the vector of effects of parameters of interest includes the period eflect n' and
B is the vector of subject effects. He obtains the following within-subject estìmate of 0,
considering þ in the mixed linear model as fixed effects:
0, : 1r',¡I - B(B' B)-' g'lr\-r'll - B(B' B)-' B'ly.
27
For the estimate of variance o2 he obtains
ã2 : a'lI - B(B', B)-' n',l@ - T0')ll|iD- 1)(1v - 1) - 2(L - t)1,
with B : (8, B)-r B'(A - Tá3) rvhere lV is the total number of subjects in the design and
tr is the number of direct treatments. The covariance matrix of 0s is given as
i,, : "r1f'll - B(B'B)-r B'lTj-.
For the betlveen-subject analysis, he assumes that the subject effects,, B. formed a
random vector distributed as l/(0, ø11). Then the covariance structure for Y is:
Ð:Var(y1 -- olna'+ o2I,
and he obtains the between subject estimate of d as
e, : {r'a(B'B)-t B'r\- {r'a@'B)-r B'a} .
Chi then defines o!: o!l o2lp, and obtains the following estimate
i7:12'z - z'Qor)l(N - ¿.),
where Z: B(B'B)-'B'y,Q: B(B'B)-IB'7, and.L* is therank of Q. The covariance
matrix of gz is
i,2 : pã2ulT'B(B'B)- B'T)-t.
Finally he combines the two estimators of d and got the following generalised least
squares estimators for 0
â : (i; + r;)-'1r;4, + iio.): (T,r-17;-tT'Ð-, y,
where, i: a3An,¡ irI and âl : ;î - or¡p. U"also shows that the combination of the
two estimates of 0 is equivalent to generalised least squares.
2.7 Equal group sequence analYsis
If rve assume that n1 - 'rL2 : "' : Nlg : n, that is, if we assume that the number
of subjects is equally allocated to each group, then the analysis of crossover designs
simplifies. The linear mixed model for the means can be expressed as
y* : p\sp* (ln Ø lr)r + (Ie a lò0* +T*0 + (Ie a Ir),*.' (2'7'l)
28
where €* : G*€ has elements rvhich are l/(0, o'lr), and B* has elements which are
N(0,o! ln). Then we have
V ar(y-)
The Fisher information available in the space orthogonal to the grand mean is
"lriru¿and the data vector distributes into the following strata
.Qö: nJn Ø J, for Grand Mean stratum,,
.Qi: nJn Ø Ko for Period stratum,
.Qi: nKs Ø Jo for GrouP stratum,
.8ä : nl{s Ø K, for Group x Period stratum,
which are orthogonal with respect to the weight matrix (fI) and applied to the vector
of means y*, and
oQa: Kn Ø Is Ø Je for Subject within Group stratum,
.Qs: Kn Ø Is Ø I{p for Period x (Subject within Group) stratum',
which are applied to the data vector y. Then the ANOVA table for this case of design is
shown in Table 2.5.
Source Sum of Squares d.f BMS
GM na*'(Js Ø Jr)a. 1
Period na*'(Js Ø Kr)a* p-l o2
Group na*'(Kn Ø Jr)y" g-l o' + po?
Group x Period nU*'(Kn Ø l{r)y. (g-t)(p-t) o2
Subject within Group a'(K,Ø InØ Jr)y s(" - r) o' + po?
Period x (subjects within GrouP ) a'(I{,Ø InØ Ko)y s(n- tXp- t) o2
Total ngp
Table 2.5: ANOVA table for standard design, equal size groups, no treatment terms.
The treatment information in the ith stratum is the T*'QiT*, and the estimates and their
covariance matrices are as given in Section 2.6.2.
toþJ
: Varl(In 61 1r)..] -f Varl(In A 1r)É.]õ2 no?: î(,,I /e) + ";(ts Ø Jr)
: "l(,16, Ilr) * o2 +-po? (rs Ø Jò,
n"n
2.8 Conclusion
This chapter provides a framework for the analysis of design in which multiple mea-
surements are made on each subject and the variance structure is modelled by random
subject effects and random measurement errors at each period. Treatment regimes are
applied to groups of subjects, possibly with unequal numbers in each group. In the next
chapter, we will apply these methods to some particular designs.
30
Chapter 3
Analysis of two-treatment
two-period crossover designs
3.1 Introduction
This chapter is concerned with two-treatment,, two-period crossover design, in which each
of several groups of subjects receive a different sequence of treatments. In clinical trials
to compare the effects of several treatments, large between- subject variability often
reduces efñciency. To avoid that problem, tesorting to crossover designs has become
popular since they use each experimental subject repeatedly, and tests of treatment and
ca ryover effects can often be performed using within subject variation leading to more
powerful analysis.
In this chapter, we will analyse various aspects of two-treatment, two-period crossover
designs, and have a look at some modifications such as Balaam designs and the use of
baseline measurements, which give us new information about parameters in the model
and resolve the difficulty which we face in the simple 2 x 2 crossover design.
The simplest design is the one known as the 2x2 design. In this design' subjects
are divided into two groups at random. One group receives treatment A followed by
treatment B, and the other group receives the treatments in the reverse order. We will
analyse this design in Section 3.2 just to see the application of Chapter 2 to a weil-
known situation rvhich is analysed by Jones & Kenrvard (1989). In Section 3.3 we will
consider this clesign with one and two baseline measurements. The Balaam design will
31
be considered in Section 8.4 and in Section 3.5 we will give a modification which shows
the optimum design to solve some problems in the 2 x 2 crossover design'
g.2 Two-treatment, two-period crossover design
Suppose we have the 2 x 2 crossover design with two sequences of treatments ,48 and
BA. One measurement or observation is obtained per subject per period in the stan-
dard crossover design, although this measurement might itself be an average of several
measurements of responses taken during the period. Table 3.1 shows the layout for the
data.
Group Subject Period 1 Period 2
Treatment
A B
1
þ,,
0tn,
Uttt
Ultn,
Utzt
At2nt
Treatment
B A
2
þrt
þn"
Azn
Aztn2
Azzt
Azzr2
Table 3.1: Notation and layout for the simple crossover design
The subjects are divided into two groups of sizes n1 and ??2 such that the n1 subjects in
group 1 receive the treatments in the order AB and the n2 subjects in group 2 receive the
treatments in the order BA. Follolving the method in Chapter 2, we assume that Y¿¡¡ is a
random variable lvith the observed value U¿:¿ which follows the mixed linear model given
in (2.2.1). Thus, for the two-treatment, two-period crossover design the model terms for
,4th subject in group 1 would be:
Uttt : p' * þ* t rr * rr I erk, (3'2'1)
Urzt : p'* 7tnIrz*rzI ÀtI etzt.
32
and for group 2 lhey rvould be:
Uzt* p'Iþzn*nrtrzlê2ft.,
¡t * 0z* * ¡rz I rt * Àz I ezz¡.
(3.2.2)
We assume that the subject effects, B¿¡ are random variables from a Normal distribution
rvith mean zero and variance o2, and the errors' e¿¡¡ ale i.i.d random variable from N(0,
or). We also assume that the subject effects and errors are independent. In other
words, we consider o2 as the variance within subjects and ø"2 as the variance between
subjects. We clefine N : nr ! n2 as the total of subjects in the design. In order to
make the parameter values unique, we introduce the following constraints on treatment
parameters:
Ty : -72: T¡
)r : -Àz:À.
(3.2.3)
This section is dir ided into three parts. The 2 x 2 design is considered when À is present
and when it is not present with the subject effect as a random effect. We then discuss
the case with ) in the model and the subject effect as a fixed effect.
3.2.L Analysis of design when À is present
With the constraints in (3.2.3), the expectations of responses in this case can be written
in Table 3.2
Period
Group I
2
1 2
p1'rt*r p,Irz-,¡-+)
þltt-r p,Inz*r-À
Table 3.2: Expectation of responses in 2 x 2 crossover design when ) is in the model
As in Section 2.6, we can examine the treatment information and obtain treatment es-
timates by looking at the vector of means y*. This can be written in matrix notation
Uzzt
,1,)
AS
Utt.
Utz.
Azt.
Azz.
0
1
0
1
0
0
iI
1
1
0
0
1
0
I
0
1
0
1
1
I
I
10pi
p;+
1
-1
-lI
€tr.
€tz.
€zt.
€zz.
(3.2.4)
(3.2.7)
(3.2.8)
tl+l;;1
+ i;1.
where the dot notation in the formulas denotes averages and each mean is lormed by
averaging over the subjects in that group and that period. With the matrix notation of
Chapter 2, we can write out the above mixed linear model as
a* : lrLn * (1, Ø lr)n + B* P* ¡ T*0 ¡ e", (3.2.5)
where y* and 0 are known as data vector and treatment parameters respectively and
B* :W" Ø 10, where
W*: #r'n,0
The covariance matrix of Y* can be obtained by
Var(y.): o'(N-t 6 Kz) * (o'+ 2d3)(N-1 6 Jz) (3.2.6)
where
0
I tlnltn2
N : (14l'.IV*')-t :rL1 0
0nz
n TLTTIZ
By referring to Chapter 2 and incorporating random subject effects, \¡r'e can get the
information matrices about d in the four strata which we introduced in Chapter 2. Since
we reduced the responses to means of observations over the subjects in each groups and
period, we can put
rlJ;: i(Nt1'N) : - TLtTl2 n2,
21
Table 3.3 then shows the relevant projector matrices, which are orthogonal with respect
to the weight matrix N-l I 12, for the four strata identified in Table 2.3. Each stratum
corresponds to just one degree of freedom.
34
Matrix
Grand Mean Qó J;ØJ2 1
2N
n2, n2, TL1tz2
n2, lrfl2
TLtTùZ
TItTL2n21
nl
nl
).,
ta
lTLTù2
fttnz
TLITLZ
TL LTIZ
n
n
Period Qi Ji Ø I{, 1
2N
,,,
-nlTLTTLZ
- TLt TL2
-nl TùtTtz
nl -rùtTlz
-TùtTLz
Tùt'lTZ
n2,
-n"
nl
n2,
-TltTùZ
Tl tTlZ
Group Q; (N-/i)s/, ntn22N
I
i
-1
-1
I
1
1
1
-1
-11
1
1
1
I
1
Group x Period Qä (N-/;) ØKz
1
-1
-11
I
1
1
1
1
1
1
1
1
-1nl n22N
1
I
Table 3.3: Weighted orthogonal matrices for four strata in the design
3.2.2 Treatment information
This separates into a component in each of the three strata, given by
Now, to get the information about treatment parameters $re consider the treatment
information excluding Grand Mean stratum. This is given by
r.'(N81- qilr.-¡i I 4 _21 uS=it tlul_r ,l*TLo rj
12
2N
4-2T* QiT- (3.2.e)
T*,4nyn2
2N
Tl2
2N
q
[:
I
4nt
g;r. 0
1
1
214 2
T-,QäT- :
35
where .^ú : ??r I nz, and I - Tt1 n2. We note that the rank of each of these matrices
is 1. In particular., T-'QiT*(i:1,3) contains information only about (2, - À) and no
information about (r*2À), while T.'8;T* contains information only about À. [f rz1 : Tt2:
there is no information about treatment inT*'QiT*.
We will ignore, as discussed earlier, the information in Çi, the Period stratum. The
estimates of d in the other strata can then be obtained as
1
02 QO Q;T\_T*,Q;Y* n@n. + an.) - @n. -l vzz.)
0 0
t, Az
0": (r,8är-)-r*,Qäy*: å [ ,I,,,-_n),,:,rr*(r;, n:_r]] :
*4,r, -d,2.) [ :, ]
where dt. : ytt. - Ytz. and dz. : Azt. - Uzz..
Thus, á2 provides an estim",":t ) only,
întt"rr:,o.."t:"t an estimateof (2r - ))' since
lz -t I '. :
,(dr - dr.),
and this can be shown to have expected value (2r-)). On the other t und, I IL
so that no information is available for the parameter (z + 2)) in this stratum
In the câ,s€ ?-ù1 I nr, there is information about (2, - )) in the Period stratum, but
this is not recoverable unless ì¡/e are willing to assume that either there is no period effect
or that the period effect is random with a known variance. It can be seen from 3.2.9 that
the information about (2r - )) is split between the Period stratum and the Group x
Period stratum in the ratio 12 : (4np2), and hence that the proportion of the information
lost into Periods is t2f N2: (1 - 2q)2,where q: ft. Clearly, there is no information loss
to Periods when rlt : rL2.
The variance matrices for parameter estimates are
var(02): (o2 +2o?)(7.'8;T-)- : ttt(2'?+zo?)l o t I2n1n2 lo tl
var(03): .2(T*'QäT-)- : ##l : ilL' ^tThe estimates can then be combined using (2.6.4) to give
Q;rr-'q;r* + ótt?*'air")- @;rT-'Q;T*9, * 6;tT-'q|T"0r¡
,f ," 0
l;l36
1
tAn. - Azt.
-\'¿\at.. - az..)
r*p I+2pr -t 2p z(t + 2e)
(3.2.10)
(3.2.11)
(3.2.12)
and' if p : o? lo2, then the covariance matrix for the combined estimates is
( n+*,r*'Q;r* * jr-'qâ?.)-'var{ [;],
No2
4n1n2
we see that an unbiased estimator of r can be obtained as
i : (at. - yzt,) l2'
that is, â is based only on the first period data when À is present in the model. In
other words, if ) is in the model then the estimate of r is based on between-subject
information and is the estimator we would have obtained if the trial has been designed
as a parallel-group study in only one period'
To test the null hypothesis that r:0 when À is in the model, Jones & Kenward
(1989), p. 28, recommend using a two-sample t-test, estimating o2 I o? using only the
first period data. This is different from the estimated variances which would be obtained
by Chi (1991) who proposes estimating both ø2 and ø1, separately' from the full data
set' by equating the mean squares for 8¿ and Q5 to their expectation in Section2'6'
3.2.9 Analysis of the design when À is not present
A further stage of course is that if we do have not a carry-over effect in otrr model then
to get an unbiased estimator of z, we can write
T*:
T",QiT*
T*,8;T"
T*,9äT*
1 -1 -1 1
Then the treatment information in the three strata of interest can be given by
Ðnùf
212
N0
8n1n2
N
and since we have decided not to use the information in the Period stratum the estimate
of z just comes from the GroupxPeriod stratum,' namely
i : (Ytt. - an. - azt. I Yrr.) la
Now we have an unbiased estimator of r based on rvithin-subjects information, and
V ar(î\ : -^[-.o2.8n1n2
We note again that if nt # nzwe lose a proportion (1 - Zq)' of the treatment infor-
mation into the Period stratum. As noted by other authors, the assumption that À : 0
leads to higher precision for the treatment estimate. The efficiency of estimating r with
À in the model compared to estimating r without À in the model is the inverse ratio of
the two variances, namelYNo2 4n1n2
8"t"r^ No,11 ¡¿'which is 50% when p:0, that is there are no between-subject differences, and less the
50%for p>0.
3.2.4 Analysis of design when subject effect is fixed
Our model has subject effects random. This enables us to recover information from
differences between subjects. If the subject effects are fixed, the only information available
is in the differences d¿ : y¿t.-yor.(i : 1, 2) between the two measurements on each subject.
This implies that only the information from Qi and Qi is available. In order to derive a
set of estimates for the parameters of interest, we write the period differences as.
d¡: at.-an.:2¡r*2r-À+er, (3'2'13)
d2 : Yzt.- Yzz.:Ztr -2r* À*(z'
where 7T : ilt - -lt2. By matrix notation
[:; ]:l:l hl.i ; llt;l.l:lwhere Var((¿):2o2ln¿, for i: 1,2. It is clear that
(3.2.r4)
ì:(ú+dz)la
38
with varianceo2var(fr):
srvnlt - 4However we can not distinguish between ¡ and À since (dt- d2) estimates2(2r - )). If
À : 0, (d, - dr)la estimates r rvith a vatiance,
ltar(î\: no' ,.\'/-8Nq(1 -q)'
but as discussed earlier, if ^
+ 0, the only estimate available for r is based on between
subject differences. If the subjects are fixed effects,, we cannot obtain an estimate of r at
all.
3.2,6 A comparison of the three cases of 2 x 2 crossover design
In Table 3.4 we give the results of the analysis of the cÌossover design and have a com-
parison in terms of presence and absence of carryover effect in the model. Let us define
1 n1+n2 t o'
Case
I 2 ,)
Estimator p+0andq+0.5and À is present
p+0andq+0.5
and ) is not present
p:0andq+0.5and À is not present
T T@" Azt.) IlYrr. - an. - Yn.I Yzz.llr 1
ilYrt. - Utz. - Azt.l Uzz.l
À At- - Az None None
Var(î) o2 I
V ar(\) o2 (t+zp)2Nq(1-q) None None
Table 3.4: Estimates of parameters and their variances
We note that the total information about (r, À) is
¡t/2 1
1i
when ?ty : TL2, and hence the best we could hope to do for V ar(î) would be o2 f 2N in the
case rvhere À not in the model. This is achieved iÎ q : j above, but not otherwise. If )
is in the model, the smallest variance achievable for â is o2 (7 I p) I N which is larger by a
39
factor 2(I + p). Thus. even if p : 0, only 50% of the information about r is available when
) is in the model. To provide an illustration of the minimum variances of each estimator
of the parameters, we sholv in Figure 3.1 the graphs of variance of direct treatment and
of carryover efiect, in terms of p : 0,1',2,5' 10', 20 and q' In these graphs, we set o2 : f"
From these graphs we can see a clear minimum of variance of both parameters of interest
when p : 0 and q : 0.5. Furthermore, the minimum of these variances occurs at g : Q.5
for all p.
Figure 3.1: N times variance of (i) direct treatment effect and of (ii) carry-over effect
estimator for 2 x 2 design in terms of p - 0,7,2,5, 10,20 and q : ntl(nt -l nz)'
(i) Direct (ii) CarrYover
oo
oó
o@
o+
oAI
I
I
ì
oo
o@
o(o
o{oôt
o
ooc(ú.E(!
xz
\
\
ooc(ú'tr(ú
Xz
0.0 0.2 0.4 0.6 0.8 1.0
q
0.0 0.2 0.4 0.6 0.8 1.0
q
3.3 Baseline measurements in the 2 x 2 crossover de-
srgn
Up to no\¡/ we have restricted our discussion to the case of the simplest of two-treatment
designs, that is those with only two periods, just touching on the effect of adding baseline
measurements. The difficulty with these designs is,, as we have seen, the problem of
confounding carryover or treatment x period interaction effects with the treatment effect.
\Me now turn to consideration of more complex crossover designs . We shall, however,
continue to restrict our discussion to studies involving only two treatments.
As rve showed in the previous sections, in the analysis of 2 x 2 crossover design the
main difficulty is that unbiased estimates for direct treatment effects are based on first
period data only. In this section, we shall analyse this design by including baseline
\
40
measurements lvhich some authors such as Freeman (1989) have recommended. We show
that they can not solve our main difÊculty, but they provide a way to increase the power
of tests against the presence of carryover effect in the model.
Using the baseline in crossover trials has two potential advantages, in that we can
ove come two basic difficulties which we face in the analysis of two-treatment, two-period
crossover design without baseline measurements. The first difficulty is the low power of
the test against of carryover effect. Kenward & Jones (1987a) have shown that one way
to increase the power of this test is to use baseline measurements. The second difficulty,
as Freeman (1939) has suggested, is the over-parameterisation of the design without
baseline measurements. In addition, if there is baseline measurement in the model we
can estimate the average effect of either treatment or carryover, provided lve can make
an assumption about the absence of period effects.
In this work,, we consider baseline measurements in two sections. In the next section we
will take a single baseline measurement before the subjects are given the first treatment.
Then we shall analyse the consequences of the use of baseline measurements before the
subject is given the first treatment as well as the second treatment.
3.3.1 One baseline measurement in the design
In this design there are three periods, in which the first period provides the baseline
measurements. The layout of this design can be shown as in Table 3.5.
Period
gloup
1 2 .)
A B
B A
Table 3.5: Tlvo-treatment design with one baseline measurement
Here the hyphen sign (-) means that there is no treatment implemented in that period.
The model that we apply to the above situation is the same as used in Section 3.2. Now,
by employing the usual constraints on the parameters of interest, the expected values of
observations are given in Table 3.6.
4l
Period
Group 1
2
1 2 3
¡-r' * tt [t{'rz*r ¡t*ns-r-+À¡tltt L¿lnz-r P,*rs-l-r-À
Table 3.6: Expectation of responses in 2 x 2 crossover design when ) is in the model
and the matrix form for the six means ls
Utt.
Utz.
Ut's.
Uzt.
Uzz.
Uzs.
100 10
v: :p 1
0
0
1
0
0
1
0
0
0
1
0
0
+
I
1
1
1
1
1
10 0
0
1
I
1
1
1
0
0
0
0
0
1
0
0
i
+
0
1
-10
-11
€tt.
€tz.
€ tg.
€zt.
€zz.
€zz.
7f1
ff2
rl3[;].
pi
pi+
(3.3. i )
or
y* : þÃø -l- (12 A ft)r * (Iz Ø ls)þ" + 7"0 + e*. (3.3.2)
The variance-covariance of y* can be shown as
var(y*): a2(N-1 a 1(e) I (o' + 3øl)(N-t 8 Jr).
By using "/j defined in Section 3,2.1, we can show that the weighted orthogonal idempo-
tents for this case are as given in Table 3.7
42
i\{atrix
aö $ØJz
111 111n t
1 111 TltTùZ 111111 111
I3N
111111111
111T'l,t Tlz n 2
t 111111
a I JiØKs
2-\ -1
-12
2-l -i-12
n 21 t2 Tùl.TLZ t2
-1 1 -1 1
1
3N
2-7 -1
-12
2 -1 -1r2-11-1 2
TùtTùZ t2 nl
1 -1
a; (N-4) 8/3
1
1
I
-l-1
-i
1
1
I
1
1
1
1
I
1
-1
-1
-1
-1
-1
-1I
I
1
1
1
1
1
1
1
1
1
1
1
1
1
'lL1 'n23N
aä (N - /i) 8/rs n1n23N
2-t -1-2 1 1
r 2 -1 r-2 1
1-1 2 l-2 1
2 r I 2 -1-1r-2 1-i 2-71 1 -2 -i -1 2
Table 3.7: Weighted orthogonal matrices for four strata in the design
With this design, the total information available is
6-3 4n1n2
-32?.'(Na/-0ð)?.:{
3
43
3¡/ l:il
The treatment information in Period, Group and GroupxPeriod strata respectively is
given by
T*,giT"
T*,8;T*
T*'gåT*
12
3/V
6 -3-32
6-332
4n1n2
3¡ú
4n1n2
3¡r¡
0
0 il
The estimates in Group and Group x Period strata can be given by
In,
0o^,)0,:'2- Uz.
-Ut.*Un.*Yzt.-Yzz.
-2ytr. * an. * grs. l2an. - azz. - Yzz.
V ør(02) (o'+Jo!)(T.'Qir*)- : 3¡'/4nyn2
^10z: z
with covariance matrices
'")(o2 ¡3o l:ilV ar(ls) o2(7.'q;T'*)-: ffilZ:]
These estimates can then be combined using (2.6.4) to give
Q;r T.' Q;T* + ót t r*' 8år. )- (6; I T*' Q;T* 0, + 6; t T*' QäT. 0 r)lîl(o' + "?)(y'r. - azz.) * o?(yr'. - ytt.)
(ot + ")@tr. * yß. - uzz. - azz.) I2o!(grr. - vn.)
an. - azz.l #¡(Yrt. - Yn.)
an. * urc. - uzz. - uzs.l h@rt - an.)
and the covariance matrix for the combined estimates is
Ç|qr*' q;r* * jr.' eär*)-'
No2 lt+zp l*3p I4np2(r+r) ll*3p z(r+lp)
.l
1
2
i;l ÌVar{
44
In the case p -+ oo, the subject effects are regarded as providing no information and
the estimates reduce to the rvithin-subject d3. In case p 4 0, there no subject effects,
und,0r. â. u,r" combined lvith dz : ds. As p decreases, the extra information about ) from
the subjects stratum reduces the correlation betlveen the estimates of r and ). Horvever,
this correlation still remains, being 0'71 even when P : 0' If À is not in model' the
information about r in T*'Qi?* is 8n1rz zf N,, and the estimate is
(Yrr.- Us.s.- azz.I Yrs.)12,
with a variance of No2l(Snfl2). The strong covariance between î and  implies that
the variance of the within-subject estimate of r when ) is in the model is increased to
Noz f (2npz), implying an efficiency of only 25%. Even this is an improvement over the
case without baseline when there is no within-subject estimate available. Incorporation
of betlveen-subject information reduces the variance of the estimate of r to
No2(r + 2p)
4np2(t -l p)'
implying that the efficiency of the combined estimate of r when À is in the model is
No2 4np2(r ¡ p) (1 + p)
B"ru ^ NoT +2p): 44O'relative to the estimate of r when ) is not in the model. This varies from2STo (as p : 6s¡
to 50% as (p: [).
3.3.2 Two baseline measurements in the design
We shall norv examine the effect of considering two baseline measurements to lhe 2 x 2
crossover design, one before the first active treatment period and another before the
second treatment period. This case can be shown in Table 3.8 as follows.
Group
Periods
1 2 r) 4
A B
B A
Table 3.8: Two-treatment crossover design with two baseline measurements
45
Employing the usual constraints on parametets, lve have
Urt.
Utz.
An.
Au.
Uzt.
9zz.
Azs.
Uzs,.
tt'+
I
1
I
1
1
1
1
1
01001000
00100001
10 0
1
0
-10
-10
1
€t t.
€n.
€ 1s.
€t+.
€zt.
€zz.
€zs.
€za.
0
0
0
1
1
1
1
I
1
1
0
0
0
0
0
0
0
I
0
0
0
1000
7f1
7f2
?T3
7r4
+pi
p;+
[;].010000100001 1
or
y" : L¿le * (12 A In)n + (IzØLn)p" +T*0 + e*
and we have variance-covariance matrix,, v as given below
(3.3.3)
var(y.) - o,2(N-1 g K+) + (o'* 4ø"2)(N-t I "In)
where N is diøg(nt,nz), as before.
The treatment information for this design also comes from those strata that we con-
sidered before, and is given bY
T*,QåT*
where Qi arc shown in Table 3.9.
T*,QiT*12
4N
T*,8;T* :¡ú
¡r/
n
n
8-4-43
0
0
0n
n
a1
21
i
438-4
46
aå JiØJ+
1
1
1
1
1
1
I1
11 1
1
1
1
1
1
1
111111 1
1
1
1
1
1
1
111na
1'ltrtTL2
11 11111 111
1AF 11111
11 111ntrLZ n22
11 1111 1 1 1 1 1
n tI
3
-1
-1
-13
-1
-1
-1
-13
-1
-1
-13
-1
-1
-1
-13
-1
-1
-13
-1
-1-1
-13
-1
-1
-13
TLt-fù2
3
-1
-1
-13
-1
-1
-1
-13
-1
-1
-13
-1
-1
-1
-13
-1
-1
-13
-1
-1-1
-13
-1
-1
-13
1
4N
ntfl2 n22
a 1 JiØK4
ai (N-Jä) 8J4 n1TL24N
1
1
1
1
-1-1
-1
-1
1
1
1
1
-1-1
-1
-1
1
1
1
1
-1-1
-1
-1
1
1
1
1
-1
-1-1
-1
-1-1-1
-11
1
1
1
-1-1-1-1
1
1
1
1
-1-1-1
-11
1
1
1
-1-1
-1
-11
I1
1
aä (N-/ä) s1(4nt n24N
3
-1-1
-1
-3I
1
1
-13
-1
-11
-31
1
-1-1
3
-11
1
-31
-1
-11
3
1
1
1
-3
-31
1
1
3
-1
-1
-1
1
D-d
1
1
-13
-1
-1
1
1
-3I
-1
-13
-1
1
1
1
-3
-1
-1
-13
Table 3.9: weighted orthogonal matrices fol four strata in the design
47
The estimates in the tlvo strata, namely Group and GroupxPeriod, can be given by
Yza;I
;I
0,
0"
0
2(yt, - azz.) - (ytr.l yts. - Uzt. - yzs.)
[email protected] yu. - Utt. - yrs.) i (arr. * yzs. - 9zz. - yrn.)]
with covariance matrices
Var(02) :
Var(fu)
(o' + 4o!)(T"'QiT")- :
o2çT.'q¿T'*)-: #l
T*'Q;T* + T*'QäT* : n"::'¡/
N(oz + 4o?)
TltTùZ l:il3448
t;l
These estimates can then be combined using (2.6.4) to give
ç6;t r-' q;r* + 6lt T*' 8är.)- (6;t r-' Qir" 0, ¡ 6;r T.' QäT- êt")
Var{
f - ñ /- ''l
: ! I un. - azz.l #(grt" - an. - Uts.I vzs.) I
'l@,, lyt¿.- yzz. - az¿.) - hT,. * vrs. - vzt.- úrù )and the covariance matrix for the combined estimates is
11) : 17 unr",Q;T* * t*r-'gi?-)-t
No2 f r+sp r+4p: A;,nlT1p) l, * no ze + ap)
l;l
3.3.3 Conclusion and discussion
As we can see in the analysis of the two-treatment, two-period case, we cannot estimate r
and ) within-subject; the only combined estimates require between-subject information.
The total information in Group and GroupxPeriod strata for the two-treatment, two-
period design shown in Section 3.2 and both designs including baselines in Section 3.3
2 I
-1 I
IS
48
(3.3.4)
The only difference between the designs is the lvay in which the information about
) is lost into the Subject stratum. In the two-treatment, two-period design, 50% of
the information about À is lost into the Subject stratum, leaving a singular informa-
tion matrix. With one baseline, a third is lost into the Subject stratum,, while rvith
two baselines, only 25% is lost. Hence the baseline measurements allow us to keep
more information about ) in the Subject x Period stratum and hence improve the es-
timate of r. Thus, as move through these models, the variance of î changes from
#+t"ffifut"7effiøIf p :0, there is no change, since the total information remains the same in a1l these three
cases. Holvever, if p > 0, V ar(i) is reduced as we include more baseline measurements.
3.4 Balaamts design for two treatments
As Jones & Kenward (1939) on page 140 have pointed out, the main disadvantages of
the standard AB, BA design without baselines are that
1. the test for a carry-over effect or direct-by-period interaction lacks power because
it is based on between-subject comparisons, and
2. lhe carryover effect, the treatment xperiod interaction and the group difference are
compietely aliased with one another in the sense that only two of the three can be
estimated.
If designs with more sequences or more periods for two treatments are used, howevet,
\¡/e are able to obtain within-subject estimators of the carryover effects. The problem of
choosing the best estimates for direct treatment and carryover effects has been considered
by a number of researchers, and the designs which they have chosen provide minimum
variance unbiased estimators of the effects of interest.
One of the more efficient designs for comparing two treatments in the two period
crossover design is the design of Balaam (1968) which we consider in this section.
We will analyse the Balaam design when the subject effects are random variables with
mean zero and variance o"2 and we assume that the within subject errors are independent
random variables with mean zero and variance o2.In addition we can assumethat these
49
variables have a normal distribution. In Table 3.10 rve show the layout of Balaam's design
for tlvo treatments and the number of subjects in each sequence group.
group subjects treatments
1
2
3
4
Tù1
Tr1
Tr2
ft2
AB
BA
AA
BB
Table 3.10: The layout of Balaam's design with the number of subjects in each group.
3.4.L Balaam's design when subject effects a.re random and
unequal group sizes
We begin our analysis for Balaam's design by referring to the eight expectecl vaiues in
Table 3.11.
Period
1 2
Group
1
2
,)
4
p'*rt1.r ¡L,lnz-7+)þlrt-r ¡t*rz*r-)p'lr.t-l-'r p,lrz*r*)
þ*rr-r p'Irz-r-)
Table 3.11: Expected value of mean in Balaam's design
It will be noted that the all previous notation can be used here. By referring to
Table 3.11 the mixed linear model for this case can be written as
y" : ple* (1, Ø lz)t * (1¿ a lr)0. +T*0 + e* (3.4.1)
50
In this design the 7* matrix can be set as
t:
and by defining a weight matrix as
1
-1-1
1
1
1
-1
-1
0
1
0
-i0
I
0
-1
(3.4.2)
then we have
1
ü ¡r(N14liN)
N_
TLtTùZ
TLTTLZ
T7y 0
0nt0000
0000rù2 0
0nz
21
2I
TL
n
nl
nl
TLtTIZ
Tù tTù2
'lltTLZ
TLtTIZ
n2
TLyTIZ
TLITùZ
n"
nl
Ø Jz,1
¡\i
2
¡\i
n? fttTtz
Tlt TlZ n|nl
2
where N :2nt*Znz.Now we can get the following weighted orthogonal idempotent for
the Balaam's design in Table 3.12.
Matrix
a0 JiØJ2 tN
nl TùtTlZ
nlTùtTlZ
Ø JzØ Jz
Qi Ji Ø I{22
N
n I TLLTLz
TI¡TLZ n ) ØJ2ØI{2
a 2 (N-/;)s¿ TL1 0
0nzØ I{, ¡ ao#t<, Ø J,
) *,,
aä (N-4) Ør{zTù1 0
0nzØKz*øfl<rø1, ØKz
Table 3.12: Weighted orthogonal matrices for four strata in Balaam's design
51
The treatment information in the strata of interest can be given as
0T*,QiT"
0
T*,Q;T*2nz
ntlnz
-2n
oz(T*'QiT.r-: ç j
:i:t:t
0
0
4?IZ
T*'gäT"
2nz
4nt
-2nt nr*nz
As we can see, the rank of T*'Q|T* is two,, which is sufficient to estimate the two param-
eters of interest, so we have
0zUn. * Uzz. - Utr. - Un. * 9sr. * Uzz. - y'tt. - Yaz')
2(Ytt. * an. - uzt. - Yzz.)
V ar(îs)
Then the combined estimate for the parameters can be obtained as
T
)
T
1
4
0" : I I n"'- utz'- vzt'l vzz'- azt'* asz'* Y¿t'- a*')la L 2(v"r. - vzt. - usz' * aqt) l
and covarìance matrices of estimates are
v ar(02) : (o' + 2o?)(7.'8;T.)- : Ü#[ * "{ ]
N4ntn2 n2
48n2 n2
I : ç]*",*'Q;r* * i'.'¿är.r(;+2,?r.'8;r*02* #'.'qä7.0")
If l/ : 2(n, + nz):total number of subjects in the design, and g : ;f- as before, then
we have
{qlz(t - q)p(r + p) + (r + zp)l(an - yzt.)
- (1 - q)lzqp(r + p) - (r + zp))(úst. - yn.)
-zq(I - qX1 -f ù'@rr. - Uzz. - Uzz. + ùn )\¡zl+qçr- qxl * p)' + (1 + 2p)]
52
{q[(t - q) + (-2+ 3q)(1 + 2p)]@". - yzt.)
+ q[(1 - q) + (z - q)(t + zp)](yrr. - yzz.)
+ (1 - q)(1 + 2p)lzq(r - p) - ll(y.'. - uat)
+(1 - qX1 + 2p)l2q(r + p) + 1l(y.r. - an .)\
¡zl+qçr - qxt + p)' t (r + zfi) ,
and the covariance matrix which also depends on q and p car be obtained as
i
,.,,I i ], oz ¡ 2o2,
1l-1( T",Q;T* T*,QäT-)_
o2(LIp)(L+2p)Nfaq(l -qxr +p)2 +(1 +2p)]
Var(À)zo2fr+2qp](r+2p)
n[aq(l -qxt +p), +(1 +2p)]
\JOU\T1^) :
As we can see in Balaam's design when the subject effect is random, estimates of param-
eters of interest and variances of estimates are completely dependent on g and p. In term
of q and p there are several cases to consider.
1. When q : l, Balaam's design is just the same as the two-treatment, two-period
crossover design with n1 - Tt'2' In fact when Q : I we get the estimates of the
parameters same as given in (3.2.10) in Section 3.2.1.
2. If we let p -+ oo, then Balaam's design gives just the within-subjecf "stimates, á¡.
3. If À were not in the model, the information about r is
4n1t 4n2,
and hence the best possible variance for î couldbe o2f2N. In fact the estimate
based on within-subject differences would have a variance o2f 4n1 : 02l(2NØ), so
that this suggests we should use q close to I if we believe that no canyover exists.
4. When ) in the model, the rvithin-subject estimate of r has a variance N o2 f (8ntnr) :' o'zllQN(q(l - q)] which recovers at most 25To of the information about r, and then
ar(ì)v
o.)
only if q: Il2, \.e. ny - Tt.2. The use of between-subject information reduces this
variance too'(r-p)(r+2p)
Nfaq(l -q)(1 +p) + (1 +2p)]'
which gives significant gains in efficiency, particularly when q is near If2 and p\s
reasonably small, say less than 5. I1 p : 0 and q : I12, this achieves the minimum
varlance
3.5 Two-treatment, three-period crossover design
As Mathews (1938) pointed out, although the use of more periods causes some problems
in the experiment, there are several reasons why one may wish to extend the number of
periods in a crossover design:
1. to allow the carryover effect to be estimated (within-subject estimate),
2. to make greater use of the experiment material,
3. to make it easier to compare more treatments in the one design.
Kershner & Federer (1981) calculate the variance of estimators for direct, carryover and
total treatment effects for a number of two-treatment designs, using from two to four
periods. The various optimality results were:
o the most efficient three-period design is AB B , B AA, and
o the most efficient four-period design is AABB, BBAA, ABBA, BAAB.
Table 3.13 shows the layout of the three-period design.
Periods
Group
1 2 3
A B B
B A A
Table 3.13: Two-treatment, three-period crossover design
The model is the same as that shown in the previous sections,, but there are three active
treatment periods. Thus, by using the usual constraints on the parameters, then the
54
linear model in terms of matrix form can be shown as below.
Utt.
An.
Uts.
Azt.
Uzz.
Azs.
1
(3.5.1)
OI
y*: pta*(1r 8I3)P*(1241.)8. +7"0+e*. (3.5.2)
If there are ni subjects in the ith group (i : 1,2), the variance-covariance of the design
can be written as following.
Var(y.): ø'(N-1 A 1(3) -l (o'* 3o"2)(N-t 6 -Ir).
The weighted orthogonal idempotent matrices are the same as those given earlier in
Table 3.7.
Then the treatment information for this design can be written as
T*,QiT*2'.,2
3¡\i
1
1
1
1
1
1
1
0
0
1
0
0
0
1
0
0
1
0
0
0
1
0
0
1
I
I
1
0
0
0
0
0
0
1
I
I
-1
0
1
-10
-11
tt' +
7iy
7f2
7l'3
pi
p;
:l
I
1 [;]..1
1
T.,g;T* : w
4
0
'ol,0 0l
4 ol,0 3l
T*'gåT"
and the variance of estimates a e
and the estimates in the two strata are
f ,,
0, : I I l(a''"+azz'lvzt')-(an'+un'rttt')l I2l o l
0, : I I ttzl" - utz'- arc') - (2v"'- uzz'- t"')l I8 L 2@rr. - uts. - azz. * azs.) j
8nyn2
3¡ú
Var(02) : (o' + 3a3)(7.'Q;7.)- :"# l:: l
bb
V ar(0s) o'çT-'q;T-¡:#l; ;]Combining estimates from the trvo strata we have the following estimates for parameters
of interest by
t;l ( rrh"r*'Qir* * ir.'ïår"l-( rr +-Lu:7.'q;T.0r * ir-'Qä7.0")
Then the estimates of two parameters can be obtained as
1
2(3 + 8p) l(r + 4p)(gn. - an.) + (1 + 2p)(-vrr. - arc. * azz.* vzs.)l
^1¡ :,lytr. - Uts. - Uzz. I Yzs.),4
and the covariance matrix for estimates is
v.,(l I l,: !!f r# 'l'-''LÂl' Bn1n2 l o rlAs we can see the estimate of direct treatment is obtained from all data but it depends on
variances and the variance of estimate not only is less than the corresponding variances
in the standard design, that is, AB vs B Abú also the correlation between them is zero,
that is, î and i ir itrd"p"ndent. In addition as we can see in this design the information
in Subjects stratum is now about r rather than À. With this design the proportion of
information on r and À lost into the Periods stratum is
2/2 2(q-n2)2 rt o \2
5"t", +2P :
4r, ¡ -, - \L - zq)
The proportion of treatment information lost into the Subjects stratum is $ : |, so
there is not much treatment information to recover. This can be demonstrated by an
equivalent formula which shows how little the estimate of r is changed:
î : â",',.- r+"- {(zvrr. * vn.* vrs.) - (2arr. * vzz.+ vzs.)}. (3'5'3)ð(r -t- öp)
and the variance of â is smaller than that of the within-subject estimate by a factor
s(1+sp)/{s(s*8p)},
which varies from 8/9, when p is small and we recover all the between-subject variation,
to 1, when p is large and there is no useful information to be recovered from between
subjects.
T
bb
Chapter 4
A Bayesian analysis for the general
crossover design
4.L Introduction
By sharing information among components of a statistical assessment and possibly in-
cluding external objective evidence and personal opinion, Bayesian methocls have the
potential to produce more efficient and informative statistical analyses than those based
on traditional approaches. In the Bayesian approach, the likelihood function is multiplied
by another function due to prior belief in the values of parameters under consideration.
This product gives us a new function which we refer to as the posterior distribution.
Then in a Bayesian analysis lve should consider the following concepts
1. an observed random variable or vector y,
2. a parameter vector 0,
3. the conditional density f @lg) of y given 0, and
4. the marginal density p(d) which is called the prior distribution of d.
In the two treatment, two-period crossover design, there are a series of papers whìch
have used the Bayesian approach. Racine et al. (1986) and Grieve (1985), for example,,
presented a Bayesian analysis of the two-treatment, two-period design and Grjeve (1986)
used Bayesian analysis for this design when there are two baseline measurements in the
5(
model. Recently Grieve (1994) considered missing values in the Bayesian approach to
this design.
This chapter presents the Bayesian analysis of the crossover design. Emphasis is on
analysis of the normal linear model together with normality of prior distribution for 0.
Most previous studies use un-informative prior distributions. It is argued here that
lve would generally have some expectation about the size of carryover as a proportion of
the direct effect. If so this should be reflected in the prior distribution.
In Section 4.6 we analyse the Bayesian approach of tlvo-treatment, two-period crossover
design and in Section 4.7 we consider this design with one baseline measurement. Bayesian
analysis of two-treatment with two baseline measurements is discussed in Section 4.8.
Bayesian analysis of Balaam's design is considered in Section 4.9. In Section 4.10 we
discuss Bayesian analysis of the two-treatment, three-period crossover design.
Suppose in a crossover design that treatments are to be compared over p periods and
that there are gn subjects randomised to g equal-size groups, such that in each group
a sequence of treatments is applied, and measurements made at intervals of time. The
treatments may include absence of treatment and measurements may be taken in these
periods. To see this more clearly, consider the following layout for general crossover
designs shown in Table 4.1.
Period
1 2 p
Group
I
2
g
T¿j
Table 4.1: Layout of general crossover design
The treatment T¿¡ is applied to all subjects in group i and period j; where ?,¡ may be a
null treatment or a combination of direct and carryover effects.
58
4.2 The statistical linear model
We assume that the model of responses follows the mixed linear model in (2.2.1) in
Chapter 2, but lvith observations denoted by y¿jt , corresponding to the ¿th group, the
j period, and the kth subject within the ith group (k : 1, . .. ,n). Note that the order
of these subscripts differs from that in Chapter 3, in order to faciliiate the l(ronecker
product notation required in this chapter.
The model (2.3.1) can then be written
y:70'f€ (4.2.\)
where a : (U1t,...,a1p,...,U'¡r...rU'no)' is an gpn x 1 vector of observation, such that
Aij : (A¿jr,ynjr,...)A¿jn)t for (i : I,2.,-..,9)i :1,2,...,p), T is a gpn x ú matrix of
known coefficients, 0 is a f x 1 vector of parameters and { is a gpn x 1 vector of random
effects. Again, for the moment, we subsume all other effects into the covariance matrix.
We also assume that the elements of ( are jointly normally distributed. 'lhe model
says simply tirat the conditional distribution of y given parameters d and the variances
of the random effects is the multivariate normal distribution N(70,Q). From Section
2.2.2, the matrices which appear in the covariance matrix Q are mutually orthogonal
idempotent matrices so we can write Q as follows
Q:Ð6nQ'
where Q; arc mutually orthogonal idempotent matrices of rank ri, as given in Table 4.2.
We note that in this formulation, the dimension of the first matrix in the Kronecker
product is g, the second is p and the third is n. \{here it is obvious from the context, we
drop the subscripts from 1, J and K.
? Stratum 6¿ T¿ Q¿
0 Grand Mean o, + po,, 1 JØJØJ1 Period o t p-l JØKØJ2 Subject o' + po? ng -r (IØJs1)-(JØJØJ)3 Period x Subject o2 (p-1)(ns-1) (181(S1) -(/s1(S/)
Table 4.2: Idempotent matrices of each stratum
59
As before, an analysis of this model to determine the estimates of d in the various
strata, lvould not include i : 0 since there is no useful information about treatment
parameters in the Grand l\,{ean stratum. Throughout this chapter, therefore, lve will omit
the term for ôs. We may, however, include i : I if there lvere information available, in
the form of ol, which would allolv recovery of the treatment information in the Periods
stratum. Generally, however, we would set o2n : ñ, so that ár : 0 and the Periods
stratum would not contribute to the estimation of d.
The fact that all n observations in a group have the same treatment regime implies
that
E(y):ry81:T*0Ø1 (4.2.2)
and
Var(y): q,
where q i. {ry¿¡} in standard order and 7 : T* I 1,r. In order to work on means let us
define the average of responses by
1
n(1s181)'y (4.2.3)
4.3 Likelihood and parameter estimation
The likelihood function becomes
1L p@10,Q): l}aÈerp{ fu-rÐ'Q-'fu-re)j
2(4.3.1)
fr.uo î"
"*, - ;å*å,, - ro)' e¿@ - rÐj
slnce
tat det(Q) :3
II ¿I.i=O
IQ-r t Q¿
i=1 6¿
To simplify the exponent term in the likelihood function we define
a:Gy+(I-G)y;
60
Maximising L can be achieved by maximising the logarithm of .[ which shall be
denoted by, /, namely
Sz Ss(4.3.4)
and the maximum likelihood estimate of 0 is obtained by differentiating I and equating
it to zeroat
(4.3.5)i=l
f rolog 6,-t Ðti=l lrn - r-i)'Qî@ - r"o)j - Z(o2 * no2") 2o2'
a0 Ð lr.'orrn - r.o) -- o.
\ : yr.. - uz..- t (^, t!+r),rvhere m: Nl(U n2), and an estimate of the variance is provided by
sz - o2(l +Zp)xlrr^_r¡.
,1I: -,
If we define Q* :ÐiQi, then the solution is obtained as
0 : (7"'Q*T*)-tT*'Q*y, (4.3.6)
and the distribution of d given d is
Ole - N {0, (T"' Q*?*)-t},
provided o2 and o2 + po! are known. Using the marginal likelihood for ^92 and 53, we
obtain
ur+pã?: I S, ,1.
'g(, - \)t'
;z: ' St
tLs@-t)(n-1)"
providing unbiased estimates of ø2 and o2 + po2, respectively. S, has g(n - 1) and ^93 has
g(p - 1X" - 1) degrees of freedom. If in fact the rank of T* is less than gp, there will
be sorne degrees of freedom available which may provide further estimates of o2 andf or
o' +po?. In general, they will have relatively few degrees of freedom and hence we choose,
for simplicity, to ignore this information.
If o2 and o' + po? are unknown and have to be replaced by estimates, clearly the
multivariate Normal distribution now becomes similar to a multivariate ú-distribution. In
literature, in this regard, little has been done in general. However, in the two-treatment
two-period case the combined estimate of ) has the distribution
62
Hence()-À)
- tz@-t\.mSzla{@ - 1)}
Holvevet,^ l, / m.ø2lt*p)\.î : ,furt. - an.) - t trt' -o )
and rve do not have aX2-distribution to estimate o2(\*p).The estimateof this relies on a
combination of 52 and S¡. The result of this, as Grieve (1985) shows in Bayesian context,
is that the joint distribution of (î, i¡ ir u combination of two independent multivariate
f-distributions, which provides a Behrens-Fisher distribution. Patil (1965) shows that
this is well approximated by a multivariate f-distribution.
4.4 Bayesian analysis
Suppose that the prior of 0 is 0 - l{(do, X6), where X6 ma} be singular. Now we discuss
the Bayesian approach for our model in two sections,, one when Ðs is a nonsingular
covariance matrix and another section when it is a singular matrix.
4.4.t !s is a nonsingular matrix
In this case of a design with g groups and p periods, when X6 is full rank,, by following
Lindley & Smith (1972) and using Bayes theorem, the posterior distribution for d is given
by
010 - N(ïe,Ðò (4'4.r)
where
1Var(010):E, (t T"'Qir* + r;t)-t
6¿
and
E(010):0e : tÐf,r.'qîr- +tt')-' ,\lr.'Oîa +>;'r,o¡. (4.4.2)
The posterior mean is a vector-weighted average of the prior mean d¡ and the maximum
likelihood estimate, á. The weights are respectively the prior information matrix X;1
and the inverse of the covariance matrix of the maximum likelihood d. The posterior
63
covariance matrix is the inverse of the sum of these weights, and the posterior cova lance
matrix is smaller than either the prior covariance E¡ or
1
6¿(t T-'gîT-)-t
in the sense that
(Ðlr-'qir.)-'- (: lr.'orr. + xl')-',
is non-negative definite. It follows that
Var(d'010) 3 Var(d'l), (4.4.3)
for all vectors d.
In general,l suppose we are interested in some contrast d'0. For example, if d :
llr,rr,Tz;Àt¡À2]'and rrye are interested in the posterior distribution for (1 - 12), then
Var(r1_ "2ll) : {(T*'Q*?. * Xõt)-td,
where d: 01-1 00 . Then the posterior distribution of (rt - rr) would be
(r1 - r)lâ - N (d' 0p, d'Epd) (4.4.4)
4.4.2 Es is a singular covariance matrix
If the covariance matrix Ðo is not full rank, this implies that certain directions in the
/-dimensional parameter space have no prior information. This can be expressed by
regarding the information matrix Xo as having zero eigenvalues in these directions. Thus
we write k
Xo: DJ.im¿mtn- MIM',, (4.4.5)
where k : rank(Eo),m'¿m¡:6¿j,-n";- 6¿¡ is the Kronecker delta. Then the irrformation
matrix or g-inverse with minimum rank is
kti=l
(t
iTTItàn'¿\--uO
and the posterior distribution for d is Normal with
T,,|
1Var(010) : t',
64
6¿T"'QiT* + Ð0)-t
and
E@p¡ : e, (D T"'?ir* + Et)-'(Ð T-'Qiv + Ð;oo). (4.4'6)
On all occasions we will use the minimum rank symmetric A-inverse of Xs.
If o2 and o' + po? are unknown and have to be replaced by estimates, then O¡O it u
mixture of two non-central multivariate ú-distributions. There is little on this in the liter-
ature. However, Grieve (1985) considers the two-treatment, two-period crossover design
with l/ subjects and, possibly, unequal group sizes of n1 and D,2, and from the classical
analysis approach of this design, he summarises the following independent distributions
for all parameters in the model.
1--ð;
1
ò¿
\lt, ^) ry
\Trr ) ry
SzN
SsN
¡/i(p, )), ÐrÌ;
N{(r, r - À12),Es};
(o' + 2o?)x?u-r;
o'x'¡v-,
N-12
where
Ez : (o, + ,"?ll * l' 'ln f ,
I tl+ *12 )
xs : o'l *l'
'/t I ,
I tls *14 )
where N : nt * nz, m: Nl(u n2) and l: (u - n2)f (npr). By appiying the following
un-informative prior for al1 parameters in the model
p(lt, ¡r, r, À, ..2, o3) o ñ+tð,Grieve (1985) shows approximately that the marginai distribution of Ór: , - \12 and
ó2 : ^12
can be obtained as
p(ót) o(
8(ó, - ^12)'
Jv-1--Tp(óz) o( Sz1
n'r
By referring to the result of Patil (1965) who showed that, if /1 and þ2 arc two param-
eters lvhich have independent shifted and scaled ú distributions, then the sum of two
65
parameters has Behrens-Fisher distribution, Grieve (1985) then shorvs that r : ót I Óz
is approximated byt+l
2
p(r) x
rvhere
(Ss*s2)'?(N-6)r 4
s3+s3(/-z)(s, 1s,)
Law (1987) in his thesis develops a Bayesian analysis of higher-order crossover designs.
For a subset of the parameters, d, of length r, he use a prior distribution lvhich is
multivariate Normal distribution, with mean 0s and covariance matrix o2Ðo. He also
uses a Gamma Distribution for the prior of ø2' Thus
hN -4
p@2) o(
p(01"\ o(
0e
x
t:l
r:lwhere d, 0,00 and Xs are assumed known. Then the prior density of 0 can be derived as
r i .2aÐ^r l-tt+")p(0) x
lr + firo - vù''qr; (d - d.)l u
-," lro,
\r",r,f , Ø.4.7)
a multivariatel-distribution, where r is dimension of 0. He then uses the within-subject
estimate of d as 0lg - N {0,o2(T'T)-t} and shows that the marginal posterior density
for 0 is also the multivariate ú-distribution
ïly - t,l|r,Ep,u I r ! 2al,
where
p
: (T'T + xt')-'(?'rA + z;r0o)
: a(T'T + xtt)-t,
and
lzp + ¿'ol,t0o + a,y - (T,Têt+ t;1p0),(T,T + Ðtt)-t (T'T0 + xo'do)] ,*- (r*r*2QLand u is the degrees of freedom of .9s. The only reason this method works is that Law
(1937) makes the somewhat unusual assumption that the prior for 0 involves the unknown
ø2. Without this, the multivariate f-distribution is not obtained. Similarly, in our case,
to include the betrveen-subject estimates as well, the complication is that there is a factor
o2(t + pp) and,, unless p is known, the terms do not combine conveniently.
66
4.5 Choice of prior distribution
We have four strata in the crossover design which may contain treatment information,
wìth weights ð¿(i - 0,...,3). As before we a e generally not interested in using i : 0,1,
although we might use i : 1 if we had a prior distribution for the period effects n.
In addition in practice the 6z : o2 + po? and á3 : 02 ate not known and we should
replace these terms by their estimates or use prior distributions and integrate them out
if necessary.
In the following sections we will show the Bayesian analysis in certain crossover de-
signs. But first, we need to consider the type of prior distributions which might be used
for (r, )).
As we showed in the previous chapter, the expected values of responses in the two-
treatment, two-period crossover design when the period effect is random can be repre-
sented in Table 4.4.
LL+r p,-r*Àp,-r p*r-À
Table 4.4: Expected vaiues of responses in two-treatment, two-period design.
In our models, the difference between the two treatments A and B is equal to 2r, and
the difference in their carryover effects is equal to 2À. Suppose that treatment B is a
standard on-going treatment, like a baseline, for which the true value is ¡lo and for which
no carryover effect is expected. If we now look at the way treatment A departs from this
baseline, we can model the four means by the revised Table 4.5.
þo*2r ¡ts!2À*rl-to L¿o l2r I r
Table 4.5: Expected values of responses when treatment B is a standard treatment
This table provides a model in which the active drug A has a treatment effect relative
to baseline and in which we might expect any ca ryover effect to be a percentage of that
treatment effect. Such a relationship of treatment and carryover effect can be shown in
67
Figure 4.1, where our prior distribution for the carryover effect is that it is a proportion
k of the treatment effect, as expressed by
P()lt) - N(k''ozò' (4'5'1)
where k is a smal1 value, probably in the range (0 - 0.5) and øfr expresses our degree of
prior certainty about that relationship. We note that the case k : 0 corresponds to the
situation considered by Sehvyn et al. (1981). If we assume an un-informative prior on r,
the joint distribution of À and r is
p(r, À) : p(r)p()l r) x erp{-t^ - kr)'zlzol) . Ø.5.2)
Figure 4.1: Conditional distribution of carryover effect given direct effect
À
ÀÃ.2t
"c
Now we can write
-k T(À - kr)21 (4.5.3)
1 À
so that the prior density can be represented by a singular bivariate Normal with infor-
mation matrix
\]-uoIc2 -k-k1
202o
I,02o [' ^rli; ti
Following (4.4.5), this can be represented as a singular covariance matrix
,,:r5l!'rïl68
4.6 Two-treatment, two-period design
In Chapter 3, we obtained betrveen-subject and within-subject estimates for d in the
Group and the PeriodxGroup strata as
/V
2
0,
0,
l:, 1 #l_: il)110
0
Ut.. - Uz.
1(dr. - dr.) (2r - À)
We noted that, in combining these estimates, we needed to weight d2 uttd d, by th"
inverse of those covariance matrices, using a minimum rank g-inverse in cases where the
covariance matrix is singular.
When this is extended to include a prior distribution for d, the same principle applies,
so that ifd - l/(do, Eo), (4.6.1)
where Xs ma1, be singular, the posterior distribution for d, given (o',,p), is given by
(010,o2, P) - N(00,E), (4'6'2)
where
\-l-4OEr: (T*tQ*T* + -1)
and
0e : Ee(T"'8"7.d+E;do)
: Eo{(T.'Q*?. + Ð;)d + t;(8. - rî)}
: l- to>;Q-eù, (4.6.3)
so the mean of the posterior is shifted from á towards ds depending on the distance
between d utrd 0s and the strength of the prior information.
4.6.I Posterior estimates
Combining the two estimates appropriately with the prior for (r, À) we obtain
r, lr,,,,l n -zl 2n1n2 f o ol ,-rlk'-*ll-'Ep: t"t'L-, ,]*ñ@¡ralo,l*al-r ,ll69
k2 -k l.[:i]]
k
-1
l.,l4-2
-k1 -2 1
1
where b: (1 +2p), and ¿ : No2(1 l2p)lQn1n2o2). Then, after simplification,
Ð (4.6.4)
where
A': alb(2 - k)' + l*2) + +u.
If we assume that do:0, the posterior distribution for 0 \s N(îr.,Xo) where Xo is given
above and (4.6.3) implies that
p: {.[; å].,[;;].1;:]]
Ir-:llt
; I . å
I ; ] t-t +b(z-k)(zi-irr
0
lri-*ork)
k)2b
1.,[; ')+ 1 0p
0 -1
2
If  : kî then the estimate is unchanged, as would be expected since the data is consistent
with the prior distribution.
In other cases, however, the maximum likelihood estimate á is shifted towarcls the
line ) : lcr.
lL, - ak{k - b(2- k)}lâ r a{k - b(2- k)}i2abk(2 - k)î+ {A - 2ab(2- k)}i
(4.6.5)
(4.6.6)
Now, k will typically be smal1, certainly less than about 0.5, and ó is at least one. Thus,
the last term is dominated by the fact that most of our information in the data is about
(2ì - i) u"d there is relatively little information about ).
The two particular cases of interest here are (i) when a : 0 in which case there is
no prior information and the estimate reverts to d, and (ii) when c -+ oo, for then we
know that ) : kr and the estimate has to lie along that line. The above formula then
70
simplifies to
0p1
l.l:îl)'
(4.6.7)k
so that the posterior estimate for ) is k times the posterior estimate for r. As ao2 varies
between these two extremes, the posterior estimate (4.6.5) moves along a straight line
joining d and the point (4.6.7) on the line ) : kr.
4.7 Two-treatment, two-period with one baseline mea-
surement
This section presents the Bayesian analysis of the 2 x 2 crossover design with one baseline
measurement taken before a treatment is given to the subject. We consider that the
experiment yields complete data on n1 subjects in the first group and n2 subjects in the
second group.
Frcl-i Chapter 3 for this design we obtained the following between-subject and within-
subject estimates from Group and GroupxPeriod strata respectively
0,
03
1,,
,1,
'{l
0
Uz )
An. - An.l Yzt. - Azz.
n. * ytz. - 2yrr. - Uzz. - Azs. I ZAzt
0
t)No2T
)
2336' 4n1n2
If we apply the joint prior density in(4.5.2), then these estimates can be combinedwith
the prior for (r, À) to obtain
\-f"pl 4n1n2
3No2
6-3 1
r-]) '
l:'ro2o+
-32
lcz -k].,1
6-3-k1 -32
77
where b: I * 3p and a : 3No2(l + 3fl1(anp2o2o). Then
10rup00
rvhere tr : 2ab(k2 - 3k + 3) + 3b(ó + 2) I ak2 . Using (4.6.3), rve obtain the posterior
mean as given by
r##{.[; ;].,[: :].I
0 k
0
l)
0-E,hl:'- ,-1,
â-x{.1;;1.,i::].1
0e
{, l;-:l.l:l}l**ek-s¡1r+spt IL (s¿-6X1 +3p) j
¡CLo+^
a+Ã
(4.7.r)
illi k 1 0
-1
i( ki )
:0 (À - kâ)
As we can see, if  : frâ, the estimate of d is unchanged. Otherwise the mean of the
posterior for d is given above. We can re-express this in terms of a weighted mean between
d on the one hand, and a vector which iies on the line À-_ kr, as follows:
0o : + I to - ak{k- ó(3 - zk)}lî + o{k- ó(3 -^ zr'l1i l' a L ¡o bk(z - /c)î + {a - 3øó(2 - k)}) j
:3b(ó+zllrl n ftltf Lr ]-tà L* lttut'
-k)(2î -)) + k(b+2)))], (4'7'2)
which again highlights, since k is typically very much less than 1, that there is strong
information about (2, - )) but rather weaker information about À.
As an example of how the parameter estimates change with ø and ó, consider the case
k : 0.2. Then, ignoring the relatively small alcz termin A, (4'7.2) reduces to
t^t ^ l-^^llîl o(À-0.2î) 12.6 I
4ssc+B(b+2) | ILil rsalFigure 4.2 shows what happens according to the strength of the prior information.
I1 of, : oo, rve have ¿ : 0 and we get the maximum likelihood estimates (î, i), an
arbitrary point on the plane. The vertical distance from this point to the line ) : 0.2r
is then 1i - O.Z;;. If this is positive, as in the figure, the posterior mean 0o will lie along
72
Figule 4.2: Relation between posterior estimates lvhen k :0.2
À
?'=0.2r
(r,À)
T
2(yt . - azz.) - (ytr. * an. - uzt. - Yzs.)
Z(Etr. I ytt. - at. - aß.) * @rr.l yzs. - Uzz. - az+
'c( À)
À-0.2r
'c
the steep angled line, and will be somewhere between d and the point denoted by (i, i)in Figure 4.2. This point is the one obtained as ø -+ oo.
4.8 Two-treatment, two-period lvith two baseline mea-
surements
This section presents the Bayesian analysis of the 2 x 2 crossover design with two baseline
measurements which have been taken before each treatment is given to the subject. We
consider that the experiment yields complete data on n1 subjects in the first group and
n2 subjects in the second group.
From Chapter 3 for this design we obtained the following estimates
0z
T
)
3448
No2
'8n1n2 l)If we apply the joint prior density in (4.5.2), then the combination of these densities using
Bayes theorem can be given by combining the two estimates appropriately with a prior
I ,')
for (r, À) to obtain
\-up
0e -{x
8-4 1 ll ïl)'ol+-43
-1N(l + 4p)o' k2 -k l.l:il)TltllZ -k1 -43{,1 ]-,1
8-4
where b : 1 * 4p ar'd a: No2(l * afil@p2øfr). Then
Ðe: ryy{, l;;1.,1; :].1; :l}rvhere [: ab(3k' - 8lr + A) + 8ó(ó + t) * ak2. The posterior distribution is N(do, Xr),
lvhere
0_1 I k2 -kp 0
ofi -ki
k
k+
À
)
u-x{"
,.å{,
,.ål_
4
8 l.l; :lllll rk-1 ê
)T
+ (3k - 4X1 + ap)
(4k-8)(r+ap))
We note that if \ : kî, the estimate of 0 is unchanged. However, in general, for different
values of a, the posterior mean for d will lie along a line between g and a point on the
line ) : kr:
^ I I t¡- ak{k-b(4-3k)}lâ +o{k- b( -r¿lli IaP : ; I A^ALrr) L\êrJ,A, I. Al A"bk(z-k)i+{A-4ab(2-k)}) I
: qilr I ; I
- å I ; l "o"
- k)(zî- i) + k(b +1)i)i' (4.8.1)
4.9 Bayesian analysis of Balaam's design
We have seen in Chapter 3 that Balaam's design is an optimum design to compare two
treatments. In this section rve consider the Bayesian analysis of Balaam's design when
we will restrict ourselves to n subjects in each sequence group.
74
From Chapter 3 for this design rve obtained the estimates as
1i: AL
ru ,,{
0z
Azt. * Uzz. - Utt. - Atz. * 9sr. * Usz. - y¿t. - y¿2.)
2(Ytt. * atz. - Uzr. - Yzz.)
i'l o'+zo?l t -1 lìL^l'',;L_,,11'Utt. - Utz. - Un.I Uzz. - Av. * Usz. * yat. - y+2.)
2(Y"r. - Uzt. - U+2. * Yq.)
I'l "'lt'llL^l'z"lt r)l
If we apply the joint prior density in (4.5.2), then the combinations of these densities
using Bayes theorem can be given by combining the two estimates appropriately with
the prior for (r, À) to obtain
1
2n
"'
2-l +l; l)'i].
ll)',']2
1
4
+
le2 -kFap
-1 -k1
k2 -k].,1
2
-k1 1 1
Ðe:4#{. ll-;i ]-,ll ;l-where L,: abl& - t)'+ 1l + ø[(k + 1)2 + r]+ (2b2 + 6b11;.
The mode of the posterior distribution when go : 0 is
where b: I ! 2p and a : o2(l | 2p)lQnoo). Then
0p : 0-Ðo 0
(1 + k) + (k - r)(1 +2p)
-(2+ k) +(k -2)(1 +2p)
1
-1
tO,0-l
â**
rO,o+^
;l,ll
Â
1 -1 k-t ]
,î
]r-1
+2 -1
kì' )
(5
1i-m;
Norv if À : kî, the estimate of d is unchanged. But again we can shorv that, as ø changes,
the posterior mean moves along a straight line joining (â, i) and a point (ã, i) on the
line À : kr.
[A - øk{l + k - ó(1 - k)}]î + ø{1 + k - b(1 - k)}
ak{2 -r k - b(2- k)}f + [a - a{2 * k + b(2- k)i]0e: * ìl
l;1.*l1
kl{2 + k + b(2 - k)}î + o{1 + k -ó(1 - k)iÀ)1,
4.tO Three-period designs with two groups
From Chapter 3 for this design we obtained the estimates as
ê" : ilUn,' * vzz' r v,*')
; ,n" t vn'+ t" )f
]
'{[;] "s#dl;:l]â, :
å | "n' ' -
î','n,,-:,] --?,n,'*
n,t: *''
]
'{l .i^,,] **l; ;]}If we apply the joint prior density in (4.5.2), then the combination of these densities using
Bayes theorem can be given by combining the two estimates appropriately with the prior
for (r, )) to obtain
1 -kl)'
l)
il)
0 1
-1t{ -k].,1;:] +
1000-k1
where b:2(l+3p) and a: No2(I*3fi1@n1rro'ò. Then
\-P
0
(l)
0
where L^ : ab(3le' + 4) + I2b2 i 3b * a. The posterior distribution is then N(|p,Eò,
where
î, _ zo0p
tCL0-T
â*lt
ls o
L' 4
I.'-
+00 k
01 1 '].âk
3/ú(1 + 3p)
-4(1 +3p) -1
Now if \ : lcî,, the estimate of 0 is unchanged. But again we can show that, as o
changes, the posterior mean moves along a straight line joining (î, i) and a point (i, i)
on the line À : lcr.
kî.
1
^0e
(L,-labk')î+Sabk\ak( b+ 1)â + {A - a(4b+ 1)}i
3ó(4b + 1)(4.10.1)
A
In general, in all these examples, the case 03 : 0 will produce an estimate satisfying
À: kr by moving along a straight line from (â, i) to the line ) : ler. lf ol10, then the
posterior mean is only part of the way along that straight line, as shown in Figure 4.2.
I ; ] . å [ ; ]
*',"r1)kr+3ablci)r,
(t
Chapter 5
Analysis of a two-period crossover
design for the comparison of two
active treatments and placebo
5.1 Introduction
This chapter considers the analysis to the design in Koch et a1. (1989) involving three
treatments in two periods. Those authors were considering a situation involving a chronic
health disorder, in which two treatments were active agents (labelled A and B), with the
third treatment being a placebo, labelled P. In their paper, the authors supposed that
there weÍern subjects in each of the groups 1 and 2, which receivedthe activetreatments
in the order AB and BA respectively, for every one subject in each of the groups 3,4,5,
and 6 which received the treatments in the order AP, PA, BP and PB respectively. In
this chapter, we will suppose that m: l.
The analysis given by Koch et a1. (19S9) considers several models which have either
no carryover effects, equal carryover effects for the two active treatments, or distinct
caryover effects for the two treatments. We note particularly that they only consider
between-subject estimates and, as we show ìn this chapter, there is considerable advantage
to be gained by recovering the betlveen-subject information.
This design is considered further by Laird et al. (1992) who consider two-period
crossover designs in general and who argue for the combination of the two components of
78
the variation. While they consider the variances of the estimates in the different strata,
they do not look closely at the covariance matrices of the estimates in the different
strata, nor do they present either the combined estimates or the covariance matrix of the
combined estimates.
5.2 The linear model
Following the method in Chapter 2, \rye assume that Y;¡¿ is a random variable with the
observed value g¿¡r which follows the mixed linear model given there. The layout is shown
in Table 5.1. We shall suppose that the groups each have n subjects, thus there N : 6n
subjects.
Table 5.1: Layout of design with two active treatments and placebo
Following the model in Section 2.3 of Koch et al. (1989), we shall use a model in
which
¡ there is no residual effect for the placebo,
r the direct effects of the placebo, A and B are respectively,0, Tr and 12,
o the residual effect ofplacebo, A and B are respectively,0, )r and )z
We shall see shortly that the information in the design separates conveniently into a
component corresponding to the average treatment effect lor A and B and another cor-
responding to the difference between the effects of A and B. Accordingly, we define:
Group Period
i 2
1
2
J
4
5
f)
A B
B A
A P
P A
B P
P B
79
o rs : (rt + 12)12 and À6 : ()1 + ^2)12
as the average effects of treatment and
canyover respectively, and
o rp : (r1-r2)12 and )¡ : ()r -Àr)12 as the half difference betrveen direct treatment
and carryover effects of the two treatments respectively.
From the twelve cells which are the combination of six groups and two periods, the
expected values corresponding to the cells in Table 5.1 are as shown in Table 5.2.
Period
Group
I 2
p,*rr*ro*rn p,Irz*zo*)o-r¡r*À¡p,Int*ro-ro p,*nz*ro*)o*ro-À¿p,*rrlrolro p,*nz*Ào*Ào
¡L' I rr p,Irz*roIrop,*rt*ro-ro p,*rzf)o-)¿
p,I nt p'Irz*ro-rn
Table 5.2: Expected values of responses in comparison of two active treatments and
placebo.
10
These equations lead to the following form for the Group x Period means
At.
Utz.
Uzt.
Uzz.
Av.
Usz.
Ast
U¿2.
Ust.
Asz.
Uet.
Uø2.
0110
1
1
1
I
1
I
1
1
1
1
1
1
lt'+
10011001100110
01
01
þ-+
0
I
0
-10
1
0
0
0
-10
0
'tf 1
7f2
1
1
0
0
0
0
0
0
0
0
0
0
0
0
1
i0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
000000000000000000100100010010001001
10 1
1 1 -110 -111 1
10 1
01 0
00 0
10 1
10 -101 0
00 0
10-1
Tg
Ào
TD
)¡
II e
80
(5.2.1)
OT
a : LÃn* (lu E) 1z)zr' * (10 A Ir)þ. + T*0 ¡ e", (5.2.2)
where d : Ts )s rD Àp . The covariance matrix of y can be obtained b¡,
(5.2.3)
We want to examine the treatment information and obtain treatment estimates by looking
at the vector of means y. By referring to Chapter 2 and incorporating random subject
effects, \¡/e can get the information matrices about d in the four strata which we introduced
in Chapter 2. Table 5.3 shows the relevant projector matrices for the four strata Grand
Mean, Periods, Groups and GroupxPeriod.
Stratum Matrix
Grand Mean aå JeØJz
Period ai JaØ Kz
Group Q; Ke Ø Jz
Group x Period Qå KaØKz
Table 5.3: Idempotent matrices for four strata in the design including placebo.
5.3 TYeatment information
To get the information about treatment parameters as before we consider all strata except
the Grand Mean stratum. The treatment information on the four parameters is then
identified as:
0000
Tl gîT. 0400000000002t 0 0
r20000630036
v ar(s) : *rt" Ø r{2) - t#4(;o a rz)
n:.)
n
.)T*,8;T"
81
TT QäT"
The Period stratum has some information about the parameter )s, corresponding to half
of the available information, but, since it is estimated by the difference between the two
period means, and we suspect that there may be period differences anyway, we do not
attempt to recover this information, concentrating instead on the information in the last
two strata. We see that 75To of the information about the treatment parameters is in
the Group x Period stratum, and 25% in the Groups stratum, while the information
available, i.e. excluding that lost to the Period stratum,, about the carryover parameters
is split equally between these two strata. We note that Koch et al. (1989) only consider
the within-subject estimates available from the Group x Period stratum.
Now the between-subject and within-subject estimates and their covariance matrices
of parameters of interest respectively are given by
A, : Q"'Q;f.)-7^-'Q;y
at..*U2..-Ys..-Ys..
Us.. - Ua.. -l Ys.. - Ya..
(-yt.. r vz.. -f as.. -f 2y¿,.. - us.. - zyu..) l3(zar.. - 2y2.. -f uz.. - u+.. - as.. t yu..) l3
â, Q-'gär")-ro Qia
Gart. t yn. - an. * azr.* ysr. - asz. * arr. - asz.)12
lz(-ytt.I yn. - Uzt.I yzz.) * ysr. - asz.l yq.. - 9+z.I Yur. - asz.l yur. - yur.)l I
[(yrr. - Utz.- Un.lazz.- Uv.* Usz.I ast.- asz.) -Z(ynt.- y+2.- Uat.+aaz.)) lt
?a"r. * ysz. - U+r. * asz. * ys. - asz. * aat. - yur.) l2
n
.)
6
-.)0
0
0
0
18
-9
J
2
0
0
0
0
I6
o2(t + zp)3n
00002-rI2
6
-30
0
.)
6
0
0
V ar(92)
82
V ar(îs)o2
3n
6 900918000 0 230 036
The variance matrices for the parameter estimates reveal some important properties.
The within-subject estimates given by d. show very high positive correlation between
the estimates of r-s and )s, and also between the estimates of r¡ and )¡. We note that
the corresponding correlations in the between-subject estimates are positive, suggesting
that the combination of estimates across the two strata may be advantageous not just
for bringing together the information in the two strata, but to some extent cancelling out
the correlations between the estimates.
5.4 Combination of the estimates
The between and within subject estimates may then be combined by the appropriate
weighting as
0 : {T"'Q;T. + (1 + 2p)7.'QiT"}-'{7.'8îT.0r+ (1 + 2p)T.'QiT.fu}
The estimates of the four parameters may then be given, using the same layout as
Table 5.1, by:
1
2(7 + t4p -13p2)Tg
2-l p-3p' 3(1 * p),
2* p-3p' 3(1 + p)'?
2-l7p+3p' _¡(1 + p),
-4(r + 2p) 0
2-l7pl3p' -g(1 + p),
-aQ + 2p) 0
83
Ào1
2(7 +t4pt3e2)
1TD
2(7 + r4p -t 3p2)
io: 1
2(7 +\4pt3p2)
The variance matrix for these combined estimates d is
V ar(0) (; *r"r*' q;r* * ir.' qär.) -'
oz(t + 2p)
2n(7 *r4pt3p2)
6(t + p)
3(1 + 3p)
0
0
3(1 + 3p)
6(2 + 3p)
0
0
0
0
zçt + e)
l*3p
0
0
7t3p2(2 + 3p)
5.5 Conclusions
The estimates obtained in this way vary according to the value of p : o? lo', which
needs to be estimated from the data. Even with only 5 subjects per group, the lvithin
and between subject variances are estimated on 24 degrees of freedom and will give a
stability to the estimate of p rvhich rvill provide estimates largely unaffected by the errors
S+13p+6p'r-p-6p'5*i3p+6p2l-p-6p"
I*2p+3p2 2I p-3p'
-(2+ p-3p') -(7 + I4p I3p2)
).*2p+3p2 2* p-3p'
-(2+ p-3p') -(7 +t4p*3p2)
_(r + p),2+5p+p'(r + p),-(2 + 5p -f p')
_(r + p),2+3p- p2
_2(r + p) -2(r + p)2
-(2+3p-p') (r + p),
2(r + p) (\ + p)'
3*5pl-l p
_(1 + p) -(3 + 5p)
l-3p' 4I9p * 3p'
lI4p+3p'-p(t + 3p)
-(r - 3p') -(4 + 9p I3p2)
p(r + 3p) -(1 +4pt3p2)
84
in that estimate. The estimates of o2 ¡ 2o2, and o2 are the mean squares W2 arrd. W1,
the Subjects within Groups and the Period x Subjects lvithin Groups mea,n squares,
respectively. The estimate of p is then given by equating the observed variance ratio to
its expectation, so that
þ: (#- EF)IQEF)',
where EF : 24122 is the expectation of an F-distribution with (24,24) degrees of freedom.
The standard deviation of p then follows from the variance of the same F-distribution
and is given by
SD(P) : 46
2x24x20:0.22
(Johnson k Kotz, 1970, Chapter 26). Except when the estimate of p is close to zeto'
changes in p of the order of one standard deviation will have a small impact on both the
form and the standard deviations of the estimates.
As p changes, the estimates move between the estimates at p : 0 which ignore the
grouping and treat all observations as independent, and the estimates at p : æ which
regards the subject differences as so large that only the within-subject estimates are
useful. In the latter case, it can be seen that the estimates revert to á3; while in the
former case they are not simply the averageof 0z and ds due to the fact that the variance
matrices are not proportional. For example, the estimates of 7s and Às in the case p : Q
may be written as
1:-t4
Tg
io
5îoz*9îo¡*4(io2-Âæ)
9Âor*5i0.*6(io2-îm)
Figure 5.1 shows a plot of the standard deviation of î¿ as p changes. It ignores the
factor o2 fn introduced by having n subjects in each group. The horizontal axis here is
ptl" in order to show the change more clearly. The upper limit of ,Fø: 0.816 is the
standard deviation achieved using the rvithin-subject estimates, which are the estimates
used as p)æ,whilethelowerlimitof ,Ftf :0.378isachievedif thereisnobetrveen-
subject variance and p : g.
85
Chapter 6
Cohort designs for two-treatment
crossover trials with one baseline
measurement
6.1 Introduction
In previous chapters, we showed that the average effect of treatment against baseline was
not estimable unless strong assumptions were made about the absence of period effects.
or the existence of a random effect for periods with known variance. In this chapter
we apply an alternative design which under some conditions enables us to estimate the
average effect of treatment against baseline. This alternative design is like an age-period-
cohort design, in that it supposes that periods correspond to the times of the year and
that some groups of subjects, referred to here as cohorts, entet the trial i','ith a delayed
start. In particular, we suppose that each successive cohort is delayed by exactly one
period. Provided we can assume that period effects are related to the time of year rather
than the length of time in the trial, referred to here as the 'age', this design will allow the
estimation of the average effect of treatment against baseline. Section 6.2 summarises
Age-Period-Cohort designs as they currently exist in the literature and illustrates the
association between these and our proposed 'cohort designs'. In Section 6.3 we build the
linear model for both the standard design used earlier and the new 'cohort design'. The
standard design with one baseline measurement taken before the first period treatment
is analysed in Section 6.4 and the corresponding cohort design with just two cohorts is
87
considered in Section 6.5. In Section 6.6 we discuss the treatment information for both
designs and in Section 6.7 we combine the treatment information in each stratum to
get estimates of the parameters of interest for the tlvo designs. The limiting cases ofo?rp : 3 -+ oo and zero are given in Section 6.8 and finally in Section 6.9 we make some
general conclusions about using cohort designs rather than standard designs.
6.2 Age-period-cohort design
The aim of this section is to give a general overvielv of age-period-cohort designs. Up
to now there are a lot of publications in which they attempted to show all features of
this design and to expiain all sorts of things on the basis of three variables, namely:
age, period and cohort which occrlpy a central position in cohort design. Based on our
literature review on some articles in this regard we can claim that the precise meaning
of the variables age, period and cohort, and thus also what is meant by age, period and
cohort effects, varies considerably from one study to another.
I(upper et al. (1983) discuss the specific problem of age-period-cohort analysis within
the general framework and give the following definition for age-period-cohort design:
"age-period-cohort analysis concerns methods for statistically analysing of such data
gathered on human population followed over time, the purpose of such analysis being
to quantify accurately patterns in the separate effects of age, period of time (e.g., as
measured by the year of occurrence), and cohort membership (".g., as measured by year
of birth)." Hagenaars (1978) in his paper gives the following definitions for these three
variables. He says that "in general it may be said that age is regarded as an indicator
for all the possible changes which are related to becoming older. Age can for example
be taken as an indicator for phenomena which are mainly of a biological nature, for
example becoming sexually mature, becoming tired more easily, getting ill more often."
For definition of period he says that "Period refers to all events which have taken place at
or between the moments of observation and which have influenced the phenomenon being
studies. Here too it will often be difficult to indicate exactly which event is responsible for
changes observed." Finally he defines cohort as "A collection of people born within thc
same period or a collection of people who have experienced a fundamental event during
the same period."
88
Therefore in our study we rvould like to draw attention to the suitability of the cohort
design for the analysis of crossover trials rvith the follorving definitions for the age, period
and cohort variables:
(i) Age can be defined as the length of time in trial and geometrically can be represented
as each diagonal in Figure 6.1,
(ii) Perìod in this situation is the same as defined generally for each crossover trial in which
experimenters take observations separately on that duration of time and is represented by
the columns in Figure 6.1. Period may be of short duration (e.g. hours) or long duration
(e.9. months).
(iii) Cohort is a group of subjects which start together in the trial at the same time is
represented by the rows in Figure 6.1.
To clearly show this we represent it schematically in Figure 6.1 emphasising the
definitions of age(A), period(P) and cohort(C) for our so-called cohort design.
Figure 6.1: Diagram of cohort design and its mP4
ain concepts in crossover trial
---------------- cl
+C2
-
c3
+c4
The main and interaction effects of age, period, and cohort are not a1l estimable. This
difficulty, usually referred to as the identification problem, is caused by the fact that a
certain fixed, direct relationship exists between age, cohort and period. We can express
algebraically that age (A), is the difference between the moment(P) of observation and
the start time (C), that is A: P - C. In this framework, the rows, C arc orthogonal
to the diagonals, A . Hence, the P can be thought of as part of the A x C interaction.
Thus, fitting the linear effects lor C, the linear trend for A and the linear trend for P
provide 3 contrasts, only two of which are estimable.
This identification problem can perhaps be shown even more clearly by rneans of a
more formal demonstration. To show this lve refer to the general development model of
89
Schaie (i965). He gives an additive model in rvhich each observation is seen as a rveighted
sum of the linear effects of the three independent variables
Y :boA+b"C IbrP
Since A: P - C it is also true that
Y : b"(P - C) + b"C + brP - (b" ¡ br)P + (b"- b,)C,
this can be reformulated as
y:eppIq"C.
In other words the additive model with three linear terms can be reduced to a model rvith
two linear terms which predicts exactly the same Y value as the three variable model.
Only two independent effect parameters are found to exist, that is, qo and q", from lvhich
it is no longer possible to deduce the three original coefficients bo,b",bp.
In our situation, we will be assuming that 'age' effects are not present, but that
'period' effects may be. This will only be appropriate in particular cases, such as asthma
trials where a subject's condition may be related quite strongly to the time of year in
which a treatment is applied.
The potential advantage of the baseline measurement in crossover trials is that it
enables estimation of the absolute effect of direct treatment by comparison rvith the
baseline. This advantage is lost if there are period effects present. The use of the cohort
design as described here is that the absolute effect of treatment may again be estimable,
even when period effects are present, provided age effects, as described, are not present.
For this to be successful, we require the following :
(i) Subjects enter according to a protocol so that no essential differences exist between
those subjects entering as the first or second cohort.
(ii) Length of time or experience of being in the trial does not influence the results. This
might not be satisfied, for example, if blood pressure is related to anxiety and the first
cohort is less anxious in period two than the second cohort. Different blood pressure
betlveen first and second cohorts at period two may then be partly due to anxiety level,
and hence related to what we have called here 'age' or time in trial.
The potential advantages of this design in crossover design are:
o it is useful when we suspect period effects (e.g. time of year), and
90
o we can estimate absolute effects of treatment and ca ryover
6.3 Building the model for standard and cohort de-
srgns
To shorv the efñciency of the cohort design lve compare it to the standard design of
crossover trial with baseline shorvn in Table 6.1. Here we use tn'o groups of subjects,
where each row represents a group of subjects, and each column represents one period.
In this situation, periods refer to the time a subject has been in the trial and to calendar
time.
Period
I 2 ,f
Group 1
2
A B
B A
Table 6.1: Two-treatment three-period crossover design with baseline measurements
Suppose, however, that subjects enter the trial at different times, as the trial pro-
gresses. It may be that we anticipate an effect related to the time of year in which the
measurement is made, but that we do not expect an effect related to the length of time
a subject has been in the trial.
To illustrate the idea, suppose that a study is undertaken as above, but with half
the subjects having a delayed entry at period 2 as shown in Table 6.2. ll there are 4n
subjects, this design would appear as shown in Table 6.2, with 2n subjects in each cohort,
and the subjects in each cohort being randomly allocated n to each of the two treatment
regimes. We note, for example, that a direct between-group comparison of treatment
with baseline is available in period 2 under certain conditions which we will explore in
the next section.
91
Period
1 2 ,f 4
1
2
3
4
Group
A B
B A
A B
B A
Table 6.2: Cohort design in two-treatment three-period crossover design with one baseline
measurement
In order to compare this with the standard design, it is convenient to present the
standard design in a similar way, with 4 groups of n subjects, as shown in Table 6.3.
Table 6.3: Two-treatment three-period crossover design with one baseline measurement
repeated as 4 groups of n.
Our purpose in this chapter is to compare the cohort design in Table 6.2 with the
corresponding standard design in Table 6.3. We assume that subject effects are random;
and that we have fixed effects for the period and mean ¡-1. The treatment parameters are
defined as:
o 16, the average effect of treatment; i.e. $¿ minus baseline;
. Ào,theaverageeffectof carryover; i.e. #.,fo, carryovereffects,minusbaseline;
o Zrp, the differencebetlveen A and B; i.e. rpfor A and -r¡ for B;
o 2)p, the difference in carryover between A and B; i.e. À¿ for ca ryover of A and
-À¡r for camyover of B.
Period
1 2 3
1
2
3
4
Group
A B
B A
A B
B A
92
To consider what treatment information is available, we can write out as in Table 6.4
the 12 means in terms of the treatment effects in the following table which could be
applied to either the standard or the cohort design.
p lt+ro+rD p,*ro*Ào-rnlÀn
11 p,lro-ro þt lro*)o*rn-Ànp p+ro+rD p,*ro*Ào-ro*Ào
11 p,Iro-rn p,*ro*)o*rn-Ào
Table 6.4: Treatment effects for the 12 means of observations for two-treatment three-
period crossover design
If we write y : (Art.,. . . ,A¿s.)', corresponding to the 12 means written in order with
periods changing quickest, then groups, and cohorts changing slowest, then we have
a:To0o*e,
where
?o: 1z I
10101111i111
0000010-11 -111
0
0
0
0
1
1
(6.3.1)
and 06 : lpr10,Ào, rnrÀo]', and e represents the random components, to be defined. If
e - l/(0, Ql"), then the overall information matrix is
nTlQ-rTs.
In many cases, Q can be expressed as ff=o ô¿Q¿, where the Q¿ are orthogonal idempotent
matrices, such that Ðl=oQ¿: .[. Then the total information matrix will be
k
nl6;tr[e¿ro, (6.3.2)i=O
where the 6, are estimated from data. For some values of i, for example for the grand
mean Qo : J , the term may be omitted if we feel that it contributes no useful information
93
on the parameters of interest' Since DQ¿:1, it is useful to consider
642 0 0
0
0
2
2
nlf[qofo: nTåTo :2ni=0
442 0
222 0
000 4
000-2
as the overall information matrix which rvill be partitioned into components and then
recombined with different weights as in (6.3.2) in the final analysis.
Information about the parameters of interest is only contained in the space orthogonal
to the grand mean. Thus, while 4To gives the overall information matrix, the term
T¿JT¡, corresponding to the grand mean) represents information which is not available
to us since it just represents an average level of response. The information available about
the parameters d : (ro, Ào,rD,)¡)' of interest is obtained by using the component of Tq
which is orthogonal to the mean vector p,I. ln particular, this reduces our matrix Ts to
the matrix
Kro - år, *
1T: -1, R6 --
0-40-402020202
000060
-60-666-6
r)
-2-2-2
4
4
from which we can discard the first column of zeros to obtain
-4-4
2
2
2
2
2
2
2
2
4
4
0
0
6
6
6
6
0
0
0
0
6
6
(6.3.3)
94
Hence the available information about 0 : (ro, Ào,rD, À¡)' is contained in the information
matrix
.nT: n,T'T : -oJ
84 0
48 0
00 24
00 -12
0
0
-t212
(6.3.4)
This is the same for both standard and cohort designs. We want to look at T|Q;T,the
information contained in the ith stratum. This will be different for the standard and
cohort design.
However, we want to see where the treatment information goes, and holv much is lost
when we take out periods. We could do that by putting a term for periods in the model
and comparing joint information matrices for d with and without periods.
An alternative is to consider the projection matrix which takes out period effects and
see which part of Z ends up in that subspace. Thus, we are seeing Q as not just a variance
matrix, but rather as a way of splitting up t € ,R12 into separate components based on
the strata in the experiment, to see how much treatment information is lost into each
subspace. Later on we can make the choice as to which components we recombine to get
treatment estimates.
6.4 Standard design
We first look at describing the treatments effects for the 12 means contained in both
standard and cohort design defined in Section 6.3.
6.4.L Information matrices and parameter estimates
In the crossover design with baseline measurements described in Section 6.3 we assume
that we have means with expected values as shown in Table 6.4.
(I-J)y:T0te, (6.4.1)
(6.4.2)3
Q:Ð6oQ.i=l
rvhere e - l/(0, Qln) and
95
where Qt,: J 81(8,/ represents the projector matrix for Periods, Qr: (1S/A 1- "/8J Ø J) represents the projector matrix for Groups, and Q3 : (18 I{ Ø I - "/ 8If S /)represents the projector matrix for Periods x Groups, and the matrices in the l(ronecker
products are of dimension 2, 3 and 2, corresponding to Cohorts, Periods, and Groups
withinCohorts,respectively. Thecoefficients 6¿in(6.4.2) areô2:o2*3o!,fi:63:02,although d1 would also include a component relating to the variation between Periods.
Since it is unlikely that we would regard the periods as a random rather than a fixed
effect, we would not use ð1 in our recovery of treatment information.
We can think of the information on d as coming from the three different strata. Thus,
since ff=, Q¿: U - J), and since T'JT :0,,
r'r :f,''on''t=l
Hence, the information matrix for the ith stratum is T'Q¿T. These will be given in the
first column of Table 6.8.
The estimate of 0 from the ith stratum is
0¿: (T'Q¿T)-tr'Qny.
and these can be combined using
" - (P 6irT'8¿T)-'(Ð 6;IT'Q¿T0¿), (6.4.3)
provided suitable estimates of ô¿ are available. The cases i :0,1 correspond to the grand
mean and periods, respectively, and we will not attempt to recover information from
these strata. Hence the summation in (6.4.3) will be for i : 2,3 only.
6.4.2 Analysis of va-riance for standard design
For the standard design from equations (6.4.1) and (6.4.2), we can write out the table
of the analysis of variance for the standard design. We can write
vE(YY'):{Q¡nT00'T'},
where we have ignored terms involving the periods, from which lve can get the expected
sum of squares for each stratum as
n E {Y' Q ¿Y}
:'i,.-")!å r- -"ïrT r?;Tl}'
96
Since trace(Q¿) : rank(Q¿), then the expected mean squares are
EIVISI : ô; * nq'T'Q¿T0lrank(Q¿),
as shorvn in Table 6.5.
SSSource EMSDF'
Table 6.5: Analysis of variance for standard design
6.5 Cohort design
In the cohort design we have the same model (6.4.1) for the 12 means, and the matrix
Q still has the same covariance structure in terms of the between and within subject
effects. However, the projector matrix for Periods is now changed, because the 12 means
are now assigned to five different periods, rather than the original three. Furthermore,
Groups and Periods are not orthogonal. This implies that we cannot ìust use the Groups
stratum as it stands because it has some period effects in it. We can explore the different
strata by considering:
o Periods (ignoring Groups), with 3 degrees of freedom,
o Groups(ignoring Periods), with 3 degrees of freedom, and
o the Group x Period interaction, with 5 degrees of freedom.
The first two of these can be examined by considering the following contrasts. Contrasts
on Groups can be expressed as the row contrasts 91, 92 and !3:
LlQØJØJ)gGrand Mean
Periods
Groups
Period x Group
Subject within Groups
(Subject within Group) x Period
na'Qta
na'QzY
na'QsY
o2 ¡ Jo! j no'T'Q2To lJo2 + no'T'q3ro 16
o2 + 3ol
o2
2
J
6
4(rz - 1)
8(n - 1)
lZnTotal
97
1 -1 -1
1 -1 1
I 1 1
1 I 1
1 1 1
1 1 1
-1 1 1
1 1 1
-i i 1
1 I I
1 1 1
i -1 -1
9z
from which a projector matrix for Groups (ignoring Periods) can be formed as
3
ec : Ð gngil@'ngo)i=l
Similarly, contrasts for Periods are the column contrasts pt, pz and p3
9z9t
0 -1 1
0 -1 1
-1 1 0
-1 1 0
-2 1 1
-2 1 1
1 1.t
1 1 ,
(6.5.1)
(6.5.2)
(6.5.3)
Pz
and the projector matrix for Periods (ignoring Groups) can be given as
3
ep:løn;l@,;vò.i=l
PzPt
Due to the fact that groups and periods here are not orthogonal, QçQ, t 0. We can
identify the relationships between the six contrasts by the following matrix of cross-
products. If we let W : (gt,,gz,gs¡Pt,,Pz,ps), then
W,W
t2
0
0
0
0
4
0
t2
0
0
0
0
0
0
0
0
24
0
4
0
0
0
0
4
0000t20080000
which shows the lack of orthogonality between groups and periods because g'Lh + 0.
This selection of contrasts is such that:
. (gz,gs) capture differences between groups within cohorts and are orthogonal to periods,
. (pr,pz) capture differences between periods which are orthogonal to groups, and
-1 0 0
1 0 0
0 0 1
0 0 1
98
o 91, the difference betrveen cohorts,, is not orthogonal to p3, the contrast between the
first and last period.
The implication is that 2 degrees of freedom for groups (ignoring periods), correspond-
ing to !2 and !s, a,re the same whether we fit groups before periods or periods before
groups. Similarly, 2 degrees of freedom for periods (ignoring groups), corresponding to p1
and pz, are the same whether we fit periods first or second. The only difference between
the two analyses arises from the order of fitting g¡ and p3.
If we fit groups and periods together, the projector matrix is
Qcp : w(w'w)-rw', (6.5.4)
and, since all these columns are contrasts and hence l'W :0, the projector matrix for
the interaction stratum is
Qnt:K-Qcr.
Figure 6.2 shows the relationship between the vectors produced using these projector
matrices. Thus, if we consider a vector Qcpy, as the projection of the vector y onto the
space of Groups and Periods, in a case when the Groups and Periods are not orthogonai
to one another, then the vector can be decomposed in two distinct \¡/ays. The first of
these has a component in the Groups space, Qçy, and a remainder, QA7A, in the Periods
(eliminating Groups) space. The second has a component in the Periods space, Qpy, and
a remainder, QcpU, in the Groups (eliminating Periods) space.
Figure 6.2: The relationship of projector matrices in cohort design
". Q
",rY
We can obtain Qçp directly, or we can obtain it by orthogonalizing the columns of
I4l. This is conveniently done in one of two ways:
acpi
a prcY
Q^yU
Groups
99
(i) rve can rewrite W, equivalently, as
in which all columns are orthogonal then
W*: 9t 9z 9s Pt 'Pz (3Pt - gr)
Qcp: Qc -l Qeg,
where
Q rp : rrnrr\ + )nrn, + f,n;,nr-
gr )(3ps - gr)'
represents the projector matrix for Periods eliminating Groups.
(ii) We can rewrite W, equivalently, as
¡y+ (gt - p") 9z 9s Pt Pz Ps
in which, again,, all columns are orthogonal, so that
Qcp:QpiQclp
and
Qqe: å,n, - pz)(gt - p")' + |krsl+ grgL)
represent the projector matrix for Groups eliminating Periods. From (6.5.3) we can
derive Qcp directly. We need the inverse consisting of the first and last row and column
Itz4l lr -rIsubmatrixin (6.5.3) thatis,theinverseof | - l,namelyl I l,so that
lt 4)' "'l-r 3l'
ecp : |ø,n;+gssL)+fn,ri +fin,n*ål* -]ll, ;tll;; llr 1 I l'
¡rezn'z+ g"gL) + intri + finrnL+ ,kts't - etP's- pssi + 3pspå)
For the cohort design we can show the projector matrices in two different orders that is,
in one case we fit periods first and in the second case we fit groups first. Table 6.6 shows
these projector matrices for the cohort design. The matrices 41, Az, Br, Br,CrrC2, D1, D2
are shown in Table 6.7.
100
Cohort Design (Groups first)Projector Matrix Cohort Design (Periods first)
)ø*t1
B2
B
,4,
Ai
A2
AiØJzQp: äPeriods
tI
aGPIC Cr,
1
24
Ct CzQc: I{4Ø J3
l.,t\bGroups
Qn:frInteraction
(A 1 represents A1 transposed about the reverse diagonal; that is, about the diagonal
running from bottom left to top right of the matrix.)
Table 6.6: Projector matrices for cohort design for two orders of fitting
A I
5
-1
-1
1
2
1
1
1
2
Az:-1
2
-1
I
I
2
I
1
1
Bt:8
4
4
-4 4
I
5
b
-1
B
0 003-30
-3 304
-44
-44
-4
-4 4
4
,7I
1
I
1
-44
-1I
-1I
4
-47
-1I
-1
-44
4
4
4
4
4
1
7
1
I
CI
0
0
-ð
-3-3-.t
0
0
3
3
rt
ù
0
0
,)
3
.)
.)
0
0
-3-3-3-,)
0
0
0
0
0
0
0
0
0
0
0
0
Cz:
Dt:
8
-8-4
4
-44
-88
4
-44
-4
-44
11
-5-I
1
4
-4-511
1
-7
-44
-7I
11
-5
4
-41
-7-511
Dz:
0
0
3
3
11
.)
0
0
t)
r1
3
rtr)
0
0
,f
3
3
ó
0
0
t)
,f
3
,)
0
0
0
0
0
0
0
0
0
0
0
0
Table 6.7: Matrices introduced in the table of projector matrices for cohort design
101
Thus for the cohort design lve have four strata i.e Groups with 2 degrees of freedom,
Periods with 2 degrees of freedom, the rest of the Group{Period stratum with 2 degrees
of freedom and the GroupxPeriod interaction with 5 degrees of freedom. The partially
confounded 2 degrees of freedom in Groupf Period can be presented as the two contrasts
Gps(ig. Per) Pers(elim. Gps)
a,1 1
o 1 I
-1 1 2
-1 1 2
9t 3pz - gz
or as the two contrasts:
Per(ig. Gps) Gps(elim. Per)
Pz 9t-Pz
Period effects may be large and hence we would prefer to remove them first and base
our treatment estimates solely on the information in the Group (elim. Periods) and
interaction strata for which the projector matrices are Qclr and Qm.
6.6 Tleatment information
The matrix in (6.4.2) gives the total treatment information available after removal of
the grand mean for both the standard and cohort designs. We now consider where
that information resides in the different strata. For the standard design, this involves
calculating T']iT(i:1,2,3), while for the cohort design, we need T'QpT,T'QclpT,
T'QnT. In each case, our intention is to discard the information contained in Periods
(ig. Groups) on the grounds that there are few periods and that the effects of periods
cannot reasonably be separated from the treatment effects contained therein. Table 6.8
shows where the treatment information occurs in the different strata.
1 1 1
-1 1 1
1 1 I
1 1 1
1 0 0
1 0 0
0 0 I
0 0 1
0 i -1
0 1 -1
1 1 0
1 1 0
r02
Cohort Design
(Groups first)
000000000008
n6
0
0
0
0
6
ù
-t)
0
0
-ù3
0
0
0
0
48
24
0
0
-2416
n6
16
8
0
0
8
16
0
0
0000
48. -2424 24
Table 6.8: Information matrix in each stratum for both designs
Table 6.8 shows that the information about rp arrd )p is distributed between the
strata in the same way for both standard and cohort designs. However,, for rs and Às,
the only information in the standard design is contained in the Period stratum. Thus
information is unavailable unless we make strong assumptions about the period effects.
For the cohort design, and in particular when we take out Period (ig. Groups) first, some
of the information about (to, Ào) is available in both the Groups (elim. Periods) and
Group x Periods interaction strata. The matrices in the middle column of Table 6.8 shows
that the information matricesfor (16,À¡) in Groups (elim. Periods) and GroupxPeriods
are each of rank 1, the first corresponding to the sum and the second to the difference of
the two parameters. This is formalised in the next section.
Cohort Design
(Periods first)
Standard DesignStratum
10
8
0
0
8001000000000
b
13
11
0
0
11 0 0
1300000000
n6
16
8
0
0
8001600000000
6Period
J
ù
0
0
6
300300000008
Group
0
0
0
0
n6
000000000008
ù
t)
0
0
ù
,f
0
0
0
0
48
24
0
0
24
16
Int
0
0
0
0
00000480 -24
0
0
24
16
6
16
8
0
0
8
16
0
0
0
0
48
24
0
0
24
24
n6Total
16
8
0
0
801600480 -24
0
0
24
24
n6
103
6.6.1 Information matrices on þt - ro * Ào and B2: ro - Ào
By referring to the cohort design information matrices in Table 6.8, we can see that the
information matrices of Groups (elim. Periods) and Interaction strata about rs and )s
parameters are of rank 1 and have information on (rs * )o) and (16 - )o) respectively.
Hence to get information matrices for our parameters of interest that is,' rs and Àe, lve
consider two new parameters, that is B1 and B2, where
rs : (0, + 0r) 12
)o : (þ' - þr) 12.
The information matrix for the parameters þ :
the information matrix for the parameters ds :0, can be obtained in terms of
)o by using the formula
p
Tg
l
t'
Ip: GIeoG',
where
G:ffi:1,,,,7
":)Then the information matrix about Ér and 0z for the Period (ig. Groups) is
1
rl[':,:]ll1 n
6
9001
GHG' : +n 1 i
where 11 is the top 2 x 2 of matrix in Table 6.8 corresponding to (rs, À¡) from Period (ig.
Groups) stratum. If this is done for the other information matrices in the table, we a e
led to the information matrices for B given in Table 6.9.
104
(t)
+ H 5 (t)
H Ê-
stj n a- t o (t oq H
l\o !i äo4
o N\J
9(h ;oc
).-
: Ô
!l ^
cn=
+U
H
H =o
-=Ø
óoe H
FÚ n ô a- o)lJ
o r.
9
ÀO
\\/
ollJ
O
olJ
O
O¡.
o
H o O Ó)l¡ O
H c+ o 4 P c+ H O ôl¡
C¡:
O
o)lÈ
C/9
O
ll -+ ôls
O
ì.O
ÈO o)
lË
O
t.9
o)15
ot9
È p o i^ H n H H o H p c+ n X Þ.) H $ N Ø H p -l t- H 4 <t o a) oc H Ø
H Ð Èa
F< + 4 ol + o ol I
b.9
CJro E a- (t ûc !r _
,¿
H Ø -¡ cJr
ts (t o H H tu H ê- Ø b.9
CJr O
H o tst o -l- { crr
ôrl
HO o-;
i;O oc
o H o o c+ p ûc o +) n H + H H - o p o cn H 9.)
c+ H p ct ol _l_ I p H p- ol I o * o' 4 c+
OJ o) ie { o r.t o Ø + o el-
r-i o p + t{ o c+ Ð E p el- o a-
,
C,I
two parts of the parameters of interest lies. By referring to Section 6.5 we can say that
Groups (elim. Periods) stratum can be d.ecomposed into two substrata as Groups within
Cohorts vr'ith the projector
(fis,d,+ l nú),
with 2 degrees of freedom and ô2 : o2 I3o,2 and the one degree of freedom for Cohorts
(elim. Periods) with the projector hdd', where d: (gt -ps). We need to establish the
appropriate variance to use for the contrast d. The projector matrix corresponding to
this contrast is d(d'd)-td' to that the sum of squares for this stratum is
S Sc"p : a, d(d, d)-, d,, y : fi@, r),
: r1{a, ù)r,
and the expected sum of squares ignoring the treatment effects
E (S S c "ù : ltror" E (d'' yy',1).
Now we know that
var(y) : o'(t Ø K) + (o" + 3o!)(K Ø J) (6.6.1)
: 02I +zo!(I Ø J) - o2J
: o'(I - J) +Jo!(I I "/),
and we can show that d'(I - J)d,: ô : 8 and d'(I Ø J)d, :16/3. It follows that, ignoring
treatments for the moment,
E(S Sc"P) : o2 ¡ 2o2" '
The contrasts 92 and 93 form the Group within Cohort stratum with two degrees of
freedom and they provide the sum of squares
1
S Sc,c : fitrace{g'2yy'gz
+ gLay'gt},
for which the expected mean square can be expressed, using (6.6.2), as
E(M SG.c) : h*r*rÐlr'g'n| - J)go + 3o'?,si| Ø J)g¿lj.
Now we have
g'¿(I - J)gn : 12, g'i] Ø J)g¿ : L2
for i : 2,3,, so that, ignoring the treatment effects again, we obtain
E(M Sc-c) : o' -f 3o!'
So to get 0, we have the following Table 6.11.
106
Stratum Parameter estimable degree of freedom IVISE
Cohort (elim. Periods) (to, Ào) I o2 +2o?
Group within Cohort (rr, Àn) 2 o2 ¡ 3o2,
Interaction (to, À0, ,o, Àn) 5 o 2
Table 6.11: Those strata which contribute to estimate some parameters of interest
6.7 Combining information
The parameter estimates in (6.4.3) can now be obtained by combining the information
from these three strata. The pair (â0, io) and the pair (â¡, ¿) ut" uncorrelated with
each other in all strata, and hence lve can look at their estimates separately. Information
about ("o, )o) is available in two strata, with weights o2 and (o2 +2o2,), while information
about (rn,Àn) is available with weights o2 and (o'+ 3øl). trstimates of o2 and o! arc
obtained from the appropriate residual lines in the analysis of variance in Table 6.5, these
being calculated in exactly the same way for both the standard and cohort design.
6.7.L Estimation of ("0, Ào) in cohort design only
For the average effect of treatments, that is (16, )s), we can obtain estimates from each
of the Group (elim. Periods) and the interaction strata, as shown in the middle columns
of Table 6.12, and then combine them with appropriate weights using:
âo
io I ]) '
ri{Q"t'+ (1 + 2p)Q'^'}v'{l:3
o.)
(1+rJ
.)
ri(Q.tr+(1 +2p)Qm,)a,(r + 2p)
r*pp
1
where Zr is a submatrix of matrix 7 given by the first two columns in (6.3.3), corre-
sponding to the pair of parameters (rs, Às). This leads to the contrasts represented in
the last column of Table 6.12. The covariance matrix of these combined estimates is
fr o2 7*pp
p
1*pn
r07
(6.7.1)
Cohort design
Estimator Periods
(ig. Groups)
Groups
(elim. Periods)
Interaction Combined
7g1
-4r2-4t2
t2-2t2-2
36
0
0
II
II
-9-90-9-90
0 9-90 9-9
-9 90-9 90
It
010010
-1 00100
)o
2
2
2-l-2 I
-2 -1 4
q t4
1
099099
-9 -9 0
-9 -9 0
0-9 I0-9 I
9-909-90
1t
00100 i
0
0
-1 0
-1 0
Table 6.i2: Contrasts for estimating parameters in cohort design
Table 6.13: Contrasts for estimating parameters in standard design
Estimator Period
Standard Design
Group Interaction Combined
Tg4
i I
I
0
0-1
-1 10110
None None None
)o None None
0
0
-1
-1
1
1
0-1 1
0 I 1
None
108
Nolv lve can have a look in each stratum at the contrasts for the tlvo designs and
compare them. First, for the standard design, we have the results shown in Table 6.13.
Holvever, these results are not useful, because all information about the tlvo parameters
are in the Period stratum and in the presence of strong or unknolvn period effects, we
cannot estimate the average effects of treatments. In fact this information in the Period
stratum oî 16 and )s in Table 6.13 is not available when we suspect period effects. The
cohort design offers a way to get more useful information for average effects of treatments.
6.7.2 Estimation of (ro,Ào) in both designs
Table 6.8 shows that the information about (ro,Ào) is in Groups (elim. Periods) and
the Interaction strata, for both standard and cohort designs. If we put p: o?1o2, then
combined estimates can then be found as
ï ] I' "{Q'1'+ (r + rp)Q'^'\ u
)r,ro"o+ (1 + rp)e,^,) y,
0
8+(1+3p)
48
-24
1
4(r + p)
(r+2p)10+3p) 1
I2
where T2is a submatrix of matrix ? given by the last two columns of (6.3.3) corresponding
to the pair of parameters (r¡, )¡). This leads to two contrasts which should be applied
to the vector y, and these may be represented by:
1
4(1 + p)
1
îp
_p (1 +p)
p _(1 +p)
0
0
p (1 +p)
p _(1 +p)
0
0
-2p (1 + p)
2p -(1 + p)
(1 +p)_(t + p)
-2p (1 + p)
2p -(1 + p)
(1 +p)_(1 + p)
)r:4(1 + p)
109
o, rlr+ze r*rp I rc.7.2)E,:4n(t+p)|,*,o 2(1 +rol l
Except for the change in the definition of n, these estimates and their covariance matrix
agree rvith the results of Section 3.3.1.
The covariance matrix of these estimates is
for which the estimates are:
6.7.3 Estimates of treatment effect minus baseline
We might want to look at estimates of r¡ - p and rn - þ where ¡-r, is the baseline
measurement. The estimates of (to, Ào) and (rp, Àp) are uncorrelated, and hence the
covariance matrices of 0 canbe found in (6.7.1) and (6.7.2). As we defined our parameters
in Section 6.3 we have
re- þ
rn-F
Àt-tl
Àp-þ
(6.7.3)
/^^\If we put f' : ( O, -p în - tt \¡ - tt \a - tt ), then to get the estimates and
covariance matrix of new vector ri from (6.7.3) we can write
: ro*rn
: To-TD
: ,Ào*À¿
: )o-Ào
0100-1 0
1011 0-1
1
1
0
0
îo
io
'tD
iD
0F
1
-p 3(1 + p) 0
p r-l p 0
-(2 + 3p) l*p 0
_(2 + p) _(1 + p) 0
rt- þ 4(1 + p)
110
TB - l-t
1
4(t + p)
\o- t'1
1
4(r + p)
Àn-tt4(r + p)
These four contrasts should be applied to the vector of means y to estimate the effects
The covariance matrix of 17 can be obtained by using the fact that
Var(f¡) - Ði: FÐ,7',
which gives
5+10p+4pt
3*6p+4p2
1-l7p+4p2
-1 + p*4p2
3*6p+4p2
5*10p+4p"
-1 + p l4p'1*7p+4p'
r+7p+4p'
-l+ p+4p2
6*I4p+4p'2+2p+4p2
-lI p+4p'
l-l7p+4p2
2*2p+4p'6 -f l4p + 4p2
6.8 Limiting estimates in terms of p
In this section we consider the estimates of the parameter of interest when p tends to
zero and infinity.
6.8.1 The estimates and covarÍance matrix when p -+ æ
I1 p -+ oo, that is rvhen o? -+ oo, the subjects are considered as fixed effects and we
obtain the follorving estimates of r¡ and )¡:
0p r* p
-p 3(1 + p) 0
_(2 + p) _(1 + p) 0
rlp 0-(2 + 3p)
-2p r* p 3(1 + p)
r-lp2p _(1 + p)
-2p _(1 + p) Lrp_3(1 + p) _(1 + p)2p
2p _(1 + p) r*p-2p r-lp 3(1 + p)
2p -B(1 + p) _(1 + p)
-2p _(1 + p) l-lp
111
I-1 1 0
1 10-1 1 0
1-1 0
-2 112 1 -1
-2i12-l-1
î'o
Ào
These estimates are orthogonal to both Groups and Periods, and are those estimates
obtained solely from the Interaction stratum in Table 6.8.
Because the estimates of rs and Às are independent of p, then those estimates do
not change when p changes. The covariance matrix of estimators however, tends to
infinity when p -+ æ because the contrasts involve comparisons between as well as
within subjects. For 17, the cohort design gives the estimates:
î,q - tt
rn-þ
1
4
1
4
1
4
À,q- pI2
o 1 3
2 1 I
-2 -i 1
2 -3 -1
1 ù 0
1 1 0
-3 1 0
1 -1 0
1 1 0
1 ,) 0
-1 I 0
-.) I 0
ll2
i,
All elements of the covariance matrix for 17 in this case become large as p -+ oo
because the estimates are not orthogonal to subjects.
6.8.2 Estimates and their covariance l¡vhen p + 0
In this subsection we consider the estimates and their covariance when p 4 0, or in other
words when there is no difference between the subjects. Since the estimate of the pair
(ro,Ào) isindependentof pthenitremainsthesame,butforthepair (rn,Ào) wehave
tt1
2
2 I 1
o 1 .)
2 -3 -1q
-1 1
TD
i,
I
1
4
0
0
1 1
1 1
0
0
111 -1
The covariance matrix of d is
02limXr: -p-+o ' 4n,
4 0 0 0
040000110 012
The result in this case for 4 is given below and shows that the estimates are orthogonal
to Periods, but not to subjects:
0
0
10-1 0
0
0
10-1 0
113
î,a, - p
rn-þ
\-.un
I4
1
4
À¿, - tt
\"-t'
The covariance matrix for 17 in this case is
1
2
0 1O,)
0 1 1
0 1 1
0 -r) -1
5 3 I -13 5 -1 1
1-1 62-1 126
1
oolimp-+o 4n
6.9 Conclusion
By using the cohort design in crossover design, we can recover some information about
(to, )o) and obtain estimates for them within and between subjects. This method opens
the way for a more detailed examination of the topic.
It has been shown that the block effects corresponding to Cohorts and Periods are not
orthogonal to one another. This lvill also be true in more general cases. The resolution of
this in general requires the application of the results of James & Wilkinson (1971) which
0 .) 0
0 1 0
-2 1 0
-2 1 0
0 1 0
0 3 0
-2 -1 0
-2 I 0
0 -1 1
0 I J
0 -3 -10 1 1
t74
allorvs the appropriate partitioning of these spaces. Although the original worl< of James
& Wilkinson (1971) is concerned rvith Blocks and Treatments, rve apply it to a situation
rvith tlvo nonorthogonal blocking factors.
115
Chapter 7
Block structure of cohort desrgns
7.L Introduction
In the previous Chapter, we considered a particular example of a cohort design in which
a second cohort was delayed by one period. It was shown that treatment information
was available in the Cohort (elim. Periods) and Interaction strata.
We will now compare a variety of such designs with the corresponding standard design
in which subsequent cohorts do not have delayed entry.
In cohort designs two basic types of structures are the block structure of the experi-
ment and the treatment structure. The purpose of this chapter is to present an analysis
of the block structure of the general class of cohort designs. We will consider the general
case where each of c cohorts are observed for p periods, but with each successive cohort
entering the study after a delay of one period. Thus, the experiment will occur over a
total of (p + " - 1) periods. Within each cohort we will assume there are g groups of
n subjects, where each subject within a group receives the same treatment regime. We
will show the ANOVA tables for the standard and the cohort designs with a data vector
of length cpgn in terms of the block structure, where g is the number of groups in each
cohort and n is the number of subjects in each group. We show that the differences
between them are only due to differing idempotent matrices for cohort,, periods and the
interaction.
We use the work of James & Wilkinson (1971), because James and Wilkinson's ge-
ometrical formulation provides a way of looking at the , , (p t c - 1) array of means
a
116
and identifying the contrasts for Cohort(elim. Periods) and their expected mean squares.
For this purpose we need to split the contrasts for Cohort(elim. Periods) into different
1 degree of freedom contrasts each lvith a possibly different expected mean square and
then to identify the projector matrix for projecting onto the vector space spanned by the
columns of Cohort(elim. Periods). We then consider several examples in detail. In a
special study we have got the orthogonal contrasts for the three strata for two and three
cohort designs with up to 7 periods in each cohort. For these special cases we obtain the
analysis of variance table in Section 7.3 and by using a function which we have written in
S-PLUS (Venables & Rezaei (1996)) we show their projector matrices and the expected
mean square for each stratum.
7.2 Cohort design in general
The design and analysis of cohort designs for the two-treatment three-period design with
one baseline measurement is carried out in the previous chapter. In that design, it was
found that there was nonorthogonality between periods and cohorts, and this led to using
the Cohort (elim. Periods) sum of squares to provide some of the treatment information.
In this section, we shall describe the block structure of a general cohort clesign and
our response will be supposed a linear function of parameters with a covariance matrix
whose spectral form is governed by the blocking arrangement. The model has c cohorts,
p periods in each cohort where each successive cohort starts one period later than the
previous one, such that the design has a total of pl c- 1 periods. Within each cohort we
have g groups and within each group there are n subjects. These groups will eventually
each be assigned a different treatment regime, but the analysis in this Section will rely
on the fact that the ng subjects in a cohort are randomly allocated to these groups. The
design is shown in the following layout in Figure 7.1.
I17
Figure 7.1: Fu1l cohort design length cpgn
I
p
I
1
c
2o
c
Ic-1
g
c
g
p+c-1
In this and subsequent tables, we suppose that the data are written in standard
order, running over the last subscript first, where the data values are given as U¿jtt.,
for the ith cohort (ó: I,2,...,c), jth period (j :I,2,...,p), kth group in ith cohort
(k : 1,2,.. . ,g), and /th subject in kth group (l : I,2,. .. ,n). Data from the experiment
will be denoted by y, an cpgnx 1 column vector, with mean E(y) and variance-covariance
matrix Q
We consider a model which can be represented as
U¿it t : tt * l¿ * r¡ I þ;m I r7jkq + eiikt (7.2.r)
where þ, 1¿ and zrj are the grand mean, ith cohort, jth period effect respectively, and B¿¡,¡
is subject effect from the /th subject in kth group in ith cohort and rç¡nt¡ at this stage
is used to describe the set of treatment effects applied to the /th subject in kth group in
jth period in ith cohort. In this model, the þ¿m are independent random variables which
are normally distributed with mean 0 and variance o! and are independent of the e¿¡¡¡
which are independent random normal variables with mean 0 and variance o2.
118
7.3 Analysis of variance
In the previous section some general structure was laid dolvn for the general cohort design.
Now in this section rve lvill provide the analysis of variance tables for both standard
and cohort designs which simply identifies the components in the block structure and
ignores, for the moment, the treatment structure. This same model could also be used
for the standard design in which there were no delays from one cohort to the next. The
data vector y of length cpgn, written in standard order, can be partitioned into various
components, corresponding to the terms that can be identified in the linear model in
(7.2.1), and the expected mean squares determined. The general ANOVA for the standard
design is shown in Table 7.1.
Table 7.1: ANOVA table for standard design, no treatment terms
If we apply the model to the cohort design shown in Figure 7.1, the matrix for periods
becomes more complex, since there are now (p+ "- 1) periods. As noted in the previous
chapter, periods and cohorts are no longer orthogonal to one another. However, groups
within cohorts and subjects within groups are still orthogonal to periods and hence we
can rvrite Table 7.2 lor the cohort design.
Source Idempotent matrix Degrees of Freedom trMS
GM J.ØJpE)JsØJ" 1
Period J"ØKpØJsØJ" p-1.
Cohort
Group within Cohort
Subject within Group
K"ØJpØJsØJ"
I"ØJPE)KgØJ"
I"ØJpE)IsØK"
c-l
"(g - 1)
cs(n - 7)
o'+ po?
o'+ po?
o, + po?
Subjects cgn-I
Period x Cohort
Period x (Group within Cohort)
Periods x (Subject within Group)
I("ØKeØJsØJ"
I"Ø I{e Ø Ks Ø J"
I"Ø KpØ IsØ K"
(c-1)(p-1)c(p-lxg-t)c(p-1)e(n-1)
)o
02
o2
Periods x Subjects ("gr- tXp- t)
Total cpgn
119
Source Idempotent matrix Degrees of Freedom EMS
GIVT J"ØJpØJsØJ" 1
Periods (ig. Cohort) StØJgØJ" P*c-2Cohorts (elim. periods)
Group rvithin Cohort
Subject within Group
s,&J^ñ¿I"ØJpgI(eg"I"I"ØJpØIsØK"
c-Ic(g - 1)
cs(n - I)o
o
'+po?
'+po?Subjects cgn-l
Period x Cohort
Period x (Group within Cohort)
Periods x (Subject within Group)
S¡8Js8"f"I"Ø KeØ I(s Ø J"
IcØI(eØIsØI("
(r-1)(p-z)c(p-r)(s-t)c(p-1)s(n-I)
o t
o2
o2
Periodsx Subjects ("sn- t)(p- 1) - (c- 1)
Total cpgn
Table 7.2: ANOVA table for general cohort design, no treatments term
In Table 7.2 we consider the projector matrix which takes out Period effects from the
mcdel and then consider a stratum named Cohort (elim. Periods) which is orthogonal
to Periods (ig. Cohort) stratum. As we can see, many lines in Tables 7.1 and 7.2 are
the same. The differences occur only in Periods (ig. Cohort), Cohort (elim. Periods)
and Periods x Cohort interaction. For these three lines, the idempotent matrices always
end in Js Ø Jn, indicating that we can average over all subjects within a cohort. Thus,
to get a block structure for the general cohort design we need only consider the vector of
(Cohort x Period) meansof length cp. InTableT.2,Sl,,92andSsareprojectormatrices
for Period (ig. Cohort), Cohort (elim. Periods) and PeriodxCohort interaction applied
in each case to the vector of Cohort x Period means of length cp. These throe projector
matrices add together to giveidempotent matrix (I - J) of dimension (cp- 1). Wewill
sholv how these can be presented in the next section.
7.4 Fitting periods and cohorts
We nolv consider just the table of Cohort x Period means with length cp, as shown in
Figure 7.2. Considering this in standard order across the rows, we get the model
y : [rl * Pn -f Cl * Tr I e, (7.4.1)
t20
where Zr considers lvhatever treatments are applied and e indicates the random effects
due to subject and error. In this case, each mean will nolv have a variance of (o2 -f o!) I gn,
with observations in the same cohort having covariance "?lg".We rvill ignore the factor
gn in this section, for convenience, and regard each value as a single observation rather
than as a mean of gn values. The variance-covariance matrix of e can then be lvritten
Var(e): o2(1"Ø Kp) * (o'+ p"?)U.Ø Jr).
Figure 7.2: Full cohort design, length cp
+ P----------------
1p
2
c
c-1
c
plc-1
For the Cohort x Period table of means we can identify the components in the block
structure. This will provide five strata whose orthogonai projector matrices can be shown
in the following;
. ./" 8 Jr lor gland mean;
o ,9s : K"Ø Jp for Cohorts ignoring periods;
r 51 : P(P'P)- P' - J for Periods ignoring Cohorts;
o 52 : X(X'X)- Xt - P(P'P)-P' for Cohort eliminating periods;
. Ss : I - X(X'X)- X' for Interaction stratum;
rlwhere r: lP : C ). Notethat C:1"81o, and (X'X) is singularwithrankof
(p+2"- 3). In this formulation, the .9; determine the block structure of the cohort design
1.21
experiment and with respect to the usual Euclidean norm divide R"p into subspaces
called strata. We can think of the columns of C and P as generating subspa,ces of R"p
withdimensionc-1andp+c-2respectively,andcorrespondingidempotentmatrices
So : I{"8,,I0 and Sr, after removing the grand mean. These subspaces are not orthogonal
to each other. To explore the relationship between these subspaces, we turn to the results
of James & Wilkinson (1971).
7.5 James and Wilkinson theorem
In the following section we summarise a geometrical approach to the analysis of nonorthog-
onal designs generated by projection matrices which was initially worked out by James &
Wilkinson (1971). Such a geometrical approach has quite general application to problems
in experimental design. In our cohort design, due to the nonorthogonality of two strata
in the design, we need to use the result of the above mentioned paper.
James and Wilkinson show that, if the projector matrices onto the spaces U and V.
with dimension (q,*),such that e ) nù, are U and V then there exist spaces
UtrUzr.. . rUkrUk+t,
Vt,Vzr., ,,Vk,
and the projector matrices
Ut,Uz, . . . ,UnrUt+,
VtrVzr... rVk,
such that rank(tJ¿) : rank(V) : r¿, D!=tri : rn and lfjl ri : et such Lirat,
U¡V¡ : 0 (i+i), (7'5'1)
U¿U¡ : 0 (i+i),V¿V¡ : 0 (i+i),
U¿VU¿ : p;Ur, (7.5.2)
VU¿V
rvhere 7t pr >...and if a¿ is such that cos2 aà: pi,t then a¿ is the angle between the projectionU¿y of any
r22
vector into U¿ and the projection l/¿y of that same vector into V¿. Nolv in the next section
lve show holv lve can get these pi's as the canonical correlation coefficients betlveen the
columns of C and the columns of P.
7.6.L Canonical variables
In the analysis of cohort designs rve use the concept of the canonical correlation of two
canonical variables. I(rzanowski & Marriott (1994) develop this idea as follows. Suppose
P and C are N x q and ,Ä/ x m respectively, and suppose the variance-covariance V of
the rorvs of X : lPicl is partitioned as
l,/ -Vt
V,,
Vz
V,,(7.5.3)
so that V11 and V22 are the variance-covariance matrices of the columns of P and C
respectively, while Vn contains the covariances between the columns in P and those in
C. In particular, we can show that 7rr : þrP'Q - J)P,Vtz: ;\P'Q - J)C, and
Vzz : ;+Ct (I - J)C , where (I - J) in each case is N x N. Now consider the correlation
between any linear combination Pa of the rows of P and any linear combination Cb of
the rows of C , namely
,:-L.'ffi' (7'5'4)
Next, choose a and b to maximise this correlation. This is equivalent to maximising
1lF -- a'Vpb - uÀ@'Vn
a - 1) - |ub'Vzrb - l)
where À and LL are Lagrange multipliers, sincewe can choose a and b to make a/V11a and
b'V22b: 1. Diflerentiating with respect to a and b gives
AFAo
: l/1rb - ÀVø : 0, (7.5.5)
AFAb:Vzta-þVzzb
:0.
Substituting for b gives
V12V22rVva - À¡L,V;1a: 0. (7.5.6)
This equation has non-trivial roots only if
lV12V;tVr - pVtl: o
123
\(.Ð.( )
rvhere p : ÀF,. Let the values of p, the roots of the determinantal equation (7.5.7),
be denoted pr,. . . , pq, where some of the roots may be repeated, and let the vectors
corresponding to these be a1 r...râqr since we can scale these vectors so that alV11a: 1.
It follows from (7.5.6) that
b¿ : (ll¡1")VrrtV21a¿,
for i : 1,. . . , q, and that, by the properties of eigenvectors of symmetric matrices,
al¿Vta¡ :0,
for all i + j. Again, from (7.5.6),
a'¿I/rzb¡: lalVrra:' : 0,
and, similarly,b'¿V22bj : 0, for i I j.
The vectors Pa1 and Cb1, corresponding to the largest eigenvalue pt are called the
first canonical variates and, from the definition (7.5.4), pr represents the correlatìon
between these two vectors. Subsequent roots will correspond to other pairs of vectors a
and b which have the property that they maximise the correlation subject to Pa and Cb
being orthogonal to each of the earlier canonical vectors as demonstrated above.
Suppose now that we choose P to have (p+ "- 2) linearly independent columns and C
have (c - 1) linearly independent columns, where in each case the columns are contrasts
in the sense that (1 - J)P : 0, (1 - J)C : 0. Then the matrices 7rr and V22 will bç
positive definite and the roots of lVzVnrVn - pVttl : 0 at" non-negative. If we assume,
without loss of generality, that e ) m, there are exactly rn : (c - 1) non-zero roots,
which may well have repeated roots among them. The remaining q - nL : p - 1 roots
willbezeroandcorrespondtoasetofvectorsã.¿,i:c,...,p*c-2whichspanapart
of the Cohorts space which is orthogonal to Periods.
The vectors Pa¿ and Cb¿ now each form a set of orthonormal vectors in the space of
contrasts, and they provide a partition into orthogonal one degree of freedom contrasts
of the Periods(ig. Cohorts) and Cohorts(ig. Periods) strata, respectively, fulfilling the
conditions of the James and Wilkinson model. The only difference is that we have not
collapsed repeated eigenvalues into subspaces of higher dimensionality. The projector
matrices for these one dimensional spaces are then Pa¿al¿Pt and Cb¿biC', respectively.
124
To obtain the vectors rvhich span the Cohorts(elim. Periods) space, we need to look;
for each i, at the linear combination of Pa¿ and Cb¿ which is orthogonal to Periods, i.e.
to Pa¡. This can easily be shon'n to be the vector
I(Cbn - p¿Pa.i),
1-p?
which has been normalised to length one. This will be established more formally in the
next section.
7.6 Expected mean squares of each block structure
We choose to use the Cohort (elim. Periods) to capture treatment information, so we
need the expected mean squa e of Cohort (elim. Periods) stratum and we need this for
each degree of freedom contrast. In this situation we refer to equation (7.4.I), and we
know that the projector matrices, that is Si(i : 0,1,2,3), will change as p and c change.
We also know that
v ar(y) : o'1", + po:(I" Ø Jr) (2.6.1)
We have now a situation analogous to that of James and Wilkinson. In the space r?"p,
we have two subspacesU, for Periods(ig. Cohorts), and )/, for Cohorts(ig. Periods), such
that these two subspaces are not orthogonal to one another. As we know the respective
projector matrices for these subspaces are 51 and .90 : 1(" I Jp, of rank (p * c - 2) and
("- 1) respectively such that pf c-2 > c- 1 for p > 1. Based on the result of James and
Wilkinson's paper it is possible to find projector matrices Ut,Uz,...Un+, in Periods(ig.
Cohorts) and I{ ,Vz, . . .V* in Cohorts(ig. Periods) satisfying the results \n (7 .5.2) and with
ranks rr,r2,,...rrk,r¿11, such that, in our case Ð!=rrt -- (c- 1), r¡+r - p- 1. r/úe also
can get the subspaces and the quantities p¿ by orming the canonical correlations between
sets of columns spanning the two subspaces. Let P be any set of linearly independent
columns spanning U and C be any set of linearly independent columns spanning )/. The
canonical correlations pi can be obtained by finding a¿ and b¿ such that
Pt maxat,bl
maxa2,b2
Corr{Pa1, Cbt},
Corr{Pa2,,Cbr},Pz
725
Pc-r max Corr{Pa.-r, Cb"-r},ac-l rbc-1
subject to theorthogonality conditions all - bl : 0,a¿P'P'aj:6¿t andb'rC'Cb, : ôuj,
rvhere ô¿¡ is the Kronecker delta. There is also an additional set of vectors âc, . . . ,,àc*p-2,,
satisfying the same conditions.
If P and C are the Helmert contrasts on the Periods(ig. Cohorts) and Cohorts(ig.
Periods) respectively in the general cohort design, and if we make each such column
orthogonal to the vector of ones, we can obtain the canonical variables for Cohorts(ig.
Period) and Periods(ig. Cohorts). \Me can then write
xe : P x [a1, ã2¡...,,ar¡.-z)
: I pr, Pz., ,Pp+"-zf ,
: C x [b1, bz,. . ., b"-r]
tlI Clt Czs ,t Cc-l l¡
: lpr, pz, ,p.;f,,
(7.6.2)
x"
( 7.6.3)
where, from the earlier results, c'¿cj : p'¿p¡ :6¿j, C¿pj: 0 for i + i, arrd c'¡pt¿: pi, Note
that p1 , Pz, . . . , P"-L may include repeated values.
To obtain the projector matrix onto Cohort (elim. Periods), we can use the combined
(Cohort, Periods) space and the projector matrix ^91 for Period (ig. Cohort). Now the
projector matrix onto the (Cohort, Period) space,,S12, is given by
s,, : I x, " ] i';,:,';,:l' |
-:,)
Since XiXo : Ip¡.-z ar'd XlX" : Ic-t and if we put XLX, : fdiag(ptt. . .,, pc-r),01"-r¡"10-r)] :D, say, then the projector matrix for Cohort (elim. Periods) is
p
X;
Xt"xpSz: Sn- tt :
Ix"
-1Ip+"-z D'
D I.-t - xrx'r,
Now, using Rao (1973, p.29), rve have
IDl I + D'E-t D
-E_ID-D'E_T
E-rDI
-1
726
where
ø : (I - nD')ç-r)x(c-r) : diag I-p 2It t-p7, | - p7_,
Then, it follorvs that
Sz Xp X" ll' *_o,,:, : -';:,-'
] I ; )-',*,ll":_,: -':,-' I lî:lx.
x"
lx" - XrD'
xp
xe
where
Pt0 .0.0, P.-r
.0
Pz
xp Pt¡ ...r Pp*c-2 PtPt ¡ Pc-rPc-r
0
Thus
c-l I
Sz : Ð r-ìf c¿ - p;p¿)(c¿ - p¿p)'.i=tr-Pi
We note that (c¿ - p¿p¿)'p¡ : 0, for all j, so that this space is indeed orthogonal tó
Periods. Then Cohort (elim. periods) has (c - 1) degrees of freedom and they can be
given individually by the rank 1 projector matrices
Cn: -\(.¡ - p¿p¿)(.¿ - p¿p¿)'.| - pí' '
The sum of squares for the ith of these contrasts is just y'C¿A. The expected value of this
will typically include treatment components,, so we obtain
E(y'coù : tr{c¿(o2 1", + po?I"8 /o)} { rtT'c¿Tr
: o2tr(C¿) ¡ po!tr{C¿(/" ø "rr¡1 I r'T,C¿Tr
: o2 ¡ poltr{c¿(I.g Jo)} ¡ r'T'c¿Tr.
t27
D'
0
0
0
0
To further simplify this, we note that the orthogonal projector matrix for Cohort(ig.
Periods) is c_r
X"(X:X.)-tX'":Ð"n"'ni=l
and we also know that this orthogonal projector matrix can be given as
{(1 - J)"8 Jr} : I"Ø Jp - J"Ø Jp
Therefore, it follolvs thatc-l
Ð"n"'n: 1" I J, - J"Ø Jr,
OI
i=I
c-1
i:r
Since C¿l :0,
Now on the basis that
where ô¿¡ is Kronecker delta given by
1tr{C¿(1.8 Jo)} : trl c¿ - p¿p¿)(c¿ - p¿p¿)tl c¡C¡).r-p7 j=r
I"Ø Jp: t c¿c'¿-f J"Ø Jp.c-1
i=l
clc¡ : 6¿¡, p'¿c¡ : p¿6¿r,
c-1
6¿t:1 i:j0 i+i
lve have
tr{C¿(I"S /r)i : t]-t ("; - p¿p¿)'"n}' : (t - p?),r-píso that
E(v'c;v) : o2 rp(1 - p?)"? I r'T'c¿Tr.
Thus the expected mean square for each contrast may be different, according to the
degree of correlation between that contrast and the appropriate contrast in the period
stratum. Overall, the expected sum of squares in the Cohort (elim. Periods) stratum is
c-1E(S Sc"P) D[o'+ p(l - pÐ"3] +lr'T'C;Tr
(c - t)o2 + po? Df t - p?) +lr'r'c;rrc-l
i=1
i:1c-l
i=1
(c - t)(o2 + po) - po?D p? +lr'T'C¿Trc-7 c-I
t28
i=l i=l
For the expected sum of squa es for the Period (ig. Cohort) stratum, we know that the
projector matrix is 51 : L',Ji-' p¿p'i, so we can lvrite
E(S SPic) : Eltr(S1yy'))
: tr['1{o21", + po?U" E) /o)}] ¡ r'T'SrTr ¡ n'P'S1Pr
: o2tr(st) ¡ poltrlsl (1" a .re)] ¡ rtTtslTr I r'P'stPr
: (p + " - 2)o' + po?trlSt(I" 6l Je)] + r'T'StTr ¡ r' P'SyPr,
since trace and rank of any idempotent matrix is same. It follorvs that the expected mean
squa e has a terrn o2 but now has a component involving ø"2, indicating that some of the
cohort differences have'leaked'into the period stratum.
Finally for Cohort (ig. Periods) we know that the projector matrix is the same as
projector matrix for standard design that is, lve have So: I{" I ./0. Then we can write
E(S SciP) : E(a'Soy) : E[tr(Soyy')]
: trlss{o21" + po?U" s /r)}] ¡ r'T'soTr I r'P'ssPr
: o2tr(so) ¡ poltrlso(1" ø "lo¡1 ¡ r'T'soTr ¡ r'P'ssPr
: (c - t)oz + po2,tr[(I{.Ø Jò(1. S Jo)] | rtTtSsTr ! rtptsspTt
: (c - l)o2 + po?trl(It" Ø Jr)l ¡ r'T'SoTr * r'P'SoPn
: (c - l)(o2 + po2) + r'T'SoTr ! r' p'Sspr,
Since there are (c- 1) degrees of freedom. the mean square for Cohort (ig. Periods) has
expected value o'+ po?, plus whatever treatment and period effects are present. Any
contrast of length 1in the Cohort (ig. Periods) stratumhas varianceo2¡po!, but cannot
generally be used for estimation since it contains the (unknown) period effects.
7.7 Description of a function cc in S-PLUS to get
the result of James and Wilkinson's theorem
To see the result of applying James and Wilkinson's theorem,, we use a function cc written
in the S-PLUS program and listed in Appendix B. The function requires as input the
value of p, the number of measurements on each subject, and c, the number of cohorts.
Columns P and C are formed giving the levels of periods and cohorts coffesponding to
129
the vector of cp means y written in standard order. From these, design matrices Xo and
X" are created. In the second stage,, for any cohort design, the Helmert contrasts for
p*c-2 and c- I degrees of freedom forboth the Period(ig. Cohorts) and Cohorts(ig.
Periods) strata respectively are created. Initially, due to the unequal replication of the
periods, the Helmert contrasts are not orthogonal to the grand mean. The use of the
function cc produces columns which are true contrasts. In the third stage, the function
cc gives us the two sets of linearly independent columns labelled P" and C" spanning the
spaces due to periods and cohorts respectively and satisfying the James and Wilkinson
theorem.
To show these stages we refer to the special case of a cohort design with c: 3,p:3,which is shown in Figure 7.3 and for which observations are made over a total of 5 periods.
Figure 7.3: Cohort design for three periods in each cohort3
4
As expected in this case, we have four Helmert contrasts on the period stratum, say
Pt,,. . ., Pa, and two Helmert contrasts on the cohort stratum, say C1 and C2. These are
shown in Tables 7.3 and 7.4 respectively.
Pz
1 -1 2
-1 2 0
2 0 0
P4
-1 -1 -1
1 1 1
1 1 4
Table 7.3: Helmert contrasts for Periods
Pz
1 I 1
1 -1 ,)
-1 r) 0
2
5
Pt
-1 I 0
I 0 0
0 0 0
130
Ct
-1 -1 1
1 1 1
0 0 0
Cz
1 I -1
1 1 -1
2 2 2
Table 7.4: Helmert contrasts for Cohorts
The Helmert contrasts given here for Periods are not, as they stand, orthogonal to
the grand mean and hence an additional step is required to make this so. Norv in the
next stage the function cc gives us orthogonal contrasts for Cohort (ig. Periods) and
Periods (ig. Cohort) and since these contrasts are not orthogonal to each other we
use the result of James and Wilkinson's paper. Function cc considers C as any linear
independent columns spanning the space due to Cohort (ig. Periods) stratum by vectors
of C1,, Cz and.P also as any linear independent columns spanning the space due to Periods ,(ig. Cohort) stratum by vectors of Pyr. . . , P¿. Now, we consider a new stratum named
Cohort (elim. Periods) which is orthogonal to the Periods (ig. Cohort) stratum. To reach
this aim function cc gives us the canonical correlation coefficients and a set of vectors a
and b. For example, the vectors of a and b and the corresponding canonical correlation
coefficients, p, associated with them are in this case:
$a:
[, 1] l,2f
[t,] o .204 -0.354
lz,f 0.204 0.118
$¡:
[, t] l,z) [,3] [,+]
[t,] 0.r44 -0.433 -0.401 o.o7T
12,f 0 .744 -0.048 0.045 0.231
[9,] o .r44 -0.096 0.089 -o .orT
[+,] 0.144 o.11s -0.107 -o. o1s
$rho:
t1l 0.707 0.408
When these columns are applied to the matrices X" and Xo, respectively, we obtain the
contrasts (.o,p) as in James and Wilkinson theorem. These span the Cohort (ig. Periods)
and Period (ig. Cohorts) strata respectively. For the Cohorts (elim. Periods) stratum, it
is necessary to calculate (c¿ - p;p¿). All these contrasts are shown in Table 7.5, although
they are not shown standardised to length 1.
131
c p t - p? Cohort(ig. Periods) Periods(ig. Cohort) Cohorts (elim. Periods)
3 3 rl21
0
1
I
0
1
-10
1
-2-1
0
1
0
1
0
1
2
0
1
2
1
0
1
2
1
0
516
1
2
1
1
2
1
1
2
1
2
1
0
I
0
1
0
1
2
03-3 -423
2
-t)
0
4
-51
5
1
4
1
4
-5
-2-1
tt
1
3
2
,l
2
1
Table 7.5: Contrasts for three cohorts and three periods
For convenience, we write the CohortxPeriods aiiay as a rectangle in which cohorts are
represented by rows and successive periods are represented by successive reverse diagonals
of the aray; that is, the sets of value running along diagonals from top right to bottom
left. It can then be seen that the period contrasts are in fact contrasts between periods,
while the Cohort (elim. Periods) contrasts are orthogonal to periods.
7.7 .I Projector matrices
For this design we have
r ,9s : Ils I ../s for Cohort (ig. Periods)
r ^91 : Ð|=tp¿p'¿ for Periods (ig. Cohort)
o 52 : D?=t +(ci - p¿)(ci - p¿p¿)' for Cohort (elim. Periods)
r ,93 : (Is A 1r) - (J,6 "f.) - (S, 1Sr) for PeriodxCohort interaction,
where it is assumed that both q and p¿ are standardized to length 1.
t32
7.8 To get a pattern on p and the Cohort (elirn. Pe-
riods) contrasts
To get a pattern on p and the contrasts for the Cohort (elim. Periods) stratum lve have
written the results in Table 7.6 for c -- 2 and Table 7.7 for c : 3. For c : 2, there is one
contrast for Cohorts (elim. Periods), and it is obtained by deleting the values in the first
and last periods and taking the average difference between the other means. This has
(1- p'): (p-l)f p, so that the expected mean squa e for this contrast is o2 +(p- l)"?,
plus lvhatever treatment terms it estimates
c p t-p? Cohort(elim. Periods)
2 2 0.5000 1
1 0
2 3 0.6670 -1 1
1 1 0
2 4 0.7500 -1 1 1
1 1 1 0
2 5 0.8000 1 -1 1 1
1 I 1 1 0
Table 7.6: Canonical correlation coefficients and orthogonal contrasts for two cohorts and
various periods
When c : 3, there are two such contrasts, with (t - p?) equal to 1 - 3l(2p) and
t - t lQe). The two contrasts have a consistent pattern, representing, respectively, a
linear and quadratic contrast of the cohorts through the middle p - 2 periods, withslight adjustments at the ends, and having expected mean squares of o2 + (p - t)o! and
o' + (p - I)"?, respectively.
2 6 0.8330 -1 -1 1 1 1
1 1 1 I 1 0
2 7 0.8570 1 1 1 -1 1 -1
I 1 1 1 1 1 0
2 8 0.8750 1 -1 -1 1 1 -1 -1
1 1 1 1 1 1 1 0
133
0 -1 0 1
1 1 i 13 2 0.250 0.750
1 0 I 0
0 -1 o 0 ó 2
1 0 -1 Ð -4 -33 3 0.500 0.833
2 1 0 2 3 0
0 -1 2 2 0qd 2 2
1 0 0 1 -.) -4 -4 -ó3 4 0.625 0.875
2 2 I 0 2 2 3 0
0 1 , Ð Ð 0 ð 2 2 2
1 0 0 0 I -.) -4 -4 -4 -ù3 5 0.700 0.900
2 2 2 I 0 2 2 , .) 0
0 3 2 2 2 2
-3 -4 -4 -4 -4 -3
0 2 2 -2
0 0 0
2 2 2
-1
1 1
1
3 6 0.750 0.917
Ð
2
0
2 2 2 2 e) 0
2
-4 3
0 2 .)0 3 2 2 2 2
0 0 1 3 -4 -4 -4 -4
2 2 1 0 2 2 2 .)
1
I
-2 -2 -2
3 7 0.786 0.929 0
2
0
2
0
2 2 3 0
cp r- p? t- p3 Cohort(elim. Periods) Cohort(elim. Periods)
Table 7.7: Canonical correlation coefficients and orthogonal contrasts for three cohorts
and various periods
7.9 Conclusions
The above analysis provides a set of contrasts among the Cohorts, orthogonal to Periods,
which will enable estimation of treatment effects. Each contrast will have a different
expected mean square, so that the treatment information from each normalised contrast
will need to be combined with a different weight. Furthermore treatment information
will be available in the CohortxPeriod interaction, and each normalised contrast in that
space will have variance o2. In the next chapter, we consider some particular designs and
demonstrate how much information is available and how it can be recovered.
134
Chapter 8
Treatment structure of cohort
designs
8.1 Introduction
Having examined the block structure of cohort designs in the previous chapter, we now
turn our attention to treatment structure. We consider the specification of linear treat-
ment models when treatments are applied to the experimental subjects, and we get
formulae for the analysis of cohort designs which have a linear treatment model and a
non-orthogonal block structure. To extend our approach to analyse crossover trials effi-
ciently as we did in Chapter 6, we apply the cohort design methodology to a number of
different crossover trials . In Section 8.2, we describe how to obtain treatment estimates
from cohort designs, and in Section 8.3 we extend the results of Chapter 7 to the case
where there are g groups of subjects in each of the cohorts, the idea being that each group
within a cohort is given a different treatment regime. Subsequent sections then consider
a number of possible designs and compare the results with those obtained earlier in this
thesis for the corresponding standard design.
135
8.2 Treatment structure for cohort and standard de-
signs in general
To shorv the efficiency of the cohort design and the results of James and Wilkinson's
theorem in a nonorthogonal block structure, we estimate the treatment effects in a cohort
design and then compa e it to a corresponding standard design. To refer to previous
chapter we now consider the general cohort and standard designs with length cpgn, for
both designs, where there are c cohorts, g groups within each cohort, and each of the rz
subjects rvithin each (cohort, group) combination is observed for p periods. Since the rz
subjects within a group all receive the same treatment regime, we can consider just the
vector of length cpg which consists of the group means at each time. In terms of seeing
where treatment information resides, we can without further loss of generality consider
the case where n -- l.
We assume that the design has s treatment parameters and the linear model for
treatment and grand mean for both designs is expressed in Chapter 6 in equation (6.3.1).I l/
Suppose ds - | U 0' I represents the parameters other than the group and period
parameters, and suppose that the vector of treatment effects d has s elements. Let
?s represent the corresponding design matrix and suppose that its (s * 1) columns are
linearly independent.
The vector á might include ro¡ rD, )6, and À¿, which we introduced in Chapter 6.
However, we want to consider at this stage a general set of treatments which might
even include factorial contrasts, for example. To consider what treatment information
is available, we take the matrix ?6 for treatments in a model with length cpg. Then
the information matrix overall it \fo when the grand mean is considered. Sone of the
columns of 7s corresponding to particular parameters may be orthogonal to the 1 vector.
Since the estimate of the grand mean p does not provide any useful information about
treatments, the only usable information is contained in the bottom right-hand s x s
submatrix of f[Q - J)To. If we let f. : I L fö ], u.r'a define
T : (r _ J)T;,
then the required information matrix for 0 is TtT.
If we split the observation vector into several strata, using orthogonal idempotent
136
matrices Qo., Qr, . . .Q x where D!:o Q n: 1, then we can rvrite
u:(Qo-fQtI...+Q¡)v,
where Qog.,QtU,. ..Qny are components of the variation among different strata, and the
treatment information for the ith stratum isT'Q¿T.In the case where Var(e) - o2I,
0: çT'T¡-|T'y,
but if each stratum has variance matrix 6¿Q¿ then the combined estimate is
k
B : {D 6¿rT,giT}-lt 6ntT,gny
andk
V ar(O) : iD 6;LT' q¿Tj-r,
where, as in previous chapters, the summation is made over those values of i for which
we believe we have useful information. As before, we would exclude components such as
the grand mean (i : 0), and strata involving period efiects since they can not generally
be well estimated or a variance coÍrponent determined. In the case of a cohort design,
we would generally not try to use information from Period(ig Cohort) stratum, but the
information from the Cohorts(elim. Period) stratum would be used.
8.3 Projector matrices for cohort and standard de-
signs in general
We know that the projector matrices for the standard design, in the case ?? : 1, are as
shown in Table 8.1.
And for the cohort design we have the projector matrices shown in Table 8.2 where
S¿ are cp x cp matrices defined in the previous chapter.
t37
Stratum Projector matrices for Standard Design
Periods J"Øl(eØJs
Cohorts
Groups within Cohort I"Ø Jp Ø I(s
Period x Cohort I{"ØI(eØJs
Periods x (Groups within Cohort) I"Ø KpØ I{s
Table 8.1: Projector matrices for general standard design
Stratum Projector matrices for Cohort Design
Periods (ig. Cohorts) SrØJs
Cohorts (elim. Periods) SzØJs
Groups within Cohort I"Ø JpØ Ks
Periods x Cohort SsØJs
Periods x (Groups within Cohorts) I"Ø Ke Ø I{s
Table 8.2: Projector matrices for general cohort design
\Me note that in Cohort (elim. Periods) in the general case the information is contained
in (c - 1) separate l-dimensional subspaces and is given by
T'(CuØ Js)T (i:1,...,c- 1)
where C¿ can be interpreted as d¿d'¿ such that d,¿ is a contrast of length 1. Then the
projector matrix for d¿ is
d' ¿(d''¿d'¿)- |
d,'¿ : d¿ d¿,
where
d,i: -+k¿ - p¿p¿),
lr-Piand c¿,p¿ and pi are as defined in Chapter 7.
T'(Co Ø Js)T T'(didi Ø Js)T
!T'@,0d,',8 1r.')?gI
ir'@,8 1)(d; I 1)'7
(8.3.1)
1
?,'g-uiw
138
It follorvs that the information in the ith l-dimensional subspace of Cohort (elim. Periods)
is given by jw;wi, rvhere LD¿: T'(d¿Ø 1). Correspondingly, the projector matrix for
Periods (ig. Cohorts) stratum i. Ðf]'-'p¿pt¿, and since the length of p¡'s are I then the
information matrix for this stratum can be lvritten as
P*c-2T'{5, Ø Js}T : r'{ Ð P¿P'¿ Ø Jn}T (8.3.2)
z'{ Ð (ro ø t;1oo 81)'}"1
g
1
i=1p*c-2
i=lP*c-2
D q¿q'¿,
i=lg
where q,i : Tt(pi Ø I).
If we have n ) 1. subjects in each group, then the ANOVA's will have two more lines,
corresponding to Subjects within Groups and Periodsx(Subjects within Groups) strata.
As before, these will contain no treatment information, and will provide estimates of o2
and o2 + paz, respectively.
8.3.1 Estimates of parameters and covariance matrix of esti-
mates
To get the estimates of parameters we combine the estimates of parameters from those
strata in which there are some information about parameters. Thus we can write
â : {:r l;''(c¡ I ")')- å [ä''(8' ø
")"]] (8 3 3)
{= [;' ,(c¡ Ør,r] * Ð-lir,,o,*
r")] ],,
and covariance matix of d is
var(g): {Ë lï,''"'s /,)r] - å i;t '(Q¿ Ø",t]} (8 3 4)
where \j : o2 + p"?(l - p3) (j : 1,2,...,"- 1) and ds - du : o2 and 6s: o2 + po?.
In the rest of this chapter we examine the result of James and Wilkinson theorem in
some special cases of standard and cohort designs.
139
8.4 Treatrnent inforrnation for two-treatrnent crossover
design with two baseline measurements and a
corresponding cohort design
We nolv examine the treatment information for the two-treatment, crossover with two
baseline measurements, one before the first treatment period and one after this period.
As Kenward & Jones (1987b) concluded, one reason to use two baseline measurements in
the 2 x 2 crossover designs is to increase the power of test for the presence of carryover
effects by splitting the sum of squares between-subjects into various components which
can be used to analyse these designs. Freeman (1989) gives us another reason to take the
baseline measurements in the 2 x 2 crossover design. He and others have suggested that
use of baseline measurements, leads to a unique resolution of the estimation difficulties
which we face in the simple crossover designs. In this section, we consider the consequence
of the use of baseline measurements and in the next subsection we look at what happens
if we withdraw the canyover effect in this design. These extra periods give the design
which is shown in Table 8.3.
Period
1 2 ù 4
Group 1
2
A B
B A
Table 8.3: Two-treatment crossover design with two baseline measurements
Now in this study we consider a cohort design with c : 2., p: 4 and g : 2 but n : 1,
then the layout of design is shown in Table 8.4 with a delayed entry at period 2 for
the second cohort. Subjects within a cohort would be randomly allocated in equal-sized
groups to the treatment regimes.
140
Period
1 2 ù 4 5
I
2
3
4
Group
A B
B A
A B
B A
Table 8.4: Cohort design in two-treatment, two-period crossover design with tlvo baseline
measurements.
By referring to Table 8.4, we first write out the 16 expected values in the following
table which could be applied to either the standard or the cohort design. Using the
notation of Chapter 5, the expected values in each cell are as shown in Table 8.5.
ll IL+ro+rD ¡;*Ào*)o p,lro-rnp p,*rro-ro p'*Ào-Ào p,+ro+rD
p p+ro+rD ¡r*Ào*)p p'Iro-rop p'lro-rn p*)o-)o p,+ro+rD
Table 8.5: Expected values of two-treatment, two-period crossover with two baseline
measurements when we consider the cohort design or double standard design.
Our purpose in the two following subsections is to get the treatment information for
this case. We now first consider the model which includes the first-order carryover effect
and in the next subsection we analysis the model without the first-order carryover effect.
t4r
8.4.L Treatment information when the first-order carryover is
present
If lve 1et y be a vector of observations corresponding to the 16 expected values written in
order across the rows, then by referring to the linear model expressed in 8.5 we have
To: Iz Ø
T,T :
00 0
00 0
10 1
10-101 0
01 0
10-110 1
1 0
0
0
0
1
1
0
0
1
1
1
1
1
1
-4-4
4
4
-4-4
4
4
-2-2-2
q
6
6
-2-2
1
where 12 arises from the fact that there are two cohorts, both using the same treatment
structure and ds : þ, Tot )0, TD¡ Àn . The matrix ?, obtained by eliminating
the effect of the 1 vector from the columns of 7s is then given by
T:!1,ø8'-
0
0
8
8
0
0
8
8
0
0
0
0
8
8
0
0
(8.4.1)
so that the overall treatment information is
4
-20
0
00008004
2
.)
0
0
Table 8.6 shows the information matrices for the different strata in each design obtained
by calculating T|Q¿T for the appropriate idempotent matrices Q¡. In the case of the
standard design, we use the Kronecker product matrices sholvn in Table 8.1. For the
r42
cohort design, we refer to Table 7.6 in the previous chapter, lvhich shows that lvhen c:2and p : 4, we have one degree of freedom for Cohort (elim. Periods) with p1 : 0.50, and
d1 given by the contrast in Table 8.7.
Table 8.6: Information matrices for standard and cohort designs in two-treatment design
lvith two baseline measurements, when first-order canyover effect is present
Stratum
Standard Design Cohort Design
o2 o2" Information Matrix p 02 o 2 Information Matrixt
Periods(ig. Cohorts)
4
-20
0
2
3
0
0
0
0
0
0
0
0
0
0
1
0
0
0
000i00000000
Cohorts(elim. Periods) t4 None
1
0
0
0
000
1
3
0000.50 1 3
000000
Group within Cohorts I4
0000000000000001
t4
0000000000000001
Periodsx Cohorts 10 None 10 n
3
4
-30
0
-300300000000
Periods x (Grp.w.Coh.) 10
0000000000800003
10
0000000000800003
4
2
0
0
2
3
0
0
0
0
8
0
0
0
0
4
4
-20
0
2
3
0
0
0
0
8
0
0
0Total
0
4
t43
0 -1 1 1
0 -1 1 -1
1 1 1 0
I i 1 0
Table 8.7: Orthogonal contrast on Cohort (elim. Periods)
In comparing the results in Table 8.6, in terms of treatment information in each design
and in each stratum,, the follolving points can be noted:
o The treatment information in Group within Cohorts and Periods x (Grp.w.Coh.)
strata for both designs is the same, because the projector matrix in each stratum
is the same for both designs.
o In the standard design, all information about (ro, )o) is in the Periods stratum,, so
that the possible presence of period effects would imply that we cannot estimate
(ro, )o) 'vith this design.
¡ In the cohort design, most of the information about (ro, Ào) is available in Period x Cohort
stratum with a variance of ø2 and some information (11%) about 16 is in the Cohorts
(elim. Periods) stratum with a variance of o2 +3o?.
8.4.2 Tleatment information when carryover efÏect is not present
Now we consider this crossover design when there is no carryover effect in the model.
We refer to Table 8.5 and do not consider the two parameter Ào and )¡. Then the total
treatment information for direct treatment only can be written as
4008
The treatment information in five different strata for direct treatment effect for standard
and the cohort designs is given in Table 8.8.
T,T
t44
Standard Design
Stratum o2 o! Information Matrix
Periods(ig. Cohorts)0 0
40 1000
Cohorts(elim. Periods) NoneI4
Group within Cohorts Nonet4
Periods x Cohorts None10 1
310
0 0
80
Periods x(Grp.w. Coh.) 10 0008
10 0008
Total4008
4008
Cohort I)esign
p o2 o! Information Matrix
0.50 1 3 -L3
0
0
1
0
T4 None
Table 8.8: Information matrices for two-treatment, two-period standard and cohort de-
signs when first-order carryover effect is not present.
Like the situation in the previous subsection with this same design but with first-order
carryover effect,, cohort design is more efficient than the standard design. The parameter
re is estimable in all strata except the Groups within Cohort stratum and for rp ,the
difference between direct treatments, Periodsx (Groups within Cohorts) in both designs
has all the information.
8.5 TYeatment information for two-treatment, extra-
period crossover design and its corresponding co-
hort design with one baseline measurement
In this section we consider the extra-period crossover design with two treatments and
one extra period for baseline measurement. This design with three treatment periods
with two groups is known as a dual balanced design, because this design is made up
of equally replicated pairs of dual groups. The reason to choose this design is that the
additional treatment period decreases many of the problems associated with analysis of
t45
trvo-period crossover design. For example, as Ebbutt (1984) has shorvn, an appropriate
extra-period design allolvs us to use all the data to estimate and test direct treatment
effects u'hen first order ca ryover effects are present. I(ershner & Federer (1981) discuss
the estimation of contrasts for direct and carryover treatment effects for a wide variety
of two-treatment crossover designs lvith two, three and four periods. They also compare
the variances of these contrasts for many alternative designs and evaluate the impact of
including baseline measurements as suggested by Wallenstein (1979). Hafner & Kocþ
(1988) give some alternatives to the analysis of extra-period crossover designs with two
treatments. In their paper for these designs, they have emphasised the formulation of
appropriate within-subject linear functions to analyse the design with pararnetric and
non-parametric methods. The reason to choose this linear function in non-parametric
analysis is that one only needs the independence and common distribution assumptions
on the resulting within-subject linear functions. In the parametric approach one also
requires normality assumptions on the resulting within-subject linear function for small
sample cases. Now in this section we attempt to compare the treatment information
for both the standard and the cohort designs for this crossover trial with one baseline
measurement. This design with one baseline measurement is given in Table 8.9.
Period
1 2 3 4
Group 1
2
A B B
B A A
Tabie 8.9: Two-treatment, extra-period crossover design with one baseline measurement
The Cohort design corresponding to this standard design is shown in T.Lle 8.10
Table 8.10: Cohort design for two-treatment,, extra-period crossover design with one
baseline measurement.
Period
1 2 3 4 5
I
2
o,)
4
Group
A B B
B A A
A B B
B A A
t46
The expected values of the design for this model are given in Table 8.11
I,r lr+ro+rD þiro*)o-rolÀn p,lro*Ào-ro-Àop p.*ro-rn p'lro*)o*rn-Ào þlro*)o*ro*Àop lt+ro+rD L¿*ro*)o-rnI\n ¡.r,I ro * )o - ro - Àn
ll p,*ro-r¡t p,*ro*)o*rn-Àn ¡t"Iro*)o*rnIÀnTable 8.11: Expected values for design of two-treatment, extra-period crossover trial
Now, as in the previous section, we deal with this design in two subsections in terms
of presence and absence of first-order ca ryover effect in the model.
8.5.1 Tleatment information when first-order carryover effect
is present
The total treatment information for both the standard and the cohort designs after taking
o':t the vector 1 due to the grand mean can be obtained as
7"7 :
The output from SPLUS and Maple software provides the information matrices for var-
ious strata for both designs as given in Table 8.12. Treatment information about the
differences rp and )¡r remain the same for the two designs, with a small amount of in-
formation in the Groups within Cohorts stratum. The situation, however, is different fro
the treatment information which looks at how the average of the two treatments differs
from baseline. In the standard design, this is totally lost to the Period stratum. For the
cohort design, not much is recovered, with only a third of the information about r¡ and
a quarter of the information about Às being recovered, the rest being lost into the Period
stratum.
The Cohort(elim. Periods) stratum has the information about (ro, )o), and this is
contained in the one degree of freedom contrast
32 0 0
24 000012000 08
747
0 -1 1 -1
0 -1 1 1
I 1 I 0
1 1 1 0
which in this case has expectation
2(2rç ! )o) - 2(3ro -l2)o) : -2(ro * Ào),
and variancel2(o2 +3o!). The PeriodxCohort stratum has just 2 degrees of freedom
These may be represented by
which have expectations of 2(D,s - rs) and -2ro, and variances of 24o2 arrd 8a2,, respec-
tively. It follows that the information matrix for these two contrasts is given by
1
3
r12 ]+ z
1
6
1 2 1
1 l;lr -1 02
as given in Table 8.12.
l2
0 1 2 -l0 1 2 -1
I -2 1 0
1 -2 1 0
0 I 0 1
0 1 0 I
1 0 -1 0
1 0 1 0
r48
Stratum
Standard Design Cohort Design
o2 o , Information Matrix,9
p 02 o! Information Matrix
Periods(ig. Cohorts)
32 0 0
240000000000
220023 0 0
00000000
Cohorts(elim. Periods) t4 None
1
1
0
0
100
11000.50 1 3
000000
Group within Cohorts 74
0000000000100000
74
0000000000100000
Periodsx Cohorts 10 None 10 I
2
-10
0
-1 0 0
200000û00
Periods x (Grp.w. Coh.) 10
00 0000 000 0 11 0
00 08
10
00 0000 000 0 11 0
0 0 0832 00
Total2
0
0
4
0
0
0012008
3 2
4
0
0
0
0
0
0
0
8
2
0 t2
0 0
Table 8.12: Information matrices for standard and cohort two-treatment, extra-period
designs with one baseline measurement.
This information can ed to form treatment estimates by rveighting these
contrasts by their inform ces, i.e.
l;l : {;ll rl ', ;l}
r49
x {lll lll;: l.+'l ; ;ll;; l.='l; :ll;. 1}in which îot, îoz, âe3 represent estimates obtained from the three contrasts, one from
Cohorts(elim. Periods) and the other two from PeriodsxCohorts. Now lve can write
-1t6pTg
io ['
{t
3p
-ôp3*6p
1
1
X (î0, * io,) * 1*3p2
0 rrp 0 -p0 l*p 0 -p
_1r + n) 0 p 0
_1r + n) 0 p 0
0 0 1r+n) -p0 0 1r+n) -p
0 _1r + r) p 0
0 _1r + r) p 0
I ;l(î"-zi") +le+iø l;l-')
i;l
r [r+p p .l
3(r+pxl +spL p ,*rr)
{ll](î.'*i.') -F'j# t ;] (î.,-z\.,)+
: --+;l{III(îor*io') +tl-,1 ,, I(ìo'-2io')
+;[';"]"Then the estimate of rs can be obtained as the following linear combination of the 16
treatment means:
X3(1 + 3p)
los2
I,o - z(r+p)
and the estimate of )s is similarly given by
\-1no - zÍ+p)
The variance of these estimates is given by
Tg | -l2p
p
p
1*2p)oVar
150
\A,¡e note again that this refers to the case n - 1 subject in each group in each cohort,
and an increase in the value of n lvould simply reduce this variance by a factor of n.
8.5.2 Treatment information when the first-order ca.rryover is
not present
We here again consider the design in the previous subsection, but we discarcl the first-
order carryover effect from the model. Just for the direct treatment effect rve have the
total treatment information
,'r:u 300 ).2
and subsequently the treatment information separately for various strata is ol¡tained for
both standard and the cohort designs as in Table 8.13.
Cohort Design
p 02 o! Information Matrix
2000
2
0
010 !3
0
Table 8.13: Treatment information for both standard and cohort two-treatment, extra-
period design with one baseline when the carryover effect is not present.
Standard Design
o! Information Matrixo2Stratum
3000
Periods(ig. Cohorts)
Nonet4 0.50 I 31
3
1000
Cohorts(elim. Periods)
lolo
0
1
14 I 40001
Group within Cohorts
None10Periods x Cohorts
0
11
0
010 0
11
0
0Periods x(Grp.w. Coh.) 1 0
0
12
r)
0
0
72
ot)
0Total
151
8.6 Treatment information for three-treatment and
three-period crossover design
In designs with more than two treatments, unlike the two treatment design. the main
focus in planning is that we must decide which contrasts of treatments are to be estimated
with the irìghest precision. This idea may lead us to want a design such that all pairwise
differences between the treatments are estimated with the same precision. A design that
possesses this property is known as variance balanced. Thus, our design comparing three
treatments A, B and C, lvill be variance balanced if it satisfies the following property.
V ar(î¿ - îa) : Var(ît - îc) : V ar(î'p - îc) : Lto2,
where rA,,rB, and r¿ are the direct treatment effects of treatment A, B and C respectively
and u is a constant for all treatments. A layout which achieves this is shown in Table 8.14.
Group 1
2
Period
1 2 3
A B C
B C A
Table 8.14: Three-treatment, three-period crossover design
A balanced incomplete block (BIB) design is one in which each treatment is replicated
the same number of times and each pair of treatments occur together in the same block
rc times, where rc is a constant integer. Then as can be seen the design in Table 8.14 is
a balanced incomplete block design with periods as blocks with ú : 3 treatments and
k : 2 units in each of the three blocks, and the efficiency of this (BIB) design is
n ¿(k-1) 3x1 3"-k(t-1)-zx2-4'
This implies that 25% ol the information on treatment differences is lost into the Period
stratum. The above design is thus balanced if periods are regarded as blocks. We note
that treatments are orthogonal to groups. The presence of carryover effects here may,
horvever, destroy this balanced property if they are included in the model. The cohort
design corresponding to this design is shorvn in Table 8.15.
752
Period
1 2 ô 4
I
2
.)
4
Group
A B C
B C A
A B C
B C A
Table 8.15: Cohort design in three-treatment and three-period crossover design
If the usual constraints
r¡*rn*rc:0, À¿ *À¡ *)c:0,
are applied to the direct treatment and first-order carryover effects, r¡/e can put 1 and 12
for direct treatment effects of A and B respectively, )1 and À2 for first-order carryover
effects for A and B respectively, and have -(1 f 12) and -()r * À2) for direct treatment
and carryover effects respectively for treatment C. Then the expected values of the cohort
design of this model is given in Table 8.16.
p,*rr p,l rz I Àt H-rt-rzlÀzp'Irz F-rt-rzlÀz p,+ rt-Àr-)zp,lrr p,*rzl\r þ-rt-rz*Àz
¡L' I rz þ-\-rzl\z p,*rr-)r-)z
Table 8.16: Expected values for three-treatment, three-period crossover and its corre-
sponding cohort design
8.6.1 Treatment information when carryover effect is present
By following the expected values in Table 8.16 the design matrices ?s, and T are written
AS
1110101 -11 -111
0-10-16-105056-7
000100110
-1 0 1
-1 0 1
0-1-1
6
0
0
-6-6
6
0
f)
6
6
6
0
Zo:128
153
T: !1, m6--
where for T matrix we have taken out the vector 1 due to the grand mean from matrix
To. Then the total information on (21 ,r2,,Àt,)2) is gìven by:
T,T
and treatment information for these parameters is given in Table 8.17
Table 8.17: Information matrices for standard and cohort designs for three-treatment,
three-period.
By referring to Table 8.17 for ?'1 and 12,75% of the information is in the Periods x (Group
within Cohort) stratum. However, half of what is left for 11 is recovered in the Periods x Cohorts
stratum lvith a variance o2 and a small amount more in the Cohorts(elim. Periods) stra-
tum. The Periods x Cohort stratum actually provides information on only one treatment
I:-6
24
I2
-6-18
12
24
6
-12
-66
12
rt
-18
-t26
T7
Stratum
Standard Design Cohort Design
o2 o2, fnformation Matrix p o' ot" Information Matrix
Periods(ig. Cohorts)
6336ÐÐ
-30
-33
6
0
,
1
12
9
9
0
Il8
Io
0-39093
Cohorts(elim. Periods) 13 None
1
t
Io
|,1 0
0
0
0
4 t0.577 I 2
21o0
Group within Cohorts 13
0
0
0
0
0
0
0
0
0
0
4
2
0
0
1
I 3 1
0
0
0
0
0
0
0
4
Periodsx Cohorts lo None I 0
90oo
-90-60
-9 -6o09664
Periods x(Grp.w. Coh.) 1 0 I
18
I
-3-15
I18
-t2
-3
2
1
-15-12
1
l4
1 0
18
I
-3-15
9
18
-t2
-33
J
1
-15
-121
l4
Total
24
I2
-6-18
L2
24
6
-72
-66
t2
6
-18-12
6
l7
754
combination, namely 3rr - 3)r - 2À2, rvhile Cohorts(elim. Periods) has information only
on the linear combination rr * 2^2 + ^1.
8.6.2 Tleatment information when carryover effect is not present
The total information for just direct treatment effect after taking out the grand mean
from the matrix design of the model is
T'T : l;:land treatment information for both standard and cohort design is given in Table 8.18.
It shows that, of the25% of the information lost to Periods in the standard design, about
half of this is recovered for 11 from the Periods x Cohorts stratum,, and a quarter of it
for 12 is recovered from the Cohorts(elim. Periods) stratum. It may be possible to do
better than this by astute choice of design or by extending the design to three cohorts.
Cohort Design
p o2 o! Information Matrir
13 None
Table 8.18: Treatment information for both standard and the cohort designs in three-
treatment, three-period when the carryover effect is not present.
Standard Design
Stratum Information Matrix2 ,o o I
Periods(ig. Cohorts)2l12 Z
1
2
1
1
None13Cohorts(elim. Periods) 0,577 1 2 1
8
t224
None13Group within Cohorts
Periods x Cohorts None10 I8
10 9000
Periods x(Grp.w. Coh.) où10 27
t2I 0 ,l
2l72
Total 42TI2
4I 2
2t
155
8.7 TYeatment information for two-treatment with
one baseline measurement with more than two
cohorts
In Chapter 6 we analysed this design with two cohorts, now we see what happens when
we analyse with more than two cohorts. The layout for this case is shown in Table 8.19.
Table 8.19: Two-treatment crossover design with one baseline measurement when c : 3
The means corresponding to these 18 groups can be written as in Table 8.20
ll p,lro*rn p,lro*)o-rplÀop p'lro-ro p,*ro*Ào*rn-Ào
ll y,+'ro*ro p,*ro*Ào-ro*Àop p,lro-ro p,*ro*)o*rn-Ànp p,*roIro p,*ro*Ào-rnlÀop p,lro-ro p,lro*)o*rn-Ào
Table 8.20: The 18 means for two-treatment with one baseline measurement lvhen c: 3
As for the previous designs, we consider this case in two stages, with and without first-
order carryover effect.
Period
1 2 3 4 5
1
2
or)
4
b
6
Group
A B
B A
A B
B A
A B
B A
156
8.7.1 Tleatment information when c - 3 and first-order carry-
over effect is present
By referring to Chapter 7 and considering the means repeated three times due to c : 3,
and also the different ordering to put the design matrix, then the design matrix Ts can
be presented as belolv
?o:ls8
00 0 0
00 0 0
10 1 0
10-1 0
I 1 -1 1
1 1 1 -1
r:fr,ø
0
0
0
0
6
6
1
1
1
1
I
1
-4-4
2
2
2
2
2
2
2
2
4
4
0
0
6
6
6
6
and the total information for this case is
T,T
42 0
24 0
00 t2
00-6
0
0
6
6
This information is now partitioned among the different strata as shown in Table 8.21.
We see that, in the cohort design, about 60% of the information about ro and Ào is
recovered in the various strata, with some appearing in the Cohorts (elim. Periods) and
some in the Cohort x Period interaction.
r57
Standard Design
Stratum Information IVIatrix2o o2
4200240000000000
Periods(ig. Cohorts)
Cohorts(elim. Periods) 1 3 None
1 3 None
Group within Cohorts
0000000000000 0 02
13
1 0 NonePeriods x Cohorts
0
0
0
0
00000 1.2
0-6
0
0
6
4
10Periods x(Grp.w. Coh.)
Total
42 0
24 0
001200-6
0
0
6
6
Cohort Design
p o2 o! Information Matri>
0.707 1 1.5 I2
0.408 | 2.5 #
540045 0 0
0000000033.) .)
0000
I
-10
0
00000000-1
1
0
0
I
0
0
0
c
0
0
0
0
13
10î
10
0000000000000 0 02
1-1 00-1 1 0 0
0 0000 000
00 0
00 0
00 t2
00-642 0
24 0
001200-6
0
0
6
4
0
0
6
6
Table 8.21: Information matrices for standard and cohort designs for two-treatment with
abaselinerc:3
158
8.8 Concluslon
As we showed, classical analysis of crossover design even if baseline measurements are
included and in the presence of strong or unknown period effects is unable to recover
information about the difference between the average treatment effect and baseline. Using
the cohort design in crossover design, we can recover some information about (16, )s) and
obtain estimates for them from within and between subjects. This method opens the
way for a more detailed examination of the topic and in this chapter lve have examined
various crossover designs.
In comparing the cohort and standard designs and to show the abilities of cohort
designs to recover some information about the parameters of interest, consider the infor-
mation matrices in Table 8.22 in three cohort designs in the analysis of two-treatment,
two-period crossover design with 1,, 2 and 3 cohorts, or equivalently, c : l, c: 2 and
c: 3 respectively. We note that c : I is equivalent to the standard design.
Information matrix
Strata C:I C:2 C:3
Periods(ig. Cohorts) I6
8448
1
6
108810
1
6
10 8l810
I
Cohort(elim. Periods) ,l: :l *lII ;l
ål0.2
-0.2
-0.2
0.2
Periods x Cohorts 1tt
3-3
-3316
4.8
-4.8
-4.84.8
Total !b
8448
1
6
168816
I6
24 t212 24
Table 8.22: Information on rs and Às in three cases of design
As can be seen,, the total information will increase when we increase the number of
cohorts in the model. In addition in the standard design (c : 1), all information about
the parameters of interest is in the Periods(ig. Cohorts) stratum, although in the cohort
designs (c : 2,3) the information is distributed in all strata, and the proportion of the
159
APPENDICES
A Some useful concepts from linear algebra
In this appendix, we give some basic matrix results which are need at various stages in
this thesis.
,4'.1 The Kronecker product
We begin with a brief review of the definition of a right l(ronecker product and some of
its properties. Thus, if A and B denote matricesof dimensionp x q and mxn) then the
right Kronecker product of A and B is defined as
atB oroB
AØ B:aprB apqB
Observe that A I B is a matrix of dimensioî pn'¿ x qn made up of pq submatrices with
each submatrix equal to the scalar multiplication of an element of A with the matrix B.
Some useful properties of the above Kronecker product include
(1)
(2)
(3)
(4)
(5)
(6)
\r/
(8)
(AgB)':A'Ø8,(AØ B)(C 8 r) : AC ø BD;
rank(A$ B) : rank(A)rank(B)
(A+B)8(C+ D): (AsC) +(48 D)+(B sC) +(BøD);
(ABØCD):(AØC)(B8D);
(aør)-1 : A-r 8B-l(AØ B)- - A- Ø B-
lA Ø Bl: lAlelBl^,
where we assume that the matrices involved in regular matrix multiplication are of appro-
priate sizes and in property numbers 6-8 the two matrices, A and B, ate square matrices.
These properties are established , for example, in (Rao (1973), p. 29)
161
L.2 Idempotent matrices
A matrix A of order m xÌrL1is said to be idempotent if
AA: A
We note that if A is idempotent, each of its eigenvalues is order 0 or 1. If A is idempotent,
then the matrix I - A is also idempotent, since we have
(r - A)(r - A): r - A- A+ AA-- I - A.,
since ,4.,4 : A. The rank of an idempotent matrix is equal to its trace
4.3 Positive definite quadratic forms and matrices
Q is said to be a quadratic form in the n variables (*r,. .. , r,,) if
8:Ðla¿¡r¿r¡: r'Atá=I j=l
We refer to A : (a¿¡) as the matrix of quadratic form and we will assume that A is
symmctri,i. Ä matrix A is said to be positive definite if the quadratic form Q : rtAr ) 0
for a1l r + 0. A matrix A is said to be positive semidefiniteif Q : rt Ar ) 0 for all r I 0.
^.4 Contrast and orthogonal contrasts
As Jones & Kenward (1939) pointed out a contrast on a vector 0 : (rr,rr,.. ., r¿) is a
linear combinationclrl lczrz+...+ qr¡,in which e*cz +...+c¿:0. Forexample
with a vector with two dimensions there is only one such contrast and that is ry - 12.
The coefficients of this simple contrast âr€ c1 : 1 and cz : -1. With more than two
dimensions,, however, a number of different sets of contrasts can written. One contrast
h\l czrz*...1ctrt is orthogonal to another dtrt* dzrz*...* d{t if and only if
c(L * czdz I . . . * qdt : 0. In general we can always write down a set which includes up
to I - 1 orthogonal contrasts on a vector with dimension of f .
4.5 ANOVA sum of squares as quadratic forms
In the following discussion rve denote the k x k identity matrix by In and the k x k of
ones by E¡. In terms of matrices like 1¡ and Ek, a neat expression may be given of the
762
various quadratic forms in terms of matrices ,./¿ and Il¡ denoted by
J* : k-r Et and Kt : In - Jn,
where,I and K arc matrices of rank 1 and (k - 1), respectively.
Suppose X is an r x c matrix with X¿¡ as (i,j)th element and Y is a rcxl vector
obtained by stacking the rows of X starting with first row, i.e, Y : (X'r,. . . , X',)' where
X¡: (X¿r,...rX¿")'for i : 1,...,r. To avoid confusion, the reader should note that
subscripts in this note play a dual role, namely, to indicate the dimension of a matrix as
a means for referencing an element or a row of a matrix. The first basic sum of squa es
as a matrix quadratic form is
Ð D x?¡ : Y'lr, s 1"1 y. (A.1)i=l j=L
This follows from the definition of the inner product of Y with itself and the fact that
[1" S 1"] : I,".Next, we have
rc
rc
DDx.¡ : Y'lE, Ø E.lY,i=l j=l
(A.2)
(A.4)
because the left side of (4.2) is equal to
ll',.Y\' - Y'1,"1',"Y : Y' lEr"]Y : Y' lE, Ø E"]Y,
where 1"" is the rc x 1 vector of ones. The third basic sum of squares as a quadratic form
is given by
ifi x¿¡l' : Y'II, Ø E.IY, (A'3)i=r j-l
since the left side of (4.3) may be written as
r f
Ð ttlXl' : D xlu"x; : Y' lI, Ø E"lY,i=I i=l
The final basic sum of squares as a matrix quadratic form is
c
få ",] : Y' tE, Ø I"tY,D
j=l
To see this, we expand the left side (4.4) to obtain
c r c r
tt x?¡ + ÐtÐxo¡xr¡,j-t i-t k+iijI i=I
163
In terms of vectors representing the ith and kth rolvs of matrix X, the preceding expres-
sion may be rewritten as
f c rI. I.
I" I.tt x?¡ + DtÐXn¡xr¡: X xj=I i=l j=t i:I k+;
11 1
I
4.6 Summing vectors, and ,Ð-matrices
Vectors having every element equal to unity are called summing vectors and are denoted
by 1, using a subscript to present order when necessa y. They are called summing vectors
because, with x' : lrtrtz¡...,r,r], for example l'rrx: ET=rr¿. In particular, the inner
product of 1,, with itself is n: 111,, : n. A product of a summing vector with a matrix
yields a vector of either column totals or row totals, of the matrix involved: for B having
elements b¡¡,the product 1'B is a row vector of column totals ó.r', and 81 is a column
vector of row totals 6¿.. Outer products of summing vectors with each other are matrices
having every elernent unity. They are denoted by E. For example,
i1 1
-E'
11 1
For E square and of size n, E2 :nF', and Elr, - nLo and úr(Er,) : n. Two useful
variants of E' are
1.Jr'-! E,,
2. K': Ir, - J'r,
which are idempotent matrices of rank 1 and (, - 1), respectively.
1r1; :
164
B Bayest theorem
Suppose that y : (yt,Uz,. . . ,!Jn)' \s a vector of n observations whose probability distri-
bution p(Vlï) depends on the values of k parameters d : (0r,...,0n)', Suppose also that
d itself has a probability distribution p(0). Then,
p@10)p(0) : p(u,o): p(îla)p@) (8.1)
Given the observed data, y, the conditional distribution of d is
p(,,lù:n@lor)n(o), (8.2)
where p(v) : nelp',11)l: c-r : I t pful|)e4)d'| if 0 is continuous
\r t" tr t Ee@1l)eQ) il 0 is d,iscrete '
where the sum or the integral is taken over the admissible range of 0, and where E6[f] is
the mathematical expectation of / with respect to the distribution p(0). Thus we may
write (8.1) alternatively as
p(olv) : cp(vlo)p(o)
or
p(ïly) x e(ylï)e(0)
where the proportionality depends only on y, not 0
8.1 Normal prior for multinormal sample
Suppose that (r1, t2.t...,r,,) are independently and identically distributed as lú(d,X).
The likelihood function is
ÌL
f (*t,rr,. . .,,rrlo,Ð) o( fl lrl-'¡'. rp{-(r¿ - 0)'Ð-t(r¿ - 0) l2} (8.3)i=l
Now expanding the quadratic forms we have
lÐl-"/' "*p{-ÐT=t@ ¿ - 0)' E-t (rn - 0) I 2}
fL
Ð("0-î)'E-t(r¿-0) n|'E-r0 - nr'Ð-r0 - nÎ'E-rr tÐ_rr'tÐ-rrt
"(0-ù'Ð-t(0-r)+r,i=1
165
where r : n-r f ¿ r¿ is the sample mean vector, and
i=l\ x'rE-r r¿ - nrtE-r r
and ,S : n-LÐi@¿ - *)(r, - ø)/ is the sample covariance matrix. Therefore
D(to - r)'¡-t @¿ - *)i=1
t7
I traceE-t (*o - r)(r¿ - r)ti=1
ntraceÐ-r S
f (rr,r2,t. . .,,rn10,Ð) x lEl-"/2.rp{-r(g - n)'E-t(0 - r)12 - ntrace(Ð-t Sl2)} (8.4)
Now if X is known, two terms can be dropped from (8.4) leaving
l(*r, rr,..., rnlg,E) x eæp{-"(0 - ù'E-rQ - ù l2}. (8.5)
If we let the prior distribution of 0 be ,Ä/(ds, Ðq) we find
lQl"tsr2,,. . . ,rn) o( f (0)f (rt,,t2t. . .,r"10)
o( erp{-(0 - do)'Eo t(0 - 0o)12 - n(0 - r)t2-t(0 - r)lz}.
Now complete the square by
(0 - lo)'E;'Q - 0o) ¡ n(0 - r)'2-t (0 - ¡) g'(¡;t +nÐ-r)o - (dåx;t +nr'E-t)o
o' (E;r oo r nE-r r) + (dix;ld s I nr'Ð-r r)
(o-op)'Q,ò-'(o-oò+R,
where
Ee: (t;t+rrX-t)-t
oe : re(Ðtldo ¡nÐ-rn),
and ,R is a constant. The posterior distributionol0 is therefore N(ïr,Xo). The posterior
mean is a matrix-weighted average of the prior mean d6 and the sample mean r . The
weights are respectively the prior information matrix Xo 1 and the analogous data infor-
mation nÐ-L (being the inverse of the covariance matrix of z ). The posterior information
matrix is the sum of X;l and nE-I, and the posterior covariance matrix is smaller than
either the prior covariance matrix X6 or the data analogue n-rD, ìn the sense that both
Io - Ðo and n-lE -E, are positive definite. Then the distribution of d is N(0e,Ðò.
166
C The S-PLUS program for cohort designs
Chapters 6, 7 and 8 require the calculation of the canonical correlations and the canon-
ical variables for the columns of two design matrices P and C representing Periods (ig.
Cohorts) and Cohorts(ig. Periods), respectively. The following functions in S-PLUS re-
quire that the user specify only the number of cohorts nc, the number of periods np, and
the number of groups within each cohort ng. It returns the canonical correlations in the
vector 'rho' and the contrasts required for the various subspaces defined in the text.
C.L S-PLUS functions
Firstly, we need a function which will form the Kronecker products. This is provided in
the S-PLUS library design:
kron(-function(. . . )
{
1 <- list(...)a (- as.natrix(l ttlll )
f or(b in 1[-1] ) {b (- as.natrix(b)
a (- matrix(apern(a|o'/,a, c(3 , L, 4, 2)), nrow=nrow(a)*
nrow(b), ncol = ncol(a)*nco1(b))
)
The function cc then forms the required projector matrices, as defined in Table 7.2, where
Vq: I"Ø JeØ Kn andVs: I"Ø KpØ Ksi
cc(-function(nc, np, ng)
{
# Set up Period and Cohort factors
P (- factor(outer(l:np, (1:nc) - 1,
C (- factor(rep(l:nc, rep(np, nc)))
a
Ì
r67
rrlrr ) )
design (- data. f rarne (C, P)
# Set up Helnert contrasts in P and C
Xp <- nodel.natrix( - P, design)[, -t]Xc (- nodel.matrix( - C, design)[, -t]
# Make orthogonal to grand mean
Xp <- scale(Xp, scale = F)
Xc (- scale(Xc, scale = F)
# Forn canonical correlations
ccor <- cancor(Xc, Xp)
# Forn contrasts for PiC and CiP
Pc (- Xp 'i"*L ccor$ycoef
Cc (- Xc '/n*L ccor$xcoef
# Forn contrasts for CeP
rho (- ccor$cor
n (- 1:Iength(rho)
if(length(rho) > 1)
{cepc <- (cc [, n] -Pc [, n]'/.*'/,aíag(rho ) ) %*%diag ( 1/sqrt ( 1 -rho ^ 2) ) ]if(tength(rho) -= 1)
{Cepc <- (ccI rn] -Pc[,n]*rho) /sqrt(r-rrro^z)]# Find projection rnatrices
S1<-Pcz.*7,t (Pc)
S2<-Cepc%*%t (Cepc)
one(-rep ( 1 ,np*ns¡
53<-diag(1,np*¡ç¡-one i/,*%t (one) / (np*nc)
s3<-s3-s1-s2
# Now include the ng groups within each cohort
Jg <- natrix(t/ng, ng, ng)
Ig <- diag(l, ng, ng)
Kg<-Ig-JgV1 <- kron(S1, Jg)
V2 <- kron(S2, Jg)
V3 <- kron(S3, Jg)
Ic (- diag(l, nc, nc)
168
Jp <- natrix(t/np, np, np)
Kp <- aiag(1, np, np) - Jp
# Groups within Cohort
54 <- kron(Ic, Jp)
V4 <- kron(S4, Kg)
# Periods x (Groups within Cohort)
SS <- kron(Ic, Kp)
V5 <- kron(S5, Kg)
# Periods(e1in. Groups)
56 <- SL+52-Cc'A*i/.t (cc)
V6 <- kron(S6, Jg)
list(Xc = round(Xc, 0), Xp = round(Xp,
Cc = round(Cc, 6), Pc = round(Pc,
vl = round(v1, 6), v2 = round(v2,
v4 = round(v4, 6), v5 = round(V5,
)
6), rho = round(rho, 6),
6), Cepc = round(Cepc, 6),
6), V3 = round(V3, 6),
6), V6 = round(V6, 6))
The above function provides the projector matrices, but to get the informatiorr matrices,
we need a matrix T whose columns describe the treatment parameters and are orthogonal
to the vector of ones:
inf n(-function (tn, VI ,V2, V3,V4,V5, V6 )
{ tn<-scaIe(tn, scale=F)
onec( -natrix (rep ( 1 , nc) , nrow=nc)
t(-kron(onec,tm)
# Now the treatnent infornation natrices are obtained using the
# projector matrices obtained fron cco
I1<-t (t) l"*L Vt '1,*'A tr2<-t(t) 'i"*'/" v2 '/.*'A tI3<-t (t) '/,,*',/, V3 ',/"*l/, tI4<-t (t) L*',l" v4 l,*L tIs<-t (t) '/"*'/, v5 '/,*'1" tI6<-t (t) '/'^*1" v6 l"*',A t
list(I1 = round(Il, 6), 12 = round(I2, 6), 13 = round(I3, 6),
169
)
14 = round(I4,6), 15 = round(I5,6), 16 = round(I6,6))
C.2 Use of S-PLUS functions for Chapter 6
We now illustrate the use of these functions in the analysis of the design given in Chapter
6. Firstly, we find the projector matrices using cc:
cc2321-cc(2,3 ,2)
# Projector natrices
6*S1
123456
1 5 -1 -1 -1 -1 -1
2-7 2-r 2-L-73-1-1 2-t 2-L4-7 2-7 2-L-75-1-1 2-L 2-L6 -1 -1 -1 -1 -1 5
SPLUS > round(2+*(V2+V4))
[,1] l,Z) [,3] [,4]
[t,]4-44-4lz,f -4 4 -4 4
[9,] 4 -4 7 -1
[+,] -4 4 -1 7
[s,]4-47-1[e,] -4 4 -1 7
l7,l o o -3 -3
[e,]oo-3-3[9,] o o -3 -3
[10,] o o -3 -3
hl,l o o o o
lL2,f o o o o
SPLUS > round(24*(v3+V5))
[,t] l,zl [,3] l,+f
al [,9] [, 10] [, 1 1] l,tzl0000000000
-3-3-300-3-3-300-3-3-300-3-3-300-17-I4-47 -1 7 -4 4
-17-74-47-17-44
-44-44-44-44-44
[,5] [,6] t
4-4-447 -1
-t77 -1
-1 7
-3 -3
-3 -3
-3 -3
-3 -3
0000
7)t0
0
-3
-3
-3
-3
7
-1
7
-1
4
-4
tt sl ,71 [, A][,0]
170
[,9] [,to] [,tt] l,r2f
It,]lz,f[9,]
[+,]
[s,][0,]
17,f
[4,]
[9,]
[10,]
[11,]
Itz,lSPLUS
[t,]12,f
[9,]
[4,]
[5,]
[0,]
17,f
[e,]
[9,]
[10,]
[rt,]U2,l
4-4-2-442-2252-2-3
-22-42-200 0 -1
0 0 -1
002002000000
) round(Z+*VA)
[ , 1] l,Zf [, a]
88-488-4
-4-45-4-45-4 -4 -1
-4 -4 -1
00300300-300-3000000
2
-2
-3
5
0
-4
-1
-1
2
2
0
0
[ ,4]
-4
-4
5
5
-1
-1
3
5
-3
-3
0
0
-2
2
-4
0
6
-3
2
2
-1
0
0
[,5]-4
-4
-1
-1
tr
5
-3
-3
3
3
0
0
2
-2
0
-4
-3
6
2
2
-1
-1
0
0
0
0
-1
-1
2
2
5
-3
-4
0
-2
2
0
0
-1
-1
2
2
-3
5
0
-4
2
-2
[,9]0
0
3
3
-3
-3
5
5
-1
-1
-4
-4
0
0
2
2
-1
-1
-4
0
6
-3
-2
2
[, g]
0
0
-3
-3
3
3
-1
-1
5
5
-4
-4
[,6] l,Tl-40-40-1 3
-1 3
5-35-3
-35-353 -1
3 -1
0-40-4
0
0
2
2
-1
-1
0
-4
-3
6
2
-2
[, 1o]
0
0
-3
-3
3
3
-1
-1
5
5
-4
-4
0
0
0
0
0
0
-2
2
-2
2
4
-4
[, 11]
0
0
0
0
0
0
-4
-4
-4
-4
8
I
0
0
0
0
0
0
2
-2
2
-2
-4
4
l,L2f0
0
0
0
0
0
-4
-4
-4
-4
8
I
Then, we need to calculate the information matrices given in Table 6.8, using the treat-
ment design matrix of given by the last four columns of the matrix in (6.3.1).
#This is the T as given in Chapter 6: Coh x Per x Grp
tm(-natrix(c(0,0,1,1,1,1, 0,0,0,0,1,1,
0,0,1, -1,-1,1, 0 r 0,0,0,1, -1) ,nrow=6)
inf (-inf n (tm,cc232$V1 , cc232$1,12,cc232$VS, cc2326V4,cc232gV5, cc232$V6)
t77
# ïnfornation matrices
SPLUS > 6*11
[, 1] l,2l [ , g] [,4]
h,l 10 8 o o
l2,l 8 10 0 0
[9,] o o o o
[+,] o o o o
SPLUS > 6*T2
[,1] l,2f [,9] [,+]
[1,] 3 3 o o
l2,l 3 3 o o
[3,] o o o o
[+,] o o o o
SPLUS > 6*13
[, t] l,2l [ , 3] [ ,4]
[r,] g -3 o o
l2,f -3 3 o o
[9,] o o o o
[4,]0000SPLUS > 6*14
[,1] l,Zl [,3] [,4]h,l o o o o
Lz,f o o o o
[9,] o o o o
[+,]oooBSPLUS > 6*15
[,t] l,zf [,8] l,+fh,l o o o o
l2,f o o o o
[9,]oo48-24[4,] o o -24 16
SPLUS > 6*16
[,r] l,z7 [,s] [,4]
172
[t,]lz,l[3,][+, ]
13
11
0
0
11
13
0
0
0
0
0
0
0
0
0
0
Finally, for the standard design, rve can obtain the information matrices using the same
design matrix and applying the idempotent matrices given after (6.4.2):
# Set up basic matrices
12(-natrix(c(f , O,O, 1),nrow=2)
J2(-natrix(c(1, 1, 1, t),nrow=)) /2
K2<-r2-J2
r3(-natrix(c(1, 0, 0,0, 1,0,0,0, 1),nrow=3)
J3(-natrix (rep ( 1, 9), nrow=3) /3
K3<-I3-J3
# Set up idenpotents
Q1(-kron (lZ,Xg, JZ)
Q2(-kron (lz,.lg,I2) -kron (J2,J3 ,J2)
Q3(-kron (Iz, xg, I2) -kron (Jz,l<s, lz)# Fix up design natrixtm(-scal e (tn, scale=F)
onec( -matrix (rep ( 1 , nc) , nrow=nc)
# Calculate infornation matrices
t(-kron(onec,tm)
rsl<-t (t) '/^*'/, QI '1"*'A tIs2<-t (t) '/^*'/, q2 '/,*'A tIs3<-t (t) L*'A Q3 '/"*'1, tSPLUS > round(6'*Is1)
[t,] 16 8 o o
lz,) I 16 o o
[9,] o o o o
l+,1 o o o o
SPLUS > round(6*Is2)
[, 1] l,2l [, s] [, +]
173
[t,] o o o
lz,f o o o
[¡,] o o o
[+,] o o o
SPLUS > round(6xIs3)
[,1] l,Zf [,S]
[t,] o o o
lz,l o o o
[¡,] o o 48
[4,] o o -24
0
0
0
I
[,4]0
0
-24
16
174
Bibliography
AenyasnxnnR, S. & CunNow, R. N. (1984). The desirability of adjusting for residual
effect in a crossover designs. Biometrics 4O,1071-1078.
Anultaco, A. P. & HIlls, M. (1982). The two-period crossover trials. Statistician
31,118-131.
BRLRRM, L. N. (1968). A two-period design with f2 experimentalunits. Biornetrics24,
61 73.
BRRxnn, N., Hows, R. J., & PoIoNIECKI, J. (1982). The two-period crossover trials.
Bull. in Appl. Statist.9, 67-116.
Bnowu, B. W. (1980). The crossover experiment for clinical trials. Biometrics 36,
69-79.
Cunssnru, J. B. (1964). On the analysis of simple cross-over with unequal numbers of
replicates. Biometrics 20, 206-208.
CHt, E. M. (1991). Recovery of inter-block information in cross-over trial. Statistics in
Medi,cine LO, ILLS-II??.
CocsRRN, W. G. (1939). Long-term agriculturalexperiments (with discussion) . Journal
of the Royal Statistical Society, B, 6,104-48.
CocHRnN, W. G., AutRov, K. M., & CauNoN, C. Y. (1941). A double change-over
design for cattle feeding experiments. Journal of Dairy Science. 24,937-5I.
CoRlrnll, R. G. (1980). Evaluation of bioavailability data using non-parametric statis-
tics, in Drug Absorption and Disposition. Statistical Considerations (ed. K. S. Albert),
American Pharmaceutica Association pages pp. 51-7.
Essull, A. F. (1984). Three-period crossover designs for two treatments. Biometrics
40,219-224.
FRRBr,vpl,L, V. T. (1985). Some remarks on the analysis of cross-over trials with a binary
response. Applied Statistics 34, I2l-I28.
FnonRnR, W. T. & ArrlxsoN, G. F. (1964). Tied-double-change-over designs. Bio-
1,75
metrics 20, 168-181.
FInNnv, D. J. (1978). Statistical Method in Biological Assay. Griffin, London.
FrnrcHoR, D. J. (1987). A nerv class of change-over designs for factorial experiments.
Biometrika 74, 649 654.
FRBBlrnN, P. R. (1989). The performance of the two-stage analysis of trvo-treatment,
trvo.period crossover trials. Statistics in Medicine 8,1421-39.
GRRT, J. J. (1969). An exact test for comparing matched proportions in crossover
designs. Biometrikø 56, 75-80.
GoucH, K. (1989). Leiter to the editor. Statistics in Medicine 8,891-892.
Gnlnvn, A. P. (1985). A Bayesian analysis of the two-period crossover designs for
clinical trials. Biometrics 41, 979-990.
GRtnvo, A. P. (1986). Corrigenda to Grieve(1985). Biometrics 42,459.
Gnlevo, A. P. (1994). Extending a Bayesian analysis of the two-period crossover to
allow for baseline measurements. Statistics in Medicine 13, 905-929.
Gntzzrø, J. E. (1965). The two-period change-over design and its use in clinical trials.
Biometrics 2L, 467-80.
Gntzzrø, J. E. (1974). Corrigenda to Grizzle (1965). Biometrics 30,727.
HRrunR, K. B. & Kocs, G. G. (1988). Some analysis strategies for three-period
changeover designs with two treatments. Statistics in Medicine 7,477-487.
HRcBI¡RRRS, J. A. (1978). Ag., cohort and period: A general model for the analysis of
social change. The Netherlands' Journal oJ Soci,ology 14, 59-91.
Heonvnt, A. & Ap'snRtxpJAD, K. (1978). Repeated measurement designs,,II. Annals
of Statistics 6, 619-28.
Hllls, M. & ARulrncn, P. (1979). The two-period cross-over clinical trial. British
Journal of Clinical Phurmacology 8,7-20.
Jnutns, A. T. & WIIxINSoN, G. N. (1971). Factorization of the residual operatonr
and canonical decomposition of nonorthogonal factors in the analysis of variance.
Biom etrika 68, 27 9-294.
JouNsoN, N.L. k Korz, S. (1970). Continuous Uniuariate Distributions - 2. Houghton
Miffiin, London.
JoNps, B. & KouwRRo, M. G. (1987). Modelling binary data from a three-period
crossover trial. Statistics in Medicine 6, 555-564.
JoNns, B. & KsNwnRo, M. G. (1989). Design and Analysis of Cross-ouer Trials.
776
Chapman and Hall, London, Nerv York.
Kexr,vRRo, M. G. & JoNBs, B. (1987a). The analysis of data from 2 x2 cross-over
trials with baselines measurements. Statistics in ù[edicine 10, 911-26.
KnxwnRo, M. G. & JoNes, B. (1987b). A log-linear model for binary cross-over data.
Applied Statistics. 36,792 204.
KnRssuBR, R.P. & FnnnnnR, W. T. (1981). Two-treatment crossover designs for
estimating a variety of effects. Journal of the American Statistical Association 76,
612-18.
Koctt, G. G. (1'972). The use of non-parametric methods in the statistical analysis of
the two-period change-over design. Biometrics 28, 901-914.
Kocu, G. G., AMARA, I.4., BRowN, B. W., Colrox,T., & GrlllNcs, D. B.
(1989). A two-period crossover design for the comparing of tlvo active treatments and
placebo. Statistics in Medicine 8, 487-504.
KRzn¡¡owsKI, W. J. & M,qnRlorr, F. H. C. (1994). Multiuariate Analysis. Part 1:
Distribution, ordering and inJerence. Edward Arnold, New York - Toronto.
KueenR, L. L., JnNls, J. M., SalRun, C, N., GRnoNnERG, y. G., & GRnoNnnRc,
B. G. (1983). Age-period-cohort analysis: An illustration of the problem in assessing
in one observation per cell data. Communication in Statistics -Theory and Method
L2,2779-2807.
Lnlnn, N. M., SxtilNnR., J., & KBNwRRI, M. G.(1992). An analysis of two-period
crossover designs with carryover effects. Statistics in Medicine LL,1967-1979.
Llsxl, E. & MntsNeR, M. (1935). A variational approach to optimal two-treatment
crossover designs: application to carryover effect models. Journal oJ the American
Statistical Associati,on 80, 704-710.
LRsxR, E., MEISNER, M., & KusuNER, H. B. (lgs3). optimal crossover designs in
the presence of carryover effect. Biometrics 39, 1087-91.
LRssnRvp, B. W. (1991). Determination of optimal designs using linear models crossover
trials. Statistics In Medici,ne 36,909-924.
LRtv, M. G. (1987). A Bayesian analysis of higher-order cross-over designs. Unpublished
MSc dissertation University of Kent at Canterbury, 18-25.
LevnRo, M. W. & AnvBsEN, J. N. (1978). Analysis of Poisson data in crossover
experimental designs . Biometrics 34, 42I-428.
Lpr,vts, J. A. (1983). Clinical trials:statistical developments of practical benefit to the
177
pharmaceutical ìndustry (with discussion) . Journal of the Royal Statistical Society. A
146, 362-393.
LINotev, D. V. & SptlrH, A. F. \,I. (1972). Bayes Estimates for the Linear Nfodel(with
discussion). Journal of the Royal Statistical Society B 34,1-41.
Matuer,vs, J. N. S. (1988). Recent Ddevelopments in Crossover Designs. International
Statistical Reuiew 56, 777 727.
MItr,rxBN, G,A.& JoHNsoN, D. E. (1984). Analysis of Messy Data, Volume 1:
Designed Erperimenús. Van Nostrand Reinhold, New York.
PRRxns, K. R. (1982). Occupational stress among student nurses: a natural experiment.
Journal of Applied Psychology 67, 784-796.
PnrBL, H. L (1983). Use of of baseline measurements in the two-period cross-over design.
Communication in Statistics - Theory and Methods t2,269-2712.
Pl:rtl-, V. H. (1965). Approximation to the Behrens-Fisher distributions. Biometrika
62,267-271.
PRBscott, R. J. (1981). The comparison of success rates in cross-over trials in the
presence of an order effect. Applied Statistics 30, 9-15.
Rncrrue, 4., GRrevE, A. P., Fluur.ER, H., & Srr,lrru, A. F. M. (1986). Bayesian
methods in practice; experiences in the pharmaceutical industry. Applied Statistics
35, 93-150.
RncgRvRRAo, D. (1989). Crossouer designs in industry, in Design and Analysis of
Erperirnents with Applications to Engineering and Physical Sciences (ed. Ghosh),.
Marcel Dekker, New York.
Rno, C. R. (1973). Linear Statistical Inference andits Application.Ùnd edition. Wiley,
New York.
ScHRte, K. K. (1965). A general model for the study of development problern. Psycho-
logical Bulletin 64, 92-707.
SelwvN, M. R., DnutlstER, A. R., & HILL, N. R. (1981). A Bayesian approach to
bioequivalence for the 2 x 2 change-over design. Biometrics 37, 11-21.
SBNN, S. J. (1988). Letter to the editor: Cross-over trials, carry-over effects and the art
of self-delusion. Statistics in Medicine 7.
SpN¡{, S. J. (1991). Problem with the two stage analysis of crossover trials. British
Journal of Clinical Pharmacology 32,I33.
VnNnslps, W. N. & RnzRel, S. (1996). Splus Program for James and Wilkinson's
178
Theorem.
Wnllp¡,¡srElN, S. (1979). Inclusionof baselinevaluesintheanalysisof crossoverdesigns
(abstract). Biornetrics 35' 894.
WnllnNsrprN, S, & FlsnER, A. C. (1977). Analysis of the two-period repeated mea-
surements crossover design with application to clinical trials. Biometrics 33,261-269.
Wrllnx, A. R. (1988). Using the maximum test statistic in the two-period crossover
clinical tr\al. Biometrics 44, 2ll-78.
\MrllaN, A. R. & PntnR, J. L. (1986). Using baselinemeasurementsin the two-period
crossover clinical ftial. British Journal of Clini,cal PharmacologyT,232-289.
Zrrr,rrr.,lBRlr¿ANN, H. & RRHlrs, W. (1978). Model building and testing for the change-
over design. Biornetrical Journal 22.' 197-220.
179