This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

An optimal multi-period crossover design for an application in paediatric nephrology

J.N.S. Matthews
School of Mathematics & Statistics
Newcastle University
Newcastle upon Tyne NE1 7RU, UK
john.matthews@ncl.ac.uk
Abstract

Crossover clinical trials can provide substantial benefits by eliminating inter-patient variation from treatment comparisons and by allowing multiple observations of each patient. They are particularly useful when sample sizes are necessarily small. These advantages proved particularly valuable in an assessment of clot prevention in children undergoing haemodialysis. Only small numbers of children are treated at any given time in any single unit, but each patient is obliged to attend two or three times each week, suggesting the use of a crossover trial with many periods. Standard crossover trials described in the literature a) typically have fewer than 10 periods and b) are based on a model of questionable applicability to this study. This paper describes the derivation of an optimal crossover trial with 30 periods which was used to compare the treatments using nine patients.

Key words: Crossover trial; Crossover design; optimal design; paediatric nephrology.

1 Introduction

Children undergoing regular haemodialysis for renal failure need to attend the dialysis unit (DU) two or three times per week. On each visit the dialyser is connected to the patient’s circulation. To achieve this the patient usually has a surgically inserted venous central line which remains in place permanently and is accessed on each visit to the DU. During the interdialytic period the blood in the lumen of the line may clot. Unclotting a line is a time-consuming problem and if it happens regularly then the central line will have to be re-sited. Re-siting is both unpleasant for the patient and undesirable, as it is a procedure that cannot be repeated indefinitely because of the limited number of sites which can be used for this purpose. For these patients failure to achieve reliable venous access has dire consequences. Consequently, a small amount of anticoagulant (a “lock”) is injected into the lumen of the central line at the end of each dialysis session in order to prevent clotting. The usual anticoagulant is heparin (H). This article is concerned with design issues that arose in a trial to compare heparin with alteplase (A), an alternative anticoagulant.

The dialysis treatment of choice for children with renal failure is peritoneal dialysis, but there are a few patients for whom this is unsuitable, or for whom peritoneal dialysis is no longer viable. Consequently, in a paediatric population there are only a few children attending for haemodialysis at any centre: fewer than ten patients in a UK regional centre at any given time is typical. Moreover, differences in treatment protocols between centres mean that a multicentre study would be difficult to run. However, each patient who does attend the DU is obliged to do so three (or, for occasional patients, two) times a week. Therefore information on the comparison between A and H can best be obtained using a crossover design, as this allows each patient to provide numerous observations, and at the same time improve the precision of the estimated comparison between A and H by eliminating inter-patient differences. The trial will measure the weight of clot in the blood withdrawn from the central line at the start of each dialysis session. As this withdrawal of blood is part of the usual clinical procedure to flush the line prior to dialysis, it imposes no burden on the patient and as such can repeated many times.

Numerous crossover designs are available in the literature (Jones and Kenward, 2003, chapters 4 & 5) but designs with more than ten periods are unusual in most areas of application and almost unheard of in clinical settings. Moreover, what optimality properties these designs possess are usually based on one of a handful of rather general models which may not be the most appropriate choice for the current trial. The next section considers the technical issues which arise in developing a design for this study, in particular the choice of a realistic statistical model for this application and its implications for design selection. In Section 3 an optimal design is derived for this model and in Section 4 some practical issues are discussed.

2 Choice of design and statistical model

To avoid causing practical difficulties on the DU it was decided to run the trial for ww weeks, so for most patients a 3w3w-period design is required but for those patients who need to attend only twice per week, a 2w2w-period design is needed. This is the first non-standard feature of the design problem: almost all “off-the-shelf” designs assume that each experimental subject is observed equally often.

The second issue is that most available designs assume that the response observed on subject ii in period jj, yijy_{ij}, follows the model:

yij=μ+ξi+πj+τd(i,j)+ϵij,y_{ij}=\mu+\xi_{i}+\pi_{j}+\tau_{d}(i,j)+\epsilon_{ij}, (1)

where 1in1\leq i\leq n and 1jp1\leq j\leq p. Here μ\mu is a general mean; ξi\xi_{i} is some form of subject effect, either fixed or random; πj\pi_{j} is the effect of period jj; τt\tau_{t} is the effect of treatment tt and d(i,j)d(i,j) is the treatment allocated to subject ii in period jj, and ϵij\epsilon_{ij} are independent residuals with a common variance. More specialised designs specific to the crossover setting have been derived for a model which explicitly acknowledges the possibility of the effect of a treatment carrying over to the following period, namely

yij=μ+ξi+πj+τd(i,j)+γd(i,j1)+ϵij.y_{ij}=\mu+\xi_{i}+\pi_{j}+\tau_{d}(i,j)+\gamma_{d(i,j-1)}+\epsilon_{ij}. (2)

where γt\gamma_{t} is the carryover effect of tt and we set γd(i,0)=0\gamma_{d}(i,0)=0: see (Jones and Kenward, 2003, chapters 4 and 5) for details.

For our purposes the former model (1) is more relevant because the effects of the anticoagulants will not carry over to subsequent periods. This is because only a small quantity of anticoagulant is used at the end of each dialysis session. The volume is just sufficient to fill the lumen of the central line and it largely stays in situ until the following session. Moreover, at the start of the next session the line is aspirated, thereby removing most of the anticoagulant; any small amount left in the line will be flushed out in the course of the dialysis. Systemic effects due to small quantities of anticoagulant diffusing from the lumen into the circulation will be negligible because i) the volume of anticoagulant involved will be very small compared with the volume of the circulation and ii) the agents will be metabolised long before the subsequent dialysis session.

Optimal designs for model (1) are row-column designs such as Latin square and Youden designs, in which each treatment occurs equally often on each subject and in each period (Shah and Sinha, 1989, Chapter 4). These conditions are rather restrictive in that they impose conditions on the divisibility of pp and nn by the number of treatments. Moreover, they arise from the very general form of period effect allowed by (1), where a separate parameter is specified for each period.

In the present application the condition that each treatment, i.e. A and H, should be applied equally often to each patient is unlikely to be troublesome. However, having A and H occur equally often in each period is potentially more troublesome to arrange. The model in (1) goes back at least to animal feeding trials (Cochran et al., 1941) where the appropriateness of the period effect would have been much clearer. Its widespread adoption in the theory of crossover designs has often had a precautionary flavour - analysts feeling that designs needed to be protected again the possible untoward influences of non-specific period effects (Matthews, 1994a, b), rather than that there was any clear-cut reason for the form adopted in (1).

In fact, the study of aspects of dialysis is an area where the inclusion of period effects in the statistical model can be justified. As pointed out by Matthews and Hoenich (1994), many variables which can be measured during or following dialysis will be affected by the point in the dialysis cycle when the observation is made. Most DUs operate a cycle of dialysis sessions whereby a patient attends on Mondays, Wednesdays and Fridays (or on Tuesdays, Thursdays and Saturdays but for definiteness we will assume the former, as the alternatives are equivalent). On Mondays, variables such as the size of clots, or the amount of urea available for removal from the blood will have accumulated over the three days since attending the previous Friday, so can be expected to be different from the corresponding quantity on a Wednesday or Friday, as this will have accumulated over only two days.

We therefore decided to adopt the following model for the expected weight of clot aspirated from a patient ii who attends the DU \ell times per week. In period j=1,,wj=1,\ldots,\ell w.

𝖤(yij)=τd(i,j)+π(i,j)+ξi,\mathsf{E}(y_{ij})=\tau d(i,j)+\pi_{\ell}(i,j)+\xi_{i}, (3)

where d(i,j)=1d(i,j)=1 if H is allocated to patient ii in period jj and -1 if A is allocated. For patients attending three times per week

π3(i,j)\displaystyle\pi_{3}(i,j) =\displaystyle= π1,if j is a Monday\displaystyle\pi_{1},\text{if }j\text{ is a Monday}
=\displaystyle= π2,if j is a Wednesday\displaystyle\pi_{2},\text{if }j\text{ is a Wednesday}
=\displaystyle= π3,if j is a Friday.\displaystyle\pi_{3},\text{if }j\text{ is a Friday.}

Note that the observation yijy_{ij} is the weight of clot aspirated following allocation of treatment d(i,j)d(i,j) but this is only obtained when the lumen is aspirated at the following dialysis session. Therefore, for example, the observation yi,Fridayy_{i,\text{Friday}} which relates to the treatment allocated on the Friday, is not obtained until the following Monday. Given that the interval between Wednesday and Friday is the same as that between Monday and Wednesday, a possible simplification of (2) is to set π1=π2\pi_{1}=\pi_{2}. We do not adopt this slightly more prescriptive form as it transpires that this extra restriction provides no important practical advantage in the specification of the design.

For those patients required to attend only on Mondays and Fridays we replace π3(i,j)\pi_{3}(i,j) by π2(i,j)\pi_{2}(i,j) to accommodate the different attendance schedule, namely

π2(i,j)\displaystyle\pi_{2}(i,j) =\displaystyle= π4,if j is a Monday\displaystyle\pi_{4},\text{if }j\text{ is a Monday}
=\displaystyle= π3,if j is a Friday.\displaystyle\pi_{3},\text{if }j\text{ is a Friday.}

The parameter π3\pi_{3} is common between π2(i,j)\pi_{2}(i,j) and π3(i,j)\pi_{3}(i,j) as both groups of patients attend on Monday having attended on the preceding Friday.

The final aspect of the model which needs consideration is the residual term ϵij=yij𝖤(yij)\epsilon_{ij}=y_{ij}-\mathsf{E}(y_{ij}). This represents the within-patient variation and these terms will be assumed to be independent with common variance σ2\sigma^{2}. The assumption of independence is questionable when each individual is observed serially over time: we will briefly consider the possible effect of dependence on the design calculations in the next section. It is also possible that the assumption of homoscedasticity will prove unfounded, although we would aim to address such issues by suitable transformation of the yijy_{ij}.

3 Optimality calculations

We suppose that N3N_{3} patients dialysed three times per week and that N2N_{2} patients dialysed twice weekly will be recruited, providing a total of m=3wN3+2wN2m=3wN_{3}+2wN_{2} observations. The aim is to determine the best treatment sequences to use to define the d(i,j)d(i,j). A good review of this area can be found in Stufken (1996). There are a variety of optimality criteria which can be used but because we are comparing only two treatments these all coincide and optimal designs are chosen to minimise the variance of the estimator of τ\tau.

3.1 General conditions

In order to obtain an optimal design suppose that the design matrix, XX, implied by (3) is written as (AB1B2)(A\mid B_{1}\mid B_{2}) where B1B_{1} corresponds to the period parameters, B2B_{2} corresponds to the patient parameters ξi\xi_{i} and AA is the m×1m\times 1 matrix which describes the treatment allocation. In the following we use ‘information matrix’ to denote the inverse of the dispersion matrix, even though no likelihood is defined by the model in (1). If we write 𝒫(X)=X(XTX)XT\mathscr{P}(X)=X(X^{T}X)^{-}X^{T} for the orthogonal projection onto the column space of XX and 𝒫(X)=I𝒫(X)\mathscr{P}^{\perp}(X)=I-\mathscr{P}(X), then the information matrix for τ\tau can be written as

1=σ2AT𝒫([B1B2])A\mathscr{I}_{1}=\sigma^{-2}A^{T}\mathscr{P}^{\perp}([B_{1}\mid B_{2}])A (6)

where [B1B2][B_{1}\mid B_{2}] is XX with the column for AA omitted. Direct evaluation of this matrix is cumbersome because the number of columns in B2B_{2} increases with the number of patients. An alternative approach uses an identity for projection matrices

𝒫([B1B2])=𝒫(B1)𝒫(𝒫(B1)B2),\mathscr{P}^{\perp}([B_{1}\mid B_{2}])=\mathscr{P}^{\perp}(B_{1})-\mathscr{P}(\mathscr{P}^{\perp}(B_{1})B_{2}), (7)

which was first applied to crossover designs by Kunert (1983). The information matrix for τ\tau in the model which omits patient effects from (3) is

2=σ2AT𝒫(B1)A.\mathscr{I}_{2}=\sigma^{-2}A^{T}\mathscr{P}^{\perp}(B_{1})A. (8)

Using (7) it can be seen that 210\mathscr{I}_{2}-\mathscr{I}_{1}\geq 0 unless AT𝒫(𝒫(B1)B2)A=0A^{T}\mathscr{P}(\mathscr{P}^{\perp}(B_{1})B_{2})A=0, in which case they coincide. Here AB0A-B\geq 0 would in general mean that ABA-B was positive semi-definite, but in the present application it is a simple inequality as 1,2\mathscr{I}_{1},\mathscr{I}_{2} are scalars. This equation requires that AA should be orthogonal to the columns of 𝒫(B1)B2\mathscr{P}^{\perp}(B_{1})B_{2}, which amounts to:

ATB2=AT𝒫(B1)B2.A^{T}B_{2}=A^{T}\mathscr{P}(B_{1})B_{2}. (9)

Therefore, if we find a design which maximises the information for τ\tau in the model with patient effects omitted and if this design also obeys (9), then the design will be optimal for the full model (3).

3.2 Detailed calculations

In the model (3) AA is an m×1m\times 1 vector, B1B_{1} is an m×4m\times 4 matrix and B2B_{2} is an m×(N2+N3)m\times(N_{2}+N_{3}) matrix. In order to use the results outlined above we need expressions for ATB2A^{T}B_{2}, AT𝒫(B1)B2A^{T}\mathscr{P}(B_{1})B_{2} and AT𝒫(B1)AA^{T}\mathscr{P}^{\perp}(B_{1})A.

ATB2A^{T}B_{2} is a 1×(N2+N3)1\times(N_{2}+N_{3}) vector, where the iith element refers to patient ii and is the number of times patient ii receives H minus the number of times they receive A. The matrix AT𝒫(B1)B2A^{T}\mathscr{P}(B_{1})B_{2} is a 1×(N2+N3)1\times(N_{2}+N_{3}) matrix where each element has one of two values: qTRP3q^{T}RP_{3} for patients treated three times per week and qTRP2q^{T}RP_{2} for patients treated twice a week. Here

qT=(qM3qW3qF3+qF2qM2),q^{T}=\begin{pmatrix}q^{3}_{M}&q^{3}_{W}&q^{3}_{F}+q^{2}_{F}&q^{2}_{M}\end{pmatrix},

where, e.g., qMq^{\ell}_{M} is the number of times H was allocated on a Monday minus the number of times A was allocated on that day among patients dialysed \ell times per week: corresponding differences for Wednesdays and Fridays have the subscripts WW and FF respectively. Further definitions and derivations are in the Appendix.

The results in the Appendix, specifically equations (10) and (11), can be combined to show that the information on τ\tau in the reduced model (8) is

σ2AT𝒫(B1)A\displaystyle\sigma^{-2}A^{T}\mathscr{P}^{\perp}(B_{1})A =\displaystyle= σ2(mqTRq)\displaystyle\sigma^{-2}\left(m-q^{T}Rq\right)
=\displaystyle= σ2[m1w((qM3)2N3+(qW3)2N3+(qF3+qF2)2N3+N2+(qM2)2N2)].\displaystyle\sigma^{-2}\left[m-\frac{1}{w}\left(\frac{(q^{3}_{M})^{2}}{N_{3}}+\frac{(q^{3}_{W})^{2}}{N_{3}}+\frac{(q^{3}_{F}+q^{2}_{F})^{2}}{N_{3}+N_{2}}+\frac{(q^{2}_{M})^{2}}{N_{2}}\right)\right].

A design with q=0q=0, i.e. with equal replication of A and H on Mondays and Wednesdays within the thrice-weekly patients and equal replication on Mondays within the twice-weekly patients and, in addition, with qF3+qF2=0q^{3}_{F}+q^{2}_{F}=0, will maximise AT𝒫(B1)AA^{T}\mathscr{P}^{\perp}(B_{1})A. Such a design would automatically mean that AT𝒫(B1)B2=0A^{T}\mathscr{P}(B_{1})B_{2}=0. Therefore any such design for which ATB2A^{T}B_{2} also vanishes, i.e. each patient receives A and H the same number of times, will be optimal for the model (3). Although such designs could arise from cases where neither qF2q_{F}^{2} nor qF3q_{F}^{3} was zero, in practice it will be easier to ensure qF2+qF3=0q_{F}^{2}+q_{F}^{3}=0 by looking for designs where qF3=qF2=0q_{F}^{3}=q^{2}_{F}=0.

3.3 Design Construction

An optimal design will provide an estimator of τ\tau that has variance σ2/(w[3N3+2N2])\sigma^{2}/(w[3N_{3}+2N_{2}]) and this can be used to determine the length of the trial. The number of patients of each type (i.e. attending two- or three-times weekly) available for the study will be known at the outset. If the usual requirements of a sample size calculation, namely an estimate of σ2\sigma^{2}, and of the treatment difference to be detected, 2τ02\tau_{0}, then the number of weeks, ww, needed to give a specified power and size for the associated hypothesis test can be calculated. In the present application the precise length of the trial is not critical, so the construction of suitable designs will be expedited if ww is rounded up so that ww is even.

A clinically important difference between H and A was thought to be 10mg, giving τ0=5\tau_{0}=5, and some pilot data suggested a planning estimate for σ\sigma of 22mg. For a two-sided 5% test with 80% power this gives (τ0/σ)m=1.96+0.84=2.8(\tau_{0}/\sigma)\sqrt{m}=1.96+0.84=2.8, leading to m152m\approx 152. At the planning stage there were four patients being treated three times per week and two twice-weekly, so m=16wm=16w and hence a trial with w=10w=10, i.e. lasting 10 weeks, is indicated.

The task is to determine appropriate sequences so that the resulting design obeys the conditions that ensure the design is optimal. An optimal design for patients who are treated three times per week can be constructed as follows. Consider the set of four 3-sequences S3={AAA,AAH,AHH,AHA}S_{3}=\{\text{AAA},\text{AAH},\text{AHH},\text{AHA}\}: these are all the sequences of As and Hs of length three starting with an A. The dual of any element of S3S_{3} is the sequence with As and Hs interchanged. Any of these sequences could be used to specify the treatments to be allocated in a particular week. For a trial with ten weeks we need to select ten of the sequences in an appropriate way. We proceed in two stages. Suppose the sequences are written as in Table 1 with A-E allocated to the first five weeks and the corresponding lower-case labels to the last five weeks. We randomly choose an element of S3S_{3} and assign this to be sequence AA and then assign the dual of AA to aa. Repeat for the remaining B-E. In the second stage, the sequences AA to ee are randomly permuted - e.g. as in the second row of table 1.

This construction ensures i) each patient receives each of A and H 3×(12w)3\times(\tfrac{1}{2}w) times and that each of A and H is used equally often on each of the days Monday, Wednesday, Friday. Clearly this method could be used for any even value of ww. Design sequences for patients treated twice-weekly can be constructed analogously, starting from the sequences S2={AA,AH}S_{2}=\{AA,AH\}. A design comprising arbitrary numbers of patients treated two- and three-times weekly constructed according to this algorithm will give q=0q=0 and equal replication within each patient and hence will be an optimal design.

AA BB CC DD EE aa bb cc dd ee
\downarrow
Randomly permute
\downarrow
CC BB dd AA bb EE cc DD aa ee
Table 1: Weekly sequences for the trial

3.4 Dependent errors

The error terms ϵij\epsilon_{ij} are assumed to be independent, even within patients. The nature of the process under investigation in the trial, namely properties of a small volume of fluid in the lumen of an intravenous line that is flushed with many times of its own volume between each observation, makes this assumption less questionable than might often be the case with serial observations. However, if a simple adaptation can make the design robust to dependence in the error term, then it would be prudent to use it.

The optimal design of two-treatment crossover designs with autocorrelated error was addressed by Matthews (1987), who found that for a model with no carryover term γij\gamma_{ij}, designs with rapidly changing allocations were optimal if the autocorrelation in the residual term were positive. For positive autocorrelation, and in the present application, this would suggest that the sequence AHA and its dual are slightly preferable to AHH and AAH, and certainly to be preferred to AAA, and its dual. The algorithm given in Section 3.3 did not specify the probabilities with which elements of S3S_{3} are chosen. We therefore decided to choose the elements of {AAA,AAH,AHH,AHA}\{\text{AAA},\text{AAH},\text{AHH},\text{AHA}\} with respective probabilities {110,15,15,12}\{\tfrac{1}{10},\tfrac{1}{5},\tfrac{1}{5},\tfrac{1}{2}\}. This gives a preponderance of rapidly alternating treatments in the design, while maintaining a range of sequences. For patients treated twice per week the elements of S2S_{2} can be chosen with probabilities {15,45}\{\tfrac{1}{5},\tfrac{4}{5}\}. A reasonable range of sequences is desirable if the investigator wishes to leave open the possibility of performing a randomization analysis of the data, in addition to a model-based analysis.

4 Data analysis and some practicalities

Although planning for the study assumed four patients treated three times per week and two twice-weekly, by the time the trial started, seven patients treated thrice-weekly, i.e. N3=7N_{3}=7 and two treated twice weekly, N2=2N_{2}=2, were available. Full details of the study are reported in Gittins et al. (2007).

There were some difficulties in the conduct of the trial. The most important of these was that although the allocation to the treatment for the final period was done correctly, confusion on the DU meant that the corresponding observation of clot weight was missed for all thrice weekly patients. That this observation was to have been made at the start of August, when the junior medical staff change over, may have been relevant. In addition there are some isolated discrepancies: patient 8, one of the twice-weekly patients, missed their final four observations (three H, one A) due the availability of a kidney transplant. Patients 1, 2 and 7 fell short of the revised total of 29 observations: patient 2 missed three observations due to holidays, while patients 1 and 7 missed one each due to errors on the DU. Although any deviation from the planned trial is undesirable, given the complexity of the design this level is probably no more than should be expected. While the omission of a few observations will reduce the power of the study, the reasons for the missing observations should not induce any noticeable bias. Moreover, the study was planned on the basis of N3=4,N2=2N_{3}=4,N_{2}=2, but in fact N3=7,N2=2N_{3}=7,N_{2}=2 were recruited, so the power would in fact have been higher than planned.

Fitting the model (3) gave an estimate of the mean semi-treatment difference, τ^\hat{\tau}=2.79g, P=0.13, 95% confidence interval (-0.79,6.37), indicating a higher mean weight of clot on heparin compared with alteplase. A normal probability plot of residuals shown in Figure 1(a) indicates that the model assumptions were questionable. A randomization analysis based on 2000 re-randomizations of the treatment allocation using the scheme outlined in Section 3.3 gave P=0.123, confirming the model-based P-value.

Refer to caption
(a) Weight
Refer to caption
(b) log(Weight+0.1)
Figure 1: Normal probability plots of the residuals from model (3)

The probability plot in Figure 1(a) indicates that the data are skewed. This is shown, separately for each treatment, in Figure 2 where it is apparent that in a high proportion of occasions (30% H, 50% A) no clot was retrieved from the central line. A natural response is to skewed data is to take logs of the observations but with so many zeroes we need to use a transformation of the form log(yij+k)\log(y_{ij}+k). Inspection of the data showed that the range of non-zero clot weights retrieved was 0.6g to 146g (H) and 0.7g to 114g (A). Rather than use a formal approach to estimating kk, we chose k=0.1gk=0.1g. More formal approaches using a shifted Box-Cox transformation suggested a log transformation with k0.001gk\approx 0.001g. The estimate of the treatment effect (2τ2\tau, on log scale) becomes 0.830, P=0.013P=0.013, 95% (0.175,1.485) (k=0.1k=0.1) and 1.440, P=0.018P=0.018, 95% (0.250,2.631) (k=0.001k=0.001). This sensitivity to kk, while typical of data with many zeroes, is clearly undesirable.

Refer to caption
(a) Heparin
Refer to caption
(b) Alteplase
Figure 2: Histograms of clot weight by treatment

An alternative approach is to take the analysis in two stages. First the probability of a clot-free sample is assessed, and then the mean weights when clot is present are analysed. In both cases a generalized estimating equation (GEE) was used, with patient used to define the clustering variable. The first analysis gave an odds ratio of clotting as 2.34 times larger on H than A, P<0.001P<0.001, 95% confidence interval (1.24,4.42). The second analysis addressed the considerable remaining skewness in the non-zero clot weights by fitting a GEE with gamma variances and a log link. This indicated that mean clots weights were 1.86 times heavier using H than A (P<0.001P<0.001), 95% confidence interval (1.45,2.39).

Heparin Alteplase
Patient No. of observations % no clot No. of observations % no clot
1 13 46 15 67
2 14 29 12 58
3 14 7 15 40
4 14 43 15 93
5 14 43 15 20
6 14 21 15 33
7 14 7 14 36
8 7 43 9 67
9 10 20 10 20
Table 2: Data by patient: patients 1 to 7 treated three times per week, patients 8 and 9 treated twice per week

5 Discussion

This application required a crossover design with many more periods than is usually the case. While multi-period designs are available from families of designs already in the literature, these require balance across patients: for example by ensuring equal replication of treatments at each period. They also required that each patient be treated an equal number of times. In the present application the second of these requirements would have needed the trial to be extended for the twice-weekly patients which, while possible, would have been very inconvenient for the DU. Practical difficulties could arise with the requirement to balance over periods. Moreover, this would arise from the use of a model for period effects that is unnecessarily general. The use of the model (3) tailored to the application means that an optimal design can be constructed by ensuring that each patient follows a particular form of sequence, with no requirement to balance across patients. This is a helpful feature when implementing the design.

The optimality calculations made no distributional assumptions although it did make second-order assumptions, namely that the residual terms were uncorrelated and had constant variance. Even if these assumptions are untrue, randomization tests can provide a valid test of the hypothesis of no treatment effect. In the event, the data proved to be rather different from that anticipated, with many instances where no clot was formed, especially when alteplase was the used as the anticoagulant. A model-based analysis assuming Normal errors gave a PP-value very similar to that obtained from a randomization test. However, the conclusion of the analysis was substantially different if log(yij+k)\log(y_{ij}+k), rather that the clot weights yijy_{ij} were analysed. This suggests that there was also substantial heteroscedasticity in the data. Moreover, the analyses were sensitive to the value of kk, a parameter poorly estimated by the data. Analyses of data of this form will seldom be adequately summarised by a single paramter such as a mean or median, so something similar to the two-stage method used will usually be necessary.

A more difficult question to address is the extent to which the properties of the design depart from optimal now that it is clear that the original assumptions are untenable. Optimal design theory for crossover trials started with the model (2) and has developed by considering departures from this model in terms of the form of the carryover treatment term and the dependence structure of the residuals. There is no work on design theory for crossover trials with other forms of outcome, such as a binary variable. Designs that are optimal for the mixed binary and continuous outcome that we have pursued here would be extremely challenging to determine.

Moreover, the real issue is not how far the optimal design for the known outcomes differs from that used at the outset but could we have chosen a design that would have had good, but not necessarily optimal properties for a range of assumptions about the outcome. There are many papers going back over many years, which derive designs which have good properties over a range of assumptions (Cook and Nachtsheim, 1982; DuMouchel and Jones, 1994), including recent contributions which address the problems posed by generalized linear models (Woods et al., 2006; Dror and Steinberg, 2006). However, most of these contributions address robustness to mis-specification of the linear predictor or of the link function, rather than the distribution of the the outcome variable.

Although derived using optimal design theory, the present design might have been arrived at by more general considerations of orthogonality. The design used in this study is optimal because it ensures the treatment effect is orthogonal to the period and patient effects. Such a property can be shown to have formal advantages when the outcomes are uncorrelated with constance variance. However, the notion that each treatment occurs equally often on each patient and on each treatment day, has more general appeal. However, rigorous confirmation of the value of this design for the type of data that actually arose in this trial is elusive.

Acknowledgement

I am grateful to Drs Malcolm Coulthard and Nicola Gittins for allowing me to use their data in this paper.

References

  • Cochran et al. (1941) Cochran, W. G., K. M. Autrey, and C. Y. Cannon (1941). A double change-over design for dairy cattle feeding experiments. Journal of Dairy Science 24, 937–951.
  • Cook and Nachtsheim (1982) Cook, R. D. and C. J. Nachtsheim (1982). Model robust, linear-optimal designs. Technometrics 24, 49–54.
  • Dror and Steinberg (2006) Dror, H. A. and D. M. Steinberg (2006). Robust experimental design for multivariate generlized linear models. Technometrics 48, 520–529.
  • DuMouchel and Jones (1994) DuMouchel, W. and B. Jones (1994). A simple Bayesian modification of DD-optimal designs to reduce dependence on an assumed model. Technometrics 36, 37–47.
  • Gittins et al. (2007) Gittins, N. S., Y. L. Hunter-Blair, J. N. S. Matthews, and M. G. Coulthard (2007). Comparison of alteplase and heparin in maintaining the patency of paediatric central venous haemodialysis lines: a randomised controlled trial. Archives of Disease in Childhood 92, 499–501.
  • Jones and Kenward (2003) Jones, B. and M. G. Kenward (2003). Design and Analysis of Cross-Over Trials (2nd ed.). Boca-Raton: Chapman and Hall/CRCPress.
  • Kunert (1983) Kunert, J. (1983). Optimal design and refinement of the linear model with applications to repeated measurements designs. Annals of Statistics 11, 247–257.
  • Matthews (1994a) Matthews, J. N. (1994a). Modelling and optimality in the design of crossover studies for medical applications. Journal of Statistical Planning and Inference 42, 89–108.
  • Matthews (1994b) Matthews, J. N. (1994b). Multi-period crossover trials. Statistical Methods in Medical Research 3, 383–405.
  • Matthews (1987) Matthews, J. N. S. (1987). Optimal crossover designs for the comparison of two treatments in the presence of carryover effects and autocorrelated errors. Biometrika 74, 311–320: correction 75, 396.
  • Matthews and Hoenich (1994) Matthews, J. N. S. and N. A. Hoenich (1994). Statistical aspects of the design and analysis of studies to compare haemodialysis membranes. Nephrology, Dialysis, Transplantation 9S2, 176–183.
  • Shah and Sinha (1989) Shah, K. R. and B. K. Sinha (1989). Theory of Optimal Designs. Berlin: Springer-Verlag.
  • Stufken (1996) Stufken, J. (1996). Optimal crossover designs. In S. Ghosh and C. R. Rao (Eds.), Handbook of Statistics 13: Design and Analysis of Experiments, pp.  63–90. Amsterdam: North-Holland.
  • Woods et al. (2006) Woods, D. C., S. M. Lewis, J. A. Eccleston, and K. G. Russell (2006). Designs for generalized linear models with several variables and uncertainty. Technometrics 48, 284–292.

Appendix

The matrices B2B_{2} and B1B_{1} are

B2=(IN313w0(2wN3,N2)0(2wN2,N3)IN212w)=(IN313w00IN212w)B_{2}=\begin{pmatrix}I_{N_{3}}\otimes 1_{3w}&0(2wN_{3},N_{2})\\ 0(2wN_{2},N_{3})&I_{N_{2}}\otimes 1_{2w}\end{pmatrix}\qquad=\begin{pmatrix}I_{N_{3}}\otimes 1_{3w}&0\\ 0&I_{N_{2}}\otimes 1_{2w}\end{pmatrix}

where InI_{n} is the n×nn\times n identity matrix and 1n1_{n} is an n×1n\times 1 vector of ones, and 0(a,b)0(a,b) denotes an a×ba\times b matrix of zeroes. Also

B1=(PQ)=(1wN3I30(3wN3,1)0(2wN2,3)IwN2K)B_{1}=\begin{pmatrix}P\\ Q\end{pmatrix}=\begin{pmatrix}1_{wN_{3}}\otimes I_{3}&0(3wN_{3},1)\\ 0(2wN_{2},3)&I_{wN_{2}}\otimes K\end{pmatrix}

where

K=(0110)K=\begin{pmatrix}0&1\\ 1&0\end{pmatrix}

Therefore B1TB1=PTP+QTQB^{T}_{1}B_{1}=P^{T}P+Q^{T}Q can be written

(wN3I3000)+(000wN2KTK)=w(N3N3N3+N2N2)\begin{pmatrix}wN_{3}\otimes I_{3}&0\\ 0&0\end{pmatrix}+\begin{pmatrix}0&0\\ 0&wN_{2}\otimes K^{T}K\end{pmatrix}=w\begin{pmatrix}N_{3}&&&\\ &N_{3}&&\\ &&N_{3}+N_{2}&\\ &&&N_{2}\end{pmatrix} (10)

noting that KTK=I2K^{T}K=I_{2}.

The product B1TB2B_{1}^{T}B_{2} is

(PTQT)(IN313w00IN212w)=(PT(IN313w)QT(IN212w))\begin{pmatrix}P^{T}&Q^{T}\end{pmatrix}\begin{pmatrix}I_{N_{3}}\otimes 1_{3w}&0\\ 0&I_{N_{2}}\otimes 1_{2w}\end{pmatrix}=\begin{pmatrix}P^{T}(I_{N_{3}}\otimes 1_{3w})&Q^{T}(I_{N_{2}}\otimes 1_{2w})\end{pmatrix}

Evaluating this matrix requires calculation of products such as (1wN3TI3)(IN313w)(1^{T}_{wN_{3}}\otimes I_{3})(I_{N_{3}}\otimes 1_{3w}) and this is made easier if we note that, e.g., 13w=1w131_{3w}=1_{w}\otimes 1_{3}. We then obtain

B1TB2=w(1N3T(130)1N2T(0012))B_{1}^{T}B_{2}=w\begin{pmatrix}1^{T}_{N_{3}}\otimes\begin{pmatrix}1_{3}\\ 0\end{pmatrix}&1^{T}_{N_{2}}\otimes\begin{pmatrix}0\\ 0\\ 1_{2}\end{pmatrix}\end{pmatrix}

To evaluate ATB1A^{T}B_{1} let AT=(A3TA2T)A^{T}=(A^{T}_{3}\;\;A^{T}_{2}) where AA_{\ell} is the part of AA relating to patients undergoing dialysis \ell times per week. Then ATB1=A3TP+A2TQA^{T}B_{1}=A^{T}_{3}P+A^{T}_{2}Q and we then obtain

A3TP=(qM3qW3qF30)A^{T}_{3}P=\begin{pmatrix}q^{3}_{M}&q^{3}_{W}&q^{3}_{F}&0\end{pmatrix}

and

A2TQ=(00qF2qM2).A^{T}_{2}Q=\begin{pmatrix}0&0&q^{2}_{F}&q^{2}_{M}\end{pmatrix}.

Here

qM3=qM3(H)qM3(A)q^{3}_{M}=q^{3}_{M}(H)-q^{3}_{M}(A)

where qM3(H)q^{3}_{M}(H) is the number of times H is allocated on a Monday to a thrice-weekly patient, and qM3(A)q^{3}_{M}(A) is the corresponding count for alteplase: qW3()q^{3}_{W}(\cdot) and qF3()q^{3}_{F}(\cdot) refer to Wednesdays and Fridays, while qX2(Y)q^{2}_{X}(Y) are the corresponding quantities for patients who are dialysed twice-weekly. It follows that

ATB1=(qM3qW3qF3+qF2qM2)=qT, say.A^{T}B_{1}=\begin{pmatrix}q^{3}_{M}&q^{3}_{W}&q^{3}_{F}+q^{2}_{F}&q^{2}_{M}\end{pmatrix}=q^{T}\text{, say}. (11)

If we write R=w1diag(N31N31(N3+N2)1N21)R=w^{-1}\text{diag}\begin{pmatrix}N_{3}^{-1}&N_{3}^{-1}&(N_{3}+N_{2})^{-1}&N_{2}^{-1}\end{pmatrix}, then it follows that AT𝒫(B1)B2A^{T}\mathscr{P}(B_{1})B_{2} can be expressed as qTR(B1TB2)q^{T}R(B^{T}_{1}B_{2}). Each element of AT𝒫(B1)B2A^{T}\mathscr{P}(B_{1})B_{2} corresponds to a patient and each element is either qTRP3q^{T}RP_{3} for thrice-weekly patients or qTRP2q^{T}RP_{2} for twice weekly patients, where

P3=(1110)P2=(0011)P_{3}=\begin{pmatrix}1\\ 1\\ 1\\ 0\end{pmatrix}\qquad P_{2}=\begin{pmatrix}0\\ 0\\ 1\\ 1\end{pmatrix}