59
Interim Analyses of Clinical Trials A Requirement

Interim Analyses of Clinical Trials

  • Upload
    haile

  • View
    63

  • Download
    0

Embed Size (px)

DESCRIPTION

Interim Analyses of Clinical Trials. A Requirement. Outline. Background and how DSMBs arose and function Group sequential methods Examples. References . Ellenberg SS, Fleming TR, DeMets DL, Data Monitoring Committees in Clinical Trials, Wiley, 2002. - PowerPoint PPT Presentation

Citation preview

Page 1: Interim Analyses of Clinical Trials

Interim Analyses ofClinical Trials

A Requirement

Page 2: Interim Analyses of Clinical Trials

Outline

• Background and how DSMBs arose and function

• Group sequential methods

• Examples

Page 3: Interim Analyses of Clinical Trials

References • Ellenberg SS, Fleming TR, DeMets DL, Data Monitoring

Committees in Clinical Trials, Wiley, 2002.

• DeMets DL, Furberg CD, Friedman LM. Data Monitoring in Clinical Trials. A Case Studies Approach, Springer, 2006.

• Jennison C and Turnbull BW, Group Sequential Methods with Applications to Clinical Trials, Chapman and Hall, 2000.

• Proschan MA, Lan KKG, Wittes J, Statistical Monitoring of Trials. A Unified Approach, 2006, Springer.

• http://www.biostat.wisc.edu/landemets

Page 4: Interim Analyses of Clinical Trials

Structure for Cooperative Studies (Greenberg Report)

Policy Boardor

Advisory Committee

National Advisory Heart Council

InitialreviewgroupInstitute

staff

Executive Committeeor

Steering Committee

CoordinatingCenter

Participating Units

Cont Clinical Trials 9:137-48, 1988.

Page 5: Interim Analyses of Clinical Trials

Monitoring Committee Acronyms

• PAB = Policy advisory board

• DSMB = Data and Safety Monitoring Board

• DMC = Data Monitoring Committee

• ESMB = Efficacy and safety monitoring board

• OSMB = Observational study monitoring board

Page 6: Interim Analyses of Clinical Trials

Responsibilities

• Steering/Executive Committee/Protocol Team– Study design– Patient recruitment and follow-up– Data collection– Quality assurance– Review of external data– Study reports

• DMC or DSMB– Safety of patients– Protection of integrity of study– Review of blinded data on safety and efficacy of treatments– Review of trial conduct, amendments and external data

DMCs are responsible to patients, investigators IRBs, regulatory agencies and sponsor.

Page 7: Interim Analyses of Clinical Trials

Data Monitoring Rationale

• Accumulating data needs to be monitored for risk/benefit (Safety is best assured by comparing the rate of adverse events with a control group)

• Reasons:– Ethical : do not expose participants to an inferior

intervention longer than needed to test hypothesis– Scientific: assessment of relevance of question (e.g.,

external data), design assumptions, logistical problems.

– Economic: do not waste financial or human resources for a futile trial.

Page 8: Interim Analyses of Clinical Trials

Reasons for Early Terminationof Clinical Trials

• Based on accumulated data from the trial:– Unequivocal evidence of treatment benefit or harm– Unexpected, unacceptable side effects– No emerging trends and no reasonable chance of

demonstrating benefit

• Based on overall progress of the trial:– Failure to include enough patients at a sufficient rate– Lack of compliance in a large number of patients– Poor follow-up– Poor data quality

Page 9: Interim Analyses of Clinical Trials

Today

• All NIH sponsored clinical trials are required to have a data monitoring plan

• NIH-sponsored trials with clinical endpoints have a DSMB

• Many industry sponsored studies have a DSMB

• The FDA has prepared a guidance document (Establishment and Operation of Clinical Trial Data Monitoring Committees) http://www.fda.gov/RegulatoryInformation/Guidances/ucm127069.htm

• There is variation in operating procedures for DSMBs

Page 10: Interim Analyses of Clinical Trials

When is an Independent DSMB Needed

• Early phase studies– Monitoring usually at local level; independent DMC not

usually needed.

• Phase III & IV studies with morbidity/mortality outcomes; pivotal phase III trials

• Frail populations, e.g., children, elderly

• Trial with substantial uncertainty about safety, e.g., gene therapy

See FDA Guidance and ICH/E9, section 4.5.

Page 11: Interim Analyses of Clinical Trials

DSMBComposition: Multidisciplinary

• Clinical experts in the subject matter area

• Biostatisticians with expertise in clinical trials and preferably in the subject matter area

• Others depending on the nature of the study, e.g., ethicist, pharmacologist, patient advocate

Senior investigators without significant conflicts of interest

Page 12: Interim Analyses of Clinical Trials

Independence of DSMB:

• Voting members should not be part of the investigative team or work for the sponsor

• There should be a clear “need to know” policy for non-DSMB members, e.g., the statistician preparing interim summaries needs to know and may be an employee of the sponsor or member of the investigative team

• Members should state potential conflicts

This view is not shared by all. See Meinert CL and discussion, Cont Clin Trials, 1998

Page 13: Interim Analyses of Clinical Trials

Typical DSMB Meeting Format• Open Session

– Progress report using open data (no outcome data by treatment group)

– Sponsor, e.g., NIH, Executive Committee, Protocol Chairs, DSMB and unblinded statisticians

• Closed Session– Outcome data by treatment group (usually coded)– DSMB and unblinded statisticians only

• Executive Session (DSMB only)

• Debriefing Session– DSMB, Sponsor, Executive Committee, Protocol Chairs,

and unblinded statisticians

Page 14: Interim Analyses of Clinical Trials

DSMB Confidentiality

• Interim data reviewed by the DSMB must remain confidential

• Members must not share interim data with anyone outside DSMB

• Leaks can affect– Patient recruitment– Protocol compliance– Outcome assessment– Trial integrity and support

Page 15: Interim Analyses of Clinical Trials

DMC Recommendations

• Continue the study unmodified

• Modify the study protocol

• Terminate the study–Serious toxicity–Clear benefit–Futility–Design/logistical problems

Page 16: Interim Analyses of Clinical Trials

Outline

• Background and how DSMBs function

• Group sequential methods

• Examples

Page 17: Interim Analyses of Clinical Trials

DSMB Decision Making Can Be Complex

• Internal consistency• Benefit/Risk• External consistency• Current versus future patients• Clinical and public health impact• Statistical issues – monitoring guidelines

Page 18: Interim Analyses of Clinical Trials

Overall Probability of Achieving a Result with Given Nominal Significance of 0.05

After N Repeated Tests Under Ho

1 .052 .0833 .1074 .1265 .14210 .19325 .266

No. of Tests (N) Probability

Ref: McPherson, NEJM, 1974.

Page 19: Interim Analyses of Clinical Trials

Value of Nominal Significance Level Necessary to Achieve a True Level of 0.05

After N Repeated Tests

1 .052 .02963 .02214 .01835 .015910 .0107

No. of Tests (N)Significance Level

Which Should be Used

Ref: McPherson, NEJM, 1974.

Page 20: Interim Analyses of Clinical Trials

Early Work

• Acceptance sampling

• Wald (1947) sequential probability ratio test

Manufacturing problems, continuous monitoring of the data, no upper bound on sample size

Page 21: Interim Analyses of Clinical Trials

Group Sequential Methods

• Calculate a summary statistics (e.g., Z for logrank test) on each additional new group of participants (events)

• Compare the test statistic to a critical value that preserves overall type 1 error (e.g., 0.05).

Page 22: Interim Analyses of Clinical Trials

Critical Values (z) for 2-sided Group Sequential Design with .05 Overall

Significance and 7 Looks

Interim O-Brien/ Haybittle/Analysis Pocock Fleming Peto

1 2.49 5.46 3.02 2.49 3.85 3.03 2.49 3.15 3.04 2.49 2.73 3.05 2.49 2.44 3.06 2.49 2.23 3.07 2.49 2.06 1.96 (2.00)

Page 23: Interim Analyses of Clinical Trials
Page 24: Interim Analyses of Clinical Trials

Critical ValuesNo. of Looks

Look Pocock O’Brien-Fleming PetoZ P Z P Z P

2 1 2.178 .029 2.797 .005 3.290 .0012 2.178 .029 1.977 .048 1.962 .050

3 1 2.289 .022 3.471 .0005 3.290 .0012 2.289 .022 2.454 .014 3.290 .0013 2.289 .022 2.004 .045 1.964 .050

4 1 2.361 .018 4.049 .0001 3.290 .0012 2.361 .018 2.863 .004 3.290 .0013 2.361 .018 2.338 .019 3.290 .0014 2.361 .018 2.024 .043 1.967 .049

5 1 2.413 .016 4.562 .00001 3.290 .0012 2.413 .016 3.226 .0013 3.290 .0013 2.413 .016 2.634 .008 3.290 .0014 2.413 .016 2.281 .023 3.290 .0015 2.413 .016 2.040 .041 1.967 .049

Page 25: Interim Analyses of Clinical Trials

Choosing Critical Values

kcSkKcZ

H

H

O

kcScZ

H

cZcZcZP

cZcZP

BkBk

pkpk

kk

kk

k

ly equivalentor /

whenfirst time theReject testsample fixed a similar to is test final The

study in theearly reject tohard isIt looksearlier at aluesboundary vlarger Use

(1979) Fleming andBrien '

ly equivalentor

whenfirst time theReject lookeach at alueboundary v same the Use

(1977)Pocock

0;or ...or , r

or 10;, . . . , r

: thatso c , . . . , c values theChoose

0

0

0

2211

11

1

Page 26: Interim Analyses of Clinical Trials
Page 27: Interim Analyses of Clinical Trials

General Approach• Compute sample size as if a single look (fixed sample

approach)• Specify number of interim analyses and stopping boundary

(usually OBF).• Inflate sample size to preserve assumed power using

constants in table (not always done as adjustment is minor).• Compute the standardized statistic Zk at each analysis and

compare with critical values corresponding to monitoring boundary chosen.

• At the end or upon early termination determine P-values and confidence intervals in the usual manner.

Page 28: Interim Analyses of Clinical Trials

Problems with Initial Approach

• Difficult to specify number of analyses in advance

• Logistically difficult to organize reviews after equal increments of information.

Solutions: Slud and Wei and Lan-DeMets

Page 29: Interim Analyses of Clinical Trials

Flexible Approaches

• Slud and Wei (JASA, 1982) – specify exit probabilities for each look (stage) such that they sum to , e.g., the prob of exiting the kth stage is the joint prob of not exiting the 1st k-1 stages and exiting the kth one.

• Lan-DeMets (Biometrika, 1983) – specify a use function or type I error spending function, e.g., at time zero, used = 0 and with full information used = 0.05 (or nominal level)

Page 30: Interim Analyses of Clinical Trials

Spending Function (t)

(number of events observed at monitoring)(total number of anticipated events)

t =

.0 t1 t2 1

(t1 )

(t )(t2 )

}

Alpha.05

spending function plotted over fraction of total information to be obtained in the study, evaluated at two arbitrary points, t1 and t2 in the study

Information Fraction

Cont Clin Trial 2000;21:190-207

Page 31: Interim Analyses of Clinical Trials

Critical ValuesNo. of Looks

Look Pocock O’Brien-Fleming PetoZ P Z P Z P

2 1 2.178 .029 2.797 .005 3.290 .0012 2.178 .029 1.977 .048 1.962 .050

3 1 2.289 .022 3.471 .0005 3.290 .0012 2.289 .022 2.454 .014 3.290 .0013 2.289 .022 2.004 .045 1.964 .050

4 1 2.361 .018 4.049 .0001 3.290 .0012 2.361 .018 2.863 .004 3.290 .0013 2.361 .018 2.338 .019 3.290 .0014 2.361 .018 2.024 .043 1.967 .049

5 1 2.413 .016 4.562 .00001 3.290 .0012 2.413 .016 3.226 .0013 3.290 .0013 2.413 .016 2.634 .008 3.290 .0014 2.413 .016 2.281 .023 3.290 .0015 2.413 .016 2.040 .041 1.967 .049

Page 32: Interim Analyses of Clinical Trials

Plots of Pocock-type and O’Brien Fleming-type spending functions for a one-sided 0.025 significance level,

for four analyses at 25%, 50%, 75% and 100% of the expected information.

Spending Functions

0

0.005

0.01

0.015

0.02

0.025

0 0.25 0.5 0.5 1

Information Fraction

Alph

a

Pocock

OBF

Page 33: Interim Analyses of Clinical Trials

Approximate O’Brien Fleming Boundaries Using Lan-DeMets Spending Function Approach: Overall

Significance =0.05 and 4 Looks

Interim O-Brien OBFAnalysis Fleming Lan-DeMets

1 4.05 4.332 2.86 2.963 2.34 2.364 2.02 2.01

Page 34: Interim Analyses of Clinical Trials

Usual Choices for Information

• Planned number of events in event-driven trial with common closing date chosen to achieve event target.

• Follow-up time, e.g., percent of participants attending final follow-up visit in trial with fixed follow-up for each participant.

• Calendar time, e.g., trial with common calendar closing date (e.g., to ensure some minimum follow-up for each participant) but not event-driven.

Page 35: Interim Analyses of Clinical Trials

Beta-Blocker Heart Attack Trial (BHAT)

• Placebo-controlled trial of propranolol in patients with a recent MI

• Recruitment began in June 1978; planned termination June 1982; average of 3 years of follow-up and maximum of 4

• Primary endpoint – all-cause mortality

• Event target - 629 deaths

• Stopped early in October 1981

JAMA 1982; 247:1707-1714.

Page 36: Interim Analyses of Clinical Trials

Interim Monitoring of BHAT Study

1 May 1979 11 (.23) 56 (.09) 1.682 Oct 1979 16 (.33) 77 (.12) 2.243 Mar 1980 21 (.44) 126 (.20) 2.374 Oct 1980 28 (.58) 177 (.28) 2.305 Apr 1981 34 (.71) 247 (.39) 2.346 Oct 1981 40 (.83) 318 (.51) 2.82

LookNumber

MonitoringDate

MonthsSince Start

CumulativeDeaths

LogrankStatistic

Page 37: Interim Analyses of Clinical Trials

Critical Values (z) for 2-sided Group Sequential Design with .05 Overall Significance and 7 Looks

(BHAT)

Interim Lan-DeMets (OBF)Analysis OBF Events Calendar

1 5.46 8.00 4.532 3.85 8.00 3.733 3.15 4.86 3.204 2.73 4.08 2.755 2.44 3.41 2.476 2.23 2.95 2.287 2.06 1.97 2.05

Logrank Z=2.82

Page 38: Interim Analyses of Clinical Trials

Flexible Number of Looks • Another advantage of the Lan-DeMets spending

function approach is the flexibility with the number of looks.

• Suppose BHAT was not stopped and there were 3 more looks before the end (10 total).

• Looks 7-10 correspond to information fractions considering the number of events of 0.65, 0.75, 0.85 and 1.0.

• Stopping boundaries can be calculated conditioned upon the previous tests

Page 39: Interim Analyses of Clinical Trials

Critical Values (z) for 2-sided Group Sequential Design with .05 Overall Significance and 7 Looks

(BHAT)

Interim Lan-DeMets (OBF)Analysis 7 Looks 10 Looks

1 8.00 8.002 8.00 8.003 4.86 4.864 4.08 4.085 3.41 3.416 2.95 2.957 1.97 2.588 2.419 2.26

10 2.06

Page 40: Interim Analyses of Clinical Trials

Suppose We Get To the 6th Analysis by A Different Route

• Information fractions are .05, .20, .30, .40, .45

• Instead of .09, .12, .20, .28, and .39

Page 41: Interim Analyses of Clinical Trials

Critical Values (z) for 2-sided Group Sequential Design with .05 Overall Significance and 7 Looks

(BHAT)

Interim Lan-DeMets (OBF)Analysis 7 Looks 7 Looks

1 8.00 8.002 8.00 4.893 4.86 3.934 4.08 3.335 3.41 3.196 2.95 2.98

Page 42: Interim Analyses of Clinical Trials

Variations of the Theme• Asymmetric boundaries (e.g., non-significant harmful

effect of new treatment)– Use upper boundary for superiority and less conservative

boundary for harm (Z= -1.5 or –2.0, or OBF for efficacy and Pocock for harm)

– Appropriate for an investigational product but probably not for a product already approved and used as part of standard of

care

• Multiple outcomes, e.g., efficacy and safety, and composites

• Multiple trials (CHARM heart failure, Cox-2 chemo-prevention)

• Futility and curtailed sampling procedures (conditional and unconditional power)

• Repeated confidence intervals (e.g., use OBF critical values to compute interim CIs)

Page 43: Interim Analyses of Clinical Trials

Asymmetric Monitoring Boundary for Harm

Harm

Benefit

Pocock2.4

1.5

Z

Page 44: Interim Analyses of Clinical Trials

SMART Study Design

Drug Conservation (DC) Strategy

[Stop or defer ART until CD4+ < 250; then episodic ART

based on CD4+ cell count to increase counts to > 350]

Virologic Suppression (VS) Strategy

[Use of ART to maintain viral load as low as possible throughout follow-up]

CD4+ cell count >350 cells/mm3

n = 2752 n = 2720

Plan: 910 primary endpoints; 8 years average follow-up.Intervention interrupted on 11 January 2005.

N Engl J Med 2006.

Page 45: Interim Analyses of Clinical Trials

SMART Guideline

“…it is recommended that the DSMB consider early termination or protocol modification only when the O’Brien-Fleming boundary is crossed for the primary endpoint and the findings for the primary and the composite cardiovascular, metabolic endpoint are consistent...”

Page 46: Interim Analyses of Clinical Trials

Interim Monitoring: O’Brien Fleming Boundaries for the Primary Endpoint, by DSMB Date

Page 47: Interim Analyses of Clinical Trials

Interim Monitoring: O’Brien Fleming Boundaries for the Primary Endpoint, by Cut Date

Page 48: Interim Analyses of Clinical Trials

SMART Primary and Supportive Endpoint Results

DC Group VS Group HR (DC/VS)P-valueN Rate N Rate [95% CI]

• OD or death • (primary endpoint) 122 3.4 50 1.4 2.5 [1.8, 3.5] <0.001

• CVD, Renal, Liver 65 1.8 39 1.1 1.7 [1.1, 2.5] 0.009• • - CVD 48 1.3 31 0.8 1.6 [1.0, 2.5] 0.05• • - Renal 9 0.2 2 0.1 4.5 [1.0, 20.9] 0.05

• - Liver 10 0.3 7 0.2 1.4 [0.6, 3.8] 0.46

Page 49: Interim Analyses of Clinical Trials

Futility

• Usual definition - convincing evidence exists that the new treatment is not beneficial.

• If this is the case, minimizing exposure to an ineffective treatment with potential toxicities and saving resources should lead to a consideration to stop the trial.

• What is convincing?

• Futility, more generally, can also be impacted by low event rate or slow enrollment (e.g., CVD mortality outcome in the Physician’s Health Study).

Page 50: Interim Analyses of Clinical Trials

Conditional Power (or Stochastic Curtailment) to Assess Futility

• What is the probability of rejecting the null hypothesis (i.e., getting a significant result) given the data to date and my best guess about the future, e.g.,

– will look like the past

– no difference

– like assumed in the design

Lan KKG, Wittes J, Biometrics, 1988.

Page 51: Interim Analyses of Clinical Trials

Example of Curtailment from Proschan’s Book

Control

Event No Event

75 116 191

Planned sample size = 400

Treatment 75 118 193

384

Even if all 9 remaining controls had events and all 7 treatmentgroup patients did not, Z=0.92. Why continue?

234150

Page 52: Interim Analyses of Clinical Trials

Example from Proschan’s Book (cont.)

Control

Event No Event

71 100 171

Planned sample size = 400

Treatment 71 100 171

342

If all 20 remaining controls had events and all 20 treatmentgroup patients did not, the result would be significant. But how

likely is that? Answer = almost zero.

200142

Page 53: Interim Analyses of Clinical Trials

Conditional Power: Usual Implementation

• Guidelines in protocol (pre-specified)

• Typically compute conditional power after you have a fair amount of data (e.g., 50% of information)

• Compute conditional power under a number of scenarios for assumed intervention effect (observed effect to date, alternative assumed in design, null effect, others effect sizes in between).

• Can graph boundaries of conditional power versus information accrued to facilitate decision making.

Page 54: Interim Analyses of Clinical Trials

Unconditional Power

• What is the probability of rejecting the null hypothesis (i.e., getting a significant result) based on the original design assumptions for the treatment effect, but considering:

– revised estimate of control group event rate

– duration of follow-up accounting for recruitment period and minimum follow-up originally planned for each participant

Is a null result still meaningful?

Page 55: Interim Analyses of Clinical Trials

Guideline for HIV Early Treatment Trial (START)

• 1st consider unconditional power. If < 70%, consider conditional power.

• If conditional power is < 20%, consider stopping for futility.

Rationale: Unconditional power could be low in the presence of a large treatment effect.

Page 56: Interim Analyses of Clinical Trials

Summary (1)

• Many studies require a DSMB

– Trials with morbidity and mortality outcomes

– Trials of treatments that may be associated with serious toxicities (need to have a group look a controlled comparisons)

– Trials of novel, high risk treatments (e.g., gene therapy)

– Trials involving frail populations (elderly, infants)

Page 57: Interim Analyses of Clinical Trials

Summary (2)

• A DSMB can be most effective in its role of protecting the interests of patients if it is independent of the sponsor and trial investigators – peer review works!

• Operating procedures should be agreed upon in advance

• An informed statistician who performs interim analyses is important

• To carry out interim analyses data must be collected in a timely way

• Reports should focus on comparisons of clinical outcomes and their validity

Page 58: Interim Analyses of Clinical Trials

Summary (3)

• Monitoring guidelines should be pre-specified

• Guidelines need to be accompanied with common sense, a careful assessment of risks and benefits, and and opinions from experts from different backgrounds.

• This is a fruitful area for research.

Page 59: Interim Analyses of Clinical Trials

Recommendation from Paul Canner based on his experiences in Coronary Drug

Project

“…no single statistical decision rule or procedure can take the place of the well-reasoned

consideration of all aspects of the data by a group of concerned, competent, and experienced

persons with a wide range of scientific backgrounds and points of view.”

Cont Clin Trials 1981; 1:363-376.