The Oncologist, Vol. 11, No. 6, 624-629, June 2006; doi:10.1634/theoncologist.11-6-624 © 2006 AlphaMed Press
End Points for Adjuvant Therapy Trials: Has the Time Come to Accept Disease-Free Survival as a Surrogate End Point for Overall Survival?a British Columbia Cancer Agency, Vancouver, British Columbia, Canada; b Mayo Clinic Cancer Center, Rochester, Minnesota, USA Key Words. Adjuvant therapy • Overall survival • Disease-free survival Correspondence: Daniel J. Sargent, Ph.D., Cancer Center Statistics, Mayo Clinic Cancer Center, 200 First Street SW, Rochester, Minnesota 55905, USA. Telephone: 507-284-5380; Fax: 507-266-2477; e-mail: sargent.daniel{at}mayo.edu Received January 31, 2006; accepted for publication April 13, 2006.
Access and take the CME test online and receive 1 AMA PRA category 1 credit at CME.TheOncologist.com
The intent of adjuvant therapy is to eradicate micro-metastatic residual disease following curative resection with the goal of preventing or delaying recurrence. The time-honored standard for demonstrating efficacy of new adjuvant therapies is an improvement in overall survival (OS). This typically requires phase III trials of large sample size with lengthy follow-up. With the intent of reducing the cost and time of completing such trials, there is considerable interest in developing alternative or surrogate end points. A surrogate end point may be employed as a substitute to directly assess the effects of an intervention on an already accepted clinical end point such as mortality. When used judiciously, surrogate end points can accelerate the evaluation of new therapies, resulting in the more timely dissemination of effective therapies to patients. The current review provides a perspective on the suitability and validity of disease-free survival (DFS) as an alternative end point for OS. Criteria for establishing surrogacy and the advantages and limitations associated with the use of DFS as a primary end point in adjuvant clinical trials and as the basis for approval of new adjuvant therapies are discussed.
A surrogate end point has been defined as an alternative end point (such as a biological marker, physical sign, or precursor event) that can be used as a substitute for a clinically meaningful end point that measures directly how a patient feels, functions, or survives [1]. A surrogate end point is selected based on a biologic rationale and may be employed when the primary end point of interest is difficult or expensive to measure and when an alternative, more accessible end point is sufficiently well correlated with the primary to justify its use as a substitute. In cancer trials, potential candidates as surrogate end points for OS include tumor response and progression-free survival (in the presence of measurable disease), or DFS (following curative-intent primary therapy) because these events occur earlier and are typically unaffected by the use of secondary therapies. The potential advantages to the use of surrogate end points are inherent: the effect of an intervention can be evaluated faster, at less cost, and with fewer trial subjects. Despite the enthusiasm for surrogate end points and the biologic and practical rationale for their use, there are well known examples in clinical medicine of failed surrogate measures, including the use of arrhythmia suppression as a surrogate for cardiovascular mortality [2] and the use of bone mineral density as a surrogate for fracture risk [3]. Fleming and DeMets have described several reasons why a surrogate endpoint may fail [4] : (a) the surrogate is not in the causal pathway of the disease process, (b) the intervention affects only the pathway mediated through the surrogate but does not affect other important causal pathways, (c) the surrogate is insensitive to the treatment effect or is not in the pathway of the treatments effect, and/or (d) the treatment has mechanisms of action independent of the disease process. Correlation of a surrogate with the clinical outcome of interest is not solely sufficient to establish the validity of a surrogate end point; proper validation requires an understanding of the causal pathways of the disease process and the mechanism of action of the treatment of interest. Providing a framework for studying and validating surrogate end points has been the subject of considerable study. Prentice [5] proposed a formal definition of conditions that a surrogate marker be required to satisfy. According to the Prentice criteria, a surrogate marker must be both correlated with the true end point and fully capture the net effect of treatment on the true end point. This approach was advanced by Freedman et al. [6] with the examination of the so-called "proportion-explained;" that is, the proportion of treatment (or exposure) effect explained by the surrogate end point. Buyse and Molenberghs [7] subsequently proposed that a surrogate should be required to fulfill measures of agreement with the true end point at both the individual level and at the population level. Accordingly, a surrogate is said to be "perfect" at the individual level when there is a perfect association between the surrogate and the true end point after adjustment for treatment. At the population level, a surrogate is said to be "perfect" when the effect of treatment on the true end point is equal to the effect of treatment on the surrogate end point. These measures are evaluated on a 01 scale, with values approaching 1 indicating effective surrogacy. As noted by Begg and Leung [8], an important conceptual limitation of these criteria is that a surrogate end point applied in the context of an actual clinical trial cannot be validated unless the true end points are also evaluated; that is, a criterion specific to comparing the treatment effect on both the surrogate and true end point cannot be satisfied unless the true end points are actually observed. This would effectively negate the intent of using a surrogate end point. They, therefore, proposed that the validity of a surrogate end point should be judged by the probability that the trial results based on the surrogate end point alone are concordant with the trial results that would be obtained if the true end point were observed and used for the analysis. Based on these criteria, it is clear that proper validation of a surrogate end point requires data from multiple trials, optimally through a pooled or meta-analysis examining both positive and negative trials to validate both the sensitivity and specificity of the proposed surrogate. The greater the heterogeneity present in the included trials, the stronger the case for the surrogate marker becomes; if an end point is valid in many different heterogeneous settings, it makes it more likely the relationship will be valid in future, yet unspecified trials.
DFS (or recurrence-free survival) is defined as the time from randomization to the first of either recurrence or relapse, second cancer, or death. Its application as a surrogate end point is most appropriate in settings where it is expected that recurrence of disease presents a major component of mortality in the treated population. This would be the case for most solid tumors, for which secondary therapies at the time of recurrence may prolong survival but ultimately are unlikely to result in a cure. DFS as a surrogate would particularly be advantageous when a sufficiently lengthy interval exists between recurrence and death, thus requiring a longer follow-up for a primary trial end point of OS. The use of DFS as an alternative end point for OS for adjuvant trials has been a topic of much discussion and its application is reviewed below in the context of resected high-risk colon cancer, early breast cancer, and non-small cell lung cancer (NSCLC).
Adjuvant Therapy for Colon Cancer This analysis, known as the ACCENT (Adjuvant Colon Cancer Endpoints) analysis, pooled data from 18 randomized phase III clinical trials, consisting of 43 treatment arms and a total pooled sample size of 20,898 patients. The primary analysis was a comparison of hazard ratios (HRs) for 3-year DFS and 5-year OS within patient, within treatment arm, and within trial. During the 8-year follow-up period, 80% of recurrences were documented within the first 3 years. Comparisons of the 3-year DFS rate and 5-year OS rate by treatment arm, using weighted linear regression, were very highly correlated (R2 = 0.85; Spearman rank correlation coefficient = 0.88). When examined by trial, to determine whether comparison of the experimental and control arms using DFS reaches the same conclusion as using OS, a high level of correlation was again observed (R2 = 0.90; Spearman rank correlation coefficient = 0.94), with a slight attenuation of the treatment effect HR between the two end points. The regression model estimating the relationship between DFS and OS using within-trial hazard ratios was tested using leave-one-out cross-validation; excellent model calibration was demonstrated, with 24 of 25 observed HRs for OS within the 95% prediction interval. Recognizing that no single method of surrogacy validation is agreed upon, multiple formal analytic approaches were used, including the proportion explained method of Prentice [5] and Freedman et al. [6], trial-level and individual-level measures of agreement [13], and the probability of concordance [8]. The proportion of treatment effect of adjuvant 5-FU-based chemotherapy on OS explained by DFS was >100%. This seemingly counterintuitive finding was the result of a change in the coefficient for treatment in the Cox model from negative to positive when recurrence, which is the dominant predictor of death, was included, and demonstrates a limitation of the proportion explored approach. Trial-level and individual-level correlation were 0.78 (95% confidence interval [CI], 0.600.96) and 0.873 (95% CI, 0.8690.877), respectively. Twenty three of 25 within-trial comparisons (control vs. treatment) obtained the same conclusion, regardless of end point. In summary, strong evidence for surrogacy was demonstrated. The slight attenuation of treatment effect observed when using DFS versus OS merits additional discussion. While the majority of deaths are preceded by recurrence, and while recurrent disease is typically not salvageable for a curative outcome, deaths from competing causes and the use of palliative yet survival-prolonging therapies may attenuate the treatment effect comparison for OS. Admittedly, the survival impact of palliative interventions in the era of the trials included in the ACCENT analysis (19781996) was modest. When extrapolating the DFSOS treatment effect association to modern-day trials, however, one may expect this attenuation to be magnified in an era of more rigorous surveillance and superior diagnostics leading to earlier detection of recurrent disease, and the availability of more efficacious "palliative" therapies, which not only prolong survival in the setting of recurrent disease but also increase the possibility of salvage for cure [14]. Recently, the efficacy of oxaliplatin in combination with 5-FU/leucovorin for the adjuvant treatment of stage II and III colon cancer was reported using a primary end point of DFS. In the Multicenter International Study of Oxaliplatin/5-FU/Leucovorin in the Adjuvant Treatment of Colon Cancer (MOSAIC) trial, the 3-year DFS rate was significantly superior in the oxaliplatin-containing arm (78.2% vs. 72.9%; p = .002) [15]. A similar magnitude of efficacy was subsequently reported in the National Surgical Adjuvant Breast and Bowel Project (NSABP) C-07 trial (3-year DFS rate, 76.5% vs. 71.6%; p < .004) [16]. Final analysis for the secondary end point of OS is pending for both these trials; however, in an update of the MOSAIC trial results after a median follow-up of 56.2 months, the OS rates at 4 years remain similar (84.9% vs. 82.8%; HR, 0.91; 95% CI, 0.751.11; p = .91) [17]. Notable proportions of patients remain alive with recurrence (7.4% and 11.9%) on the oxaliplatin plus 5-FU/leucovorin and 5-FU/leucovorin arms, respectively. While the majority of these recurrences will not be salvageable for cure, this does illustrate the potential for attenuation of the DFSOS correlation when median survival is prolonged for metastatic recurrence in the current era of superior palliative therapies. Rather than questioning 3-year DFS as an appropriate end point, these considerations may rather indicate that 5 years may be too rapid to truly assess long-term outcome in the modern treatment of colon cancer, further emphasizing the need for a surrogate end point in this setting.
Breast Cancer More recently, DFS was employed as a primary end point in assessing the efficacy of trastuzumab, a monoclonal antibody that targets human epidermal growth factor receptor 2 (HER-2) in HER-2-positive women with early breast cancer. In a joint analysis of the NSABP B-31 and North Central Cancer Treatment Group (NCCTG)-N9831 trials, trastuzumab in combination with doxorubicin and cyclophosphamide followed by paclitaxel reduced the risk for recurrence by 52% after 2 years of median follow-up (p < .0001). The OS rate at 4 years was also significantly better (91% vs. 87%; p = .015) [22]. Presented at the same 2005 Annual Session of the American Society of Clinical Oncology, the first interim efficacy analysis of the Herceptin® Adjuvant (HERA) trial reported a 46% reduction in recurrence at a 1-year median follow-up (p < .0001) with trastuzumab following adjuvant chemotherapy [23]. The OS rates were similar at the time of this early analysis. The findings from both these trials are significant not only for the magnitude of benefit observed with trastuzumab therapy but also for the proof of concept that DFS may be an acceptable primary end point for adjuvant trials employing targeted therapies. Further follow-up from each of these trials, and likely a formal pooled analysis, will be required to validate DFS as a true surrogate end point in this setting.
NSCLC
New therapies need to be effectively and efficiently evaluated in the adjuvant setting, where the treatment goal is increased cure. While OS represents the gold standard metric for establishing efficacy, it requires extended follow-up, which may prevent the timely dissemination of results and consequent implementation of effective curative-intent therapy. For the purposes of regular FDA drug approval, the demonstration of either clinical benefit or an effect on an established surrogate for clinical benefit is required [27]. For an end point that is not sufficiently established to support regular approval, accelerated approval can be granted on the basis that the surrogate is reasonably likely to predict clinical benefit with the requirement that postmarketing studies must verify the clinical benefit. It is acknowledged that DFS may serve as the basis for FDA approval if a high proportion of symptomatic recurrences exist or if a strong correlation with survival exists [28]. Beyond the aforementioned cases for colon, breast, and lung cancer, these conditions may also apply to other settings with expanding adjuvant indications including resected gastric, pancreatic, and bladder cancers. Can DFS now be routinely applied as an alternative end point to OS for future adjuvant clinical trials? On the one hand, trials would be smaller, faster, and cheaper. By including survival as a component of the composite end point, DFS does account for any long-term effects of a treatment that may ultimately adversely impact survival, such as end-organ toxicities (e.g., cardiotoxicity with trastuzumab) or secondary malignancies. On the other hand, DFS may be compromised in its ability to predict the net treatment benefit on survival as a result of improvements in postrecurrence therapy. In addition, it is unclear whether DFS can adequately reflect the impact of targeted therapies in the adjuvant setting because such therapies are more typically associated with a cytostatic effect; that is, such a therapy may delay but not prevent an eventual recurrence. This may not be a major limitation based upon the adjuvant trastuzumab experience. Ongoing and future adjuvant trials employing targeted therapies will further elucidate this issue. Long-term, simple follow-up for OS will continue to be necessary for adjuvant trials, as OS remains an end point of key clinical and patient interest. One must also consider the merit of DFS as an end point unto itself. From a physician and patient perspective, it is difficult to question the clinical benefit of preventing or delaying a recurrence independent of its ultimate impact on survival. Alive on therapy is not equivalent to alive free of disease; the impact of symptomatic recurrent disease is significant both in terms of patient morbidity and costs to society. From another perspective, in settings where postrecurrence therapy significantly affects long-term survival, DFS may change as the "preferred" end point because of its unambiguous measure of a new treatments impact on the disease process. As an end point less affected by crossover or subsequent therapy, it can be argued that DFS provides a more biologically relevant measure of a new treatments impact on the disease process. In summary, while improving patient OS will always be the goal of adjuvant therapy, in selected settings, DFS has become accepted as a surrogate end point of sufficient merit to satisfy regulatory approval when assessing new treatments in randomized trials. The paths to this outcome were varied: the greatest level of validation is in the setting of high-risk resected colon cancer, where a large international pooled analysis was required, whereas in breast cancer, the extended survival of patients required the use of DFS as a practical necessity with subsequent data supporting surrogacy. In other adjuvant settings, such as early-stage NSCLC, pooled analyses similar to the ACCENT analysis will likely be necessary. The benefit of these efforts can be substantial. Given the high incidence rates of many of these malignancies, the ability to establish and deliver improved therapies by a margin of two or more years earlier, through the use of a surrogate end point, may translate into better outcomes for thousands of patients.
The authors indicate no potential conflicts of interest.
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||