- Open Access
Defining a COPD composite safety endpoint for demonstrating efficacy in clinical trials: results from the randomized, placebo-controlled UPLIFT® trial
Respiratory Research volume 17, Article number: 48 (2016)
Chronic obstructive pulmonary disease (COPD) clinical trials evaluating hard endpoints (mortality, hospitalized exacerbations) require a large number of subjects and prolonged observational periods. We hypothesized that a composite endpoint of respiratory outcomes (CERO) can help evaluate safety and benefit in COPD trials.
Retrospective analysis of 5992 patients enrolled in the 4-year UPLIFT® trial, a randomized trial of tiotropium versus placebo in patients with moderate-to-severe COPD. Patients were permitted to continue using their usual COPD medications except for other anticholinergics. The CERO included deaths, respiratory failure, hospitalized exacerbations, and trial dropout due to COPD worsening. The incidence rates (IRs) per 100 patient-years and risk ratios (RRs and 95 % CI) were determined at years 1 to 4. The effect of treatments on CERO was similarly assessed. A power analysis helped calculate the sample size needed to achieve outcome differences between treatments.
The CERO IRs at years 1 to 4 for tiotropium versus placebo were 16, 13, 11, and 11, and 21, 16, 14, and 13, respectively. The RRs of CERO between tiotropium and placebo at the same time points were: RR-year 0.76 (0.67, 0.86), 0.80 (0.72, 0.88), 0.81 (0.74, 0.89), and 0.84 (0.77, 0.92). Using the IRs and RRs, the sample size (alpha = 0.05 two-sided, 90 % power) for studies of 1, 2, 3, and 4 years would be 1546, 1392, 1216, and 1504 per treatment group, respectively, with 575, 810, 930, 1383 required events, respectively, for hypothetical, event-driven studies.
A composite endpoint incorporating relatively infrequent serious or significant COPD-related safety outcomes could be useful in clinical trials. In UPLIFT®, CERO events were significantly reduced in patients receiving tiotropium compared with placebo.
Chronic obstructive pulmonary disease (COPD) clinical trials evaluating relatively infrequent endpoints such as mortality, respiratory failure, and/or hospitalized exacerbations are, of necessity, often large and require follow-up of patients over prolonged periods of time [1–3]. Indeed, recent trials evaluating mortality as the outcome, such as the Tiotropium Safety and Performance in Respimat® (TIOSPIR®) study  or the Study to Understand Mortality and Morbidity in COPD (SUMMIT) , recruited more than 16,000 patients in order to have sufficient power to determine the significance of the outcomes.
Facing a similar situation, studies in other medically relevant fields have integrated infrequently occurring events into composite endpoints that have proven relevant to the planning of clinical trials and the likely benefit of treatment to patients [5–9]. This is particularly relevant in the field of cardiology, where a composite measure of major adverse cardiac events (MACE) has become a frequent endpoint for therapeutic trials [5, 6, 10]. Composite endpoints have also been used in infectious diseases (including tuberculosis ), in stroke , collagen vascular disease , multiple sclerosis , diabetes , and renal failure . In COPD, a composite endpoint for safety in an endobronchial valve trial for emphysema was evaluated .
Using data from a large-scale clinical trial in patients with moderate to very severe COPD , we explored three hypotheses. First, that four important individual outcomes in COPD—death, hospitalized exacerbations, respiratory failure, and withdrawal from the study due to worsening COPD—could be integrated into a composite endpoint. Second, that there are measurable differences in the composite endpoint incidence between an active and a control arm. And third, that based on these results, fewer patients would need to be recruited to achieve sufficient power to explore significant effects of interventions on these outcomes. These hypotheses were tested using data from the UPLIFT® (Understanding Potential Long-term Impacts on Function with Tiotropium) trial, which included patients with COPD who were permitted to continue using their usual COPD medications, except for other inhaled anticholinergics .
UPLIFT® (ClinicalTrials.gov number, NCT00144339) was an international, multicenter, 4-year, randomized, placebo-controlled trial of tiotropium in patients with COPD, the details of which were previously published [2, 14, 15]. Patients were recruited from 490 centers in 37 countries. In brief, patients were aged ≥40 years, with a smoking history of ≥10 pack-years, postbronchodilator forced expiratory volume in 1 s (FEV1) of ≤70 % of predicted, and FEV1 to forced vital capacity (FVC) ratio of <0.70 of predicted. Exclusion criteria included a history of asthma, COPD exacerbation, or respiratory infection within 4 weeks of screening, prior pulmonary resection, and use of supplemental oxygen for >12 h/day. Using centralized randomization by country and by site, patients were randomized to 18 μg tiotropium or placebo (control) (once daily). All respiratory medications, with the exception of other inhaled anticholinergics, were permitted throughout the trial. Postrandomization clinic visits occurred at 1 month, at 3 months, and every 3 months throughout the 4-year treatment period. All patients gave written informed consent and the study was approved by local ethical review boards and conducted in accordance with the Declaration of Helsinki.
Composite Endpoint of Respiratory Outcomes (CERO) and statistical analysis
As part of the study, safety endpoints were carefully registered; causes of death were assessed by the mortality adjudication committee . The composite outcomes endpoint included fatal events (all-cause mortality); severe ventilatory failure, defined as patients requiring noninvasive or invasive mechanical ventilation and/or hypercapnia as documented in arterial blood gases; severe exacerbations, defined as episodes requiring hospitalization from COPD; and discontinuations from the trial due to worsening COPD. The onset of the first event from the composite endpoint is used to calculate the incidence rate. All patients recruited into the study were included in the analysis for the events of mortality, severe respiratory failure, hospitalizations for COPD, and discontinuation of the study for worsening COPD while on treatment within the study period (within 30 days of last intake of study medication). The analysis of the composite endpoint in this study only included on-treatment fatal events. Although fatal events were recorded beyond the treatment period in UPLIFT®, nonfatal events were not, so that vital status for this period was not included. Incidence rates (IRs) per 100 patient-years and risk ratios (RRs) were determined at 1, 2, 3, and 4 years. Incidence rates were calculated using the ratio of number of patients with the event and time at risk. Under the conditions of an event-driven trial, sample size was calculated assuming exponential survival function as the total number of events needed to achieve a given statistical power for an expected RR (or hazard ratio [HR]) and type I error. Under the conditions of a trial with fixed treatment duration, such as UPLIFT®, number of events was first calculated for a given RR by log-rank test. Number of patients was derived based on the number of events, IR, and treatment duration. For both designs, it was assumed that patients will be followed up for CERO events after discontinuation of the study according to the intent-to-treat principle. Therefore, sample size was not adjusted for possible dropouts. Statistical software nQuery was used to calculate the sample sizes.
Altogether, 5992 patients were randomized and received study medication. The baseline characteristics were balanced between treatment groups (Table 1); there were no differences in comorbidity data between the two treatment groups. Mean age was 65 years, 75 % were men, 30 % were current smokers, and mean postbronchodilator FEV1 was 1.32 L (47.6 % predicted). The patients had a wide range of airflow limitation and on average had significant impairment in their health status scores.
Composite Endpoint of Respiratory Outcomes (CERO)
The survival event curve (Kaplan–Meier) for the CERO endpoint shows a separation throughout the study in favor of tiotropium (Fig. 1). The IRs and RRs of the CERO endpoints were consistently and significantly lower with tiotropium vs placebo for all 4 years (Table 2). The same was observed for the individual events of respiratory failure, hospitalizations for exacerbations, and discontinuation from the trial due to COPD, but not for death, for which the RRs were significant for years 3 and 4, but not significant for years 1 and 2 (Table 3).
The effects of tiotropium versus placebo on the incidence of CERO were significant in most patients, including the 2007 Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage  III and IV patients (year 1 − 4) (Table 4).
Calculation of sample size
Using the RRs at 1, 2, 3, and 4 years as the expected HRs in a survival analysis, in a hypothetical event-driven study, the sample size (alpha = 0.05 two-sided, 90 % power) as represented in number of events would be 575, 810, 930, and 1383 for the HRs of 0.76, 0.80, 0.81, and 0.84, respectively, per treatment group (Table 5). Sample sizes were calculated based on the estimated RRs and the IRs provided in Tables 2, 3 and 4. The number of patients to enroll varies depending on the length of the study, including enrollment period. For example, for a HR of 0.76, if the enrollment is 1 year and the maximum length of follow-up is 2 years, then the number of patients would be 1205 per group to achieve 575 events, assuming all patients will be followed up for the composite safety endpoint until study completion. With the same HR, 1-year enrollment, and maximum follow-up of 3 years, the number of patients will reduce to 786 per group to achieve the expected 575 events. The sample sizes with the composite endpoint (GOLD stages II, III, IV, and total group) at 80 % power, 85 % power, and 90 % power showed differences depending on the GOLD stage (Table 5). Sample sizes were calculated based on the estimated RRs and the IRs provided in Tables 2, 3 and 4. Using the IRs and RRs as the expected outcome, the sample size (alpha = 0.05 two-sided, 90 % power) at 1, 2, 3, and 4 years would then be 1546, 1392, 1216, and 1504 per treatment group, respectively.
This analysis of data obtained in UPLIFT® yielded three important findings. First, a COPD composite endpoint incorporating infrequent serious or significant COPD-related safety outcomes can be useful to demonstrate efficacy and safety in clinical trials. Second, in the UPLIFT® trial, the IRs of CERO events were significantly reduced in patients receiving tiotropium compared with placebo . Third, this approach may help plan studies incorporating efficacy and safety evaluation with a reasonable sample size and shorter time.
Randomized clinical trials represent the gold standard to evaluate the comparative efficacy and safety of novel therapies, especially pharmacological drugs. For highly prevalent and deadly diseases, potential therapies should be evaluated by their impact on important clinical endpoints. An example is COPD, a disease that constitutes one of the top four causes of death in the world and that in its more severe cases results in hospital admissions, which increase personal and societal costs. However, although COPD causes many deaths because of its high prevalence, its long natural course, and the relatively low incidence of deaths or hospitalized exacerbations has resulted in the need to complete studies with very large numbers of patients and over a long period. Thus, the TORCH (Towards a Revolution in COPD Health) and the UPLIFT® studies, which recruited ~6000 patients each, lasted 3 and 4 years, respectively; and the mortality and hospitalization rate signals were relatively weak [1, 2]. Based on those results, two other trials (TIOSPIR® and SUMMIT) evaluating mortality as an outcome needed to recruit more than 16,000 patients each to have sufficient power to address the impact of these interventions on mortality and hospitalized exacerbations [3, 4]. An alternative approach used in other fields with a high burden of disease that may be divided into several outcomes that by themselves are relatively rare has been integrating important clinical outcomes into a composite endpoint that can then represent a more comprehensive measure of the overall response to therapy from a safety perspective. Although most studies have been done in the general field of cardiology [10, 13, 15, 18, 19], there have been many reports of trials using composite endpoints in infectious diseases (including tuberculosis ) as well as in stroke , diabetes , rheumatoid arthritis , and renal failure , among others. Only one study used a composite endpoint to evaluate the safety of endobronchial valves . Interestingly, some COPD studies used the composite endpoint MACE as a safety composite endpoint, but this was done to evaluate cardiac safety in COPD patients [2, 3, 21–23].
We selected four hard endpoints because of their clinical validity and registered accuracy in this regard; we not only included all-cause mortality (an accepted patient-centered outcome), but also hospitalizations for exacerbations of COPD and respiratory failure. We considered that these latter two outcomes were appropriately associated with all-cause mortality to add strength to the observed results. Similarly, we included discontinuation from the study due to COPD worsening because this event in long clinical trials represents an important signal of decompensation of the target organ and relates to the patients’ own feelings regarding their deteriorating health . We integrated all of the outcomes into the CERO and explored its value in the UPLIFT® study database.
The results of the study indicate a significant effect on the composite index that was of only borderline significance for the individual outcome of mortality in the original report . Indeed, in this regard, neither UPLIFT® nor the TORCH trial has been considered as having shown a significant impact on mortality using mortality including vital status follow-up of discontinued patients [1, 2]. Several authors have criticized the use of composite endpoints when there are contradictions among the results for the individual components and when they are not clearly defined. This is not the case in the current study. As seen in Tables 2 and 3, the tiotropium arm showed a significant impact on the incident rate ratio for CERO compared with placebo, and the RRs for each of the individual components supported the beneficial effect. Further, the results were applicable across a wide range of airflow obstruction (Table 4), supporting general application of the results, at least for patients meeting the criteria for inclusion in UPLIFT®.
Perhaps the most important information provided by the current study is the power calculation of the number of subjects and the duration of the trial needed to obtain a significant result for a therapeutic intervention. Again, using the results from UPLIFT® and the composite measure of CERO, significant reduction in the social and economic cost of completing such trials is theoretically possible.
Using the IRs and RRs as the expected outcome, the sample sizes are certainly much smaller than the SUMMIT trial (16,000 patients) and the completed TIOSPIR® (17,135 patients) study, in which all-cause mortality was the primary endpoint. As the number of medications approved for the treatment of COPD and the benefits that are reported to support their routine use increase, ethical reasons will preclude the completion of long-term trials with a placebo-control arm. Thus, comparison trials may prove to be the way of the future, and the larger signals from composite endpoints a potentially useful tool to evaluate the results of those trials.
We recognize limitations to this study. First, it is a retrospective analysis of data collected for a study in which the primary endpoint was rate of decline of lung function, and the composite endpoint could only be derived from endpoints reported in the study. However, mortality (carefully reviewed by a clinical endpoint committee) was a predetermined endpoint of the study. In addition, respiratory failure and severe exacerbation used in this report were collected as adverse events and the cause for discontinuation of the study carefully recorded. Indeed, we believe careful accounting for the differential dropout from clinical trials represents the strength of this analysis because those patients who discontinue the trial prematurely due to worsening disease  are characterized by worse patient-related outcomes. Furthermore, patients dropping out from the placebo group have worse patient-related outcomes than those dropping out from the active treatment group. Second, the elements used to define CERO were chosen after the study was completed. However, these were selected because of their clinical validity, careful recording, and value as reflecting the impact of COPD on the patients themselves. Third, the analysis was an on-treatment analysis based on data collected during the treatment period. The intent-to-treat analysis was not feasible because two components of the CERO, severe respiratory failure and severe exacerbation, were not collected after treatment discontinuation. This likely caused an unfavorable bias against tiotropium because more patients discontinued from placebo than from tiotropium; thus, more events in the placebo arm may otherwise have been recorded had the discontinued patients been followed up. Nevertheless, the exposure was adjusted in the calculation of IR and RR in an attempt to address such bias. If the composite endpoint is used as the primary endpoint in a future study, we recommend that all components are followed up after treatment discontinuation in a true intent-to-treat analysis.
This study shows that a composite endpoint incorporating infrequent serious or significant COPD-related safety outcomes could be useful in clinical trials. This clinically meaningful approach may help in the design of studies incorporating safety with a feasible sample size; however, further validation of the composite endpoint will be required. In the UPLIFT® trial, the IRs of CERO events were significantly reduced in patients receiving tiotropium.
Ethics approval and consent to participate
The trial was performed in accordance with the provisions of the Declaration of Helsinki, and the study protocol and procedures were approved by relevant ethics committees at each center. All the patients provided written informed consent.
composite endpoint of respiratory outcomes
chronic obstructive pulmonary disease
- FEV1 :
forced expiratory volume in 1 s
global initiative for chronic obstructive lung disease
major adverse cardiac events
study to understand mortality and morbidity in COPD
tiotropium safety and performance in respimat®
towards a revolution in COPD health
understanding potential long-term impacts on function with tiotropium
Calverley PM, Anderson JA, Celli B, Ferguson GT, Jenkins C, Jones PW, et al. Salmeterol and fluticasone propionate and survival in chronic obstructive pulmonary disease. N Engl J Med. 2007;356:775–89.
Tashkin DP, Celli B, Senn S, Burkhart D, Kesten S, Menjoge S, et al. A 4-year trial of tiotropium in chronic obstructive pulmonary disease. N Engl J Med. 2008;359:1543–54.
Wise RA, Anzueto A, Calverley P, Dahl R, Dusser D, Pledger G, et al. The Tiotropium Safety and Performance in Respimat Trial (TIOSPIR), a large scale, randomized, controlled, parallel-group trial-design and rationale. Respir Res. 2013;14:40.
Vestbo J, Anderson J, Brook RD, Calverley PM, Celli BR, Crim C, et al. The Study to Understand Mortality and Morbidity in COPD (SUMMIT) study protocol. Eur Respir J. 2013;41:1017–22.
Gomez G, Gomez-Mateu M, Dafni U. Informed choice of composite end points in cardiovascular trials. Circ Cardiovasc Qual Outcomes. 2014;7:170–8.
Loncar G, Barthelemy O, Berman E, Kerneis M, Petroni T, Payot L, et al. Impact of renal failure on all-cause mortality and other outcomes in patients treated by percutaneous coronary intervention. Arch Cardiovasc Dis. 2015;108(11):554–62.
Merle CS, Sismanidis C, Sow OB, Gninafon M, Horton J, Lapujade O, et al. A pivotal registration phase III, multicenter, randomized tuberculosis controlled trial: design issues and lessons learnt from the gatifloxacin for TB (OFLOTUB) project. Trials. 2012;13:61.
Einarson TR, Garg M, Kaur V, Hemels ME. Composite endpoints in trials of type-2 diabetes. Diabetes Obes Metab. 2014;16:492–9.
Cutter GR, Baier ML, Rudick RA, Cookfair DL, Fischer JS, Petkau J, et al. Development of a multiple sclerosis functional composite as a clinical trial outcome measure. Brain. 1999;122(Pt 5):871–82.
Naber CK, Pyxaras SA, Nef H, IJsselmuiden AJ, Briguori C, Schlundt C, et al. Final results of a self-apposing paclitaxel-eluting stent fOr the PErcutaNeous treatment of de novo lesions in native bifurcated coronary arteries study. EuroIntervention. 2015;11(2):20141127–06. doi:10.4244/EIJY15M06_02.
Chi C, Tai C, Wang J, Protogerou A, Blacher J, Safar ME, et al. 2A.04: Whether ACE inhibitor/ARB combined with CCB is superior to other combinations? A meta-analysis in 20,669 hypertensives. J Hypertens. 2015;33 Suppl 1:e19–20.
Lung Health Study Research Group. Effect of inhaled triamcinolone on the decline in pulmonary function in chronic obstructive pulmonary disease. N Engl J Med. 2000;343:1902–9.
Sciurba FC, Ernst A, Herth FJ, Strange C, Criner GJ, Marquette CH, et al. A randomized study of endobronchial valves for advanced emphysema. N Engl J Med. 2010;363:1233–44.
Decramer M, Celli B, Kesten S, Lystig T, Mehra S, Tashkin DP. Effect of tiotropium on outcomes in patients with moderate chronic obstructive pulmonary disease (UPLIFT): a prespecified subgroup analysis of a randomised controlled trial. Lancet. 2009;374:1171–8.
Tashkin DP, Leimer I, Metzdorf N, Decramer M. Cardiac safety of tiotropium in patients with cardiac events: a retrospective analysis of the UPLIFT® trial. Respir Res. 2015;16:65.
McGarvey LP, Magder S, Burkhart D, Kesten S, Liu D, Manuel RC, et al. Cause-specific mortality adjudication in the UPLIFT® COPD trial: findings and recommendations. Respir Med. 2012;106:515–21.
Rabe KF, Hurd S, Anzueto A, Barnes PJ, Buist SA, Calverley P, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am J Respir Crit Care Med. 2007;176:532–55.
Gershlick AH, Banning AP, Myat A, Verheugt FW, Gersh BJ. Reperfusion therapy for STEMI: is there still a role for thrombolysis in the era of primary percutaneous coronary intervention? Lancet. 2013;382:624–32.
Gershlick AH, Khan JN, Kelly DJ, Greenwood JP, Sasikaran T, Curzen N, et al. Reply: Complete revascularization in patients undergoing primary percutaneous coronary intervention for STEMI: is it really what we should be doing? J Am Coll Cardiol. 2015;66:332–3.
Smolen JS, Emery P, Fleischmann R, van Vollenhoven RF, Pavelka K, Durez P, et al. Adjustment of therapy in rheumatoid arthritis on the basis of achievement of stable low disease activity with adalimumab plus methotrexate or methotrexate alone: the randomised controlled OPTIMA trial. Lancet. 2014;383:321–32.
Halpin DM, Dahl R, Hallmann C, Mueller A, Tashkin D. Tiotropium HandiHaler® and Respimat® in COPD: a pooled safety analysis. Int J Chron Obstruct Pulmon Dis. 2015;10:239–59.
Wedzicha JA, Dahl R, Buhl R, Schubert-Tennigkeit A, Chen H, D'Andrea P, et al. Pooled safety analysis of the fixed-dose combination of indacaterol and glycopyrronium (QVA149), its monocomponents, and tiotropium versus placebo in COPD patients. Respir Med. 2014;108:1498–507.
White WB, Cooke GE, Kowey PR, Calverley PM, Bredenbroker D, Goehring UM, et al. Cardiovascular safety in patients receiving roflumilast for the treatment of COPD. Chest. 2013;144:758–65.
Vestbo J, Anderson JA, Calverley PM, Celli B, Ferguson GT, Jenkins C, et al. Bias due to withdrawal in long-term randomised trials in COPD: evidence from the TORCH study. Clin Respir J. 2011;5:44–9.
The authors acknowledge the medical writing and editorial assistance from Godfrey Lisk, PhD, scientific specialist at PAREXEL International, in the form of preparation and revision of the draft manuscript. Data have been reported in part in the primary data publication .
The UPLIFT® trial and this subsequent post hoc analysis were funded by Boehringer Ingelheim.
Bartolomé R. Celli, MD, has received research grants to his academic division from AstraZeneca, advisory board or consultancy payments from GlaxoSmithKline, Boehringer Ingelheim, Almirall, AstraZeneca, Takeda, and Novartis. Neither he nor any family members have shares or commercial interests in any company. He has not received payments or had any relationship with the tobacco industry.
Donald P. Tashkin, MD, has received research grants to his university from Boehringer Ingelheim, AstraZeneca, Novartis, Sunovion, and GlaxoSmithKline, consultancy payments from AstraZeneca, Novartis, Sunovion, and Theravance, and speaker fees from Boehringer Ingelheim, AstraZeneca, Sunovion, and Forest. Neither he nor a member of his family has shares in any company or any relationship with the tobacco industry.
Marc Decramer, MD, has received speaker fees from AstraZeneca, GlaxoSmithKline, Boehringer Ingelheim, Pfizer, and Novartis, consulting fees from AstraZeneca, Boehringer Ingelheim, Pfizer, Dompe, GlaxoSmithKline, Novartis, Nycomed, and Vectura, and grant support of AstraZeneca, Boehringer Ingelheim, Pfizer, GlaxoSmithKline, and Chiesi.
Dacheng Liu, PhD, Guus M. Asijee, PhD, and Norbert Metzdorf, PhD, are employees of Boehringer Ingelheim.
BRC, MD, DL, NM, GMA and DPT have all made substantial contributions to study design, data acquisition, analysis or interpretation, drafting the article, critically revising the content, and final approval of the submitted version. DL performed the statistical analyses. Agreements with the study sponsor did not restrict the academic authors in the publication of these data.
BRC: MD; MD: MD; DL: PhD; NM: PhD; GMA: PhD; DPT: MD.
About this article
Cite this article
Celli, B.R., Decramer, M., Liu, D. et al. Defining a COPD composite safety endpoint for demonstrating efficacy in clinical trials: results from the randomized, placebo-controlled UPLIFT® trial. Respir Res 17, 48 (2016). https://doi.org/10.1186/s12931-016-0361-4
- Composite endpoint