Abstract
Nowadays there is an increased medical interest in personalized medicine and tailoring decision making to the needs of individual patients. Within this context our developments are motivated from a Dutch study at the Cardio-Thoracic Surgery Department of the Erasmus Medical Center, consisting of patients who received a human tissue valve in aortic position and who were thereafter monitored echocardiographically. Our aim is to utilize the available follow-up measurements of the current patients to produce dynamically updated predictions of both survival and freedom from re-intervention for future patients. In this paper, we propose to jointly model multiple longitudinal measurements combined with competing risk survival outcomes and derive the dynamically updated cumulative incidence functions. Moreover, we investigate whether different features of the longitudinal processes would change significantly the prediction for the events of interest by considering different types of association structures, such as time-dependent trajectory slopes and time-dependent cumulative effects. Our final contribution focuses on optimizing the quality of the derived predictions. In particular, instead of choosing one final model over a list of candidate models which ignores model uncertainty, we propose to suitably combine predictions from all considered models using Bayesian model averaging.
Keywords
1. Introduction
Motivated by the current increased medical interest in personalized medicine, we focus in this work on subject-specific survival predictions.1–5 For example, in the field of cardio-thoracic surgery and especially after a heart valve replacement, the main disadvantage of human tissue valve allografts is their limited durability, due to calcification with tissue damage resulting in degeneration and dysfunction. Thus, it may be of interest for the treating physicians to develop a prognostic tool that could inform them about a future re-intervention to their patients using all available repeated measurements. Specifically, the motivation comes from a study that was conducted in the Erasmus Medical Center, Rotterdam, The Netherlands. This study includes 270 patients who received a human tissue valve allograft in aortic position in the Department of Cardio-Thoracic Surgery in a period of 21 years. Patients were followed prospectively over time and measurements of aortic gradient and aortic regurgitation were obtained at six months and one year postoperatively and biennially thereafter.
6
The continuous variable aortic gradient measures whether the opening of the aortic valve is narrowed, while the ordinal variable aortic regurgitation measures leakage of the aortic valve and consists of five categories. Furthermore, at the end of follow-up, 57 (20.1%) patients had died and 74 (26.1%) patients required a reoperation on the allograft. Since aortic gradient and aortic regurgitation are both measuring aortic heart valve abnormalities and therefore, the presence of one disease will have an influence on the other, it is of great interest from the clinical point of view to analyze them together. In Figure 1, we present the profiles of aortic gradient for males and females, while in Table 1 we present some extra information regarding the data.
Profile plots of aortic gradient for males and females. The three patients that are highlighted will be used for prediction. Descriptive statistics.
To evaluate the predictive value of the heart valve data on mortality and reoperation and to derive the subject-specific predictions, we rely on joint models for longitudinal and time-to-event data. Joint modeling is an active area of statistics research that has received a lot of attention in recent years.7–9 Moreover, these models can be used to objectively extract information from multiple markers and to employ them to dynamically update risk estimates. An advantage is that the predictions are updated as more measurements become available. Thus, the statistical predictions can be combined with the physician’s expertise to yield improved health outcomes eventually.
In this paper, we extend the work presented in Andrinopoulou et al. 10 that focuses on fitting the data using a joint model where the survival outcomes are associated with the random effects of the longitudinal outcomes. In particular, our contribution is two-fold: first, we postulate joint models assuming different functional forms to underlie the relationship between multiple longitudinal and survival outcomes and second, we focus on prediction models in the presence of competing risks. Since we are more interested in predicting future patients than simply assessing the degree of association between the trend of the repeated outcomes and time-to-events, it is important to accurately determine the estimate of the underlying process of the heart disease. Thus, we go beyond the standard joint model that utilizes only the latent value of the biomarkers and investigate whether the risk of an event could be affected also by the slope or a summary of the whole history of the longitudinal outcomes.
Finally, it is common practice that a model upon which the clinical predictions are based is selected from a set of different possible models. However, such an approach neglects model uncertainty. In addition, different models may produce more accurate predictions for different types of patients. To explicitly account for these issues we rely on a Bayesian model averaging (BMA) approach.11,12 More specifically, we propose to suitably combine predictions from joint models that assume different covariates and association structures between the longitudinal and event time processes. Previous research using that approach 12 has focused on BMA risk predictions for one event based on one continuous longitudinal outcome. In this paper, we extend this idea to the competing risk setting while also allowing for multiple longitudinal outcomes. Specifically, motivated by the heart valve study where treating physicians are more interested in risk predictions separately for reoperation and death, we derive a BMA version of the cumulative incidence functions of the two events.
The rest of the paper is organized as follows: Section 2 describes the joint submodels and the Bayesian estimation procedure, Section 3 provides the individualized prediction mechanism and the BMA approach, Section 4 presents the results of the valve data, while Section 5 a simulation study. Finally, in Section 6 we close with a discussion.
2. Methods
2.1 Submodels
Let
For ordinal outcomes, we propose to use the continuation ratio (CR) mixed-effects model, postulated as
For the survival process we assume that the risk for each of the K competing events depends on the true but unobserved value of the markers at time t. Specifically, we have
An issue that is often overlooked when building joint models is the functional form that describes the longitudinal outcomes that are associated with the risk for each event. Due to the fact that different structures may provide us with different inferences and predictions, it is important to study this component of the model. Following, Brown,
14
Rizopoulos and Ghosh,
15
and Rizopoulos,
9
we postulate different functional forms to assess the predictive ability of the biomarkers. Specifically, we assume the following association structures:
Model M1 postulates that the risk for an event at time t depends on the mean level of the markers at the same time point t. Model M2 is an extension of model M1 in which not only the current value but also the slopes of the longitudinal trajectories at time t are related to the hazard. Yet another option is to relate the survival outcomes with a summary of the whole history of the markers, e.g. the area under the longitudinal profiles (model M3).
Combinations of these parameterizations are possible, where in the case of
2.2 Bayesian estimation and prior specification
For the estimation of our joint model’s parameters, we adopt a Bayesian formulation and derive posterior inferences using a Markov Chain Monte Carlo (MCMC) algorithm. The likelihood of the model is derived under the assumption that the random effects account for all independencies between the observed outcomes. Specifically, given the random effects, the longitudinal and survival processes are assumed independent and moreover, the longitudinal responses of each subject are assumed independent. In particular,
We use standard prior distributions for the parameters. In particular, for the regression coefficients
All computations have been performed in R (version 3.1.2) and JAGS (version 3.3.0) and are available upon request from the first author.
3. Dynamic predictions of cumulative incidence functions
3.1 Predictions from a single model
Based on the joint models presented in Section 2.1, we focus on the derivation of the predictions of the survival outcomes. More specifically, we would like to predict cumulative incidence probabilities for a new patient l that has provided us with a set of longitudinal measurements
Under the Bayesian formulation of the joint model, the estimation of
The second term of the integrant (3),
An estimate of 1. Draw 2. Draw 3. Compute
We, then, repeat steps 1–3 H times and derive the estimates of the
3.2 Combined predictions using BMA
As it was seen in Section 2.1, there are several ways to link the longitudinal and the survival outcomes. Moreover, we could even postulate additional joint models with different assumptions for each submodel. For instance, in some of the mixed models the subject-specific evolutions over time may be non-linear, or we could control for different sets of baseline covariates either in the mixed-effects or relative-risk submodels. In this complex setting, a common practice is to choose a single model based on information criteria and obtain predictions from that selected model. However, this approach ignores model uncertainty. In addition, there may be different models that provide more accurate predictions for different types of subjects. An alternative solution to this problem is BMA, which proceeds by estimating a number of models and constructing a weighted average of predictions.11,12
More formally, following the notation in Section 3.1, we would like to obtain predictions of the conditional probabilities
4. Analysis of the valve dataset
In this section we present the analysis of the cardio data introduced in Section 1. Our interest is to derive subject-specific risk predictions using all available information for a patient. We first started our analysis by fitting a set of joint models with different association structures and different baseline covariates. More specifically, for the linear mixed-effects model many patients showed non-linear longitudinal trajectories and therefore, we assumed natural cubic spline for time with two internal knots (λ) at 2.1 and 5.5 year (corresponding to 33.3% and 66.7% of the observed follow-up times) in both fixed and random part in the mixed-effects model of aortic gradient. Furthermore, we corrected for age (after we standardized it) and gender. Since there are more clinically relevant baseline covariates that have an effect on aortic gradient, we fitted a second mixed-effects model including also: marfan, left ventricular function, standardized donor age, and standardized diameter of valve. Particularly, the linear mixed-effects models take the form
For the survival models we used proportional hazards models expressing the baseline hazard function with a B-splines function as described in Section 2.1. Standardized age and type of operation are important clinical factors and thus were included as confounders. Moreover, to not further complicate the analysis we assumed the same functional form for all longitudinal outcomes and the same model for all survival outcomes. Specifically, the joint models that we fitted take the form
DIC and LPML measurements for the six joint models.
We continued by calculating dynamic predictions based on the models described above. Under the fitted joint models, the conditional probabilities of death and reoperation were estimated using the Monte Carlo procedure, as described in Section 3.1, with H = 300. Specifically, we derived the predictions of two patients, patient 123 (47-year-old male) and patient 286 (52-year-old male), that were excluded from the dataset when fitting the joint models. In Figures 2 and 3, we present the longitudinal trajectories of these patients, which are also highlighted in Figure 1. As it can be seen, both patients show an increasing aortic gradient profile over time, but, patient 286 has higher values than patient 123. For aortic regurgitation, only patient 286 seems to change category at the second follow-up. We show the predictions of every joint model of death and reoperation for patients 123 and 286 in Figures 4 and 5, as more longitudinal measurements are available. For patient 123, when using the underlying value of aortic gradient and aortic regurgitation in the survival models (for both the model with less M11 and more covariates M12 as confounder in the longitudinal part), the conditional probabilities of death are much smaller than reoperation. The same can be seen when assuming the area under the curve model with the extra confounder M32. However, when assuming the slope parameterization and the area under the curve parameterization (for the model without the extra clinical relevant covariates), the difference between the survival and free intervention probabilities is smaller (M21, M22, and M31). Specifically, the probabilities of death and reoperation seem to cross in model M21 while, in model M22 both are really small. For patient 286, a bigger difference between the death and survival probabilities is observed in model M31 compared to patient 123. Furthermore, in model M21 it is clear that the probability of death is higher in the next five years at the first two visits.
Profile plots of aortic gradient for patients 123 and 286. Profile plots of aortic regurgitation for patients 123 and 286. Prediction plots for patient 123 using each joint model proposed. Solid line represents the reoperation predictions and dashed line the death predictions. Prediction plots for patient 286 using each joint model proposed. Solid line represents the reoperation predictions and dashed line the death predictions.



Bayesian model averaging posterior weights for the six proposed joint models for patient 123 and 286 for each measurement.

Prediction plots for patient 123 using Bayesian model averaging - CIF = Cumulative Incidence Function.

Prediction plots for patient 286 using Bayesian model averaging - CIF = Cumulative Incidence Function.
5. Simulation
We performed a simulation study in order to evaluate the performance of the proposed approach. Simulations showing that BMA predictions perform well in comparison with predictions from the true model have been already presented in the literature. 12 Therefore, in this paper we focus on the comparison between the BMA with the DIC approach in a setting where patients are simulated from different scenarios. As mentioned before, it is common practice that a single model is selected using standard criteria, such as DIC, and then, for every new patient predictions are based on this specific model. However, this raises the question whether it is appropriate to assume the same prediction model for patients with different features in their longitudinal profile.
Motivated by this, we simulated 200 datasets that consist of 140 subjects per scenario. In particular, we considered three scenarios corresponding to the three parameterizations: (1) value, (2) slope, and (3) area. Under each scenario and for each simulated dataset, we randomly selected 20 subjects and calculated the BMA weights. Since patients within a dataset could have different characteristics, we merged the remaining subjects and fitted the three joint models assuming the parameterizations (1), (2), (3) in order to obtain the DIC values. For simplicity we assumed one longitudinal and one survival outcome. A detailed description of the design of this simulation study is presented in the supplementary material in Section 3. In Figure 8 we show on the X-axis the models selected by the DIC approach and on the Y-axis the mean weights of the not selected from the DIC models obtained from the BMA approach. Assuming that the DIC selects the appropriate model and assuming that the BMA has a similar behavior, we would expect a value close to 0 for the weights. In the opposite case where the approaches are not consistent we would expect the weights to be higher than zero depending on the degree of disagreement, where one indicates a strong disagreement. As can be seen, when the value model is selected by DIC, the mean weights of the other models (slope and area) are mostly between 0.6 and 0.85, whereas in the case where the area model is selected by the DIC, these are close to 1.
Mean weights for the models that were not selected by the DIC.
6. Discussion
Time-dependent adjustment of risk prediction models for patients with severe heart valve diseases, as more longitudinal measurements become available over time, provides the physician with an evidence-based understanding of the prognostic implication of changes in the patient’s disease condition. Importantly, the calculated probabilities for survival and re-intervention can be used as an early warning system, allowing the necessary time for the physicians to plan an intervention. In this work we presented dynamic predictions for a competing risk setting using multiple longitudinal outcomes. Specifically, we performed different functional forms to relate the longitudinal and the survival processes and different structure (by adding clinical relevant baseline covariates) of one longitudinal model and investigate the predictions of two patients that we originally excluded from the dataset. Finally, since the choice of a single model ignores the model uncertainty issue, we combined all models and derived predictions for the survival outcomes for the same two patients using the BMA. This method explicitly accounts for model uncertainty and for the fact that not all future patients have the same prognostic model. Despite the usefulness of the BMA approach, there are some issues that we need to be caution for. In particular, the specification of the prior distribution of the models is challenging and has received little attention. Moreover, the number of models that could be combined may be high resulting to difficulties in calculating the summation. We should, furthermore, make clear that we cannot identify the true model from the pool of all candidate models.
An alternative measure to the survival time is the cause-specific residual life that provides the expected value for the lifetime remaining at any time t, given that the subject is known to have survived up to t. 18 The advantage of the residual life function over the survival function lies in its interpretation in many applications where the primary aim is to characterize the remaining time expectancy of a subject instead of the failure rate. For instance, patients may wish to learn from their physicians an estimate of their expected survival given that they have started a treatment or received an intervention at a given time. This idea is similar to the conditional cumulative incidence functions for death and reoperation that we presented in the paper, where death and reoperation probabilities are calculated at time u given that the patient did not experience any event at t, t < u.
We limited our work so that the association between aortic gradient and aortic regurgitation with the survival outcomes was the same. However, a different functional form for each biomarker would be also possible and interesting to investigate. A disadvantage, nevertheless, is that it will be more computationally intensive, since more models would be fitted. Moreover, additional ways to link the longitudinal and survival processes, such as the weighted area under the curve, the lag effect, and the shared random effect parameterization, were not addressed in this paper. Nevertheless, a bigger dataset with patients followed for a longer period is probably needed in order to obtain a variety of evolutions and capture the special characteristics of the patients. Furthermore, we performed models including extra confounders only for the continuous longitudinal outcome. Hence, more factors in the model of the ordinal outcome and also of the survival outcomes could be included and further investigated. In this paper we did not perform any formal validation of the derived predictions. Measures for the evaluation of calibration and discrimination of prognostic survival models can be easily adapted to the competing risks setting.19,20 Within the joint modeling framework, calibration and discrimination has been previously introduced.4,5 However to our knowledge these proposed measures of calibration and discrimination are not directly applicable on competing risk settings. Thus, validation in that setting may be an interesting topic for future research. In addition, in the paper, for the calculation of the marginal densities
Supplemental Material
sj-pdf-1-smm-10.1177_0962280215588340 - Supplemental material for Combined dynamic predictions using joint models of two longitudinal outcomes and competing risk data
Supplemental material, sj-pdf-1-smm-10.1177_0962280215588340 for Combined dynamic predictions using joint models of two longitudinal outcomes and competing risk data by Eleni-Rosalina Andrinopoulou, D Rizopoulos, Johanna JM Takkenberg, E Lesaffre, in Statistical Methods in Medical Research
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
