Skip to main content
  • Research article
  • Open access
  • Published:

Direct modelling of age standardized marginal relative survival through incorporation of time-dependent weights

Abstract

Background

When quantifying the probability of survival in cancer patients using cancer registration data, it is common to estimate marginal relative survival, which under assumptions can be interpreted as marginal net survival. Net survival is a hypothetical construct giving the probability of being alive if it was only possible to die of the cancer under study, enabling comparisons between populations with differential mortality rates due to causes other the cancer under study. Marginal relative survival can be estimated non-parametrically (Pohar Perme estimator) or in a modeling framework. In a modeling framework, even when just interested in marginal relative survival it is necessary to model covariates that affect the expected mortality rates (e.g. age, sex and calendar year). The marginal relative survival function is then obtained through regression standardization. Given that these covariates will generally have non-proportional effects, the model can become complex before other exposure variables are even considered.

Methods

We propose a flexible parametric model incorporating restricted cubic splines that directly estimates marginal relative survival and thus removes the need to model covariates that affect the expected mortality rates. In order to do this the likelihood needs to incorporate the marginal expected mortality rates at each event time taking account of informative censoring. In addition time-dependent weights are incorporated into the likelihood. An approximation is proposed through splitting the time scale into intervals, which enables the marginal relative survival model to be fitted using standard software. Additional weights can be incorporated when standardizing to an external reference population.

Results

The methods are illustrated using national cancer registry data. In addition, a simulation study is performed to compare different estimators; a non-parametric approach, regression-standardization and the new marginal relative model. The simulations study shows the new approach is unbiased and has good relative precision compared to the non-parametric estimator.

Conclusion

The approach enables estimation of standardized marginal relative survival without the need to model covariates that affect expected mortality rates and thus reduces the chance of model misspecification.

Peer Review reports

Background

When quantifying the probability of survival in cancer patients using cancer registration data it is common to estimate marginal relative survival, which under assumptions can be interpreted as marginal net survival [1, 2]. Net survival is a hypothetical construct giving the probability of being alive if it was only possible to die of the cancer under study. In these studies cause of death information is either unavailable or, more commonly, deemed unreliable [3, 4] and so expected mortality rates are incorporated so that mortality in excess of that expected can be estimated.

Relative survival is likely to vary by demographic factors such as age and sex, and disease severity, but is often reported as a marginal estimate giving the average survival in the study population. To ensure fair comparisons between population groups age-standardization is generally performed [5] where a common age distribution is forced on the groups being compared. The arguments for standardization could also apply to other covariates that vary between groups being compared.

Marginal relative survival can be estimated non-parametrically using the Pohar Perme estimator [6]. In order to age-standardize when using the Pohar Perme estimator it is usual to obtain separate estimates for different age groups, with the marginal estimate obtained through a weighted average of the age group specific effects. An alternative is to up or down weight individuals relative to a reference population [7, 8]. Standardization can also be performed using a modeling approach [911], which allows estimation of conditional effects and well as marginal estimates, through regression standardization [2, 12].

In a standard survival model, i.e. not in the relative survival framework,if one is just interested in an estimate of marginal survival, then it is not necessary to model any covariates. This means that for a reasonable parametric model with no covariates the estimated survival function should be in good agreement with the non-parametric Kaplan-Meier curve. However, in the relative survival framework there will be a difference between the estimated relative survival from a model with no covariates and a non-parametric (Pohar Perme) estimate. This is due to need to incorporate expected mortality rates that vary between individuals (by age, sex, calendar year etc) in the relative survival model. Thus, red when using a relative survival model to estimate, marginal relative survival, then covariates that affect expected mortality rates should be modelled so that the marginal effect can then be estimated through regression standardization [2, 12].

The need to model covariates that affect expected mortality rates, even when interest only lies in marginal relative survival, increases the likelihood of model misspecification, particularly as it is very common to have to model time-dependent effects (non-proportional excess hazards). In this paper we propose a model that directly estimates marginal relative survival and thus removes the need to model variables that reflect the variation in expected mortality rates. In the “Methods” section we define the conditional and marginal models and describe how incorporation of individual level weights allows external age standardization and enable covariates to be incorporated. The “Results” section includes a simulation study evaluating statistical properties of different estimators and illustrates the methods using an example of individuals diagnosed with melanoma. The paper is concluded with a discussion.

Methods

In the relative survival framework it is assumed that the overall all-cause mortality rate, h(t|Xi), for an individual with covariate pattern Xi, is the sum of the expected mortality rate, h(t|Xi), and the excess mortality rate, λ(t|Xi).

$$ h(t|\pmb{X}_{i}) = h^{*}(t|\pmb{X}_{i}) + \lambda(t|\pmb{X}_{i}) $$
(1)

For simplicity it is assumed that covariates, Xi, are the same for the expected and excess mortality rates, but this can be relaxed. Expected mortality rates are stratified by age, sex, calendar year and potentially other demographic covariates. The relative survival for covariate pattern Xi is,

$$ {R}(t|\pmb{X}_{i}) = \exp\left(-\int_{0}^{t}{\lambda(u|\pmb{X}_{i}) du}\right) $$

The marginal relative survival, Rm(t|X), is the expectation over covariates, X, i.e. EX[R(t|X)]. This can be estimated in a modelling framework when incorporating these covariates, X, by averaging the individual estimates, \(\widehat {R}(t|\pmb {X}_{i})\).

$$ \widehat{R}_{m}(t|\pmb{X}) = \frac{1}{N}\sum_{i=1}^{N}{\widehat{R}(t|\pmb{X}_{i})} $$

The marginal excess mortality rate function, λm(t|X) can be obtained through the usual transformation from survival to hazard function, \(h(t) = -\frac {d \ln [S(t)]}{dt}\).

$$ \lambda_{m}(t|\pmb{X}) = \frac{E_{\pmb{X}}\left[R(t|\pmb{X})\lambda(t|\pmb{X})\right]}{E_{\pmb{X}}[R(t|\pmb{X})]} $$
(2)

and can be estimated in a parametric modeling framework when modeling covariates X by,

$$ \widehat{\lambda}_{m}(t|\pmb{X}) = \frac{\frac{1}{N}\sum_{i=1}^{N}{\widehat{R}(t|\pmb{X}_{i})\widehat{\lambda}(t|\pmb{X}_{i})}}{\frac{1}{N}\sum_{i=1}^{N}{\widehat{R}(t|\pmb{X}_{i})}} $$

A conditional model with no covariates for the excess mortality rate

Consider the conditional model in Eq. (1) without including covariates Xi for the excess mortality rate.

$$ h(t|\pmb{X}_{i}) = h^{*}(t|\pmb{X}_{i}) + \lambda(t) $$
(3)

This assumes that the excess mortality rate, λ(t), is the same for all individuals. This would mean that all-cause mortality rate would vary between individuals only due to variation in the expected mortality rates and not variation in excess mortality rates. This is different from that defined in Eq. (2) where the individual excess mortality rates vary between individuals.

Likelihood

We adopt a fully parametric approach, so will model how excess mortality rates vary over time from diagnosis and by covariates. For an observed all-cause survival/censoring time, ti and event indicator for death due to any cause, di, the log-likelihood contribution of the ith individual with covariate pattern, Xi, for a relative survival model is

$$ \ln L_{i} = d_{i}\ln\left[h^{*}(t_{i}|\pmb{X}_{i}) + \lambda(t_{i}|\pmb{X}_{i},\pmb{\beta},\pmb{\gamma}) \right] - \Lambda(t_{i}|\pmb{X}_{i},\pmb{\beta},\pmb{\gamma}) $$

where Λ(t|Xi,β,γ) is the cumulative excess mortality function with parameters, β, modelling covariate effects and, γ, modeling the effect of time from diagnosis [11, 13].

For a marginal model with no covariates the marginal excess mortality rate, λm(t|X), as defined in Eq. (2), needs to be directly estimated; X thus denotes covariates that can impact on both expected and excess mortality rates. Rather than incorporate, h(ti|Xi), the individual expected hazard for the ith individual at time ti, a suitable estimate of the marginal expected mortality rate needs to be incorporated. A naive way to do this would be including the mean of h(ti|Xi) among those at risk at time ti. However, net survival is defined in the hypothetical world where it is not possible to die from other causes, but it is estimated in the real world where individuals may die from both their cancer and from other causes. This means that with increasing time from diagnosis individuals with a higher risk of dying from other causes will be underrepresented. This should be taken into account in both the likelihood and when estimating the mean expected mortality rate. A similar idea to that proposed by Pohar Perme et al. [6] for the non-parametric estimator is used by upweighting by the inverse of the expected survival, S(t|Xi), where,

$$S^{*}(t|\pmb{X}_{i}) = \exp\left(-\int_{0}^{t}{h^{*}(u|\pmb{X}_{i}) du}\right) $$

and defining the individual level, time-dependent weights \(w_{i}^{*}(t)\) as

$$ w_{i}^{*}(t) = \frac{1}{S^{*}(t|\pmb{X}_{i})} $$

The mean expected mortality rate at time ti incorporating weights, \(w_{i}^{*}(t_{i})\), is

$$ \bar{h}^{*}(t_{i})=\frac{\sum\limits_{j\in \mathcal{R}(t_{i})} { w_{j}^{*}(t_{i}) h^{*}(t_{i}|\pmb{X}_{j})}}{\sum\limits_{j\in \mathcal{R}(t_{i})} { w_{j}^{*}(t_{i})}} $$
(4)

where \(\mathcal {R}(t_{i})\) is the set of those at risk at time ti.

The weighted marginal expected mortality rates, \(\bar {h}^{*}(t_{i})\) can then be incorporated into the weighted likelihood rather than h(ti|Xi) together with weights, \(w_{i}^{*}(t_{i})\),

$$ \begin{aligned} \ln L_{i} &= d_{i} w_{i}^{*}(t_{i})\ln\left[\bar{h}^{*}(t_{i}) + \lambda_{m}(t_{i}|\pmb{\gamma}) \right] \\&\quad- \int_{0}^{t_{i}}{w_{i}^{*}(u)\lambda_{m}(u|\pmb{\gamma}) du} \end{aligned} $$
(5)

Note that Eq. (4) only needs to be calculated at event times and is not needed for individuals with censored times. The integral in Eq. (5) will generally not be analytically tractable. A numerical integration method, such as Gaussian quadrature, could be incorporated into the estimation process. However, we choose to split the time-scale into a number of intervals and assume that the weight is constant within each interval. The likelihood then becomes,

$$ \begin{aligned} \ln L_{i} &= d_{i} w_{i}^{*}(t_{i})\ln\left[\bar{h}^{*}(t_{i}) + \lambda_{m}(t_{i}|\pmb{\gamma}) \right] \\&\quad- \sum_{k=1}^{M_{i}} {w_{i}^{*}(t_{k})\left(\Lambda_{m}(t_{i(k)}|\pmb{\gamma}) - \Lambda_{m}(t_{i(k-1)}|\pmb{\gamma})\right)} \end{aligned} $$
(6)

where Mi is the number of intervals for the ith subject. An advantage of this approach is that after splitting the time-scale and calculating the weights, standard parametric relative survival models can be used. This requires the software to incorporate both weights and left truncation into the likelihood. There needs to be a choice of how finely to split the time-scale. As the weights depend upon expected mortality rates, the weights will vary continuously, so a choice needs to be made at what point within the interval to calculate the weight. We use the mid-point of the interval [14]. More time intervals will result in greater precision, but increase computational time. The choice of the number of time intervals is investigated in the example in the “Results” section. The weights vary within individuals and leads to within-subject correlation. Therefore, a cluster robust sandwich estimator of the variance is used [15]. This is similar to other methods that use time-dependent weights, such as the Fine and Gray subhazard model [16] or the parametric equivalent [17].

External age-standardization

In order to compare estimates of marginal relative survival between different population groups it is necessary to age-standardize to the same age distribution. In the non-parametric setting the usual approach is to estimate marginal relative survival separately within age groups and then obtain a weighted average of the age-specific estimates, with weights equal to the proportion within each age group in the reference population. In a modelling framework regression standardization is performed with each individual up or downweighted using the ratio of the proportion in the age group to which the individual belongs and the proportion in the reference age group [2]. A similar idea can be used within the marginal model that enables externally age-standardized estimates to be obtained without the need to model, or stratify by, age.

Let \(p^{a}_{i}\) be the proportion in the age group to which the ith individual belongs and \(p^{R}_{i}\) be the corresponding proportion in the reference population. Weights can be defined to upweight or downweight individual relative to the reference population.

$$ w_{i}^{a} = \frac{p^{R}_{i}}{p^{a}_{i}} $$
(7)

These weights can then be combined with the inverse expected survival weights,

$$ w_{i}(t) = w_{i}^{a} w_{i}^{*}(t) $$

These weights are the same as those defined by Sasieni and Brentnall [7] for use in non-parametric relative survival estimators. The weights need to be used when calculating \(\bar {h}^{*}(t_{i})\) by substituting \(w_{i}^{*}(t)\) for wi(t) in Eq. (4) and in the likelihood in Eq. (6). It is common just to standardize by age, but the approach is applicable when standardizing over multiple covariates.

Modeling covariates

When the aim is to make contrasts between different population groups then covariates can be added to the marginal model. For example, these covariate could be different regions/countries, socio-economic groups, time-periods or sexes. As the age distribution may vary between the groups being compared it is important to age-standardize, but now the weights defined in Eq. (7) should be calculated separately within subgroups. In addition, \(\bar {h}^{*}(t_{i})\) should be calculated separately within each subgroup.

Choice of parametric model

The likelihood defined in Eq. (6) could be used for a variety of parametric models. Here we use flexible parametric survival models on the log-cumulative excess hazard scale [9] that incorporate restricted cubic splines to model the effect of time from diagnosis. An advantage of modeling on the log-cumulative excess hazard scale is that the it provides an analytical form for the cumulative excess hazard that is required for the likelihood in Eq. (6). The model for the log cumulative excess hazard, Λ(t|k0,γ), where k0 is a vector of knots and γ the associated splines parameters, is

$$\ln\left[\Lambda(t|\pmb{k_{0}},\pmb{\gamma})\right] = \eta(t|\pmb{k_{0}},\pmb{\gamma}) = s\left(\ln(t)|\pmb{k_{0}},\pmb{\gamma}\right) $$

where s(ln(t)|k0,γ) is a restricted cubic spline function of log time. The number of parameters to model the baseline is determined by the number of knots for the restricted cubic spine function with the number of parameters (including the intercept) being equal to the number of knots. Simulation studies have shown that the models give negligible bias when estimating survival functions across a wide range of scenarios [18, 19]. For more details on these models see Royston and Lambert [13].

After fitting a model the estimated marginal relative survival and marginal excess mortality functions can be estimated,

$$\begin{aligned} \widehat{R}_{m}(t|\pmb{X}) &= \exp[-\exp(\widehat{\eta}(t|\pmb{k_{0}},\pmb{\gamma}))] \ \ \ \ \ \ \ \\ \widehat{\lambda}_{m}(t|\pmb{X}) &= \frac{d s\left(\ln(t)|\pmb{k_{0}},\widehat{\pmb{\gamma}}\right)}{dt} \exp[\widehat{\eta}(t|\pmb{k_{0}},\pmb{\gamma})] \end{aligned} $$

In Appendix I, we describe how a semi-parametric marginal model could be fitted through estimation of a separate parameter for each event type using Poisson regression and how this is a equivalent to the Pohar Perme non-parametric estimate when not modeling covariates in order to demonstrate how the methods are related. However, we do not advocate this approach due to the computational intensity of splitting the time scale at unique time points and estimating a separate parameter for each time interval.

Simulation study

A small simulation study was performed to quantify any potential bias in the different methods, the coverage probabilities and the variation in the estimates. We simulate two scenarios where there is substantial variation in relative survival by age, a situation which leads to higher bias for some methods [2]. In Scenario 1 the variation the excess hazard ratio decreases over time and in Scenario 2 we assume the effect of age is proportional over time. The simulation strategy is outlined below.

  1. 1

    The sample size of each dataset was set at 1000.

  2. 2

    All subjects were assumed to be male and diagnosed in one calendar year, 2009.

  3. 3

    Age at diagnosis was sampled from a Normal distribution with mean 66 and standard deviation 13.

  4. 4

    Times of death due to cancer was generated with the baseline survival (at the mean age) having a Weibull distribution with shape paramater, γ=0.5, and scale parameter, λ=0.2. The hazard function for the Weibull model is λγtγ−1.

  5. 5

    The effect of age at diagnosis (agediag) varied between the 2 scenarios. Scenario 1 agediag was assumed to be non-proportional over time with an excess hazard ratio of exp(0.1−0.01(agediag−66)t). The time-dependent excess hazard ratios and relative survival for selected ages are shown in Fig. 1. Note that the excess hazard ratios get closer to the null as time increases. Scenario 2 agediag was assumed to be non-proportional over time with an excess hazard ratio of exp(0.03(agediag−66)). The excess hazard ratios and relative survival for selected ages are shown in Fig. 1.

    Fig. 1
    figure 1

    True relative survival and excess hazard ratios for selected ages. For the excess hazard ratios the mean age at diagnosis, 66, is the reference

  6. 6

    Times to death from other causes were generated from exponential distributions with mortality rates obtained from expected population mortality rates in England in 2009 [20]. A different rate was used for each year of follow to account for the fact that attained age was increasing. If the simulated value was greater than 1 for any yearly interval then the individual was assumed to be alive at the start of the next interval. If the value was less than one, it was assumed that the individual died in the interval.

  7. 7

    The observed time to death was taken as the minimum of the cancer specific time to death and the other cause time to death.

  8. 8

    The following analyses were performed on each simulated data set. All flexible parametric models used 6 knots to model the effect of follow-up time.

    1. a

      The non-parametric Pohar Perme method to estimate marginal relative survival [6].

    2. b

      A conditional flexible parametric relative survival model with no-covariates, see Eq. (3).

    3. c

      A conditional flexible parametric relative survival model fitting a linear effect of age and assuming proportional hazards, see Eq. (1). The marginal relative survival was estimated using regression standardization.

    4. d

      A conditional flexible parametric relative survival model with the effect of age modelled using restricted cubic splines (4 knots) and allowing for non-proportional hazards with an interaction between the age splines and spines variables to model the effect of follow-up time, see Eq. (1). Marginal relative survival was then estimated using regression standardization.

    5. e

      A flexible parametric marginal model to directly estimate marginal relative survival. The time-scale was split every 0.2 years to incorporate the weights.

  9. 9

    1000 datasets were generated. In estimation of the coverage this gives a standard error of \(100\sqrt (0.95*0.05/1000) = 0.69\) percentage points.

  10. 10

    For each method the bias, the mean square error (MSE), the coverage and relative percentage increase in precision compared to the non-parametric Pohar Perme estimator at 1, 5 and 10 years post diagnosis was reported.

Results

Simulation study

The results of the simulation study for scenarios 1 and 2 are shown in Tables 1 and 2 respectively. In both scenarios bias is negligible for the Pohar Perme method, the marginal model and regression standardization when fitting a model with time-dependent effects for age with coverage close to 95%. Regression standardization when assuming proportional hazards gives a bias of 0.4, 0.9 and 2.0 percentage points at 1, 5 and 10 years respectively for Scenario 1, illustrating the potential issue of fitting an incorrect model. In Scenario 2 bias was lower as the variation in excess mortality reduced as a function of follow-up time [2].

Table 1 Bias (bold font), coverage (italics font) and MSE (normal font) for simulation scenario 1 comparing the non parametric Pohar Perme estimator, a conditional model (without covariates), regression standardization under proportional hazards (PH), regression standardization under non-proportional hazards (Non PH) and a marginal model. Bias is expressed as a difference in probabilities
Table 2 Bias (bold font), coverage (italics font) and MSE (normal font) for simulation scenario 2 comparing the non parametric Pohar Perme estimator, a conditional model (without covariates), regression standardization under proportional hazards (PH), regression standardization under non-proportional hazards (Non PH) and a marginal model. Bias is expressed as a difference in probabilities

Fitting a conditional survival model with no covariates gives a bias of 2.6, 4.9 and 4.6 percentage points at 1, 5 and 10 years respectively for Scenario 1, with bias of 0.0, 2.7 and 4.3 percentage points for Scenario 2.

The marginal relative survival model has a lower MSE at all times when compared to the Pohar Perme method for both scenarios. Regression standardization of the model allowing a non-proportional effect of age has similar MSE to the marginal model at 1 and 5 years, but a lower MSE at 10 years.

Regression standardization with non-proportional hazards and the marginal model had similar relative increases in precision when compared to the Pohar Perme estimator at 1 and 5 years. At 10 years, the marginal model had higher relative precision than the Pohar Perme method, but the increase was notably greater when using regression standardization.

Five (0.5%) of the models with time-dependent effect of age in Scenario 1 and 183 (18.3%) in Scenario 2 failed to converge indicating a potential problem in using complex models to model the non-proportional effects of age when there are very few individuals still at risk towards the end of follow-up.

In summary, the simulation study shows the marginal model has negligible bias and greater precision than the non-parametric Pohar Perme estimator.

Application to melanoma example

The methods are illustrated using data on 4,744 patients diagnosed with melanoma (a type of skin cancer) in a Northern European country between 1985–1994. The data is from a national cancer registry and so attempts to capture all diagnosed cases of Melanoma in the country. These data are distributed with the Stata strs package [21]. The code to implement the methods are shown in Appendix II with datasets freely available.

Figure 2a shows the non-parametric Pohar Perme estimator together with the estimated modeled marginal relative survival. These are internally age standardized estimates as the expectation is over the observed age distribution. Here a model with 6 knots (i.e. five restricted cubic spline variables) were used to model the marginal log cumulative excess hazard. The time-scale was split every 0.2 years when incorporating the weights. Agreement can be seen to be very good. A feature of the Pohar Perme estimator is that it is highly variable towards the end of follow-up. This is due to the impact of the weights, with the small number of older individuals who are still alive having high influence. Also shown on this figure is the conditional model with no-covariates which shows a clear difference when compared to the other methods.

Fig. 2
figure 2

Estimates of marginal relative survival: Panel (a) shows internally age-standardized estimates using the non-parametric Pohar Perme method and the marginal relative survival model. Also shown is the estimate from the conditional relative survival model with no-covariates. Panel (b) shows externally age-standardized estimates from the non-parametric Pohar Perme method and marginal relative survival model. 95% confidence intervals are shown by either dashed lines or the shaded areas

Figure 2b shows the externally age-standardized marginal relative survival using Pohar Perme and the marginal relative survival model. The age groups were <45,45−54,55−64,65−74,75+ years of age with standardization weights as defined in the International Cancer Survival Standard (ICSS) [5]. In order to age-standardize using the Pohar-Perme estimator separate estimates for each age group were obtained and then a weighted average was calculated. The marginal relative survival model does not need to stratify or model the effect of age due to the incorporation of the weights defined in Eq. (7). The Figure shows that, in this example, external age-standardization makes little difference when compared to the internally age-standardized estimates in Fig. 2a as the reference age-distribution was similar to that observed in the study population.

In Fig. 2 the time-scale was split every 0.2 years with the weights taken at the midpoint of each interval and 6 knots (5 spline variables) were used to model the baseline. Table 3 shows the estimated survival at 1, 5 and 10 years using a variety of time splits, at 0.05, 0.1, 0.2, 0.5, 1 and 2.5 years, and various number of knots (4, 6, 8 and 10). There is very little difference in any of the estimates at 1-year with the largest difference, 0.002. With increasing time there is more variation, but for time-splits from 0.05 to 0.2 years yields a difference of 0.002 or less at 10 years. Larger time-splits lead to larger differences, which is not surprising as the weights are forced to be constant within each interval.

Table 3 Estimated marginal relative survival at 1, 5 and 10 years from diagnosis using time splits at 0.05, 0.1, 0.2, 0.5, 1 and 2.5 years and using 4 6 8 and 10 knots to model the baseline

If the mean expected mortality defined in Eq. (4) is calculated separately for males and females then the effect of sex can be modelled. When doing so one will generally want to age-standardize as the age distributions of the groups being compared could differ. There is a choice here about what age distribution to standardize to. This could be the ICSS weights used above, which would allow comparisons with other studies using the same age standard. Alternatively, if measures more relevant to the study population are required then this could be the joint age distribution of male and females or the age distribution of one of the groups. Here the age distribution of males will be used to illustrate some useful interpretations.

It could be simplistically assumed that the marginal excess mortality rates are proportional. Fitting such a model yields a marginal excess mortality rate ratio for females compared to males of 0.70 (95% CI: 0.59 to 0.83). The assumption of proportional marginal hazards can be relaxed by incorporating an interaction between the effect of being female and the effect of time. This was done using a spline term with 4 knots, i.e. three additional parameters. The excess mortality rates for both the proportional hazards model and non-proportional model are shown in Fig. 3a with the corresponding marginal relative survival functions in Fig. 3b. The curves for each sex are very similar as the proportionality assumption appears reasonable here; (a decrease in log-likelihood of 0.71 with 3 degrees of freedom).

Fig. 3
figure 3

Estimated marginal excess mortality rate (panel (a)) and marginal relative survival functions (panel (b)) for proportional and non-proportional models. Estimates have been standardized to the age distribution of the males

As we have standardized each of these estimates to the age distribution of males the curves give the factual estimates for males, i.e. given their own age distribution and the counterfactual effect for males if they had the same relative survival as females. This is because the age-distribution for males is being applied to the effect of females.

Figure 4 shows the difference in the two marginal relative survival functions with 95% confidence intervals. As the age distribution of males has been used as the age reference, this gives an estimate how much the marginal relative survival would increase if males had the same relative survival as females.

Fig. 4
figure 4

Difference in marginal relative survival with 95% confidence interval (shaded area) comparing females to males

Discussion

We have demonstrated how marginal estimates of relative survival can be directly modeled through calculation of appropriately weighted mean expected mortality rates and introducing weights into the likelihood. We have also shown how additional weights can be included so that age-standardized marginal relative survival is directly estimated without the need to model or stratify by age. In addition we have shown that contrasts of marginal relative survival can be directly estimated from the marginal model. Although the relative survival framework has almost exclusively used within cancer, the methods described here could be utilized in other disease areas [22, 23].

When making comparisons between population groups there is generally a need to age-standardize as in most situations there will be a difference in the age distribution and there is usually a strong relationship between age and relative survival. Introducing individual level relative weights enables external age-standardization without the need to stratify or model age. Rutherford et al recently advocated the use of individual relative weights for the non-parametric Pohar Perme estimator for age standardization when data were sparse [8]. An important choice when age standardizing is in the age distribution used. Using ICSS weights allows comparisons with many other studies using the same weights, but using the age-distribution observed in the study population may be more useful for generalizing to a particular population. As separate effects do not need to be calculated by age group the use of narrower categories of age for the relative weights can be used. For example, in practice the oldest age group when standardizing for most cancer types is 75+. There can be substantial variation in relative survival within this age group and the age distribution within this age group may vary between population groups. Using the weighting approach proposed here one could, for example, apply the five year age groups presented by Corazziari et al. [5].

If just an overall summary of marginal relative survival is needed then the non-parametric Pohar Perme estimate could be used. An advantage of the marginal model is that it has negligible bias, but higher relative precision than then non-parametric approach. Our approach is particularly useful for comparisons between different population groups.

A limitation of the method is that the user has to choose the number of knots for the restricted cubic splines. Previous work has demonstrated that estimates are insensitive to the number of knots, as long as their are enough to capture the underlying shape of the hazard functions [18, 19, 24]. However, a sensitivity analysis, such as that displayed in Table 3 can be reassuring to both the analyst and others. The marginal model approximates the integral defined in the likelihood by splitting the time-scale (Eq. 6). The sensitivity analysis shown here shows that as long as the width of the intervals is at 0.2 years or less there is very little impact on the estimates, but smaller intervals would lead to improved accuracy. Although more accurate estimation could be obtained through calculating the integral numerically for each individual in the estimation process, we prefer the split time approach as it enables standard software to be used together with a host of useful post-estimation prediction commands.

In observational studies with time-to-event outcomes it is common to use inverse probability weighting (IPW) to adjust for confounders [25]. However, a conditional relative survival model without covariates gives a biased estimate of the marginal survival function (Fig. 2) and thus the IPW approach in not directly transferable to the relative survival framework. However, applying the IPW weights to the marginal model will enable the IPW approach to be used in the relative survival framework and this is being further investigated in ongoing work.

When there is interest in more detailed comparisons between population groups then a conditional model should be used. For example, to quantify how differences in relative survival vary by age. However, when there is only interest in the marginal relative survival and associated contrasts, the model proposed here simplifies the modelling process, has negligible bias and increased precision over the non parametric Pohar Perme estimate.

Conclusion

When only marginal effects are of interest the model described here is particularly useful as it less prone to model misspecification as covariates that are associated with expected mortality rates do not need to be incorporated into the model.

Software

Stata code to set-up data and fit the models for the melanoma example is shown in Appendix II of the supplementary material.

Availability of data and materials

The melanoma data used in this example is available in the strs Stata package (see reference 16). Code to fit the models is available in the supplementary material.

Declarations

Abbreviations

ICSS:

International cancer survival standard

IPW:

Inverse probability weighting

MSE:

Mean squared error

References

  1. Pavlic K, Pohar Perme M. Using pseudo-observations for estimation in relative survival. Biostatistics. 2018; 20(3):384–99. https://0-doi-org.brum.beds.ac.uk/10.1093/biostatistics/kxy008.

    Article  Google Scholar 

  2. Lambert PC, Dickman PW, Rutherford MJ. Comparison of approaches to estimating age-standardized net survival. BMC Med Res Methodol. 2015; 15:64. https://0-doi-org.brum.beds.ac.uk/10.1186/s12874-015-0057-3.

    Article  Google Scholar 

  3. Johnson CJ, Hahn CG, Fink AK, German RR. Variability in cancer death certificate accuracy by characteristics of death certifiers. Am J Forensic Med Pathol. 2012; 33(2):137–42. https://0-doi-org.brum.beds.ac.uk/10.1097/PAF.0b013e318219877e.

    Article  Google Scholar 

  4. Schaffar R, Rapiti E, Rachet B, Woods L. Accuracy of cause of death data routinely recorded in a population-based cancer registry: impact on cause-specific survival and validation using the Geneva Cancer Registry. BMC Cancer. 2013; 13:609. https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2407-13-609.

    Article  Google Scholar 

  5. Corazziari I, Quinn M, Capocaccia R. Standard cancer patient population for age standardising survival ratios. Eur J Cancer. 2004; 40(15):2307–16. https://0-doi-org.brum.beds.ac.uk/10.1016/j.ejca.2004.07.002.

    Article  Google Scholar 

  6. Pohar Perme M, Stare J, Estève J. On estimation in relative survival. Biometrics. 2012; 68:113–20. https://0-doi-org.brum.beds.ac.uk/10.1111/j.1541-0420.2011.01640.x.

    Article  Google Scholar 

  7. Sasieni P, Brentnall AR. On standardized relative survival. Biometrics. 2016; 73:473–82. https://0-doi-org.brum.beds.ac.uk/10.1111/biom.12578.

    Article  Google Scholar 

  8. Rutherford MJ, Dickman PW, Coviello E, Lambert PC. Estimation of age-standardized net survival, even when age-specific data are sparse. Cancer Epidemiol. 2020; 67:101745. https://0-doi-org.brum.beds.ac.uk/10.1016/j.canep.2020.101745.

    Article  Google Scholar 

  9. Nelson CP, Lambert PC, Squire IB, Jones DR. Flexible parametric models for relative survival, with application in coronary heart disease. Stat Med. 2007; 26(30):5486–98. https://0-doi-org.brum.beds.ac.uk/10.1002/sim.3064.

    Article  Google Scholar 

  10. Dickman PW, Sloggett A, Hills M, Hakulinen T. Regression models for relative survival. Stat Med. 2004; 23(1):51–64. https://0-doi-org.brum.beds.ac.uk/10.1002/sim.1597.

    Article  Google Scholar 

  11. Estève J, Benhamou E, Croasdale M, Raymond L. Relative survival and the estimation of net survival: elements for further discussion. Stat Med. 1990; 9(5):529–38.

    Article  Google Scholar 

  12. Syriopoulou E, Rutherford MJ, Lambert PC. Marginal measures and causal effects using the relative survival framework. Int J Epidemiol. 2020; 49:619–28. https://0-doi-org.brum.beds.ac.uk/10.1093/ije/dyz268.

    Article  Google Scholar 

  13. Royston P, Lambert PC. Flexible Parametric Survival Analysis in Stata: Beyond the Cox Model: Stata Press; 2011.

  14. Seppä K, Hakulinen T, Läärä E, Pitkäniemi J. Comparing net survival estimators of cancer patients. Stat Med. 2016; 35(11):1866–79. https://0-doi-org.brum.beds.ac.uk/10.1002/sim.6833.

    Article  Google Scholar 

  15. Lin DY. On fitting Cox’s proportional hazards models to survey data. Biometrika. 2000; 87:37–47. https://0-doi-org.brum.beds.ac.uk/10.1093/biomet/87.1.37.

    Article  Google Scholar 

  16. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999; 446:496–509.

    Article  Google Scholar 

  17. Lambert PC, Wilkes SR, Crowther MJ. Flexible parametric modelling of the cause-specific cumulative incidence function. Stat Med. 2017; 36:1429–46. https://0-doi-org.brum.beds.ac.uk/10.1002/sim.7208.

    Article  Google Scholar 

  18. Rutherford MJ, Crowther MJ, Lambert PC. The use of restricted cubic splines to approximate complex hazard functions in the analysis of time-to-event data: a simulation study. J Stat Comput Simul. 2015; 85:777–93. https://0-doi-org.brum.beds.ac.uk/10.1080/00949655.2013.845890.

    Article  Google Scholar 

  19. Bower H, Crowther MJ, Rutherford MJ, Andersson TML, Clements M, Liu XR, Dickman PW, Lambert PC. Capturing simple and complex time-dependent effects using flexible parametric survival models. Commun Stat - Simul Comput. 2019. https://0-doi-org.brum.beds.ac.uk/10.1080/03610918.2019.1634201.

  20. Lifetables. Cancer Survival Group UK. London School of Hygeine and Tropical Medicine. https://csg.lshtm.ac.uk/life-tables/. Accessed 21 Dec 2020.

  21. Dickman PW, Coviello E. Estimating and modelling relative survival. Stata J. 2015; 15(1):186–215.

    Article  Google Scholar 

  22. Nelson CP, Lambert PC, Squire IB, Jones DR. Relative survival: what can cardiovascular disease learn from cancer?Eur Heart J. 2008; 29(7):941–7. https://0-doi-org.brum.beds.ac.uk/10.1093/eurheartj/ehn079.

    Article  Google Scholar 

  23. Bhaskaran K, Hamouda O, Sannes M, Boufassa F, Johnson AM, Lambert PC, Porter K, CASCADE Collaboration. Changes in the risk of death after HIV seroconversion compared with mortality in the general population. JAMA. 2008; 300(1):51–59. https://0-doi-org.brum.beds.ac.uk/10.1001/jama.300.1.51.

    Article  CAS  Google Scholar 

  24. Syriopoulou E, Mozumder SI, Rutherford MJ, Lambert PC. Robustness of individual and marginal model-based estimates: A sensitivity analysis of flexible parametric models. Cancer Epidemiol. 2018; 58:17–24. https://0-doi-org.brum.beds.ac.uk/10.1016/j.canep.2018.10.017.

    Article  Google Scholar 

  25. Austin PC. The use of propensity score methods with survival or time-to-event outcomes: reporting measures of effect similar to those used in randomized experiments. Stat Med. 2014; 33:1242–58. https://0-doi-org.brum.beds.ac.uk/10.1002/sim.5984.

    Article  Google Scholar 

Download references

Acknowledgments

Not applicable.

Funding

This work was partly funded by Cancerfonden and Vetenskapsrådet. The funders had no role in the design of the study or collection, analysis, and interpretation of data or writing of the manuscript. Open access funding provided by Karolinska Institute.

Author information

Authors and Affiliations

Authors

Contributions

PCL, ES, MR conceived the idea for the study. PCL developed and implemented the statistical analysis, which included multiple discussions with ES and MR. PCL drafted the manuscript with input from ES and MR and all authors read and approved the final manuscript.

Corresponding author

Correspondence to Paul C. Lambert.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Poisson regression and the Pohar Perme estimator.

Additional file 2

Stata implementation of the models.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lambert, P.C., Syriopoulou, E. & Rutherford, M.R. Direct modelling of age standardized marginal relative survival through incorporation of time-dependent weights. BMC Med Res Methodol 21, 84 (2021). https://0-doi-org.brum.beds.ac.uk/10.1186/s12874-021-01266-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12874-021-01266-1

Keywords