A cautionary note on the use of attributable fractions in cohort studies

Abstract

The attributable fraction is a widely used measure to quantify the public health impact of an exposure on an outcome. It was originally proposed for binary outcomes, but attributable fraction estimators have also been proposed for time-to-event outcomes. In this note, we consider an estimator which was proposed by Benichou (Stats Methods Med Res, 2001) and is supposed to estimate the cohort attributable fraction, i.e. the number of events that would have been prevented in the cohort during follow-up, if the exposure would hypothetically have been eliminated. We show that this estimator is only valid under certain assumptions, which are often likely to be violated in practice. We further argue that the cohort attributable fraction may not be of substantial scientific interest in the first place. We propose a potentially more relevant measure of attributable fraction in cohort studies; the baseline attributable fraction. We show how the baseline attributable fraction can be conveniently estimated in Cox proportional hazards models.

Keywords

attributable fraction causal inference cohort studies Cox proportional hazards model

1 Introduction

The attributable fraction (AF; also known as the attributable risk) is a widely used measure to quantify the public health impact of an exposure on an outcome. It was originally defined for binary outcomes as the proportion of outcomes that would be eliminated in the population if the exposure was hypothetically eliminated.¹ In observational studies, consistent estimation of the AF typically requires appropriate confounder adjustment. Benichou^2,3 gave an overview of various adjusted estimators, including the Mantel–Haenszel estimator, the weighted-sum estimator, and the model-based estimator. The latter is based on the following algebraic rearrangement of the AF

AF = 1 - \sum_{x, z} \frac{ρ_{xz}}{{RR}_{x | z}}

In this expression, the sum is taken over all joint levels of the exposure X and the adjustment covariates Z.

{RR}_{x | z}

is the conditional risk ratio for level x of the exposure, given level z of adjustment covariates, and

ρ_{xz}

is the population proportion with levels

X = x

and

Z = z

among the cases (subjects who have the outcome). The model-based estimator can be used for both case–control studies and cross-sectional studies with binary outcomes, by replacing

ρ_{xz}

with the sample proportion of cases with levels

X = x

and

Z = z

, and by replacing

{RR}_{x | z}

with an estimate obtained from a regression model; in case–control studies, this risk ratio has to be approximated by the corresponding odds ratio.

Benichou^2,3 claimed that the model-based estimator can be adapted to cohort studies with right censored time-to-event outcomes, by replacing ${RR}_{x | z}$ with an estimate of the adjusted (hazard) rate ratio obtained from a Poisson or Cox regression. However, he provided no formal justification for this claim. Furthermore, he did not clarify what to replace $ρ_{xz}$ with when the outcome is no longer binary. We made a survey of recent literature and found that the cohort adaption of the model-based estimator proposed by Benichou^2,3 has been used in many studies.^4–11 In all these studies, $ρ_{xz}$ was replaced with the sample proportion with levels x and z among those who were observed to experience the event before end of follow-up; we refer to the resulting estimator of the AF as the “cohort-adapted estimator.” Without exception, the authors of these studies appear to have interpreted the cohort-adapted estimator as estimating the proportion of events that would have been eliminated in the cohort during follow-up, if the exposure had been eliminated; we refer to this proportion as the “cohort attributable fraction” (CAF). In this note, we show that the cohort-adapted estimator only estimates the CAF under the following two assumptions: (1) the cumulative incidence is small during the whole follow-up, and (2) there is no censoring, apart from administrative censoring at the end of follow-up. When either of these assumptions is violated, the cohort adapted estimator does not estimate the CAF and may have no meaningful interpretation. We further argue that even if both these assumptions hold, the CAF may not be of substantial scientific interest. We propose a potentially more relevant measure of attributable fraction in cohort studies, the baseline attributable fraction (BAF). We show how the BAF can be conveniently estimated in cohort studies, using Cox proportional hazards models.¹² Our estimator does not require the two aforementioned assumptions required by the cohort-adapted estimator.

The paper is organized as follows. In Section 2, we show that the cohort-adapted estimator only converges to the CAF under the aforementioned two assumptions. In Section 3, we argue that the CAF may not be a relevant measure of attributable fraction in the first place, and we propose the BAF as a potentially more relevant measure. In Section 4, we show how the BAF can be conveniently estimated in Cox proportional hazards models. In Section 5, we carry out a small simulation study to demonstrate the consistency of our proposed estimator. In Section 6, we present an application to real data.

2 The underlying assumptions of the cohort-adapted estimator

We first introduce some notation. We consider a binary exposure X, with levels 0 and 1 for “unexposed” and “exposed,” respectively. We let T denote the random time to event. We assume that the event of interest is “absorbing,” so that when the event occurs the individual is no longer part of the cohort, e.g. when the subject dies. We let τ denote the end of follow-up. To begin with, we assume that there is no censoring apart from administrative censoring at the end of follow-up.

We will throughout assume that the conditional log hazard function, given X and adjustment covariates Z, follows the Cox proportional hazards (PH) model

λ (t | X, Z) = μ (t) \exp (β X + γ Z)

(1)

where

μ (t) = λ (t | X = 0, Z = 0)

is the unspecified baseline hazard function. The Cox PH model is not crucial for any of the arguments that we make but simplifies the exposition. In this section, we are only concerned with asymptotic properties (e.g. bias) of the cohort-adapted estimator and not its finite sample behavior. Thus, to further simplify the exposition we will assume that the cohort is “infinitely large”, so that we do not need to bother about sampling variability. Under the model in (1), the cohort-adapted estimator can be expressed as

1 - \sum_{x, z} Pr (X = x, Z = z | T \leq τ) \exp (- β x) = Pr (X = 1 | T \leq τ) {1 - \exp (- β)}

where

Pr (X = x, Z = z | T \leq τ)

is the proportion in the cohort with levels

X = x

and

Z = z

among those who experience the event during follow-up.

To relate the cohort-adapted estimator to the CAF, we need a formal definition of the latter. Using standard potential outcome notation^13,14 we let T₀ denote the counterfactual time to event that would have been observed for a given subject, had that subject been unexposed. We let $F (t) = Pr (T \leq t)$ denote the factual proportion of events that occurs at or before time t in the cohort (i.e. the cumulative distribution function at $T = t$ ), and we let $F_{0} (t) = P (T_{0} \leq t)$ denote the counterfactual proportion of events that would occur at or before time t in the cohort, had everybody in the cohort been unexposed. In this notation, the CAF is equal to

CAF = 1 - \frac{F_{0} (τ)}{F (τ)}

For instance, if 20% of all subjects in the cohort factually experience the event before the end of follow-up, but only 5% would experience the event had everybody been unexposed, then the CAF equals

1 - 0.05 / 0.2 = 75 %

In randomized trials, exposed and unexposed are exchangeable so that $F_{0} (τ)$ is equal to the proportion of subject who would experience the event before end of follow-up among those factually unexposed, i.e. $F_{0} (τ) = F (τ | X = 0)$ . In observational studies, $F_{0} (τ)$ can be estimated by adjusting for covariates, if these are appropriately selected and sufficient for confounding control. To determine whether a particular set of covariates is sufficient or not is a difficult problem, and one that typically requires strong subject matter knowledge. This problem of covariate selection is beyond the scope of this paper; we proceed by assuming that we have measured a set of covariates Z which is sufficient for confounding control. Under this assumption, $F_{0} (τ)$ is equal to $E_{Z} {F (τ | X = 0, Z)}$ , where the outer expectation is taken over the marginal distribution of Z.^15,16 It follows that the CAF can be rewritten as

1 - \frac{E_{Z} {F (τ | X = 0, Z)}}{F (τ)} = 1 - E_{X, Z} {\frac{F (τ | X = 0, Z)}{F (τ)} \frac{F (τ | X, Z)}{F (τ | X, Z)}} = 1 - E_{X, Z} {\frac{F (τ | X = 0, Z)}{F (τ | X, Z)} | T \leq τ} = 1 - E_{X, Z} {\frac{1 - \exp {- \int_{0}^{τ} λ (u | X = 0, Z) d u}}{1 - \exp {- \int_{0}^{τ} λ (u | X, Z) d u}} | T \leq τ} \approx 1 - E_{X} {\exp (- β X) | T \leq τ} = Pr (X = 1 | T \leq τ) {1 - \exp (- β)}

which is equal to the (asymptotic limit of the) cohort-adapted estimator. Note the approximation just before the last equality in the derivation; this approximation is only valid if the cumulative incidence

\int_{0}^{τ} λ (u | X, Z) d u

is small, in which case

\exp {- \int_{0}^{τ} λ (u | X, Z) d u}

is approximately equal to

1 - \int_{0}^{τ} λ (u | X, Z) d u = 1 - exp (β X + γ Z) \int_{0}^{τ} μ (u) d u

. Thus, the cohort-adapted estimator does indeed approximately estimate the CAF, provided that the cumulative incidence is small throughout follow-up.

To see that the cohort-adapted estimator can be severely biased if the cumulative incidence is large, consider the case when the event of interest is death and the follow-up is infinite; $τ = \infty$ . In this case, the CAF is equal to 0, since $F_{0} (τ) = F (τ) = 1$ (eventually, everyone dies). However, the cohort-adapted estimator equals $Pr (X = 1 | T \leq \infty) {1 - \exp (- β)} = Pr (X = 1) {1 - \exp (- β)}$ , which may be far from 0, unless $β = 0$ (i.e. unless the exposure has no effect).

We next consider the impact of random right censoring. Let C denote the random time to right censoring, so that for each subject we observe the event if $T < min (C, τ)$ .¹⁷ In the presence of random censoring, we cannot observe $Pr (X = 1 | T \leq τ)$ , but have to replace this proportion in the cohort-adapted estimator with $Pr {X = 1 | T \leq min (C, τ)}$ , that is, with the proportion in the cohort who are exposed among those who are observed to experience the event during follow-up. To see that this modification may induce bias, suppose that the censoring is heavy, so that a large fraction is censored early during follow-up. Those who contribute to $Pr {X = 1 | T \leq min (C, τ)}$ will then mainly be those who die early. If there is an association between the exposure and the time to death, then there will be an overrepresentation of exposed subjects in this group, as compared to those who die before end of follow-up. This implies that $Pr {X = 1 | T \leq min (C, τ)} > Pr (X = 1 | T \leq τ)$ , so that the cohort-adapted estimator becomes biased upwards. We note that this bias will be present even if C is completely independent of both X, Z, and T.

3 The baseline attributable fraction

Many cohort studies have relatively long follow-up, or relatively high baseline hazard, so that the assumption of small cumulative incidence is not tenable. Censoring, either random or nonrandom, is also commonly present. Thus, the cohort-adapted estimator may often be biased, as an estimator of the CAF. Recently, methods have been proposed which can be used to consistently estimate the CAF even if the cumulative incidence is not small and random censoring is present.^18,19 However, it can be questioned whether the CAF is a relevant measure of attributable fraction in the first place. The problem is that the CAF depends on the actual follow-up time τ in a rather arbitrary fashion. In particular, as already noted, the CAF is 0 when the event of interest is death and τ is sufficiently large, since eventually everyone dies regardless of whether we are able to eliminate the exposure or not. To bypass this problem, one could instead present (an estimate of) the whole AF function

AF (t) = 1 - \frac{F_{0} (t)}{F (t)}

which measures the proportion of events that would have been eliminated in the cohort as a function of time, if the exposure had been eliminated.^18,19 A strength of the AF function is that it preserves the time dynamics of the cohort study by not summarizing the exposure impact into a scalar measure. This strength may also be a weakness though; it is typically more difficult to communicate and interpret a (possibly complicated) function than a single number. If we wish to summarize the exposure impact by one scalar measure, then we may think of various alternatives. One option may be the maximum value

{sup}_{t > 0} AF (t)

, which provides a global upper bound on the AF function. This measure may be relevant in scenarios where the AF function starts and remains at zero for some time after the study start before increasing, for instance when the exposure is (absence of) a treatment which takes a while to build up and become effective. Another option may be the limit value

{lim}_{t \to \infty} AF (t)

, which measures the proportion of events that would have been eliminated in the cohort in the long run, if the exposure had been eliminated. This measure may be relevant when the event of interest is, in contrast to death, not inevitable in the long run, for instance when the event is the occurrence of a particular disease.

We will focus on the limit value ${lim}_{t \to 0} AF (t)$ , which measures the proportion of events that would have been eliminated during a short (infinitesimal) period of time immediately after baseline in the cohort, if the exposure had been eliminated; we refer to this measure as the “baseline attributable fraction” (BAF). The BAF coincides with the maximum value of the AF function when the AF function is monotonically decreasing. An attractive feature of the BAF is that it does not only apply to the cohort but also to the population from which the cohort was drawn. This is not generally true for other summaries of the AF function, since the characteristics of a (closed) cohort often change and diverge from the characteristics of the source population over time, as subjects gradually drop out from the cohort due to death or for other reasons. At baseline though, the cohort is still guaranteed to be representative for the source population, and thus the BAF can be interpreted as the proportion of events that would have been eliminated during a short period of time in the source population, if the exposure had been eliminated. We emphasize that the BAF may not always be a relevant measure of attributable fraction, e.g. when the exposure has no short-term effect, so that the BAF is equal to 0 even though the exposure may have a long-term effect. However, in many scenarios, in particular when the effect of the exposure is relatively constant over time, the BAF summary measure may offer a useful compromise between relevance and parsimony.

Under Cox PH models, the BAF has a simple and instructive expression. Consider the following Cox PH model, which generalizes the model in (1) in that it allows for both X and Z to vary over time

λ {t | X (t), Z (t)} = μ (t) \exp {β X (t) + γ Z (t)}

(3)

We emphasize that even though this model allows for the exposure to vary over time, it assumes that the effect of the exposure (

= β

) is constant over time. Let

p = Pr {X (0) = 1}

denote the proportion exposed at baseline, and let

Ω = \log {p / (1 - p)}

denote the log odds of being exposed at baseline. Let

α_{1}

and

α_{0}

denote the log of the mean of

\exp {γ Z (0)}

, for the exposed and unexposed at baseline, respectively, i.e.

α_{x} = \log E [\exp {γ Z (0)} | X (0) = x]

. If Z is sufficient for confounding control, then it can be shown (see Appendix 1) that under the model in (3), the BAF can be written as

BAF = \frac{1 - \exp (- β)}{1 + \exp {- (β + Ω + α_{1} - α_{0})}}

(4)

To gain some intuition for this expression, it is useful to study how the RHS of (4) depends on the causal effect of X on T, on the proportion exposed at baseline, and on the strength of confounding. It is easy to show that the RHS of (4) increases with both β and

Ω

, and equals 0 if either

β = 0

Ω = - \infty

. This is reasonable; if the exposure has no effect on the outcome (so that

β = 0

) or if nobody is exposed (so that

Ω =

- \infty

), then no events can be attributed to the exposure. When β goes to ∞, the RHS of (4) goes to 1. This is also reasonable; if the exposure has a strong effect on the outcome, then most of the events could be attributed to the exposure. The term

α_{1} - α_{0}

depends on the strength of confounding, and equals 0 if either Z and T are independent (so that

γ = 0)

or Z and X are independent (so that

E [\exp {γ Z (0)} | X (0)]

does not depend on

X (0)

4 Estimation of the BAF under Cox PH models

The Cox PH model in (3) can be fitted to data with standard software to obtain estimates $\hat{β}$ and $\hat{γ}$ . Subsequently, the log of the sample means of $\exp {\hat{γ} Z (0) | X (0) = 1}$ and $\exp {\hat{γ} Z (0) | X (0) = 0}$ can be used as estimates $\hat{α_{1}}$ and $\hat{α_{0}}$ , respectively. The sample log odds of being exposed at baseline can be used as an estimate $\hat{Ω}$ . Finally, the estimates $(\hat{β}, \hat{Ω}, \hat{α_{1}}, \hat{α_{0}})$ can be plugged into (4), to obtain an estimate $\hat{BAF}$ .

To assess the statistical uncertainty in $\hat{BAF}$ , it is desirable to derive its asymptotic distribution. Using the multivariate delta method,²⁰ we have that $\hat{BAF}$ has an asymptotic normal distribution with variance $var (\hat{BAF}) = g T Σ g$ , where g is the column vector of derivatives of BAF with respect to $(β, Ω, α_{1}, α_{0})$ , and $Σ$ is the asymptotic variance–covariance matrix of $(\hat{β}, \hat{Ω}, \hat{α_{1}}, \hat{α_{0}})$ . By differentiating the BAF with respect to $(β, Ω, α_{1}, α_{0})$ , we have that

\frac{\partial BAF}{\partial β} = \frac{\exp (- β) [1 + \exp {- (Ω + α_{1} - α_{0})}]}{[1 + \exp {- (β + Ω + α_{1} - α_{0})}] 2}

and

\frac{\partial BAF}{\partial Ω} = \frac{\partial BAF}{\partial α_{1}} = - \frac{\partial BAF}{\partial α_{0}} = \frac{{1 - \exp (- β)} \exp {- (β + α_{1} - α_{0} + Ω)}}{[1 + \exp {- (β + α_{1} - α_{0} + Ω)}] 2}

Define

θ = (β, γ, Ω, α_{1}, α_{0}) T

. To derive

Σ

we note that

\hat{θ}

solves the estimating equation

\sum_{i = 1}^{n} U_{i} (θ) = 0

, where

U_{i} (θ) = (S_{i} (β, γ) X_{i} (0) - expit (Ω) X_{i} (0) [\exp {γ Z_{i} (0)} - \exp {α_{1}}] {1 - X_{i} (0)} [\exp {γ Z_{i} (0)} - \exp {α_{0}}])

is the estimating equation contribution from subject i, and

S_{i} (β, γ)

is the Cox partial likelihood score contribution from subject i. We note that the

S_{i} (β, γ)

’s are not independent since the same subject may appear in several risk sets during follow-up. However, it follows from results in the Appendix of Lee et al.²¹ that this dependence may be asymptotically ignored. Treating the

S_{i} (β, γ)

’s as independent it then follows from standard results on estimating equations²² that

\hat{θ}

has an asymptotic normal distribution with variance–covariance matrix equal to

h T \tilde{Σ} h

, where

h = E {\partial U_{i} (θ) / \partial θ}

and

\tilde{Σ} = var {U_{i} (θ)}

. Define

θ * = (β, Ω, α_{1}, α_{0}) T

and let

(h T \tilde{Σ} h) θ *

denote the submatrix of

h T \tilde{Σ} h

corresponding to the elements of

θ *

. We then have that

var (\hat{BAF}) = g T (h T \tilde{Σ} h) θ * g

. Estimates of g, h, and

\tilde{Σ}

can be obtained by replacing the true value of θ and the population moments in

\tilde{Σ}

with the estimate

\hat{θ}

and the corresponding sample moments.

The derivation of $var (\hat{BAF})$ outlined above assumes that the subjects are independent. Often, this assumption may be violated due to clustering, e.g. in studies of siblings or twins. When the clusters are independent, it is easy to correct $var (\hat{BAF})$ for dependencies within clusters. Let $U_{ij} (θ)$ denote the estimating equation contribution from subject j in cluster i, and define $U_{i} (θ) = \sum_{j} U_{ij} (θ)$ . With this minor modification, the $U_{i} (θ)$ ’s are now independent and $var (\hat{BAF})$ is equal to $g T (h T \tilde{Σ} h) θ * g$ as before.²¹

5 Simulation study

To investigate the finite sample properties of the proposed methods, we carried out a small simulation study. In all simulated scenarios, we generated a time-stationary covariate Z from a normal distribution with mean 0 and standard deviation σ. We generated a time-stationary exposure X from the logistic model

logit (p) = ω + Z

. For each simulated scenario, ω was set to achieve a desired value of p, as described below. We generated survival times T from the Cox proportional hazards model in (1), with

μ (t) = constant = 1

. We generated random censoring times C from the same conditional distribution as T, given X and Z. Under this scheme, C is associated with X and Z, but conditionally independent of T, given X and Z. The marginal (over X and Z) censoring rate is 50%. From this model, we generated 1000 samples of

n = 500

independent subjects, for each of the eight possible parameter combinations of

σ = (0.5, 1.5)

p = (0.05, 0.15)

, and

β = (0.5, 1.5)

. For each sample, we estimated the BAF and its standard error using the method described in Section 4. Table 1 displays the mean (over the 10,000 samples) estimated BAF (

\hat{BAF}

) together with the true value (

BAF

), and the mean estimated standard error (

SE

) together with the empirical standard error (

ESE

) for each scenario. We observe that the mean estimated BAF is close to the true value and that the mean estimated standard error agrees well with the empirical standard error, for all scenarios.

Table 1.

Simulation results for estimation of the BAF, for 1000 samples of $n = 500$ independent subjects.

σ	p	β	$BAF$	$\hat{BAF}$	$SE$	$ESE$
0.5	0.05	0.5	0.04	0.04	0.03	0.03
0.5	0.05	1.5	0.18	0.18	0.06	0.07
0.5	0.15	0.5	0.11	0.11	0.04	0.05
0.5	0.15	1.5	0.39	0.39	0.07	0.06
1.5	0.05	0.5	0.14	0.14	0.08	0.10
1.5	0.05	1.5	0.46	0.47	0.10	0.13
1.5	0.15	0.5	0.24	0.23	0.07	0.09
1.5	0.15	1.5	0.63	0.62	0.06	0.08

BAF: true value of the baseline attributable fraction; $\hat{BAF}$ : mean estimated baseline attributable fraction; $SE$ : mean estimated standard error; $ESE$ : empirical standard error.

6 Real data illustration

In this section, we present an application to real data, borrowed from Carlsson et al.²³ These authors aimed to study the association between body mass index (BMI) and mortality. Their cohort data comprise 44,258 Swedish same-sex twins, of which 16,793 are monozygotic (MZ) and 27,465 are dizygotic (DZ). All twins filled in a questionnaire in 1967 or 1972 on lifestyle factors, health, height, and weight. Death records were obtained by linkage to the National Causes of Death Registry for the years 1972–2004. The authors stratified the analysis on zygosity and sex, and controlled for smoking in the statistical models.

We reanalyzed the Carlsson et al data as follows. Baseline (i.e. $t = 0$ ) was defined as the date when the questionnaire was filled in. BMI was dichotomized into BMI ≤ 25 (unexposed) and BMI > 25 (exposed). The BAF was estimated for the DZ and MZ twins separately, controlling for smoking, sex, and age at baseline. Clustered standard errors were computed to account for the correlation between twins in the same pair, as described in Section 4. These standard errors were subsequently used to construct 95% Wald confidence intervals (CIs). For the DZ twins we obtained $\hat{BAF}$ = 0.06, with 95% CI ranging from 0.02 to 0.10. For the MZ twins, we obtained $\hat{BAF}$ = 0.07, with 95% CI ranging from −0.11 to 0.25. These results indicate that around 6–7% of all deaths would have been eliminated in this cohort of DZ/MZ twins, immediately after baseline, if all twins would have had BMI ≤ 25 at baseline.

Before leaving this example, we take the opportunity to note that attributable fractions are conceptually difficult when the exposure of interest is BMI. The reason is that it may not be entirely clear what it means to be “unexposed,” e.g. to have BMI ≤ 25. Clearly, there are many ways to reduce BMI; we may for instance drastically reduce the amount of body fat or the amount of muscles on everybody. These two hypothetical interventions may both render everybody “unexposed,” but the mortality rates under these interventions may be very different. Thus, unless we make the exposure definition more precise, we may consider the attributable fraction as somewhat ill-defined. We therefore conclude by recommending that researchers who wish to study the public health impact of obesity should make an effort to make their definition of obesity/nonobesity as precise as possible, in order to reduce vagueness in the research question being posed.

7 Discussion

We have studied the commonly used cohort-adapted estimator, which is supposed to estimate the CAF. We have shown that this estimator relies on two assumptions, which are often not tenable in practice. We have argued that the CAF may not be a relevant measure of public health impact in the first place, and we have proposed to use the BAF instead. We have shown how the BAF can be conveniently estimated in Cox proportional hazards models.

A major obstacle for practitioners who wish to use new methodology is often the lack of implementation in standard software. To facilitate the proposed methods, we have written a convenient R-function, which implements the BAF estimator proposed in Section 4. This R-function is available at http://www.meb.ki.se/personal/arvsjo/, and can be obtained from the author upon request. We describe its usage in the Appendix 1.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Swedish Research Council (grant number 340-2012-6007).

Appendix 1

References

Levin

. The occurrence of lung cancer in man. Acta Unio Int Contr 1953; 9: 531–541.

Benichou

. A review of adjusted estimators of attributable risk. Stats Methods Med Res 2001; 10: 195–216.

Benichou

Attributable risk. In: Armitage

Colton

(eds). Encyclopedia of Biostatistics, 2005, 2nd ed. 2005, DOI: 10.1002/0470011815.b2a03007.

Heuschman

Kolominsky-Rabas

Misselwitz

et al.

Predictors of in-hospital mortality and attributable risks of death after ischemic stroke. Arch Intern Med 2004; 164: 1761–1768.

Pedersen

Mortensen

. Family history, place and season of birth as risk factors for schizophrenia in Denmark: a replication and reanalysis. Br J Psychiatry 2001; 179: 46–52.

Natarajan

Lipsitz

Rimm

. A simple method of determining confidence intervals for population attributable risks from complex surveys. Statist Med 2007; 26: 3229–3239.

Pischon

Möhlig

Hofman

et al.

Comparison of relative and attributable risk of myocardial infarction and stroke according to C-reactive protein and low-density lipoprotein cholesterol levels. Eur J Epidemiol 2007; 22: 429–438.

McAuley

Sui

Church

et al.

The joint effects of cardiorespiratory fitness and adiposity on mortality risk in men with hypertension. Am J Hypertens 2009; 22: 1062–1069.

Mehta

Chang

. Mortality attributable to obesity among middle-aged adults in the United States. Demography 2009; 46: 851–872.

10.

Tellez-Plaza

Navas-Acien

Menke

et al.

Cadmium exposure and all-cause and cardiovascular mortality in the U.S. general population. Environ Health Perspect 2012; 120: 1017–1022.

11.

Landman

GWD

Kleefstra

van Hateren

KJJ

et al.

Educational disparities in mortality among patients with type 2 diabetes in the Netherlands (ZODIAC-23). Neth J Med 2013; 71: 76–80.

12.

Cox

. Regression models and life tables. J Roy Stat Sec B Met 1972; 34: 187–220.

13.

Rubin

. Estimating causal effects of treatments in randomized and non-randomized studies. J Educ Psychol 1974; 66: 688–701.

14.

Pearl

. Causality: Models, reasoning, and inference, Cambridge: MIT Press, 2000.

15.

Sjölander

. Estimation of attributable fractions through inverse probability weighting. Stats Methods Med Res 2011; 20: 415–428.

16.

Sjölander

Vansteelandt

. Doubly robust estimation of attributable fractions. Biostatistics 2011; 12: 112–121.

17.

Klein

Moeschberger

. Survival analysis. Techniques for censored and truncated data, New York: Springer, 1997.

18.

Chen

Wang

. Attributable risk function in the proportional hazards model for censored time-to-event. Biostatistics 2006; 7: 515–529.

19.

Chen

Lin

Zeng

. Attributable fraction functions for censored event times. Biometrika 2010; 97: 713–726.

20.

Casella

Berger

. Statistical inference, 2nd ed. Duxbury: Pacific Grove, 2002.

21.

Lee

Wei

Amamoto

Cox-type regression analysis for large numbers of small groups of correlated failure time observations. In: Klein

Goel

(eds). Survival analysis: State of the art, Dordrecht: Kluwer Academic Publishers, 1992, pp. 237–247.

22.

Newey

McFadden

Large sample estimation and hypothesis testing. In: Engle

McFadden

(eds). Handbook of econometrics, New York: Elsevier Science, 1994, pp. 2111–2245.

23.

Carlsson

Andersson

de Faire

et al.

Body mass index and mortality: is the association explained by genetic factors?

Epidemiology 2011; 22: 98–103.