Defining General, Hypothetical Interventions

Modified

June 26, 2024

WebR Status

🟡 Loading...

Have you ever begun reading a paper in the methodological causal inference literature and encountered the phrase “assume the treatment or exposure is a binary…”? (Most papers we read assume this!!) While assuming exposure variables are binary can simplify the definition of causal effects, many exposures of interest in reality are not binary.

Instead, we will work in situations where A is a binary, categorical, multivariate, or continuous variable!


In the previous section, we defined d(at,ht,ϵt) as a function that maps At, Ht, and potentially a randomizer ϵt to a new value of At. Our focus henceforth is estimating the causal effect of an intervention, characterized by d on the outcome Y, through the causal parameter

θ=E[YA¯d],

where YA¯d is the counterfactual outcome in a world, where possibly contrary to fact, each entry of A¯=(A1,,Aτ) was modified according to the function d.

But what is this function d, how can it be defined, and how does using this function to define interventions solve the problem? Let’s start from simple to more complex examples of functions d.

Static Interventions

Let A denote a binary vector, such as receiving a medication, and define d(at,ht,ϵ)=1. This intervention characterizes a hypothetical world where all members of the population receive treatment.

Static intervention

An intervention is static if the function dt always returns the same value regardless of the input.

Example

Let’s say we were interested in the effect of randomizing patients with opioid use disorder to injection naltrexone (A=1) vs. sublingual buprenorphine (A=0). We would contrast the counterfactual outcomes in a hypothetical world in which all units were treated with injection naltrexone d1=1 versus a hypothetical world in which all units were treated with buprenorphine d0=0. This gives us the well-known average (comparative) treatment effect (ATE). E[YA¯d1YA¯d0]=E[YA=1YA=0].

Dynamic Treatment Regime

Let At denote a binary vector, such as receiving a medication, and Ht a numeric vector, such as a measure of discomfort. For a given value of δ, define d(at,ht,ϵ)={1 if ht>δ0 otherwise.

Dynamic treatment regime

Interventions where the output of the function d depends only on the covariates Ht are referred to as being dynamic.

Example

Rudolph, Williams, et al. () examined a Buprenorphine (BUP-NX) dosing strategy among a population of patients who were taking BUP-NX for opioid use disorder. Under the hypothetical intervention, patients who reported opioid use during the week prior to a physicians exam received a BUP-NX dose increase while patients who did not report prior-week opioid use maintained the same dose. Let At be a binary indicator for BUP-NX dose increase at week t compared to week t1 and Xt be an indicator for opioid use at week t. Then,

d(at,ht,ϵ)={1 if xt1=10 otherwise

Modified Treatment Policies

While much attention is given to static and dynamic interventions, their use is often accompanied by a few key problems.

  1. Defining causal effects in terms of hypothetical interventions where treatment is applied to all units may be inconceivable. For example, we may be interested to know if reducing surgery time reduces surgical complications. However, it’s inconceivable to set all surgeries to a given duration, even if this duration depends on patient covariates.

  2. Defining causal effects in terms of hypothetical interventions where treatment is applied to all units may induce positivity violations.

A solution to these problems is to instead define causal effects using modified treatment policies (MTP).

Modified treatment policy

An intervention characterized by a hypothetical world where the natural value of treatment is modified is called a modified treatment policy.

Additive and multiplicative shift MTP

Let At denote a numeric vector. Assume that At has support in the data such that P(Atu(ht)Ht=ht)=1. For an analyst-defined value of δ, define d(at,ht,ϵ)={at+δ if at+δu(ht)at otherwise.

Under this intervention, the natural value of exposure at time t is increased by the analyst-defined value δ, whenever such an increase is feasible. This MTP is referred to as an additive shift MTP.

Example

Dı́az et al. () estimated the effect of increasing P/F ratio (a measure of hypoxemia) by 50 units on survival among those patients with acute respiratory failure (a P/F ratio < 300).

dt(at,ht,ϵ)={at+50 if at300at otherwise

We can similarly define a multiplicative shift MTP as

d(at,ht,ϵ)={at×δ if at×δu(ht)at otherwise.

Example

Nugent and Balzer () evaluated the association between county-level measures of mobility and incident COVID-19 cases in the United States in the Summer and Fall of 2022. They considered both hypothetical additive and multiplicative MTPs; for example, they defined a multiplicative MTP where a measure for the density of mobile devices visiting commercial locations was decreased by 25%:

d(at,ht,ϵ)=at×0.75.

Randomized Interventions

Let A denote a binary vector, ϵU(0,1), and ϵ be an analyst-defined value between 0 and 1. We may then define randomized interventions. For example, imagine we are interested in a hypothetical world where half of all smokers quit smoking. This intervention would be defined as

d(at,ϵt)={0 if ϵt<0.5 and at=1at otherwise.

Incremental Propensity Score Interventions Based on the Risk Ratio

Let A denote a binary variable, ϵU(0,1), and δ be an analyst-defined risk ratio limited to be between 0 and 1. In addition, define P(At=atHt)=g(atHt).

If we were interested in an intervention that decreased the likelihood of receiving treatment, define

dt(at,ht,ϵt)={at if ϵt<δ0 otherwise. In this case, we have gd(atHt)=atδgt(1Ht)+(1at)(1δgt(1Ht)), which leads to a risk ratio of gtd(1Ht)/gt(1Ht)=δ for comparing the propensity score post- vs pre-intervention.

Conversely, if we were interested in an intervention that increased the likelihood of receiving treatment, define

dt(at,ht,ϵt)={at if ϵt<δ1 otherwise.

Now gtd(atHt)=at(1δgt(0Ht))+(1at)δgt(0Ht), which implies a risk ratio gtd(0Ht)/g(0Ht)=δ.

Incremental propensity score intervention

An intervention where the conditional probability of treatment is shifted. (As an aside, interventions where the shift is in the odds ratio scale where previously proposed, but the effects of odds-ratio shifts should not be estimated with the lmtp package, we will discuss this further.

Interventions where the shift is in the odds ratio scale were previously proposed, but the effects of odds-ratio shifts should not be estimated with lmtp, we will discuss this more later.

Example

Using electronic health record data, Wen, Marcus, and Young () estimated the effect of increasing the proportion of PrEP uptake on bacterial STI among cis-gender males being tested for STIs and that do not have HIV. Let At be a binary indicator for PrEP initiation at week t, and Lt be a binary indicator for any STI testing and being HIV-free at week t. They defined a “medium” successful PrEP uptake intervention as

d(at,ht,ϵ)={at if lt=1 and ϵt<0.851 otherwise.

Identification of the causal parameter

Recall that the fundamental problem of causal inference is that we can’t observe the alternative worlds which we use to define causal effects. If we can’t observe counterfactual variables, then how can we learn a causal effect? Under a set of certain assumptions, we can identify a causal parameter from observed data. These assumptions are called identification assumptions.

Identification Assumptions
  1. Positivity. If (at,ht)supp{At,Ht} then d(at,ht)supp{At,Ht} for t{1,...,τ}.

    If there is a unit with observed treatment value at and covariates ht, there must also be a unit with treatment value d(at,ht) and covariates ht.

  2. No unmeasured confounders. All the common causes of At and (Ls,As,Y) are measured and contained in Ht for all s{t+1,...,τ}.

    For all times t, the history Ht contains sufficient variables to adjust for confounding of At and any subsequent variables, including future treatment.

Question

When might these assumptions be violated?

Assuming the above, θ is identified from the observed data with:

Theorem

Set mτ+1=Y For t=τ,...,1, recursively define

mt:(at,ht)E[mt+1(At+1d,Ht+1)At=at,Ht=ht],

and define θ=E[m1(A1d,L1)].

As an example, consider the following data where τ=2.

Loading webR...

We can compute the identification formula in the following steps:

  1. Set m3(A3d,H3)=Y

  2. Compute the regression of m3(A3d,H3) on (A2,H2). This gives a predictive function, call that predictive function m2(A2,H2).

  3. Use the predictive function to compute what would have occurred if the intervention had been implemented at time t=2, i.e., compute m2(A2d,H2).

  4. Compute the regression of m2(A2d,H2) on (A1,H1). This gives a predictive function, call that predictive function m1(A1,H1).

  5. Use the predictive function to compute what would have occurred if the intervention had been implemented at time t=1, i.e., compute m1(A1d,H1).

  6. Compute the mean of m1(A1d,H1). This mean is equal to θ.

References

Dı́az, Iván, Katherine L Hoffman, and Nima S Hejazi. 2024. “Causal Survival Analysis Under Competing Risks Using Longitudinal Modified Treatment Policies.” Lifetime Data Analysis 30 (1): 213–36.
Dı́az, Iván, Nicholas Williams, Katherine L Hoffman, and Edward J Schenck. 2023. “Nonparametric Causal Effects Based on Longitudinal Modified Treatment Policies.” Journal of the American Statistical Association 118 (542): 846–57.
Haneuse, Sebastian, and Andrea Rotnitzky. 2013. “Estimation of the Effect of Interventions That Modify the Received Treatment.” Statistics in Medicine 32 (30): 5260–77.
Kennedy, Edward H. 2019. “Nonparametric Causal Effects Based on Incremental Propensity Score Interventions.” Journal of the American Statistical Association 114 (526): 645–56.
Nugent, Joshua R, and Laura B Balzer. 2023. “A Demonstration of Modified Treatment Policies to Evaluate Shifts in Mobility and COVID-19 Case Rates in US Counties.” American Journal of Epidemiology 192 (5): 762–71.
Rudolph, Kara E, Catherine Gimbrone, Ellicott C Matthay, Ivan Diaz, Corey S Davis, Katherine Keyes, and Magdalena Cerdá. 2022. “When Effects Cannot Be Estimated: Redefining Estimands to Understand the Effects of Naloxone Access Laws.” Epidemiology 33 (5): 689–98.
Rudolph, Kara E, Nicholas T Williams, Alicia T Singham Goodwin, Matisyahu Shulman, Marc Fishman, Iván Dı́az, Sean Luo, John Rotrosen, and Edward V Nunes. 2022. “Buprenorphine & Methadone Dosing Strategies to Reduce Risk of Relapse in the Treatment of Opioid Use Disorder.” Drug and Alcohol Dependence 239: 109609.
Wen, Lan, Julia L Marcus, and Jessica G Young. 2023. “Intervention Treatment Distributions That Depend on the Observed Treatment Process and Model Double Robustness in Causal Survival Analysis.” Statistical Methods in Medical Research 32 (3): 509–23.