On the Probability of Immunity

Jose M. Peña¹ ¹Linköping University, Sweden. jose.m.pena@liu.se

Abstract.

This work is devoted to the study of the probability of immunity, i.e. the effect occurs whether exposed or not. We derive necessary and sufficient conditions for non-immunity and $\epsilon$ -bounded immunity, i.e. the probability of immunity is zero and $\epsilon$ -bounded, respectively. The former allows us to estimate the probability of benefit (i.e., the effect occurs if and only if exposed) from a randomized controlled trial, and the latter allows us to produce bounds of the probability of benefit that are tighter than the existing ones. We also introduce the concept of indirect immunity (i.e., through a mediator) and repeat our previous analysis for it. Finally, we propose a method for sensitivity analysis of the probability of immunity under unmeasured confounding.

1. Introduction

Let $X$ and $Y$ denote an exposure and its outcome, respectively. Let $X$ and $Y$ be binary taking values in $\{x,x^{\prime}\}$ and $\{y,y^{\prime}\}$ . Let $Y_{x}$ and $Y_{x^{\prime}}$ denote the counterfactual outcome when the exposure is set to level $X=x$ and $X=x^{\prime}$ . Let $y_{x}$ , $y^{\prime}_{x}$ , $y_{x^{\prime}}$ and $y^{\prime}_{x^{\prime}}$ denote the events $Y_{x}=y$ , $Y_{x}=y^{\prime}$ , $Y_{x^{\prime}}=y$ and $Y_{x^{\prime}}=y^{\prime}$ . For instance, let $X$ represent whether a patient gets treated or not for a deadly disease, and $Y$ represent whether she survives it or not. Individual patients can be classified into immune (they survive whether they are treated or not, i.e. $y_{x}\land y_{x^{\prime}}$ ), doomed (they die whether they are treated or not, i.e. $y^{\prime}_{x}\land y^{\prime}_{x^{\prime}}$ ), benefited (they survive if and only if treated, i.e. $y_{x}\land y^{\prime}_{x^{\prime}}$ ), and harmed (they die if and only if treated, i.e. $y^{\prime}_{x}\land y_{x^{\prime}}$ ).

In general, the average treatment effect (ATE) estimated from a randomized controlled trial (RCT) does not inform about the probability of benefit (or of any of the other response types, i.e. harm, immunity, and doom). However, it may do it under certain conditions. For instance,

	$\displaystyle ATE=p(y_{x})-p(y_{x^{\prime}})$	$\displaystyle=p(y_{x},y_{x^{\prime}})+p(y_{x},y^{\prime}_{x^{\prime}})-[p(y_{x},y_{x^{\prime}})+p(y^{\prime}_{x},y_{x^{\prime}})]$
		$\displaystyle=p(\text{benefit})-p(\text{harm})$		(1)

and thus $p(\text{benefit})=ATE$ if $p(\text{harm})=0$ (a.k.a. monotonicity). Necessary and sufficient conditions are derived by Mueller and Pearl [1] to determine from observational and experimental data if monotonicity holds. In this work, we derive similar conditions for non-immunity, i.e. $p(\text{immunity})=p(y_{x},y_{x^{\prime}})=0$ . These are interesting because under non-monotonicity, they turn an RCT informative about the probabilities of benefit and harm. To see it, consider

ATE=p(y_{x})-p(y_{x^{\prime}})

where the terms on the right-hand side of the equation are estimated from an RCT. Moreover,

	$\displaystyle p(y_{x})$	$\displaystyle=p(y_{x},y_{x^{\prime}})+p(y_{x},y^{\prime}_{x^{\prime}})=p(\text{immunity})+p(\text{benefit})$		(2)
	$\displaystyle p(y_{x^{\prime}})$	$\displaystyle=p(y_{x},y_{x^{\prime}})+p(y^{\prime}_{x},y_{x^{\prime}})=p(\text{immunity})+p(\text{harm})$		(3)

and thus $p(\text{benefit})=p(y_{x})$ and $p(\text{harm})=p(y_{x^{\prime}})$ if $p(\text{immunity})=0$ .

In some cases, non-immunity is assured. For instance, when evaluating the effect of advertising on the purchase of a new product. The control group not being exposed to the ad has no way of purchasing the product, i.e. $p(y_{x^{\prime}})=0$ and thus $p(y_{x},y_{x^{\prime}})=0$ . In other cases, non-immunity cannot be assured. For instance, when evaluating the effect of a drug. An individual may carry a gene variant that makes her recover from the disease regardless of whether she takes the drug or not, i.e. $p(y_{x},y_{x^{\prime}})\geq 0$ . However, it may still be bounded as $p(y_{x},y_{x^{\prime}})\leq\epsilon$ from expert knowledge. We show that our necessary and sufficient conditions for non-immunity can trivially be adapted to $\epsilon$ -bounded immunity. Moreover, we show that the knowledge of $\epsilon$ -bounded immunity may tighten the bounds of the probabilities of benefit and harm by Tian and Pearl [2]. We also introduce the concepts of indirect benefit and harm (i.e., through a mediator) and repeat our previous analysis for them. Finally, we propose a method for sensitivity analysis of immunity under unmeasured confounding. We illustrate our results with concrete examples.

2. Conditions for Non-Immunity

Consider the bounds of $p(\text{benefit})$ derived by Tian and Pearl [2]:

\max\left\{\begin{array}[]{cc}0,\\ p(y_{x})-p(y_{x^{\prime}}),\\ p(y)-p(y_{x^{\prime}}),\\ p(y_{x})-p(y)\end{array}\right\}\leq p(\text{benefit})\leq\min\left\{\begin{array}[]{cc}p(y_{x}),\\ p(y^{\prime}_{x^{\prime}}),\\ p(x,y)+p(x^{\prime},y^{\prime}),\\ p(y_{x})-p(y_{x^{\prime}})+\\ p(x,y^{\prime})+p(x^{\prime},y)\end{array}\right\}.

(4)

Then, combining Equations 2 or 3 with 4 gives

\max\left\{\begin{array}[]{cc}0,\\ p(y_{x})-p(y^{\prime}_{x^{\prime}}),\\ p(y_{x})-p(x,y)-\\ p(x^{\prime},y^{\prime}),\\ p(y_{x^{\prime}})-p(x,y^{\prime})-\\ p(x^{\prime},y)\end{array}\right\}\leq p(\text{immunity})\leq\min\left\{\begin{array}[]{cc}p(y_{x}),\\ p(y_{x^{\prime}}),\\ p(y_{x})-p(y)+\\ p(y_{x^{\prime}}),\\ p(y)\end{array}\right\}.

(5)

A sufficient condition for $p(\text{immunity})=0$ to hold is that some argument to the min function in Equation 5 is equal to 0, that is

p(y_{x})=0\text{ or }p(y_{x^{\prime}})=0\text{ or }p(y_{x})+p(y_{x^{\prime}})=p(y)\text{ or }p(y)=0.

(6)

Likewise, a necessary condition for $p(\text{immunity})=0$ to hold is that all the arguments to the max function are non-positive, that is

		$\displaystyle p(y_{x})+p(y_{x^{\prime}})\leq 1\text{ and }$
		$\displaystyle p(y_{x})\leq p(x,y)+p(x^{\prime},y^{\prime})\text{ and }$
		$\displaystyle p(y_{x^{\prime}})\leq p(x,y^{\prime})+p(x^{\prime},y).$		(7)

2.1. Conditions for $\epsilon$ -Bounded Immunity

The conditions in the previous section can be relaxed to allow certain degree of immunity (e.g., based on expert knowledge), making them more applicable in practice. Specifically, a sufficient condition for $p(\text{immunity})\leq\epsilon$ to hold is

p(y_{x})\leq\epsilon\text{ or }p(y_{x^{\prime}})\leq\epsilon\text{ or }p(y_{x})+p(y_{x^{\prime}})\leq p(y)+\epsilon\text{ or }p(y)\leq\epsilon.

Likewise, a necessary condition for $p(\text{immunity})\leq\epsilon$ to hold is

		$\displaystyle p(y_{x})+p(y_{x^{\prime}})\leq 1+\epsilon\text{ and }$
		$\displaystyle p(y_{x})\leq p(x,y)+p(x^{\prime},y^{\prime})+\epsilon\text{ and }$
		$\displaystyle p(y_{x^{\prime}})\leq p(x,y^{\prime})+p(x^{\prime},y)+\epsilon.$		(8)

2.2. $\epsilon$ -Bounds on Benefit and Harm

Assuming $\epsilon$ -bounded immunity (e.g., based on expert knowledge) can help narrowing the bounds on $p(\text{benefit})$ and $p(\text{harm})$ . Specifically, if $p(\text{immunity})\leq\epsilon$ then Equation 2 gives

p(y_{x})-\epsilon\leq p(\text{benefit})\leq p(y_{x}).

Incorporating this into Equation 4 gives

\max\left\{\begin{array}[]{cc}0,\\ p(y_{x})-p(y_{x^{\prime}}),\\ p(y)-p(y_{x^{\prime}}),\\ p(y_{x})-p(y),\\ p(y_{x})-\epsilon\end{array}\right\}\leq p(\text{benefit})\leq\min\left\{\begin{array}[]{cc}p(y_{x}),\\ p(y^{\prime}_{x^{\prime}}),\\ p(x,y)+p(x^{\prime},y^{\prime}),\\ p(y_{x})-p(y_{x^{\prime}})+\\ p(x,y^{\prime})+p(x^{\prime},y)\end{array}\right\}

(9)

which can potentially return a tighter lower bound than Equation 4, i.e. if $\epsilon<\min(p(y_{x^{\prime}}),p(y))$ . Although the value of $\epsilon$ is typically determined from expert knowledge and not from data, the experimental and observational data available do restrict the values that are valid, as indicated by Equation 2.1. In short, $\epsilon$ can take any value as long as the lower bound is not greater than the upper bound in Equation 9. Moreover, $p(\text{harm})$ can likewise be bounded by simply swapping $x$ and $x^{\prime}$ in Equation 9.

2.3. Examples

This section illustrates the results above with two concrete examples.¹¹1R code for the examples can be found at https://tinyurl.com/2s3bxmyu.

2.3.1. Example 1

A pharmaceutical company wants to market their drug to cure a disease by claiming that no one is immune. The RCT they conducted for the drug approval yielded the following:

	$\displaystyle p(y_{x})$	$\displaystyle=0.76$
	$\displaystyle p(y_{x^{\prime}})$	$\displaystyle=0.31$

which correspond to the following unknown data generation model:

$\displaystyle p(u)=0.3$	$\displaystyle p(x\|u)=0.2$	$\displaystyle p(y\|x,u)$	$\displaystyle=0.9$
$\displaystyle p(y\|x,u^{\prime})$	$\displaystyle=0.7$
	$\displaystyle p(x\|u^{\prime})=0.9$	$\displaystyle p(y\|x^{\prime},u)$	$\displaystyle=0.8$
$\displaystyle p(y\|x^{\prime},u^{\prime})$	$\displaystyle=0.1.$

Therefore, the necessary condition for non-immunity in Equation 2 does not hold, and thus the company is not entitled to make the claim they intended to make. The company changes strategy and now wishes to market their drug as having a minimum of 50 % efficacy, i.e. benefit. To do so, they first conduct an observational study that yields the following:

	$\displaystyle p(x,y)=0.5$	$\displaystyle p(x,y^{\prime})$	$\displaystyle=0.2$
	$\displaystyle p(x^{\prime},y)=0.2$	$\displaystyle p(x^{\prime},y^{\prime})$	$\displaystyle=0.1.$

Then, they apply Equation 4 to the RCT and observational results to conclude that $0.45\leq p(\text{benefit})\leq 0.61$ . Again, the company cannot proceed with their marketing strategy. A few months later, a research publication reports that no more than 25 % of the population is immune. The company realizes that this value is compatible with their RCT and observational results, by checking the necessary condition for $\epsilon$ -bounded immunity in Equation 2.1. More importantly, the company realizes that Equation 9 with $\epsilon=0.25$ allows to conclude that $0.51\leq p(\text{benefit})\leq 0.61$ , and thus they can resume their marketing strategy.

2.3.2. Example 2

The previous example has shown that expert knowledge on immunity may complement experimental and observational data. While data alone rarely provide precise information on immunity (or on any other response type, for that matter), there are cases where data alone provide enough actionable information. The following example illustrates this.

A pharmaceutical company is concerned by the poor sales of a drug to cure a disease. The RCT conducted for the drug approval and a subsequent observational study yielded the following:

	$\displaystyle p(y_{x})$	$\displaystyle=0.48$
	$\displaystyle p(y_{x^{\prime}})$	$\displaystyle=0.36$

and

	$\displaystyle p(x,y)=0.08$	$\displaystyle p(x,y^{\prime})$	$\displaystyle=0.2$
	$\displaystyle p(x^{\prime},y)=0.25$	$\displaystyle p(x^{\prime},y^{\prime})$	$\displaystyle=0.47.$

which correspond to the following unknown data generation model:

$\displaystyle p(u)=0.4$	$\displaystyle p(x\|u)=0.1$	$\displaystyle p(y\|x,u)$	$\displaystyle=0.9$
$\displaystyle p(y\|x,u^{\prime})$	$\displaystyle=0.2$
	$\displaystyle p(x\|u^{\prime})=0.4$	$\displaystyle p(y\|x^{\prime},u)$	$\displaystyle=0.3$
$\displaystyle p(y\|x^{\prime},u^{\prime})$	$\displaystyle=0.4.$

The fact that 36 % of the untreated recover from the disease makes the company suspect that the low sales are due to a large part of the population being immune. Equation 5 allows to conclude that $0\leq p(\text{immunity})\leq 0.34$ , which suggests that the explanation offered by the company is rather unlikely. A more plausible explanation for the low sales may be that the efficacy or benefit of the drug is not very high, as $0.14\leq p(\text{benefit})\leq 0.48$ by Equation 4.

3. Indirect Benefit and Harm

In the previous sections, the causal graph of the domain under study was unknown. In this section, we assume that the graph is available (e.g., from expert knowledge) and discuss two advantages that follow with it. Specifically, suppose that the domain under study corresponds to the following causal graph:

and thus $p(y_{x})=p(y|x)$ and $p(y_{x^{\prime}})=p(y|x^{\prime})$ . Then, $p(y_{x})$ and $p(y_{x^{\prime}})$ can be estimated from observational data and thus, unlike in the previous sections, no RCT is required. A further advantage is that we can now compute the probabilities of benefit and harm mediated by $Z$ . We elaborate on this below.

The effect of $X$ on $Y$ mediated by $Z$ (a.k.a. indirect effect) corresponds to the effect due to the indirect path $X\rightarrow Z\rightarrow Y$ , i.e. after deactivating the direct path $X\rightarrow Y$ . Different ways of deactivating the direct path have resulted in different indirect effect measures in the literature. Pearl [3] proposes deactivating the direct path by setting $X$ to non-exposure and comparing the expected outcome when $Z$ takes the value it would under exposure and non-exposure:

NIE=E[Y_{x^{\prime},Z_{x}}]-E[Y_{x^{\prime}}]

which is known as the average natural (or pure) indirect effect. Geneletti [4] also proposes deactivating the direct path by setting $X$ to non-exposure but instead, she proposes comparing the expected outcome when $Z$ is drawn from the distributions $\mathcal{Z}_{x}$ and $\mathcal{Z}_{x^{\prime}}$ of $Z_{x}$ and $Z_{x^{\prime}}$ :

IIE=E[Y_{x^{\prime},\mathcal{Z}_{x}}]-E[Y_{x^{\prime},\mathcal{Z}_{x^{\prime}}}]

which is known as the interventional indirect effect. Although $NIE$ and $IIE$ do not coincide in general, they coincide for the causal graph above [5]. Finally, Fulcher et al. [6] proposes deactivating the direct path by setting $X$ to its natural (observed) value and comparing the expected outcome when $Z$ takes its natural value and the value it would under no exposure:

PIIE=E[Y_{X,Z_{X}}]-E[Y_{X,Z_{x^{\prime}}}]

which is also known as the population intervention indirect effect. This measure is suitable when the exposure is harmful (e.g., smoking), and thus one may be more interested in elucidating the effect (e.g., disease prevalence) of eliminating the exposure rather than in contrasting the effects of exposure and non-exposure.

We propose an alternative way of deactivating the direct path $X\rightarrow Y$ and measuring the indirect effect of $X$ on $Y$ through $Z$ . Specifically, we assume that the direct path $X\rightarrow Y$ is actually mediated by an unmeasured random variable $U$ that is left unmodelled. This arguably holds in most domains. The identity of $U$ is irrelevant. Let $G$ denote the causal graph below, i.e. the original causal graph refined with the addition of $U$ .

Now, deactivating the direct path $X\rightarrow Y$ in the original causal graph can be achieved by adjusting for $U$ in $G$ , i.e. $\sum_{u}E[Y|x,u]p(u)$ . Unfortunately, $U$ is unmeasured. Instead, we propose the following way of deactivating $X\rightarrow Y$ . Let $H$ denote the causal graph below, i.e. the result of reversing the edge $X\rightarrow U$ in $G$ .

The average total effect of $X$ on $Y$ in $H$ can be computed by the front-door criterion [7]:

	$\displaystyle TE$	$\displaystyle=E[Y_{x}]-E[Y_{x^{\prime}}]$
		$\displaystyle=\sum_{z}p(z\|x)\sum_{\dot{x}}E[Y\|\dot{x},z]p(\dot{x})-\sum_{z}p(z\|x^{\prime})\sum_{\dot{x}}E[Y\|\dot{x},z]p(\dot{x}).$

Note that $G$ and $H$ are distribution equivalent, i.e. every probability distribution that is representable by $G$ is representable by $H$ and vice versa [7]. Then, evaluating the second line of the equation above in $G$ or $H$ gives the same result. If we evaluate it in $H$ , then it corresponds to the part of association between $X$ and $Y$ that is attributable to the path $X\rightarrow Z\rightarrow Y$ . If we evaluate it in $G$ , then it corresponds to the part of $TE$ in $G$ that is attributable to the path $X\rightarrow Z\rightarrow Y$ , because $TE$ in $G$ equals the association between $X$ and $Y$ , since $G$ has only directed paths from $X$ to $Y$ . Therefore, the second line in the equation above corresponds to the part of $TE$ in the original causal graph that is attributable to the path $X\rightarrow Z\rightarrow Y$ , thereby deactivating the direct path $X\rightarrow Y$ . We propose to use the second line in the equation above as a measure of the indirect effect of $X$ on $Y$ in the original causal graph.

The reasoning above can be extended to the probabilities of benefit and harm, and thereby measure the benefit and harm mediated by $Z$ . As mentioned above, the causal graphs $G$ and $H$ represent different data generation mechanisms but the same probability distribution over $X$ , $Y$ and $Z$ . Therefore, the mechanisms agree on observational probabilities but may disagree on counterfactual probabilities. We use $p()$ to denote observational probabilities obtained from either mechanism, and $q()$ to denote counterfactual probabilities obtained from the mechanism corresponding to $H$ . The probabilities of benefit and harm of $X$ on $Y$ mediated by $Z$ in $G$ and thus in the original causal graph (henceforth indirect benefit and harm, or $IB$ and $IH$ ) can be computed by applying Equation 2 to $H$ . That is,

IB=q(\text{benefit})=q(y_{x})=\sum_{z}p(z|x)\sum_{\dot{x}}p(y|\dot{x},z)p(\dot{x})

where the second equality holds if $q(\text{immunity})=0$ , and the third is due to the front-door criterion on $H$ . Likewise for $IH$ simply replacing $x$ by $x^{\prime}$ . Applying Equation 5 to $H$ yields necessary and sufficient conditions for $q(\text{immunity})=0$ . That is,

		$\displaystyle\sum_{z}p(z\|x)\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=0\text{ or }$
		$\displaystyle\sum_{z}p(z\|x^{\prime})\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=0\text{ or }$
		$\displaystyle\sum_{z}[p(z\|x)+p(z\|x^{\prime})]\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=p(y)\text{ or }$
		$\displaystyle p(y)=0$		(10)

is a sufficient condition, whereas

		$\displaystyle\sum_{z}[p(z\|x)+p(z\|x^{\prime})]\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq 1\text{ and }$
		$\displaystyle\sum_{z}p(z\|x)\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq p(x,y)+p(x^{\prime},y^{\prime})\text{ and }$
		$\displaystyle\sum_{z}p(z\|x^{\prime})\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq p(x,y^{\prime})+p(x^{\prime},y)$		(11)

is a necessary condition. Necessary and sufficient conditions for $\epsilon$ -bounded immunity on $H$ (i.e., $q(\text{immunity})\leq\epsilon$ ) can be obtained much like in Section 2.1. That is, it suffices to add $\epsilon$ to the right-hand sides of the conditions above and replace $=$ with $\leq$ . Finally, we can adapt accordingly the equations in Section 2.2 to obtain $\epsilon$ -bounds on $IB$ and $IH$ . Note that the analysis of indirect benefit and harm presented here does not require an RCT, i.e. all the expressions involved can be estimated from just observational data.

3.1. Example

This section illustrates the results above with a concrete example borrowed from Pearl [8]. It concerns the following causal graph:

where $X$ represents a drug treatment, $Z$ the presence of a certain enzyme in a patient’s blood, and $Y$ recovery. Moreover, we have that

	$\displaystyle p(z\|x)=0.75$	$\displaystyle p(y\|x,z)=0.8$
		$\displaystyle p(y\|x,z^{\prime})=0.4$
	$\displaystyle p(z\|x^{\prime})=0.4$	$\displaystyle p(y\|x^{\prime},z)=0.3$
		$\displaystyle p(y\|x^{\prime},z^{\prime})=0.2.$

Since $p(x)$ is not given in the original example, we take $p(x)=0.6$ .

Pearl imagines a scenario where the pharmaceutical company plans to develop a cheaper drug that is equal to the existing one except for the lack of direct effect on recovery, i.e. it just stimulates enzyme production as much as the existing drug. Therefore, the probability of benefit of the planned drug is the probability of benefit of the existing drug that is mediated by the enzyme. The company wants to market their drugs by claiming that no one is immune. The sufficient conditions for non-immunity in Equations 6 and 3 do not hold for the drugs. However, while the existing drug satisfies the necessary condition for non-immunity in Equation 2, the planned drug does not satisfy the corresponding condition in Equation 3. Therefore, the company should either abandon their marketing strategy or abandon the plan to develop the new drug and instead focus on trying to confirm non-immunity for the existing drug.

4. Sensitivity Analysis of Immunity

In this section, like in the previous section, we assume that the causal graph of the domain under study is available, e.g. from expert knowledge. We also assume that we only have access to observational data, i.e. no RCT is available. Specifically, consider the following causal graph:

which includes potential unmeasured exposure-outcome confounding. Since $p(y_{x})=\sum_{z}p(z|x)\sum_{\dot{x}}E[Y|\dot{x},z]p(\dot{x})$ by the front-door criterion, we can proceed as in the previous section to derive necessary and sufficient conditions for non-immunity. Suppose now that $Z$ is unmeasured or that the effect of $X$ on $Y$ is direct rather than mediated by $Z$ . Then, $p(y_{x})$ is unidentifiable from observational data [7], and thus we cannot proceed as in the previous section. We therefore take an alternative approach to inform the analyst about the probability of immunity and thereby help her in decision making. In particular, we propose a sensitivity analysis method to bound the probability of immunity as a function of the observed data distribution and some intuitive sensitivity parameters. Our method is an straightforward adaption of the method by Peña [9], originally developed to bound the probabilities of benefit and harm.

Let $U$ denote the unmeasured exposure-outcome confounders. For simplicity, we assume that all these confounders are categorical, but our results also hold for ordinal and continuous confounders.²²2If $U$ is continuous then sums/maxima/minimima over $u$ should be replaced by integrals/suprema/infima. For simplicity, we treat $U$ as a categorical random variable whose levels are the Cartesian product of the levels of the elements in the original $U$ .

Note that

p(y_{x})=p(y_{x}|x)p(x)+p(y_{x}|x^{\prime})p(x^{\prime})=p(y|x)p(x)+p(y_{x}|x^{\prime})p(x^{\prime})

where the second equality follows from counterfactual consistency, i.e. $X=x\Rightarrow Y_{x}=Y$ . Moreover,

p(y_{x}|x^{\prime})=\sum_{u}p(y_{x}|x^{\prime},u)p(u|x^{\prime})=\sum_{u}p(y|x,u)p(u|x^{\prime})\leq\max_{u}p(y|x,u)

where the second equality follows from $Y_{x}\!\perp\!X|U$ for all $x$ , and counterfactual consistency. Likewise,

p(y_{x}|x^{\prime})\geq\min_{u}p(y|x,u).

Now, let us define

M_{x}=\max_{u}p(y|x,u)

and

m_{x}=\min_{u}p(y|x,u)

and likewise $M_{x^{\prime}}$ and $m_{x^{\prime}}$ . Then,

p(x,y)+p(x^{\prime})m_{x}\leq p(y_{x})\leq p(x,y)+p(x^{\prime})M_{x}

and likewise

p(x^{\prime},y)+p(x)m_{x^{\prime}}\leq p(y_{x^{\prime}})\leq p(x^{\prime},y)+p(x)M_{x^{\prime}}.

These equations together with Equation 5 give

\max\left\{\begin{array}[]{cc}0,\\ p(x^{\prime})m_{x}+p(x)m_{x^{\prime}}-p(y^{\prime}),\\ p(x^{\prime})m_{x}-p(x^{\prime},y^{\prime}),\\ p(x)m_{x^{\prime}}-p(x,y^{\prime})\end{array}\right\}\leq p(\text{immunity})\leq\min\left\{\begin{array}[]{cc}p(x,y)+p(x^{\prime})M_{x},\\ p(x^{\prime},y)+p(x)M_{x^{\prime}},\\ p(x^{\prime})M_{x}+p(x)M_{x^{\prime}},\\ p(y)\end{array}\right\}

(12)

where $m_{x}$ , $M_{x}$ , $m_{x^{\prime}}$ and $M_{x^{\prime}}$ are sensitivity parameters. The possible regions for $m_{x}$ and $M_{x}$ are

0\leq m_{x}\leq p(y|x)\leq M_{x}\leq 1

(13)

and likewise for $m_{x^{\prime}}$ and $M_{x^{\prime}}$ .

Our lower bound in Equation 12 is informative if and only if³³3Note that the second row in the maximum equals the third plus the fourth rows.

0<p(x^{\prime})m_{x}-p(x^{\prime},y^{\prime})

0<p(x)m_{x^{\prime}}-p(x,y^{\prime}).

Then, the informative regions for $m_{x}$ and $m_{x^{\prime}}$ are

p(y^{\prime}|x^{\prime})<m_{x}\leq p(y|x)

and

p(y^{\prime}|x)\leq m_{x^{\prime}}<p(y|x^{\prime}).

On the other hand, our upper bound in Equation 12 is informative⁴⁴4Note that we already know that $p(\text{immunity})\leq p(y)$ by Equation 5. if and only if⁵⁵5Note that the third row in the minimum equals the first plus the second minus the fourth rows.

p(x,y)+p(x^{\prime})M_{x}<p(y)

p(x^{\prime},y)+p(x)M_{x^{\prime}}<p(y)

which occurs if and only if $p(y|x)<p(y|x^{\prime})$ or $p(y|x^{\prime})<p(y|x)$ .⁶⁶6To see it, rewrite $p(y)=p(x,y)+p(x^{\prime},y)$ and recall Equation 13. Therefore, our upper bound is always informative, and thus the informative regions for $M_{x}$ and $M_{x^{\prime}}$ coincide with their possible regions.

Refer to caption — Figure 1. Lower and upper bounds of $p(\text{immunity})$ in the example in Section 4.1 as functions of the sensitivity parameters $m_{x}$ , $m_{x^{\prime}}$ , $M_{x}$ and $M_{x^{\prime}}$ .

4.1. Example

We illustrate our method for sensitivity analysis of $p(\text{immunity})$ with the following fictitious epidemiological example. Consider a population consisting of a majority and a minority group. Let the binary random variable $U$ represent the group an individual belongs to. Let $X$ represent whether the individual gets treated or not for a certain disease. Let $Y$ represent whether the individual survives the disease. Assume that the scientific community agrees that $U$ is a confounder for $X$ and $Y$ . Assume also that it is illegal to store the values of $U$ , to avoid discrimination complaints. In other words, the identity of the confounder is known but its values are not. More specifically, consider the following unknown data generation model:

$\displaystyle p(u)=0.2$	$\displaystyle p(x\|u)=0.4$	$\displaystyle p(y\|x,u)=0.9$
$\displaystyle p(y\|x,u^{\prime})=0.8$
	$\displaystyle p(x\|u^{\prime})=0.2$	$\displaystyle p(y\|x^{\prime},u)=0.2$
$\displaystyle p(y\|x^{\prime},u^{\prime})=0.7.$

Since this model does not specify the functional forms of the causal mechanisms, we cannot compute the true $p(\text{immunity})$ [7]. However, we can bound it by Equation 5 and the fact that $p(y_{x})=\sum_{u}p(y|x,u)p(u)$ [7], which yields $p(\text{immunity})\in[0.42,0.6]$ . Note that these bounds cannot be computed in practice because $U$ is unmeasured.

Figure 1 (top) shows the lower bound of $p(\text{immunity})$ in Equation 12 as a function of the sensitivity parameters $m_{x}$ and $m_{x^{\prime}}$ . The axes span the possible regions of the parameters. The dashed lines indicate the informative regions of the parameters. Specifically, the bottom left quadrant corresponds to the non-informative region, i.e. the lower bound is zero. In the data generation model considered, $m_{x}=0.8$ and $m_{x^{\prime}}=0.2$ . These values are unknown to the epidemiologist, because $U$ is unobserved. However, the figure reveals that the epidemiologist only needs to have some rough idea of these values to confidently conclude that $p(\text{immunity})$ is lower bounded by 0.2. Figure 1 (bottom) shows our upper bound of $p(\text{immunity})$ in Equation 12 as a function of the sensitivity parameters $M_{x}$ and $M_{x^{\prime}}$ . Likewise, having some rough idea of the unknown values $M_{x}=0.9$ and $M_{x^{\prime}}=0.7$ enables the epidemiologist to confidently conclude that the $p(\text{immunity})$ is upper bounded by 0.65. Applying Equation 5 with just observational data produces looser bounds, namely 0 and 0.67. Recall that $p(\text{immunity})\in[0.42,0.6]$ in truth.

5. Discussion

The analysis in this work can be repeated for $p(\text{doom})$ instead of $p(\text{immunity})$ by simply swapping $y$ and $y^{\prime}$ , and $p(\text{benefit})$ and $p(\text{harm})$ . Additionally, the analysis of indirect benefit can be repeated for $p(\text{harm})$ instead of $p(\text{immunity})$ due to Equation 1, and thereby extend the analysis by Mueller and Pearl [1].

References

Mueller and Pearl [2023] S. Mueller and J. Pearl. Monotonicity: Detection, Refutation, and Ramification. UCLA Cognitive Systems Laboratory, Technical Report (R-529), 2023.
Tian and Pearl [2000] J. Tian and J. Pearl. Probabilities of Causation: Bounds and Identification. Annals of Mathematics and Artificial Intelligence, 28:287–313, 2000.
Pearl [2001] J. Pearl. Direct and Indirect Effects. In Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence, pages 411–420, 2001.
Geneletti [2007] S. Geneletti. Identifying Direct and Indirect Effects in a Non-Counterfactual Framework. Journal of the Royal Statistical Society Series B, 69:199–215, 2007.
VanderWeele et al. [2014] T. J. VanderWeele, S. Vansteelandt, and J. M. Robins. Effect Decomposition in the Presence of an Exposure-Induced Mediator-Outcome Confounder. Epidemiology, 25:300–306, 2014.
Fulcher et al. [2020] I. R. Fulcher, I. Shpitser, S. Marealle, and E. J. Tchetgen Tchetgen. Robust Inference on Population Indirect Causal Effects: The Generalized Front Door Criterion. Journal of the Royal Statistical Society Series B, 82:199–214, 2020.
Pearl [2009] J. Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2009.
Pearl [2012] J. Pearl. The Causal Mediation Formula - A Guide to the Assessment of Pathways and Mechanisms. Prevention Science, 13:426–436, 2012.
Peña [2023] J. M. Peña. Bounding the Probabilities of Benefit and Harm Through Sensitivity Parameters and Proxies. Journal of Causal Inference, 11:20230012, 2023.

$\displaystyle p(u)=0.3$	$\displaystyle p(x\|u)=0.2$	$\displaystyle p(y\|x,u)$	$\displaystyle=0.9$
$\displaystyle p(y\|x,u^{\prime})$	$\displaystyle=0.7$
	$\displaystyle p(x\|u^{\prime})=0.9$	$\displaystyle p(y\|x^{\prime},u)$	$\displaystyle=0.8$
$\displaystyle p(y\|x^{\prime},u^{\prime})$	$\displaystyle=0.1.$

$\displaystyle p(u)=0.4$	$\displaystyle p(x\|u)=0.1$	$\displaystyle p(y\|x,u)$	$\displaystyle=0.9$
$\displaystyle p(y\|x,u^{\prime})$	$\displaystyle=0.2$
	$\displaystyle p(x\|u^{\prime})=0.4$	$\displaystyle p(y\|x^{\prime},u)$	$\displaystyle=0.3$
$\displaystyle p(y\|x^{\prime},u^{\prime})$	$\displaystyle=0.4.$

		$\displaystyle\sum_{z}p(z\|x)\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=0\text{ or }$
		$\displaystyle\sum_{z}p(z\|x^{\prime})\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=0\text{ or }$
		$\displaystyle\sum_{z}[p(z\|x)+p(z\|x^{\prime})]\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})=p(y)\text{ or }$
		$\displaystyle p(y)=0$		(10)

		$\displaystyle\sum_{z}[p(z\|x)+p(z\|x^{\prime})]\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq 1\text{ and }$
		$\displaystyle\sum_{z}p(z\|x)\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq p(x,y)+p(x^{\prime},y^{\prime})\text{ and }$
		$\displaystyle\sum_{z}p(z\|x^{\prime})\sum_{\dot{x}}p(y\|\dot{x},z)p(\dot{x})\leq p(x,y^{\prime})+p(x^{\prime},y)$		(11)

	$\displaystyle p(z\|x)=0.75$	$\displaystyle p(y\|x,z)=0.8$
		$\displaystyle p(y\|x,z^{\prime})=0.4$
	$\displaystyle p(z\|x^{\prime})=0.4$	$\displaystyle p(y\|x^{\prime},z)=0.3$
		$\displaystyle p(y\|x^{\prime},z^{\prime})=0.2.$