Abstract
Through a sensitivity analysis, the analyst attempts to determine whether a conclusion of causal inference could be easily reversed by a plausible violation of an identification assumption. Analytic conclusions that are harder to alter by such a violation are expected to add a higher value to scientific knowledge about causality. This article presents a weighting-based approach to sensitivity analysis for causal mediation studies. Extending the ratio-of-mediator-probability weighting (RMPW) method for identifying natural indirect effect and natural direct effect, the new strategy assesses potential bias in the presence of omitted pretreatment or posttreatment covariates. Such omissions may undermine the causal validity of analytic conclusions. The weighting approach to sensitivity analysis reduces the reliance on functional form assumptions and removes constraints on the measurement scales for the mediator, the outcome, and the omitted covariates. In its essence, the discrepancy between a new weight that adjusts for an omitted confounder and an initial weight that omits the confounder captures the role of the confounder that contributes to the bias. The effect size of the bias due to omitted confounding of the mediator–outcome relationship is a product of two sensitivity parameters, one associated with the degree to which the omitted confounders predict the mediator and the other associated with the degree to which they predict the outcome. The article provides an application example and concludes with a discussion of broad applications of this new approach to sensitivity analysis. Online Supplemental Material includes R code for implementing the proposed sensitivity analysis procedure.
Causal mediation analysis in its simplest form decomposes the total treatment effect on an outcome into an indirect effect that transmits the treatment effect through a focal mediator and a direct effect. The indirect effect and the direct effect are identified typically under the assumption that there is no omission of pretreatment or posttreatment confounders. A pretreatment confounder precedes the treatment assignment and may confound the relationship between the treatment and the mediator, between the treatment and the outcome, and between the mediator and the outcome. A posttreatment confounder is an intermediate outcome of the treatment and may confound the mediator–outcome relationship. The latter is sometimes called an “exposure-induced mediator–outcome confounder” in the epidemiology literature. In causal mediation studies, selection bias due to potential confounding of the mediator–outcome relationship is a major concern. This is because even if the treatment assignment is randomized, the mediator value assignment under each treatment condition typically is not. Sensitivity analysis is an attempt to determine whether a plausible violation of an identification assumption could easily reverse a conclusion of statistical inference. Analytic conclusions that are harder to alter by such a violation are expected to add a higher value to scientific knowledge.
An important advance in causal mediation analysis in the recent years is to relax the standard assumptions of linearity and additivity typically invoked by path analysis and structural equation modeling (see Holland, 1988, for an explication of these assumptions). In particular, conventional practice of mediation analysis assumes that the mediator–outcome relationship does not differ between the experimental condition and the control condition. Hence, the direct effect does not depend on any particular value at which the mediator is fixed. In contrast, as we show in Section I, individual-specific direct effect and indirect effect defined under the potential outcomes framework do not imply this arbitrary assumption. Several innovative strategies for causal mediation analysis accommodate the general case in which the mediator–outcome relationship is not necessarily the same across the treatment conditions (see, for reviews, Hong, 2015; VanderWeele, 2015). The methodological advance, however, creates a challenge in sensitivity analysis.
Although the potential risk of omitting pretreatment confounders has received much attention in causal mediation analysis, handling posttreatment confounders requires particular caution. It is well known that statistical adjustment for a posttreatment covariate will lead to bias in identifying the treatment effect on the mediator and its effect on the outcome (Rosenbaum, 1984). When Treatment × Mediator interactions are absent, adjusting for posttreatment confounding of the mediator–outcome relationship is nonetheless possible through specifying marginal structural models and employing inverse probability weighting (VanderWeele, 2009). However, as shown by Avin, Shpitser, and Pearl (2005), when Treatment × Mediator interactions are present, identifying the direct effect and indirect effect becomes infeasible if there exist posttreatment confounders. We will elaborate on this issue in Section IV. Since the existence of posttreatment confounding is often inevitable, there is an acute need to assess the potential bias due to omitted posttreatment confounders, observed or unobserved.
Researchers have proposed different sensitivity analysis strategies in causal mediation studies under treatment randomization when the mediator–outcome relationship is possibly confounded (see, for reviews, Hong, 2015, pp. 261–265; VanderWeele, 2015, pp. 66–97). Most of these strategies use sensitivity parameters to quantify potential bias on the basis of a series of parametric assumptions (e.g., Imai, Keele, & Tingley, 2010; Imai, Keele, & Yamamoto, 2010; Tchetgen Tchetgen & Shpitser, 2012; VanderWeele, 2010, 2015; VanderWeele & Chiba, 2014; Vansteelandt & VanderWeele, 2012), while some others employ bounds for the causal effects of interest (Ding & VanderWeele, 2016; Imai & Yamamoto, 2013; Sjölander, 2009). Each of these different strategies has its unique strengths and may inform conclusions in a certain class of applications. Yet their constraints are also noteworthy. In particular, when parametric assumptions are required for implementing a sensitivity analysis strategy, such assumptions may seem to be arbitrary, overly strong, or apparently implausible in some applications; in the meantime, there are potential concerns that bounds are often too wide to be informative.
This article introduces a weighting-based approach to sensitivity analysis as an alternative to the existing strategies. A related strategy has appeared in past research for assessing sensitivity to hidden bias in the context of a propensity score weighting-based evaluation of a binary treatment (Ridgeway, 2006). With a focus on causal mediation analysis, we propose a new approach that (a) has the generality of assessing potential bias associated with posttreatment covariates as well as bias associated with omitted pretreatment covariates, (b) requires relatively few parametric assumptions, and (c) is unconstrained by the measurement scales for the mediator, the outcome, and the omitted covariates.
The article is organized as follows: Section I defines the causal effects of interest in a mediation analysis. Section II briefly reviews the ratio-of-mediator-probability weighting (RMPW) theorem and the sequential ignorability assumption. Section III derives the selection bias due to an omitted pretreatment confounder. Section IV derives the selection bias due to an omitted posttreatment confounder preceding the focal mediator. Section V discusses practical strategies for empirically assessing the plausibility that potential bias may alter the initial conclusion. Section VI illustrates the implementation with a real-data application. Section VII shows an extension when omitted pretreatment covariates additionally confound the treatment–mediator and treatment–outcome relationships. Section VIII compares the new weighting-based approach to the existing techniques for sensitivity analysis. Section IX concludes and discusses further extensions.
I. Causal Effects of Interest
Let Z denote treatment assignment that takes value 1 or 0 representing an experimental condition and a control condition, respectively; let M denote the focal mediator; and let Y denote the outcome. In addition, X denotes a vector of observed pretreatment covariates, P denotes a potentially omitted pretreatment covariate, and Q denotes a potentially omitted posttreatment covariate preceding the focal mediator. In the absence of P and Q, the relationships between Z, M, Y, and X are illustrated in Figure 1.
We define the causal effects of interest in terms of potential outcomes (Holland, 1988; Pearl, 2001; Robins & Greenland, 1992). Specifically, M(1) and M(0) denote the potential intermediate outcomes of an individual under the experimental condition and the control condition, respectively. The potential outcome under treatment condition z, typically written as Y(z) for z = 0, 1, can equivalently be written as . Hence, and denote the individual’s potential outcomes under the experimental condition and the control condition, respectively. Additionally, denotes the individual’s potential outcome under the experimental condition when the treatment counterfactually does not change the mediator from its value under the control condition M(0) to that under the experimental condition M(1).
Following Pearl’s (2001) terminology, we define the natural indirect effect (NIE) as
| 1 |
This is the population average treatment effect on the outcome transmitted through the treatment-induced change in the mediator—that is, with a change in the mediator value from M(0) to M(1) when all individuals are treated. Note that M(0) and M(1) are each a random variable that can “naturally” take different values for different individuals. The natural direct effect (NDE) is defined as
| 2 |
This is the population average treatment effect on the outcome attributable to other unspecified pathways. Robins and Greenland (1992) called Equations 1 and 2 “the total indirect effect” and “the pure direct effect,” respectively. The sum of NIE and NDE is the total treatment effect on the outcome. This total effect can alternatively be decomposed into “the pure indirect effect” defined as and “the total direct effect” defined as . The two decompositions are nonequivalent in the presence of a Treatment × Mediator interaction. For simplicity, the discussion in this article focuses on NIE and NDE.
II. RMPW and Sequential Ignorability
Hong (2010a) first introduced a new propensity score–based weighting approach to causal mediation analysis named “RMPW.” Hong and others (Hong, 2015; Hong, Deutsch, & Hill, 2011, 2015; Hong & Nomi, 2012; Huber, 2014; Lange, Rasmussen, & Thygesen, 2014; Lange, Vansteelandt, & Bekaert, 2012; Qin & Hong, 2014, 2016, 2017; Tchetgen Tchetgen, 2013; Tchetgen Tchetgen & Shpitser, 2012) subsequently extended the RMPW approach and its several variants to quasi-experimental data to discrete and continuous mediators and outcomes, to multiple mediators, and to multisite mediation analysis.
RMPW identifies NIE and NDE under the sequential ignorability assumption summarized by Imai and colleagues (2010). Pearl (2001) stated a set of assumptions in slightly different forms:
A1: The treatment assignment is ignorable, given pretreatment covariates X.
B1: The mediator value assignment under each treatment condition is ignorable, given X.
A1 is satisfied if the treatment assignment is in fact randomized or is as if randomized within levels of X = x. Such a treatment assignment mechanism allows Z to be independent of the potential mediators M(1) and M(0) and of the potential outcomes and , given X. It also implies that, for those with the same pretreatment characteristics x, the probability of being assigned to either treatment condition is nonzero (known as the positivity assumption).
B1 is satisfied if, under each treatment condition, the mediator value assignment is as if randomized within levels of X = x. This is plausible when individuals sharing the same pretreatment characteristics happen to display different mediator values due to random events in life. Such a mechanism of mediator value assignment under the experimental condition allows M(1) to be independent of the potential outcomes and , given X. Analogously, a similar mechanism of mediator value assignment under the control condition allows M(0) to be independent of and , given X. B1 also implies that the probability of being assigned to any mediator value is nonzero under each treatment condition.
When the sequential ignorability assumption holds, for z, z′ = 0, 1, and z unequal to z′,
| 3 |
Here, under a simple randomization of the treatment; otherwise, , where is the propensity score for the treatment assignment. This strategy is known as inverse probability of treatment weighting (IPTW; Robins, 2000) or model-based direct adjustment (Rosenbaum, 1987). In a nonexperimental study, Wz adjusts for treatment selection associated with pretreatment covariates X.
The RMPW theorem (Hong, 2010a, 2015) states that, to identify , the weight for a discrete mediator is
| 4 |
Here, , for z = 0, 1, is the propensity score for the mediator value assignment under treatment condition z. This weight applies to individuals in the experimental group who displays mediator value m and pretreatment characteristics x. Once IPTW has removed treatment selection, RMPW effectively transforms the mediator distribution in the experimental group to resemble that in the control group within levels of . If the treatment is randomized, the weight is simplified to be . Hong (2010a, 2015) and colleagues (Hong et al., 2011, 2015; Hong & Nomi, 2012) have shown that RMPW has the flexibility of handling binary and multicategory mediators. Applying the Bayes theorem to Equation 4, it is easy to derive an alternative form of the weight that can be conveniently applied to a continuous mediator:
| 5 |
where . This form of the weight is inherently related to the form presented by Huber (2014). Under a simple randomization of the treatment, , Equation 5 is then simplified to be .
NIE and NDE can each be identified simply by a weighted mean contrast:
Importantly, identification through weighting does not require the analyst to specify the outcome as a function of the treatment, the mediator, and the covariates. Hence, the analyst does not have to take the risk of possibly misspecifying the functional form of the outcome model. In this way, the RMPW method conveniently accommodates Treatment × Mediator interactions, linear or nonlinear, that may naturally vary among individuals. In contrast, all regression-based identification strategies require that Treatment × Mediator interactions be either nonexistent or homogeneous. Imai and Yamamoto (2013) derive the bias when the assumption of homogeneous Treatment × Mediator interaction fails. Applications of regression-based strategies also typically invoke high-stake assumptions such as invariant covariate–outcome relationships across treatment conditions and across mediator levels. The weighting strategy avoids these model-based assumptions, although the analyst must specify the functional form of or the functional form of .
III. Bias Due to an Omitted Pretreatment Confounder
We now consider the case in which there is an additional pretreatment covariate P that is independent of the observed covariates X and may confound the mediator–outcome relationship as shown in Figure 2. The same logic applies when P represents a vector of omitted pretreatment confounders. In general, when the observed covariates X have already been adjusted for in the original analysis, an omitted confounder that is independent of X tends to be more concerning than one that is associated with X.
When the treatment assignment is randomized or is as if randomized within levels of , part (A1) of the sequential ignorability assumption is satisfied. Part (B1) of this assumption now states that the mediator value assignment under each treatment condition is ignorable given X and P. In this case, applying the weight defined in Equation 4 for a discrete mediator will no longer identify . We can easily prove that will be identified instead by , where the new weight is
| 6 |
Here, for z = 0, 1. See Online Appendix A for a proof of this identification result. If the treatment is randomized, the weight is simplified to be . The new weight WP is distinguished from W in that WP makes adjustment for the confounding effects of both X and P, while W adjusts for X only. The new weight for a continuous mediator is
| 7 |
Under a simple randomization of the treatment, we have that . Hence, the weight becomes .
If P is the only omitted confounder, once P is taken into account, NIE can be identified by and NDE can be identified by . In contrast, when the weight defined in Equation 4 or 5 is employed, omitting pretreatment covariate P that confounds the mediator–outcome relationship will lead to a bias. In the attempt to identify NIE, the bias is equal to
It is easy to show that the attempt to identify NDE will lead to a bias equal to . Online Appendix B shows the derivation. Let represent the standard deviation of the outcome in the experimental group. With H as the metric, to identify the effect size of NIE, the bias is a product of two sensitivity parameters:
The bias in identifying the effect size of NDE is simply .
Clearly, the discrepancy between WP and W plays a central role in assessing bias. In identifying NIE and its effect size, the bias is determined by (a) the standard deviation of in the experimental group denoted by σ and (b) the correlation between and Y in the same group denoted by ρ. For transparency, in the weighting-based sensitivity analysis, we will represent the effect size of bias in the NIE estimate and that in the NDE estimate each as a function of these two sensitivity parameters: σ and ρ. The bias is 0 when for all individuals (i.e., when ) or when the correlation between and Y is 0 (i.e., when ). The bias increases with σ in the experimental group; when σ is fixed, the direction and magnitude of bias are consistent with the direction and magnitude of ρ in the experimental group. We can show that σ is associated with the degree to which P predicts mediator value assignment under either treatment condition and that ρ is additionally related to the degree to which P predicts the outcome. Online Appendix C clarifies the inherent connections between these two weighting-based sensitivity parameters and the conventional regression-based sensitivity parameters.
IV. Bias Due to an Omitted Posttreatment Confounder
A posttreatment confounder preceding the focal mediator is a covariate that is an intermediate outcome of the treatment and yet at the same time is a predictor of the focal mediator and of the outcome. In Figure 3, the posttreatment covariate Q plays the role of an additional mediator (or a vector of omitted posttreatment confounders) that precedes the focal mediator M.
Viewing Q and M as consecutive mediators, we may define NIE and NDE as follows:
| 8 |
| 9 |
Note that can be simplified as , that can be simplified as , and that can be simplified as . Hence, seemingly Equation 8 is equivalent to Equation 1; and Equation 9 is equivalent to Equation 2. However, unlike the previous cases, here the causal effect transmitted through M includes the Z-M-Y pathway and the Z-Q-M-Y pathway; the causal effect not transmitted through M includes the Z-Q-Y pathway and the direct Z-Y pathway.
Hong (2015) explicated the identification assumptions under which the causal effects defined in Equations 8 and 9 can be identified and derived the weights needed for the identification. Specifically, the sequential ignorability assumption now consists of three ignorabilities:
A2: The treatment assignment Z is ignorable, given X.
B2: The assignment of values of the first mediator Q under each treatment condition is ignorable, given X.
C2: The assignment of values of the second mediator M under each treatment condition is ignorable, given X and Q.
Assumption (C2), analogous to assumption (B1), views both M(0, Q(0)) and M(1, Q(1)) to be as if randomized within levels of Q = q and X = x. Therefore, M(0, Q(0)) is assumed to be conditionally independent of all future potential outcomes including Y(1, q, m). This is a major distinction between the current set of assumptions and those invoked in past research for multiple mediators (e.g., Imai & Yamamoto, 2013). Instead of assuming that the mediator under the control condition is conditionally independent of the potential outcome under the experimental condition, the latter invokes structural assumptions such as assuming homogeneous Z-by-M interaction and assuming that Q-by-M interaction does not exist.
Under Assumptions (A2), (B2), and (C2), one can identify by applying the following weight to individuals in the experimental group who displays first mediator value q, second mediator value m, and pretreatment characteristics x:
| 10 |
Here, , for z = 0, 1 and for all possible values of q. Online Appendix D provides a proof. If the treatment is randomized, the weight can be simplified to be For a continuous focal mediator M, after applying the Bayes theorem, the weight takes the following form:
| 11 |
Here, and . Under a simple randomization of the treatment assignment, we have that and . The weight is then simplified to be . Therefore, NIE and NDE can be respectively identified by
When Q is omitted, however, the causal effect transmitted through M will be confounded by the Z-Q-Y pathway. Applying the weights defined in Equation 4 or 5 to identify NIE will lead to a bias equal to ; the bias in the identification of NDE will be . Again with as the metric, in identifying the effect size of NIE, the bias is
The bias in identifying the effect size of NDE is its opposite. Hence, the bias associated with the omission of Q is determined by (a) the standard deviation of in the experimental group (denoted by σ) and (b) the correlation between and Y in the same group (denoted by ρ). As before, it can be shown that σ is inherently connected to the predictive relationship between Q and M and that ρ is additionally connected to the predictive relationship between Q and Y.
V. Assessing Potential Bias
Sections III and IV have examined two types of covariates—an omitted pretreatment covariate P and an omitted posttreatment covariate Q—that may confound the mediator–outcome relationship. The discrepancy between the new weight that adjusts for the confounder and the initial weight that omits the confounder is central to determining the bias in identifying the natural indirect and direct effects and their effect sizes. To summarize, we now use W# as a general form of the new weights standing for WP or WQ that make respective adjustments for P or Q along with X in identifying . In an application in which pretreatment and posttreatment confounders both exist, one may compute their aggregate bias.
When one or more confounders are omitted, the bias in identifying NIE can be represented as the covariance between the weight discrepancy and the outcome in the experimental group:
| 12 |
The bias in identifying NDE is its opposite. The bias in identifying the effect sizes of NIE and NDE can be further represented as a product of two sensitivity parameters: (a) the standard deviation of the weight discrepancy and (b) the correlation between the weight discrepancy and the outcome in the experimental group. The former is associated with the degree to which the omitted confounder is associated with the focal mediator value assignment; the latter is additionally related to the degree to which the omitted confounder predicts the outcome. We now discuss how to apply these results in practice.
Our first device is to graphically represent plausible bias in relation to the sensitivity parameters. Figure 4 illustrates a template for assessing bias in identifying the effect size of NDE and that in NIE. Here, σ has a lower bound 0 and an upper bound with some finite value. When the analyst makes additional covariance adjustment for strong predictors of the outcome, ρ would likely be bounded away from −1 and 1. The bias is 0 when σ = 0 or when ρ = 0. In this figure, the curves associated with nonzero correlation values and nonzero standard deviation values each represent a fixed amount of bias equal to ρσ. The bold dashed curve on each side corresponds to a threshold value of bias. The bias values between these two dashed curves are too small in magnitude to change the conclusion of statistical inference, while those in the outskirts of these dashed curves indicate bias values big enough in magnitude to alter the conclusion.

Figure 4. Graphical display of bias in the effect size of NDE and that in NIE. The solid curves indicate amounts of bias that may lead to a qualitative change in the conclusion of statistical inference. The number at the end of each curve denotes the bias value represented by that curve. The bold dashed curve on each side corresponds to a threshold value of bias that is just great enough to start changing the analytic conclusion. Each “*” corresponds to an observed covariate or a set of covariates that, if omitted, would contribute a bias great enough to change the analytic conclusion; each “Δ” corresponds to a covariate (or a set of covariates), the omission of which would be inconsequential. These covariates are drawn from the application example and are listed in Table 1.
To illustrate, suppose that an initial estimate of the effect size of NIE is negative but is not statistically significant and that its 95% confidence interval is . After the removal of a plausible positive bias, , the new confidence interval for the effect size of NIE would shift entirely to the negative side and would lead to a new conclusion that the NIE is negative and statistically significant. After a plausible negative bias, , is removed, the new confidence interval would shift entirely to the positive side. The analyst would then reach a new conclusion that the NIE is positive and statistically significant.
Arguably, the entire set of observed pretreatment and posttreatment covariates supply a range of plausible reference values for the sensitivity parameters. Such information will enable applied researchers to determine the amount of plausible bias. Therefore, one may supplement Figure 4 with a sensitivity analysis table listing the estimated bias that each observed pretreatment or posttreatment covariate or each subset of the observed covariates would potentially and uniquely have contributed if it had been omitted from the analysis. Even though some of the pretreatment covariates may have been adjusted for already in the original analysis, they nonetheless provide sensitivity information should there be a comparable unmeasured confounder, while some others may have been omitted for the purpose of keeping the propensity score models parsimonious. Table 1 provides examples of omitted observed confounders in the context of the application to be discussed in the next section. Considered on the basis of these observed covariates, the plausible bias values that are great enough in magnitude to alter the initial conclusion are flagged with an “*” in Table 1 and are marked with an “*” in Figure 4. The plausible bias values due to the omission of a covariate or a subset of covariates, if inconsequential, are marked with a “Δ” in Figure 4.
|
Table 1. Sensitivity Analysis: National Evaluation of Welfare-to-Work Strategies Application

In some applications, researchers could argue on scientific grounds that the study has collected some of the most important confounders, pretreatment or posttreatment. This is often the case, for example, in educational, behavioral, or health research in which the list of covariates adjusted for in the initial analysis includes one or more pretest scores or detailed health records (see Shadish, Clark, & Steiner, 2008). The remaining observed and unobserved covariates, when omitted individually or collectively, might not contribute a bias great enough to qualitatively alter the conclusion. When this is the case, then by convention, one may decide that the current results are not highly sensitive to the omitted confounding. In contrast, when the existing collection of observed covariates is far from being nearly complete, one may consider the results to be relatively sensitive if an omitted covariate with a rather small value of bias would lead to a change in the conclusion.
This sensitivity analysis strategy is easy to implement in standard statistical software programs. We provide in the Online Supplemental Material a set of R code for conducting the proposed sensitivity analysis procedure when the treatment is or is not randomized and when the mediator is measured on various scales. The R code computes the effect size of bias associated with each omitted pretreatment or posttreatment covariate or their combinations; it also computes the effect size of bias due to unmeasured confounders comparable to the observed pretreatment confounders. When the omitted or unmeasured pretreatment covariates do not confound the treatment assignment, the code generates both tabular and graphical displays of results (as in Table 1 and Figure 4). Otherwise, only a sensitivity analysis table is displayed. The R code has been incorporated into the “rmpw” package available online at https://cran.r-project.org/web/packages/rmpw/index.html.
Should an omitted covariate become available for the causal analysis, there would likely be a change not only in the point estimate but also in the asymptotic variance of the estimate for the NIE and the NDE. When the focus of a sensitivity analysis is solely on bias rather than efficiency, one may consider the plausible reference values for the sensitivity parameters to be given rather than estimated. Hence, removing a plausible value of bias from an initial estimate would lead to a change in the lower and upper bounds but not the width of the 95% confidential interval for the NIE and that for the NDE. If the impact of an omitted confounder on the estimation efficiency is of interest as well, the analyst may use bootstrapping to obtain a new confidence interval accordingly.
VI. An Application Example
Here, we illustrate with the National Evaluation of Welfare-to-Work Strategies (NEWWS) Riverside study. Immediately preceding the welfare reform nationwide in the mid-1990s, welfare applicants in Riverside, California, were assigned at random to either a labor force attachment (LFA) program (Z = 1) or a control condition (Z = 0) in an experimental study. The LFA program, with the goal of eventually weaning participants from the welfare system, emphasized seeking and securing employment, offered job search services, and provided incentives including a threat of sanctions should one fail to meet the program requirements, while the control group members were guaranteed cash assistance without the requirement for employment. Hong, Deutsch, and Hill (2015) investigated whether a treatment-induced increase in employment mediated the treatment impact 2 years later on maternal depression (Y) among participants, most of whom were single mothers with preschool-age children and were disproportionately depressed at the baseline. As a summary score over 12 self-administered items, the measure of depression had a mean equal to 7.49 and a standard deviation equal to 7.74. The mediator (M) was a binary indicator for whether one was employed in any quarter during the 2 years after randomization.
Although the average intention-to-treat (ITT) effect of being assigned to LFA rather than the control condition on maternal depression was indistinguishable from zero, this could possibly be due to a cancellation of the indirect effect and the direct effect. The researchers theorized that, on one hand, assignment to LFA would reduce depression on average if the program induced a large number of individuals to participate in the labor force (i.e., a negative NIE); on the other hand, assignment to LFA would increase depression on average if the employment rate remained at a relatively low level (i.e., a positive NDE). Applying the RMPW method to the data, the researchers reported a negative estimate of NIE (coefficient = −0.88, standard error [SE] = .47, t = −1.87, p = .06) and a positive estimate of NDE (coefficient = 1.26, SE = .86, t = 1.47, p = .14). With the standard deviation of the outcome in the experimental group as the metric, the estimated effect size of NIE is about −0.111; and the estimated effect size of NDE is about 0.158. The 95% confidence interval for the effect size of NIE is [−0.227, 0.006] and that for the effect size of NDE is [−0.054, 0.371]. Even though the null hypotheses are retained in both cases, the confidence interval for NIE is predominantly on the negative side while that for NDE is predominantly on the positive side. These results suggested that, should all individuals have been assigned to LFA, the treatment-induced change in employment (an increase in employment rate from 39.5% to 65.4%) would produce a considerable amount of reduction in maternal depression on average. Without this increase in employment, however, maternal depression would have increased on average by a considerable amount should all individuals have been assigned to LFA rather than to the control condition. Yet neither estimate reached the conventional statistical significance level for rejecting the null hypothesis. A subsequent reanalysis of the data that employed a generalized method-of-moments procedure to account for the sampling error in the estimated propensity scores generated similar point estimates and SE estimates (Bein et al., in press).
The NEWWS sample included 208 experimental units and 486 control units. To avoid overfitting the propensity score model for the mediator under each treatment condition, Hong, Deutsch, and Hill (2015) selected nine pretreatment covariates for each of these models. On the basis of theoretical reasoning, one may speculate that some observed pretreatment and posttreatment covariates omitted from the initial analysis as well as some unobserved covariates might bias the NIE and NDE estimates. Below are two examples of observed confounders that were omitted from the initial analysis.
The data show that individual preference for working rather than taking care of family full time, independent of the baseline employment experience and baseline depression level, predicts labor force participation under LFA. Individuals with a higher preference for working tend to be overrepresented among the ever employed and underrepresented among the never employed in the LFA group. Hence, once additional adjustment is made for this covariate, people with a higher preference and ever employed and those with a lower preference and never employed, due to their overrepresentation, are expected to receive a new weight less than the initial weight (i.e., ). These people tended to experience relatively less depression at the 2-year follow-up perhaps because their preferences were seemingly satisfied. In contrast, individuals with a higher preference yet never employed and those with a lower preference yet ever employed, due to their under-representation, are expected to receive a new weight greater than the initial weight, . These people would likely experience relatively more depression. A positive correlation between weight discrepancy and the outcome () will contribute a positive bias to the original NIE estimate. The bias in the original NDE estimate will be negative and will have the same magnitude due to the treatment randomization.
A second example is whether an individual was ever in a situation of not receiving welfare during the first year after randomization. About 30.3% of the LFA group members and 21.6% of the control group members had ever been off welfare during the first year. For low-income single mothers who had a hard time to make the ends meet, losing the public safety net would likely trigger or aggravate depression; yet being off welfare might also propel an individual to make her best effort in securing employment in the job market. Therefore, having ever been off welfare during the first year is an observed posttreatment covariate that may confound the relationship between the mediator and the outcome.
For illustration, Table 1 lists the above two observed covariates along with two other covariates that were also omitted from the initial adjustment. Three of them are pretreatment and one is posttreatment. Here, X1 denotes whether an individual had been on welfare for at least 36 months during the 60 months prior to randomization; X2 measures an individual’s level of preference for working as opposed to taking care of family full time at the baseline; X3 indicates whether an individual had ever obtained a high school diploma or a General Educational Development (GED) certificate before randomization; and X4 is an indicator for whether an individual was ever off welfare during the first year after randomization. We enter each of these four variables one at a time as an additional covariate in the propensity score model and obtain a new weight accordingly. We also assess the amounts of bias when different combinations of these covariates (i.e., X1X2, X1X3, X1X4, X2X3, X2X4, X3X4, X1X2X3, X1X2X4, X1X3X4, X2X3X4, and X1X2X3X4) are omitted from the analysis.
Table 1 lists, for each of these omitted covariates, the standard deviation of the discrepancy between the new weight and the initial weight (σ) and the correlation between the weight discrepancy and the outcome (ρ) in the experimental group. Among the four covariates in this example, X2 contributes a bias of the largest magnitude when omitted. As we have reasoned earlier, this omission indeed results in a positive bias for the NIE estimate and a negative bias for the NDE estimate. Once this bias is removed, the estimated effect size of NIE is modified to be −.132 and becomes statistically significant. The estimated effect size of NDE increases to 0.179 though remains insignificant statistically. Because the initial 95% confidence interval for NIE is predominantly on the negative side, any omitted covariate with a relatively small positive bias may alter the conclusion once an adjustment is made for such a covariate. This is true with five of the covariates and covariate combinations considered in the current example (marked with “*” in Table 1). Hence, we conclude that the results of the initial NEWWS application are sensitive to the influence of potential confounders.
VII. Bias in the Presence of Omitted Pretreatment Covariates Predicting Treatment Assignment
The weighting-based approach to sensitivity analysis is suitable for quantifying the consequences of both unadjusted treatment selection and unadjusted mediator value selection. Suppose that the treatment assignment and the mediator value assignment are ignorable, given pretreatment covariates X and P, yet P has been omitted in the initial analysis. We may apply IPTW to adjust for treatment selection, given the identification result that in which for z = 0, 1. Here, . Unlike Wz that adjusts for treatment selection associated with X only, adjusts for treatment selection associated with both X and P. Multiplying the IPTW with the RMPW, we are able to identify by , where for a discrete mediator
| 13 |
For a continuous mediator, the weight is
| 14 |
Hence, identifies NIE, while identifies NDE.
Omitting a pretreatment covariate P that confounds the treatment–mediator, treatment–outcome, and mediator–outcome relationships, the analyst instead applies the weight specified in Equation 4 or 5. The bias in identifying NIE and NDE will each have an additional component. This is because, when P is omitted, there will be a bias not only in identifying but also in identifying and . The bias in identifying NIE is equal to
In identifying NDE, the omission will lead to a bias equal to
See Online Appendix E for the derivation. Importantly, when the omission of P also leads to a bias in identifying the ITT effect of the treatment on the outcome, the bias in identifying NIE and that in identifying NDE no longer sum to 0. The bias in the identification of the ITT effect is
This result indicates that the weighting approach to sensitivity analysis is applicable to a broad range of causal investigations including ITT analysis. Finally, with as the metric, in identifying the effect size of NIE, the bias is
The bias in identifying the effect size of NDE is
In each case, the bias increases in magnitude when the standard deviation of the discrepancy between the true weight and the ill-conceived weight increases. This quantity is associated with the degree to which P predicts Z or M after adjustment for X. The bias also increases when the correlation between this discrepancy and Y increases in the experimental group or the control group, which is additionally related to the degree to which P predicts Y.
VIII. Comparisons With Other Existing Strategies for Sensitivity Analysis
This section reviews a number of sensitivity analysis techniques that have emerged in the recent years for causal mediation analysis and highlights their differences and connections with the weighting-based approach described above. The weighting-based bias formula is inherently related to the general form of the bias formula derived by VanderWeele (2010) under treatment randomization. This is because the discrepancy between a new weight and an initial weight due to an omitted pretreatment covariate fully captures the association between the omitted covariate and the mediator. Online Appendix F provides derivation that reveals this equivalency. However, the weighting-based approach has some important distinctions from the existing strategies for sensitivity analysis.
VanderWeele (2010) represents the amount of omitted confounding effect as a product of two hypothetical sensitivity parameters: The first parameter specifies an omitted covariate’s association with the treatment at a fixed mediator level and the second specifies its association with the outcome when the treatment and the mediator level are both fixed. In implementation, one must make simplifying assumptions (a) that the omitted covariate is binary; additionally in predicting the outcome, one must assume (b) that the omitted covariate does not interact with the treatment or the mediator and (c) that the observed pretreatment covariates do not interact with the treatment or the mediator. These simplifying model-based assumptions make it feasible and relatively convenient to implement the regression model-based sensitivity analysis techniques. However, these assumptions may prove to be scientifically unrealistic in some cases. For example, in the NEWWS data, some observed pretreatment covariates clearly interact with the treatment or the mediator in predicting the outcome. The sensitivity analysis strategies proposed within the regression framework would become overwhelmingly difficult when linear or nonlinear interactions exist between omitted covariates and the treatment or between omitted covariates and the mediator. Incorrect parametric assumptions are consequential for bias assessment. The analyst would also face challenges when an omitted covariate is multivalued. In contrast, the weighting-based bias formula does not require an omitted covariate to be binary and can be easily extended to nonexperimental treatment assignment.
In Imai and colleagues’ approach (Imai, Keele, & Tingley, 2010; Imai, Keele, & Yamamoto, 2010; Imai & Yamamoto, 2013), the key sensitivity parameter representing the severity of confounding is a hypothetical correlation between the error in the mediator model and that in the outcome model. This correlation coefficient can be related to the proportion of variance in the mediator and that in the outcome explained by a hypothetical omitted confounder. The implementation requires that the analyst explicitly specifies a parametric mediator model and a parametric outcome model consistent with the corresponding parametric models utilized for estimating NIE and NDE. Under the linear structural equation modeling framework, this strategy has a great appeal because it can accommodate some nonlinear and nonadditive terms in the parametric models and because the mediator and the omitted covariates are not constrained to be binary. Nonetheless, the model does not allow the Treatment × Mediator interaction to be heterogeneous among individuals. Additional sensitivity analysis would be required, as is the case in applications of all other regression-based methods, for assessing the consequences of outcome model misspecifications. Imai and Yamamoto (2013) propose a solution that involves two additional sensitivity parameters, one representing the correlation between the mediator and the Treatment × Mediator interaction and the other representing the variance of the Treatment × Mediator interaction. This additional sensitivity analysis becomes unnecessary when the RMPW strategy is employed for identification and when the weighting-based approach is utilized for sensitivity analysis.
Tchetgen Tchetgen and Shpitser (2012) propose to represent omitted confounding as a function of the treatment, the mediator, and the observed covariates. The sensitivity bias function is indexed by a vector of parameters possibly with a high dimension when the number of observed confounders is relatively large. This function is assumed to be known and thus must be specified by the analyst. A change in the hypothetical parameter values may lead to a change in the result of inference. To implement, the analyst would need much guidance as to how to choose the hypothetical parameter values in the selection bias function. This approach also requires that the mediator can take only a finite number of values and that the functional relationship between the mediator and the observed covariates under each treatment condition is known. The computation involves summing over the distribution of the mediator, which may become intensive or require additional parametric assumptions for multivalued mediators.
To identify NIE and NDE on the treated, Vansteelandt and VanderWeele (2012) similarly propose a selection bias function associated with an observed or unobserved posttreatment confounder. Conditioning on the posttreatment confounder, this function is meant to capture the unknown difference between treatment groups in the mediator effect on the outcome under the counterfactual experimental condition. Such a difference would indicate a confounding of the mediator–outcome relationship. The selection bias function again can take numerous possible forms; and implementation typically requires a series of additional simplifying assumptions. And finally, VanderWeele and Chiba (2014, also see VanderWeele, 2015) propose a sensitivity function for assessing bias associated with a posttreatment confounder. This sensitivity function similarly involves parameters related to the counterfactual outcomes and hence cannot be quantified by referencing to the observed posttreatment covariates.
Unlike these existing strategies, the weighting-based approach to sensitivity analysis makes no parametric assumptions about the outcome model or about the selection bias functions. Rather than relying entirely on speculation, this new approach makes use of the empirical information in the application data, including observed pretreatment and posttreatment covariates, to determine a plausible range of potential bias. This is particularly useful when researchers have measured a rich set of theoretically important covariates in a given application.
A limitation of the weighting-based approach is that, similar to the strategy proposed by Tchetgen Tchetgen and Shpitser (2012), the analyst must assume a known functional relationship between the mediator and the observed covariates under each treatment condition. This is especially true when the weight is obtained parametrically. Although the functional relationship may never be fully known, one may view a misspecified mediator model as one that has omitted certain confounders. In the NEWWS example, if the preference for taking care of family as opposed to work has a quadratic rather than a linear relationship with the logit of the mediator, the omitted quadratic form of this covariate could be entered in obtaining a new weight; the selection bias associated with this omission therefore could be quantified.
Importantly, semiparametric weighting in replacement of parametric weighting greatly reduces the reliance on the parametric assumptions that one makes in specifying propensity score models. Hong and colleagues (Hong, 2015; Hong et al., 2015) have proposed a semiparametric way of estimating RMPW by stratifying the sample on the estimated propensity scores. Researchers (Hong, 2010b, 2012, 2015; Huang, Frangakis, Dominici, Diette, & Wu, 2005) have also developed a marginal mean weighting through stratification method as an alternative to IPTW for removing bias in treatment evaluation. According to the simulation results for these methods, semiparametric weighting generates results that are considerably more robust than parametric weighting when the propensity score models are misspecified. The weighting-based sensitivity analysis can similarly take a semiparametric form.
IX. Conclusion
This article introduces a novel weighting-based approach to sensitivity analysis as an alternative to the existing strategies in causal mediation studies. The approach is consistent with the logic of using propensity score–based weighting to reduce selection bias. Therefore, it constitutes a coherent step in causal inference studies that employ weighting. In its essence, the discrepancy between a new weight that adjusts for an omitted confounder and an initial weight that omits the confounder captures the role of the confounder that contributes to the bias. The selection bias is then represented simply as the covariance between the weight discrepancy and the outcome in the experimental group. The effect size of the bias is a product of two sensitivity parameters, one associated with the degree to which the omitted confounder predicts the mediator and the other associated with the degree to which it predicts the outcome.
This new alternative displays the following features. First, the weighting strategy does not require specifying a parametric outcome model or a parametric selection bias function and thereby reducing the reliance on functional form assumptions. Also, because of this, the new strategy flexibly accommodates all the cases in which the observed or unobserved covariates predict the outcome differently across the treatment conditions or across the mediator levels under a given treatment. Second, this new strategy enables analysts to assess the consequences of omitting posttreatment as well as pretreatment confounders. Third, it makes possible to assess the aggregate bias associated with multiple omitted covariates. Fourth, the graphical display indicates the threshold value of a bias great enough to alter the inference. Fifth, this strategy can be conveniently extended to mediation studies in which neither the treatment nor the mediator is randomized. And last, the new strategy has the flexibility of allowing the mediator and the omitted covariates to be discrete or continuous. We provide R code for the implementation of sensitivity analysis with this new strategy.
In our future research, we will investigate extensions of the proposed method to multisite causal mediation analysis in which the between-site variance of NIE and that of NDE may be estimated with bias in the presence of omitted confounders. We also plan to extend the method to studies of complex mediation mechanisms that involve two or more mediators of focal interest.
Acknowledgments
The authors thank Ed Bein, Jonah Deutsch, Ken Frank, Martin Huber, Kristin Porter, and Michael Sobel for their contributions of ideas and comments on earlier versions of this article. We also thank the editor and two anonymous reviewers.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study received funding from the National Science Foundation (SES 1659935). In addition, the second author received support from a Quantitative Methods in Education and Human Development Research Predoctoral Fellowship awarded by the University of Chicago, a subcontract from MDRC funded by the Spencer Foundation, and a National Academy of Education/Spencer Foundation Dissertation Fellowship.
References
|
Avin, C., Shpitser, I., Pearl, J. (2005). Identifiability of path-specific effects. In Proceedings of the International Joint Conferences on Artificial Intelligence (pp. 357–363). Burlington, MA: Morgan Kaufmann Publishers Inc. Google Scholar | |
|
Bein, E., Deutsch, J., Hong, G., Porter, K., Qin, X., Yang, C. (in press). Two-step estimation in ratio-of-mediator-probability weighted causal mediation analysis. Statistics in Medicine. Google Scholar | |
|
Ding, P., Vanderweele, T. J. (2016). Sharp sensitivity bounds for mediation under unmeasured mediator-outcome confounding. Biometrika, 103, 483–490. Google Scholar | Medline | |
|
Holland, P. (1988). Causal inference, path analysis, and recursive structural equations models. Sociological Methodology, 18, 449–484. Google Scholar | |
|
Hong, G . (2010a). Ratio of mediator probability weighting for estimating natural direct and indirect effects. In Proceedings of the American Statistical Association, Biometrics Section (pp. 2401–2415). Alexandria, VA: American Statistical Association. Google Scholar | |
|
Hong, G. (2010b). Marginal mean weighting through stratification: Adjustment for selection bias in multilevel data. Journal of Educational and Behavioral Statistics, 35, 499–531. Google Scholar | SAGE Journals | |
|
Hong, G. (2012). Marginal mean weighting through stratification: A generalized method for evaluating multi-valued and multiple treatments with non-experimental data. Psychological Methods, 17, 44–60. Google Scholar | Medline | |
|
Hong, G. (2015). Causality in a social world: Moderation, mediation and spill-over. West Sussex, England: John Wiley & Sons. Google Scholar | |
|
Hong, G., Deutsch, J., Hill, H. D. (2011). Parametric and non-parametric weighting methods for estimating mediation effects: An application to the national evaluation of welfare-to-work strategies. In Proceedings of the American Statistical Association, Social Statistics Section (pp. 3215–3229). Alexandria, VA: American Statistical Association. Google Scholar | |
|
Hong, G., Deutsch, J., Hill, H. D. (2015). Ratio-of-mediator-probability weighting for causal mediation analysis in the presence of treatment-by-mediator interaction. Journal of Educational and Behavioral Statistics, 40, 307–340. Google Scholar | SAGE Journals | |
|
Hong, G., Nomi, T. (2012). Weighting methods for assessing policy effects mediated by peer change. Journal of Research on Educational Effectiveness, 5, 261–289. Google Scholar | |
|
Huang, I.-C., Frangakis, C., Dominici, F., Diette, G. B., Wu, A. W. (2005). Approach for risk adjustment in profiling multiple physician groups on asthma care. Health Services Research, 40, 253–278. Google Scholar | Medline | |
|
Huber, M. (2014). Identifying causal mechanisms (primarily) based on inverse probability weighting. Journal of Applied Econometrics, 29, 920–943. Google Scholar | |
|
Imai, K., Keele, L., Tingley, D. (2010). A general approach to causal mediation analysis. Psychological Methods, 15, 309. Google Scholar | Medline | |
|
Imai, K., Keele, L., Yamamoto, T. (2010). Identification, inference and sensitivity analysis for causal mediation effects. Statistical Science, 25, 51–71. Google Scholar | |
|
Imai, K., Yamamoto, Y. (2013). Identification and sensitivity analysis for multiple causal mechanisms: Revisiting evidence from framing experiments. Political Analysis, 21, 141–171. Google Scholar | |
|
Lange, T., Rasmussen, M., Thygesen, L. (2014). Assessing natural direct and indirect effects through multiple pathways. American Journal of Epidemiology, 179, 513. Google Scholar | Medline | |
|
Lange, T., Vansteelandt, S., Bekaert, M. (2012). A simple unified approach for estimating natural direct and indirect effects. American Journal of Epidemiology, 176, 190–195. Google Scholar | Medline | |
|
Pearl, J. (2001). Direct and indirect effects. In Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (pp. 411–420). Burlington, MA: Morgan Kaufmann Publishers Inc. Google Scholar | |
|
Qin, X., Hong, G. (2014). Causal mediation analysis in multi-site trials: An application of ratio-of-mediator-probability weighting to the Head Start Impact Study. In JSM Proceedings, Social Statistics Section (pp. 912–926). Alexandria, VA: American Statistical Association. Google Scholar | |
|
Qin, X., Hong, G. (2016). Analyzing heterogeneous causal mediation effects in multi-site trials with application to the National Job Corps Study. In JSM Proceedings, Survey Research Methods Section (pp. 910–938). Alexandria, VA: American Statistical Association. Google Scholar | |
|
Qin, X., Hong, G. (2017). A weighting method for assessing between-site heterogeneity in causal mediation mechanism. Journal of Educational and Behavioral Statistics, 42, 308–340. Google Scholar | SAGE Journals | |
|
Ridgeway, G. (2006). Assessing the effect of race bias in post-traffic stop outcomes using propensity scores. Journal of Quantitative Criminology, 22, 1–29. Google Scholar | |
|
Robins, J. M. (2000). Marginal structural models versus structural nested models as tools for causal inference. In Halloran, M. Elizabeth, Berry, Donald (Eds.), Statistical models in epidemiology, the environment, and clinical trials (pp. 95–133). New York, NY: Springer. Google Scholar | |
|
Robins, J. M., Greenland, S. (1992). Identifiability and exchangeability for direct and indirect effects. Epidemiology, 3, 143–155. Google Scholar | Medline | |
|
Rosenbaum, P. R. (1984). The consequence of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society. Series A (General), 147, 656–666. Google Scholar | |
|
Rosenbaum, P. R. (1987). Model-based direct adjustment. Journal of the American Statistical Association, 82, 387–394. Google Scholar | |
|
Shadish, W. R., Clark, M. H., Steiner, P. M. (2008). Can nonrandomized experiments yield accurate answers? A randomized experiment comparing random and nonrandom assignments. Journal of the American Statistical Association, 103, 1334–1356. Google Scholar | |
|
Sjölander, A. (2009). Bounds on natural direct effects in the presence of confounded intermediate variables. Statistics in Medicine, 28, 558–571. Google Scholar | Medline | |
|
Tchetgen Tchetgen, E. J. (2013). Inverse odds ratio-weighted estimation for causal mediation analysis. Statistics in Medicine, 32, 4567–4580. Google Scholar | Medline | |
|
Tchetgen Tchetgen, E. J., Shpitser, I. (2012). Semiparametric theory for causal mediation analysis: Efficiency bounds, multiple robustness and sensitivity analysis. The Annals of Statistics, 40, 1816–1845. Google Scholar | Medline | |
|
VanderWeele, T. J. (2009). Marginal structural models for the estimation of direct and indirect effects. Epidemiology, 20, 18–26. Google Scholar | Medline | |
|
VanderWeele, T. J. (2010). Bias formulas for sensitivity analysis for direct and indirect effects. Epidemiology, 21, 540–551. Google Scholar | Medline | |
|
VanderWeele, T. J. (2015). Explanation in causal inference: Methods for mediation and interaction. New York, NY: Oxford University Press. Google Scholar | |
|
VanderWeele, T. J., Chiba, Y. (2014). Sensitivity analysis for direct and indirect effects in the presence of exposure-induced mediator-outcome confounders. Epidemiology, Biostatistics, and Public Health, 11, 1–16. Google Scholar | |
|
Vansteelandt, S., VanderWeele, T. J. (2012). Natural direct and indirect effects on the exposed: Effect decomposition under weaker assumptions. Biometrics, 68, 1019–1027. Google Scholar | Medline |
Authors
GUANGLEI HONG is an associate professor in the Department of Comparative Human Development, Committee on Education, and College at the University of Chicago, 1126 E. 59th St., Chicago, IL 60637; email: [email protected]
XU QIN is a PhD candidate in the Department of Comparative Human Development and the Committee on Education at the University of Chicago, 1126 E. 59th St., Chicago, IL 60637; email: [email protected]
FAN YANG is an assistant professor in the Department of Biostatistics and Informatics at the University of Colorado Denver, 13001 E. 17th Place, Aurora, CO 80045; email: fan.



