Abstract
Correlation does not imply causation; but often, observational data are the only option, even though the research question at hand involves causality. This article discusses causal inference based on observational data, introducing readers to graphical causal models that can provide a powerful tool for thinking more clearly about the interrelations between variables. Topics covered include the rationale behind the statistical control of third variables, common procedures for statistical control, and what can go wrong during their implementation. Certain types of third variables—colliders and mediators—should not be controlled for because that can actually move the estimate of an association away from the value of the causal effect of interest. More subtle variations of such harmful control include using unrepresentative samples, which can undermine the validity of causal conclusions, and statistically controlling for mediators. Drawing valid causal inferences on the basis of observational data is not a mechanistic procedure but rather always depends on assumptions that require domain knowledge and that can be more or less plausible. However, this caveat holds not only for research based on observational data, but for all empirical research endeavors.
References
|
Achen, C. H. (2005). Let’s put garbage-can regressions and garbage-can probits where they belong. Conflict Management and Peace Science, 22, 327–339. Google Scholar | SAGE Journals | ISI | |
|
Angrist, J. D., Pischke, J. S. (2010). The credibility revolution in empirical economics: How better research design is taking the con out of econometrics. The Journal of Economic Perspectives, 24(2), 3–30. Google Scholar | Crossref | ISI | |
|
Asendorpf, J. B. (2012). Bias due to controlling a collider: A potentially important issue for personality research. European Journal of Personality, 26, 391–392. Google Scholar | |
|
Baron, R. M., Kenny, D. A. (1986). The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173–1182. Google Scholar | Crossref | ISI | |
|
Borsboom, D., Mellenbergh, G. J., van Heerden, J. (2003). The theoretical status of latent variables. Psychological Review, 110, 203–219. Google Scholar | Crossref | ISI | |
|
Borsboom, D., van der Sluis, S., Noordhof, A., Wichers, M., Geschwind, N., Aggen, S. H., . . . Cramer, A. O. J. (2012). What kind of causal modelling approach does personality research need? European Journal of Personality, 26, 392–393. Google Scholar | |
|
Bullock, J. G., Green, D. P., Ha, S. E. (2010). Yes, but what’s the mechanism? (don’t expect an easy answer). Journal of Personality and Social Psychology, 98, 550–558. Google Scholar | Crossref | ISI | |
|
Cartwright, N. (2007). Are RCTs the gold standard? BioSocieties, 2, 11–20. Google Scholar | Crossref | |
|
Cook, T. D., Campbell, D. T. (1979). Quasi-experimentation: Design & analysis issues for field settings. Boston, MA: Houghton Mifflin. Google Scholar | |
|
Ding, P., Miratrix, L. W. (2015). To adjust or not to adjust? Sensitivity analysis of M-Bias and butterfly-bias. Journal of Causal Inference, 3, 41–57. Google Scholar | |
|
Duncan, G. J., Engel, M., Claessens, A., Dowsett, C. J. (2014). Replication and robustness in developmental research. Developmental Psychology, 50, 2417–2425. Google Scholar | Crossref | ISI | |
|
Dunning, T. (2012). Natural experiments in the social sciences: A design-based approach. Cambridge, England: Cambridge University Press. Google Scholar | Crossref | |
|
Elwert, F. (2013). Graphical causal models. In Morgan, S. L. (Ed.), Handbook of causal analysis for social research (pp. 245–273). Dordrecht, The Netherlands: Springer. Google Scholar | Crossref | |
|
Elwert, F., Winship, C. (2014). Endogenous selection bias: The problem of conditioning on a collider variable. Annual Review of Sociology, 40, 31–53. Google Scholar | Crossref | ISI | |
|
Greenland, S. (2003). Quantifying biases in causal models: Classical confounding vs. collider-stratification bias. Epidemiology, 14, 300–306. Google Scholar | Crossref | ISI | |
|
Hayduk, L., Cummings, G., Stratkotter, R., Nimmo, M., Grygoryev, K., Dosman, D., . . . Boadu, K. (2003). Pearl’s d-separation: One more step into causal thinking. Structural Equation Modeling, 10, 289–311. Google Scholar | Crossref | ISI | |
|
Hayes, A. F. (2009). Beyond Baron and Kenny: Statistical me-diation analysis in the new millennium. Communication Monographs, 76, 408–420. Google Scholar | Crossref | ISI | |
|
Jackson, J. J., Thoemmes, F., Jonkmann, K., Lüdtke, O., Trautwein, U. (2012). Military training and personality trait development: Does the military make the man, or does the man make the military? Psychological Science, 23, 270–277. Google Scholar | SAGE Journals | ISI | |
|
King, G., Nielsen, R. (2016). Why propensity scores should not be used for matching. Retrieved from https://gking.harvard.edu/publications/why-propensity-scores-should-not-be-used-formatching Google Scholar | |
|
Lee, J. J. (2012). Correlation and causation in the study of personality. European Journal of Personality, 26, 372–390. Google Scholar | Crossref | ISI | |
|
Mohan, K., Pearl, J., Tian, J. (2013). Graphical models for inference with missing data. In Burges, C. J. C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K. Q. (Eds.), Advances in Neural Information Processing Systems 26 (NIPS 2013) (pp. 1277–1285). Red Hook, NY: Curran Associates. Google Scholar | |
|
Morgan, S. L., Winship, C. (2015). Counterfactuals and causal inference: Methods and principles for social research (2nd ed.). New York, NY: Cambridge University Press. Google Scholar | |
|
Mukherjee, S. (2010). The emperor of all maladies: A biography of cancer. New York, NY: Simon & Schuster. Google Scholar | |
|
Munafò, M. R., Tilling, K., Taylor, A. E., Evans, D. M., Smith, G. D. (2017). Collider scope: When selection bias can substantially influence observed associations. bioRxiv. doi:10.1101/079707 Google Scholar | Crossref | |
|
Pearl, J. (1993). Graphical models, causality and intervention. Statistical Science, 8, 266–269. Google Scholar | Crossref | |
|
Pearl, J. (1995). Causal diagrams for empirical research. Biometrika, 82, 669–688. Google Scholar | Crossref | ISI | |
|
Pearl, J., Glymour, M., Jewell, N. P. (2016). Causal inference in statistics: A primer. Chichester, England: John Wiley & Sons. Google Scholar | |
|
Piff, P. K., Kraus, M. W., Côté, S., Cheng, B. H., Keltner, D. (2010). Having less, giving more: The influence of social class on prosocial behavior. Journal of Personality and Social Psychology, 99, 771–784. Google Scholar | Crossref | ISI | |
|
Robins, J. M., Wasserman, L. (1999). On the impossibility of inferring causation from association without background knowledge. In Cooper, G. F., Glymour, C. (Eds.), Computation, Causation, & Discovery (pp. 305–321). Cambridge, MA: MIT Press. Google Scholar | |
|
Rohrer, J. (2017a, March 14). That one weird third variable problem nobody ever mentions: Conditioning on a collider [Web log post]. Retrieved from http://www.the100.ci/2017/03/14/that-one-weird-third-variable-problem-nobody-ever-mentions-conditioning-on-a-collider/ Google Scholar | |
|
Rohrer, J. (2017b, April 21). What’s an age-effect net of all time-varying covariates? [Web log post]. Retrieved from http://www.the100.ci/2017/04/21/whats-an-age-effect-net-of-all-time-varying-covariates/ Google Scholar | |
|
Rosenbaum, P. R. (1984). The consequences of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society: Series A, 147, 656–666. Google Scholar | Crossref | ISI | |
|
Rozin, P. (2001). Social psychology and science: Some lessons from Solomon Asch. Personality and Social Psychology Review, 5, 2–14. Google Scholar | SAGE Journals | ISI | |
|
Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66, 688–701. Google Scholar | Crossref | ISI | |
|
Schafer, J. L., Graham, J. W. (2002). Missing data: Our view of the state of the art. Psychological Methods, 7, 147–177. Google Scholar | Crossref | ISI | |
|
Silver, N. (2012). The signal and the noise: The art and science of prediction. New York, NY: Penguin Press. Google Scholar | |
|
Sjölander, A. (2009). Propensity scores and M-structures. Statistics in Medicine, 28, 1416–1420. Google Scholar | Crossref | ISI | |
|
Smith, G. C., Pell, J. P. (2003). Parachute use to prevent death and major trauma related to gravitational challenge: Systematic review of randomised controlled trials. British Medical Journal, 327, Article 1459. doi:10.1136/bmj.327.7429.1459 Google Scholar | Crossref | |
|
Spector, P. E., Brannick, M. T. (2011). Methodological urban legends: The misuse of statistical control variables. Organizational Research Methods, 14, 287–305. Google Scholar | SAGE Journals | ISI | |
|
Spirtes, P., Glymour, C. N., Scheines, R. (2000). Causation, prediction, and search. Cambridge, MA: MIT Press. Google Scholar | |
|
Steegen, S., Tuerlinckx, F., Gelman, A., Vanpaemel, W. (2016). Increasing transparency through a multiverse analysis. Perspectives on Psychological Science, 11, 702–712. Google Scholar | SAGE Journals | ISI | |
|
Thoemmes, F. (2015). Reversing arrows in mediation models does not distinguish plausible models. Basic and Applied Social Psychology, 37, 226–234. Google Scholar | Crossref | ISI | |
|
Thoemmes, F., Mohan, K. (2015). Graphical representation of missing data problems. Structural Equation Modeling: A Multidisciplinary Journal, 22, 631–642. Google Scholar | Crossref | |
|
Tiefenbach, T., Kohlbacher, F. (2015). Individual differences in the relationship between domain satisfaction and happiness: The moderating role of domain importance. Personality and Individual Differences, 86, 82–87. Google Scholar | Crossref | |
|
Turkheimer, E. (2000). Three laws of behavior genetics and what they mean. Current Directions in Psychological Science, 9, 160–164. Google Scholar | SAGE Journals | ISI | |
|
Turkheimer, E., Harden, K. P. (2014). Behavior genetic research methods: Testing quasi-causal hypotheses using multivariate twin data. In Reis, H. T., Judd, C. M. (Eds.), Handbook of research methods in social and personality psychology (pp. 159–187). New York, NY: Cambridge University Press. Google Scholar | |
|
Voelkle, M. C., Oud, J. H., Davidov, E., Schmidt, P. (2012). An SEM approach to continuous time modeling of panel data: Relating authoritarianism and anomia. Psychological Methods, 17, 176–192. Google Scholar | Crossref | ISI | |
|
Westfall, J., Yarkoni, T. (2016). Statistically controlling for confounding constructs is harder than you think. PLOS ONE, 11(3), Article e0152719. doi:10.1371/journal.pone.0152719 Google Scholar | Crossref |

