Correlation does not imply causation; but often, observational data are the only option, even though the research question at hand involves causality. This article discusses causal inference based on observational data, introducing readers to graphical causal models that can provide a powerful tool for thinking more clearly about the interrelations between variables. Topics covered include the rationale behind the statistical control of third variables, common procedures for statistical control, and what can go wrong during their implementation. Certain types of third variables—colliders and mediators—should not be controlled for because that can actually move the estimate of an association away from the value of the causal effect of interest. More subtle variations of such harmful control include using unrepresentative samples, which can undermine the validity of causal conclusions, and statistically controlling for mediators. Drawing valid causal inferences on the basis of observational data is not a mechanistic procedure but rather always depends on assumptions that require domain knowledge and that can be more or less plausible. However, this caveat holds not only for research based on observational data, but for all empirical research endeavors.

Achen, C. H. (2005). Let’s put garbage-can regressions and garbage-can probits where they belong. Conflict Management and Peace Science, 22, 327339.
Google Scholar | SAGE Journals | ISI
Angrist, J. D., Pischke, J. S. (2010). The credibility revolution in empirical economics: How better research design is taking the con out of econometrics. The Journal of Economic Perspectives, 24(2), 330.
Google Scholar | Crossref | ISI
Asendorpf, J. B. (2012). Bias due to controlling a collider: A potentially important issue for personality research. European Journal of Personality, 26, 391392.
Google Scholar
Baron, R. M., Kenny, D. A. (1986). The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 11731182.
Google Scholar | Crossref | ISI
Borsboom, D., Mellenbergh, G. J., van Heerden, J. (2003). The theoretical status of latent variables. Psychological Review, 110, 203219.
Google Scholar | Crossref | ISI
Borsboom, D., van der Sluis, S., Noordhof, A., Wichers, M., Geschwind, N., Aggen, S. H., . . . Cramer, A. O. J. (2012). What kind of causal modelling approach does personality research need? European Journal of Personality, 26, 392393.
Google Scholar
Bullock, J. G., Green, D. P., Ha, S. E. (2010). Yes, but what’s the mechanism? (don’t expect an easy answer). Journal of Personality and Social Psychology, 98, 550558.
Google Scholar | Crossref | ISI
Cartwright, N. (2007). Are RCTs the gold standard? BioSocieties, 2, 1120.
Google Scholar | Crossref
Cook, T. D., Campbell, D. T. (1979). Quasi-experimentation: Design & analysis issues for field settings. Boston, MA: Houghton Mifflin.
Google Scholar
Ding, P., Miratrix, L. W. (2015). To adjust or not to adjust? Sensitivity analysis of M-Bias and butterfly-bias. Journal of Causal Inference, 3, 4157.
Google Scholar
Duncan, G. J., Engel, M., Claessens, A., Dowsett, C. J. (2014). Replication and robustness in developmental research. Developmental Psychology, 50, 24172425.
Google Scholar | Crossref | ISI
Dunning, T. (2012). Natural experiments in the social sciences: A design-based approach. Cambridge, England: Cambridge University Press.
Google Scholar | Crossref
Elwert, F. (2013). Graphical causal models. In Morgan, S. L. (Ed.), Handbook of causal analysis for social research (pp. 245273). Dordrecht, The Netherlands: Springer.
Google Scholar | Crossref
Elwert, F., Winship, C. (2014). Endogenous selection bias: The problem of conditioning on a collider variable. Annual Review of Sociology, 40, 3153.
Google Scholar | Crossref | ISI
Greenland, S. (2003). Quantifying biases in causal models: Classical confounding vs. collider-stratification bias. Epidemiology, 14, 300306.
Google Scholar | Crossref | ISI
Hayduk, L., Cummings, G., Stratkotter, R., Nimmo, M., Grygoryev, K., Dosman, D., . . . Boadu, K. (2003). Pearl’s d-separation: One more step into causal thinking. Structural Equation Modeling, 10, 289311.
Google Scholar | Crossref | ISI
Hayes, A. F. (2009). Beyond Baron and Kenny: Statistical me-diation analysis in the new millennium. Communication Monographs, 76, 408420.
Google Scholar | Crossref | ISI
Jackson, J. J., Thoemmes, F., Jonkmann, K., Lüdtke, O., Trautwein, U. (2012). Military training and personality trait development: Does the military make the man, or does the man make the military? Psychological Science, 23, 270277.
Google Scholar | SAGE Journals | ISI
King, G., Nielsen, R. (2016). Why propensity scores should not be used for matching. Retrieved from https://gking.harvard.edu/publications/why-propensity-scores-should-not-be-used-formatching
Google Scholar
Lee, J. J. (2012). Correlation and causation in the study of personality. European Journal of Personality, 26, 372390.
Google Scholar | Crossref | ISI
Mohan, K., Pearl, J., Tian, J. (2013). Graphical models for inference with missing data. In Burges, C. J. C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K. Q. (Eds.), Advances in Neural Information Processing Systems 26 (NIPS 2013) (pp. 12771285). Red Hook, NY: Curran Associates.
Google Scholar
Morgan, S. L., Winship, C. (2015). Counterfactuals and causal inference: Methods and principles for social research (2nd ed.). New York, NY: Cambridge University Press.
Google Scholar
Mukherjee, S. (2010). The emperor of all maladies: A biography of cancer. New York, NY: Simon & Schuster.
Google Scholar
Munafò, M. R., Tilling, K., Taylor, A. E., Evans, D. M., Smith, G. D. (2017). Collider scope: When selection bias can substantially influence observed associations. bioRxiv. doi:10.1101/079707
Google Scholar | Crossref
Pearl, J. (1993). Graphical models, causality and intervention. Statistical Science, 8, 266269.
Google Scholar | Crossref
Pearl, J. (1995). Causal diagrams for empirical research. Biometrika, 82, 669688.
Google Scholar | Crossref | ISI
Pearl, J., Glymour, M., Jewell, N. P. (2016). Causal inference in statistics: A primer. Chichester, England: John Wiley & Sons.
Google Scholar
Piff, P. K., Kraus, M. W., Côté, S., Cheng, B. H., Keltner, D. (2010). Having less, giving more: The influence of social class on prosocial behavior. Journal of Personality and Social Psychology, 99, 771784.
Google Scholar | Crossref | ISI
Robins, J. M., Wasserman, L. (1999). On the impossibility of inferring causation from association without background knowledge. In Cooper, G. F., Glymour, C. (Eds.), Computation, Causation, & Discovery (pp. 305321). Cambridge, MA: MIT Press.
Google Scholar
Rohrer, J. (2017a, March 14). That one weird third variable problem nobody ever mentions: Conditioning on a collider [Web log post]. Retrieved from http://www.the100.ci/2017/03/14/that-one-weird-third-variable-problem-nobody-ever-mentions-conditioning-on-a-collider/
Google Scholar
Rohrer, J. (2017b, April 21). What’s an age-effect net of all time-varying covariates? [Web log post]. Retrieved from http://www.the100.ci/2017/04/21/whats-an-age-effect-net-of-all-time-varying-covariates/
Google Scholar
Rosenbaum, P. R. (1984). The consequences of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society: Series A, 147, 656666.
Google Scholar | Crossref | ISI
Rozin, P. (2001). Social psychology and science: Some lessons from Solomon Asch. Personality and Social Psychology Review, 5, 214.
Google Scholar | SAGE Journals | ISI
Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66, 688701.
Google Scholar | Crossref | ISI
Schafer, J. L., Graham, J. W. (2002). Missing data: Our view of the state of the art. Psychological Methods, 7, 147177.
Google Scholar | Crossref | ISI
Silver, N. (2012). The signal and the noise: The art and science of prediction. New York, NY: Penguin Press.
Google Scholar
Sjölander, A. (2009). Propensity scores and M-structures. Statistics in Medicine, 28, 14161420.
Google Scholar | Crossref | ISI
Smith, G. C., Pell, J. P. (2003). Parachute use to prevent death and major trauma related to gravitational challenge: Systematic review of randomised controlled trials. British Medical Journal, 327, Article 1459. doi:10.1136/bmj.327.7429.1459
Google Scholar | Crossref
Spector, P. E., Brannick, M. T. (2011). Methodological urban legends: The misuse of statistical control variables. Organizational Research Methods, 14, 287305.
Google Scholar | SAGE Journals | ISI
Spirtes, P., Glymour, C. N., Scheines, R. (2000). Causation, prediction, and search. Cambridge, MA: MIT Press.
Google Scholar
Steegen, S., Tuerlinckx, F., Gelman, A., Vanpaemel, W. (2016). Increasing transparency through a multiverse analysis. Perspectives on Psychological Science, 11, 702712.
Google Scholar | SAGE Journals | ISI
Thoemmes, F. (2015). Reversing arrows in mediation models does not distinguish plausible models. Basic and Applied Social Psychology, 37, 226234.
Google Scholar | Crossref | ISI
Thoemmes, F., Mohan, K. (2015). Graphical representation of missing data problems. Structural Equation Modeling: A Multidisciplinary Journal, 22, 631642.
Google Scholar | Crossref
Tiefenbach, T., Kohlbacher, F. (2015). Individual differences in the relationship between domain satisfaction and happiness: The moderating role of domain importance. Personality and Individual Differences, 86, 8287.
Google Scholar | Crossref
Turkheimer, E. (2000). Three laws of behavior genetics and what they mean. Current Directions in Psychological Science, 9, 160164.
Google Scholar | SAGE Journals | ISI
Turkheimer, E., Harden, K. P. (2014). Behavior genetic research methods: Testing quasi-causal hypotheses using multivariate twin data. In Reis, H. T., Judd, C. M. (Eds.), Handbook of research methods in social and personality psychology (pp. 159187). New York, NY: Cambridge University Press.
Google Scholar
Voelkle, M. C., Oud, J. H., Davidov, E., Schmidt, P. (2012). An SEM approach to continuous time modeling of panel data: Relating authoritarianism and anomia. Psychological Methods, 17, 176192.
Google Scholar | Crossref | ISI
Westfall, J., Yarkoni, T. (2016). Statistically controlling for confounding constructs is harder than you think. PLOS ONE, 11(3), Article e0152719. doi:10.1371/journal.pone.0152719
Google Scholar | Crossref
Access Options

My Account

Welcome
You do not have access to this content.



Chinese Institutions / 中国用户

Click the button below for the full-text content

请点击以下获取该全文

Institutional Access

does not have access to this content.

Purchase Content

24 hours online access to download content

Research off-campus without worrying about access issues. Find out about Lean Library here

Your Access Options


Purchase

AMP-article-ppv for $37.50

Cookies Notification

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Find out more.
Top