Abstract
This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R2 analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A simulation study, using both simple random sampling from a known population and bootstrap sampling from a single (parent) random sample, was performed to evaluate the bias, sampling distribution, and confidence intervals of quantitative dominance measures as well as the reproducibility of qualitative dominance measures. Results indicated that the bootstrap procedure is feasible and can be used in applied research to generalize logistic regression dominance analysis results to the population of interest. The procedures for determining and interpreting the general dominance of predictors in a logistic regression context are illustrated with an empirical example.
References
|
Agresti, A. (1996). An introduction to categorical data analysis. New York: Wiley. Google Scholar | |
|
Amemiya, T. (1981). Qualitative response models: A survey. Journal of Economic Literature, 19, 1483–1536. Google Scholar | |
|
Azen, R., Budescu, D. V. (2003). The dominance analysis approach for comparing predictors in multiple regression. Psychological Methods, 8, 129–148. Google Scholar | Crossref | Medline | |
|
Azen, R., Sass, D. A. (2008). Comparing the squared multiple correlation coefficients of non-nested models: An examination of confidence intervals and hypothesis testing. British Journal of Mathematical and Statistical Psychology, 61, 163–178. Google Scholar | Crossref | Medline | |
|
Bezilla, R. , (1994). Teenage attitudes and behavior concerning tobacco, June—July 1992: United States. Computer file. Princeton, NJ: The George H. Gallup International Institute [producer], 1992. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 1994 Google Scholar | |
|
Bring, J. (1994). How to standardize regression coefficients. The American Statistician, 48, 209–213. Google Scholar | |
|
Budescu, D. V. (1993). Dominance analysis: A new approach to the problem of relative importance of predictors in multiple regression. Psychological Bulletin, 114, 542–551. Google Scholar | Crossref | |
|
Chevan, A., Sutherland, M. (1991). Hierarchical partitioning The American Statistician, 45, 90–96. Google Scholar | Crossref | |
|
Cox, L. A. (1985). A new measure of attributable risk for public health applications. Management Science, 31, 800–813. Google Scholar | Crossref | |
|
Estrella, A. (1998). A new measure of fit for equations with dichotomous dependent variables. Journal of Business and Economic Statistics, 16, 198–205. Google Scholar | |
|
Grömping, U. (2007). Estimators of relative importance in linear regression based on variance decomposition. The American Statistician, 61, 139–146. Google Scholar | Crossref | |
|
Johnson, J. W., LeBreton, J. M. (2004). History and use of relative importance indices in organizational research. Organizational Research Methods, 7, 238–257. Google Scholar | SAGE Journals | |
|
Kvålseth, T. O. (1985). Cautionary notes about R2. The American Statistician, 39, 279–285. Google Scholar | |
|
Liao, J. G., McGee, D. (2003). Adjusted coefficients of determination for logistic regression. The American Statistician, 57, 161–165. Google Scholar | Crossref | |
|
McFadden, D. (1974). The measurement of urban travel demand. Journal of Public Economics, 3, 303–328. Google Scholar | Crossref | |
|
Menard, S. (2000). Coefficients of determination for multiple logistic regression analysis. The American Statistician, 54, 17–24. Google Scholar | |
|
Menard, S. (2004a). Six approaches to calculating standardized logistic regression coefficients. The American Statistician, 58, 218–223. Google Scholar | Crossref | |
|
Menard, S. (2004b). Correction to “Six approaches to calculating standardized logistic regression coefficients.” The American Statistician, 58, 364. Google Scholar | Crossref | |
|
Mittlbock, M., Schemper, M. (1996). Explained variation for logistic regression. Statistics in Medicine, 15, 1987–1997. Google Scholar | Crossref | Medline | |
|
Nagelkerke, N. J. D. (1991). A note on a general definition of the coefficient of determination. Biometrika, 78, 691–692. Google Scholar | Crossref | |
|
Pedhazur, E. J. (1997). Multiple regression in behavioral research (3rd ed.). Forth Worth, TX: Harcourt. Google Scholar | |
|
Schumacker, R. E., Lomax, R. G.(1996). A beginner’s guide to structural equation modeling. Mahwah, NJ: Lawrence Erlbaum Associates. Google Scholar | |
|
Soofi, E. S. (1994). Capturing the intangible concept of information. Journal of the American Statistical Association, 89, 1243–1254. Google Scholar | Crossref | |
|
Soofi, E. S., Retzer, J. J., Yasai-Ardekani, M. (2000). A framework for measuring the importance of variables with applications to management research and decision models. Decision Sciences, 31, 595–625. Google Scholar | Crossref | |
|
Stine, R. (1989). An introduction to bootstrap methods. Sociological Methods and Research, 18, 243–291. Google Scholar | SAGE Journals | |
|
Van den Burg, W., Lewis, C. (1988). Some properties of two measures of multivariate association. Psychometrika, 53, 109–122. Google Scholar | Crossref | |
|
Zheng, B., Agresti, A. (2000). Summarizing the predictive power of a generalized linear model. Statistics in Medicine, 19, 1771–1781. Google Scholar | Crossref | Medline |
