Abstract
Conway–Maxwell–Poisson (CMP) distributions are flexible generalizations of the Poisson distribution for modelling overdispersed or underdispersed counts. The main hindrance to their wider use in practice seems to be the inability to directly model the mean of counts, making them not compatible with nor comparable to competing count regression models, such as the log-linear Poisson, negative-binomial or generalized Poisson regression models. This note illustrates how CMP distributions can be parametrized via the mean, so that simpler and more easily interpretable mean-models can be used, such as a log-linear model. Other link functions are also available, of course. In addition to establishing attractive theoretical and asymptotic properties of the proposed model, its good finite-sample performance is exhibited through various examples and a simulation study based on real datasets. Moreover, the MATLAB routine to fit the model to data is demonstrated to be up to an order of magnitude faster than the current software to fit standard CMP models, and over two orders of magnitude faster than the recently proposed hyper-Poisson model.
References
| Cameron, AC, Johansson, P (1997) Count data regression using series expansions: With applications. Journal of Applied Econometrics, 12, 203–23. Google Scholar | Crossref | ISI | |
| Consul, PC, Famoye, F (1992) Generalized poisson regression model. Communications in Statistics—Theory and Methods, 21, 89–109. Google Scholar | Crossref | ISI | |
| Conway, RW, Maxwell, WL (1962) A queuing model with state dependent service rates. Journal of Industrial Engineering, 12, 132–36. Google Scholar | |
| Croissant, Y (2011) Ecdat: Datasets for econometrics, R Package, version 0.1–6.1. Google Scholar | |
| Fahrmeir, L, Kaufman, H (1985) Consistency and asymptotic normality of the maximum likelihood estimator in generalized linear models. Annals of Statistics, 13, 342–68. Google Scholar | Crossref | ISI | |
| Famoye, F (1993) Restricted generalized poisson regression model. Communications in Statistics—Theory and Methods, 22, 1335–54. Google Scholar | Crossref | ISI | |
| Guikema, SD, Coffelt, JP (2008) A flexible count data regression model for risk analysis. Risk Analysis, 28, 213–23. Google Scholar | Crossref | Medline | ISI | |
| Huang, A, Rathouz, PJ (2017) Orthogonality of the mean and error distribution in generalized linear models. Communications in Statistics—Theory and Methods, 46, 3290–96. Google Scholar | Crossref | Medline | ISI | |
| Lord, D, Guikema, SD, Geedipally, SR (2008) Application of the Conway–Maxwell–Poisson generalized linear model for analyzing motor vehicle crashes. Accident Analysis and Prevention, 40, 1123–34. Google Scholar | Crossref | Medline | ISI | |
| Lord, D, Guikema, SD, Geedipally, SR (2010) Extension of the application of Conway–Maxwell–Poisson models: Analyzing traffic crash data exhibiting underdispersion. Risk Analysis, 30, 1268–76. Google Scholar | Crossref | Medline | ISI | |
| McCullagh, P, Nelder, JA (1989) Generalized linear models. Boca Raton: Chapman and Hall. Google Scholar | Crossref | |
| Sáez-Castillo, AJ, Conde-Sánchez, A (2013) A hyper-Poisson regression model for overdispersed and underdispersed count data. Computational Statistics & Data Analysis, 61, 148–57. Google Scholar | Crossref | ISI | |
| Sellers, KF, Lotze, T (2015) COM Poisson Reg: Conway–Maxwell–Poisson regression models, R Package, version 0.3.5. Google Scholar | |
| Sellers, KF, Shmueli, G (2010) A flexible regression model for count data. Annals of Applied Statistics, 4, 943–61. Google Scholar | Crossref | ISI | |
| Sellers, KF, Borle, S, Shmueli, G (2012) The COM-Poisson model for count data: A survey of methods and applications. Applied Stochastic Models in Business, 28, 104–16. Google Scholar | Crossref | ISI | |
| Shmueli, G, Minka, TP, Kadane, JB, Borle, S, Boatwright, P (2005) A useful distribution for fitting discrete data: Revival of the Conway–Maxwell–Poisson distribution. Journal of the Royal Statistical Society, Series C, (Applied Statistics), 54, 127–42. Google Scholar | Crossref | ISI | |
| Smith, JQ (1985) Diagnostic checks of non- standard time series models. Journal of Forecasting, 4, 283–91. Google Scholar | Crossref | ISI | |
| Smyth, GK (1989) Generalized linear models with varying dispersion. Journal of the Royal Statistical Society. Series B, 51, 47–60. Google Scholar | |
| Ridout, MS, Besbeas, P (2004) An empirical model for underdispersed count data. Statistical Modelling, 4, 77–89. Google Scholar | SAGE Journals | ISI | |
| Zeviani, WM, Riberio, PJ, Bonat, WH, Shimakura, SE, Muniz, JA (2014) The Gamma-count distribution in the analysis of experimental underdispersed data. Journal of Applied Statistics, 41, 2616–26. Google Scholar | Crossref | ISI |
