Abstract
Count time series are frequently encountered in biomedical, epidemiological and public health applications. In principle, such series may exhibit three distinctive features: overdispersion, zero-inflation and temporal correlation. Developing a modelling framework that is sufficiently general to accommodate all three of these characteristics poses a challenge. To address this challenge, we propose a flexible class of dynamic models in the state-space framework. Certain models that have been previously introduced in the literature may be viewed as special cases of this model class. For parameter estimation, we devise a Monte Carlo Expectation-Maximization (MCEM) algorithm, where particle filtering and particle smoothing methods are employed to approximate the high-dimensional integrals in the E-step of the algorithm. To illustrate the proposed methodology, we consider an application based on the evaluation of a participatory ergonomics intervention, which is designed to reduce the incidence of workplace injuries among a group of hospital cleaners. The data consists of aggregated monthly counts of work-related injuries that were reported before and after the intervention.
References
| Akaike, H (1974) A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716–23. Google Scholar | Crossref | ISI | |
| Andrieu, C, Doucet, A, Holenstein, R (2010) Particle Markov chain Monte Carlo methods. Journal of the Royal Statistical Society Series B, 72, 269–342. Google Scholar | Crossref | |
| Cameron, AC, Trivedi, PK (2013) Regression analysis of count data. 2nd edn. Cambridge University Press. Google Scholar | Crossref | |
| Chan, KS, Ledolter, J (1995) Monte Carlo EM estimation for time series models involving counts. Journal of the American Statistical Association, 90, 242–52. Google Scholar | Crossref | ISI | |
| Cox, DR (1981) Statistical analysis of time series: Some recent developments. Scandinavian Journal of Statistics, 8, 93–115. Google Scholar | ISI | |
| Dalrymple, ML, Hudson, IL, Ford, RPK (2003) Finite mixture, zero-inflated Poisson and hurdle models with application to SIDS. Computational Statistics & Data Analysis, 41, 491–504. Google Scholar | Crossref | ISI | |
| Davis, RA, Dunsmuir, WTM, Streett, SB (2003) Observation-driven models for Poisson counts. Biometrika, 90, 777–90. Google Scholar | Crossref | ISI | |
| Davis, RA, Wu, R (2009) A negative binomial model for time series of counts. Biometrika, 96, 735–49. Google Scholar | Crossref | ISI | |
| Dempster, AP, Laird, NM, Rubin, DB (1977) Maximum likelihood estimation from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B, 39, 1–39. Google Scholar | |
| Doucet, A, Freitas, ND, Gordon, N (2001) Sequential Monte Carlo methods in practice. New York: Springer. Google Scholar | Crossref | |
| Fokianos, K (2011) Some recent progress in count time series. Statistics, 45, 49–58. Google Scholar | Crossref | ISI | |
| Fokianos, K, Rahbek, A, Tjøstheim, D (2009) Poisson autoregression. Journal of the American Statistical Association, 104, 1430–9. Google Scholar | Crossref | ISI | |
| Freeland, RK, McCabe, BPM (2004) Analysis of low count time series data by Poisson autoregression. Journal of Time Series Analysis, 25, 701–22. Google Scholar | Crossref | ISI | |
| Godsill, SJ, Doucet, A, West, M (2004) Monte Carlo smoothing for nonlinear time series. Journal of the American Statistical Association, 99, 156–68. Google Scholar | Crossref | ISI | |
| Gordon, NJ, Salmond, DJ, Smith, AFM (1993) Novel approach to nonlinear/non-Gaussian Bayesian state estimation. IEE Proceedings F, Radar and Signal Processing, 140, 107–13. Google Scholar | Crossref | |
| Jazi, MA, Jones, G, Lai, CD (2012) First-order integer valued AR processes with zero inflated Poisson innovations. Journal of Time Series Analysis, 33, 954–63. Google Scholar | Crossref | ISI | |
| Kedem, B, Fokianos, K (2002) Regression models for time series analysis. New Jersey: Wiley. Google Scholar | Crossref | |
| Kim, J, Stoffer, DS (2008) Fitting stochastic volatility models in the presence of irregular sampling via particle methods and the EM algorithm. Journal of Time Series Analysis, 29, 811–33. Google Scholar | Crossref | ISI | |
| Lambert, D (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics, 34, 1–14. Google Scholar | Crossref | ISI | |
| Lawless, JF (1987) Negative binomial and mixed Poisson regression. The Canadian Journal of Statistics, 15, 209–25. Google Scholar | Crossref | ISI | |
| Lee, AH, Wang, K, Yau, KKW, Carrivick, PJW, Stevenson, MR (2005) Modelling bivariate count series with excess zeros. Mathematical Biosciences, 196, 226–37. Google Scholar | Crossref | Medline | ISI | |
| Levine, RA, Casella, G (2001) Implementations of the Monte Carlo EM algorithm. Journal of Computational and Graphical Statistics, 10, 422–39. Google Scholar | Crossref | ISI | |
| Louis, TA (1982) Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society Series B, 44, 226–33. Google Scholar | |
| Nelson, KP, Leroux, BG (2006) Statistical models for autocorrelated count data. Statistics in Medicine, 25, 1413–30. Google Scholar | Crossref | Medline | ISI | |
| Oh, MS, Lim, YB (2001) Bayesian analysis of time series Poisson data. Journal of Applied Statistics, 28, 259–71. Google Scholar | Crossref | ISI | |
| Rue, H, Martino, S, Chopin, N (2009) Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations (with discussion). Journal of the Royal Statistical Society Series B, 71, 319–92. Google Scholar | Crossref | |
| Shumway, RH, Stoffer, DS (1982) An approach to time series smoothing and forecasting using the EM algorithm. Journal of Time Series Analysis, 3, 253–64. Google Scholar | Crossref | |
| Wang, P (2001) Markov zero-inflated Poisson regression models for a time series of counts with excess zeros. Journal of Applied Statistics, 28, 623–32. Google Scholar | Crossref | ISI | |
| Yang, M, Zamba, GKD, Cavanaugh, JE (2013) Markov regression models for count time series with excess zeros: A partial likelihood approach. Statistical Methodology, 14, 26–38. Google Scholar | Crossref | |
| Yau, KKW, Lee, AH, Carrivick, PJW (2004) Modeling zero-inflated count series with application to occupational health. Computer Methods and Programs in Biomedicine, 74, 47–52. Google Scholar | Crossref | Medline | ISI | |
| Yau, KKW, Wang, K, Lee, AH (2003) Zero-inflated negative binomial mixed regression modeling of over-dispersed count data with extra zeros. Biometrical Journal, 45, 437–52. Google Scholar | Crossref | ISI | |
| Zeger, SL (1988) A regression model for time series of counts. Biometrika, 75, 621–9. Google Scholar | Crossref | ISI | |
| Zeger, SL, Qaqish, B (1988) Markov regression models for time series: A quasi-likelihood approach. Biometrics, 44, 1019–31. Google Scholar | Crossref | Medline | ISI | |
| Zhu, F (2010) A negative binomial integer-valued GARCH model. Journal of Time Series Analysis, 32, 54–67. Google Scholar | Crossref | ISI | |
| Zhu, F (2012) Zero-inflated Poisson and negative binomial integer-valued GARCH models. Journal of Statistical Planning and Inference, 142, 826–39. Google Scholar | Crossref | ISI |
