Abstract
We describe generalized survival models, where g(S(t|z)), for link function g, survival S, time t, and covariates z, is modeled by a linear predictor in terms of covariate effects and smooth time effects. These models include proportional hazards and proportional odds models, and extend the parametric Royston–Parmar models. Estimation is described for both fully parametric linear predictors and combinations of penalized smoothers and parametric effects. The penalized smoothing parameters can be selected automatically using several information criteria. The link function may be selected based on prior assumptions or using an information criterion. We have implemented the models in R. All of the penalized smoothers from the mgcv package are available for smooth time effects and smooth covariate effects. The generalized survival models perform well in a simulation study, compared with some existing models. The estimation of smooth covariate effects and smooth time-dependent hazard or odds ratios is simplified, compared with many non-parametric models. Applying these models to three cancer survival datasets, we find that the proportional odds model is better than the proportional hazards model for two of the datasets.
References
| 1. | Cox, DR . Regression models and life-tables. J R Stat Soc 1972; 34: 187–220. Google Scholar |
| 2. | Zeng, D, Lin, D. Maximum likelihood estimation in semiparametric regression models with censored data. J R Stat Soc B Stat Methodol 2007; 69: 507–564. Google Scholar | Crossref |
| 3. | Bottai, M, Zhang, J. Laplace regression with censored data. Biometrical J 2010; 52: 487–503. Google Scholar | Crossref | Medline | ISI |
| 4. | Younes, N, Lachin, J. Link-based models for survival data with interval and continuous time censoring. Biometrics 1997; 53: 1199–1211. Google Scholar | Crossref | ISI |
| 5. | Royston, P, Parmar, MK. Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Stat Med 2002; 21: 2175–2197. Google Scholar | Crossref | Medline | ISI |
| 6. | Royston, P, Sauerbrei, W. A new measure of prognostic separation in survival data. Stat Med 2004; 23(5): 723–748. Google Scholar | Crossref | Medline |
| 7. | Royston, P, Lambert, PC. Flexible parametric survival analysis using Stata: beyond the Cox model, College Station, TX: Stata Press, 2011. Google Scholar |
| 8. | Lagakos, S . Effects of mismodelling and mismeasuring explanatory variables on tests of their association with a response variable. Stat Med 1988; 7: 257–274. Google Scholar | Crossref | Medline |
| 9. | Groenwold, RH, Klungel, OH, Altman, DG, et al. Adjustment for continuous confounders: an example of how to prevent residual confounding. CMAJ 2013; 185: 401–406. Google Scholar | Crossref | Medline |
| 10. | Sauerbrei, W, Abrahamowicz, M, Altman, DG, et al. Strengthening analytical thinking for observational studies: the STRATOS initiative. Stat Med 2014; 33(30): 5413–5432. Google Scholar | Crossref | Medline |
| 11. | Leffondré, K, Jager, KJ, Boucquemont, J, et al. Representation of exposures in regression analysis and interpretation of regression coefficients: basic concepts and pitfalls. Nephrol Dial Transplant 2014; 29(10): 1806–1814. Google Scholar | Crossref | Medline |
| 12. | Reid, N . A conversation with Sir David Cox. Stat Sci 1994; 9(3): 439–455. Google Scholar | Crossref |
| 13. | Nelder, JA, Wedderburn, RWM. Generalized linear models. J R Stat Soc Ser A 1972; 135: 370–384. Google Scholar | Crossref | ISI |
| 14. | Hastie, T, Tibshirani, R. Generalized additive models. Stat Sci 1986; 1: 297–310. Google Scholar | Crossref |
| 15. | Breslow, NE, Clayton, DG. Approximate inference in generalized linear mixed models. J Am Stat Assoc 1993; 88(421): 9–25. Google Scholar | ISI |
| 16. | Bennett, S . Analysis of survival data by the proportional odds model. Stat Med 1983; 2: 273–277. Google Scholar | Crossref | Medline |
| 17. | Nocedal, J, Wright, SJ. Numerical optimization, New York, NY: Springer, 2006. Google Scholar | Crossref |
| 18. | Wood, S . Generalized additive models: an introduction with R, Boca Raton: CRC Press, 2006. Google Scholar | Crossref |
| 19. | Verweij, PJ, van Houwelingen, HC. Penalized likelihood in Cox regression. Stat Med 1994; 13: 2427–2436. Google Scholar | Crossref | Medline | ISI |
| 20. | Joly, P, Commenges, D, Letenneur, L. A penalized likelihood approach for arbitrarily censored and truncated data: application to age-specific incidence of dementia. Biometrics 1998; 54: 185–194. Google Scholar | Crossref | Medline | ISI |
| 21. | Rondeau, V, Mazroui, Y, Gonzalez, JR. frailtypack: an R package for the analysis of correlated survival data with frailty models using penalized likelihood estimation or parametrical estimation. J Stat Software 2012; 47: 1–28. Google Scholar | Crossref | ISI |
| 22. | Volinsky, CT, Raftery, AE. Bayesian information criterion for censored survival models. Biometrics 2000; 56: 256–262. Google Scholar | Crossref | Medline | ISI |
| 23. | Gray, RJ . Flexible methods for analyzing survival data using splines, with applications to breast cancer prognosis. J Am Stat Assoc 1992; 87: 942–951. Google Scholar | Crossref | ISI |
| 24. | Therneau, TM, Grambsch, PM. Modeling survival data: extending the Cox model, New York, NY: Springer, 2000. Google Scholar | Crossref |
| 25. | Gray, RJ . Spline-based tests in survival analysis. Biometrics 1994; 50: 640–652. Google Scholar | Crossref | Medline |
| 26. | Pregibon, D . Goodness of link tests for generalized linear models. J R Stat Soc Ser C Appl Stat 1980; 29(1): 15–23. 14. Google Scholar | Crossref |
| 27. | Collett, D . Modelling survival data in medical research, Boca Raton, FL: CRC Press, 2015. Google Scholar |
| 28. | Crowley, J, Storer, BE. A reanalysis of the Stanford heart transplant data: comment. J Am Stat Assoc 1983; 78(382): 277–281. Google Scholar |
| 29. | R Core Team . R: a language and environment for statistical computing, Vienna, Austria: R Foundation for Statistical Computing, 2016. http://www.R-project.org/. Google Scholar |
| 30. | Crowther, MJ, Lambert, PC. Simulating biologically plausible complex survival data. Stat Med 2013; 32: 4118–4134. Google Scholar | Crossref | Medline | ISI |
| 31. | Gu, C . Model diagnostics for smoothing spline ANOVA models. Can J Stat 2004; 32: 347–358. Google Scholar | Crossref |
| 32. | Kneib, T, Fahrmeir, L. A mixed model approach for geoadditive hazard regression. Scand J Stat 2007; 34: 207–228. Google Scholar | Crossref | ISI |
| 33. | Cai, T, Betensky, RA. Hazard regression for interval-censored data with penalized spline. Biometrics 2003; 59: 570–579. Google Scholar | Crossref | Medline | ISI |
| 34. | Gu, C . Smoothing spline ANOVA models, New York, NY: Springer Science and Business Media, 2013. Google Scholar | Crossref |
| 35. | Sauerbrei, W, Royston, P. Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials. J R Stat Soc Ser A Stat Soc 1999; 162: 71–94. Google Scholar | Crossref | ISI |
| 36. | Lambert, PC, Royston, P. Further development of flexible parametric models for survival analysis. Stata J 2009; 9: 265. Google Scholar | SAGE Journals | ISI |
| 37. | Prentice, RL . Exponential survivals with censoring and explanatory variables. Biometrika 1973; 60: 279–288. Google Scholar | Crossref | ISI |
| 38. | Murphy, S, Rossini, A, van der Vaart, AW. Maximum likelihood estimation in the proportional odds model. J Am Stat Assoc 1997; 92(439): 968–976. Google Scholar | Crossref |
| 39. | Hanson, T, Yang, M. Bayesian semiparametric proportional odds models. Biometrics 2007; 63(1): 88–95. Google Scholar | Crossref | Medline |
| 40. | Hougaard, P . Fundamentals of survival data. Biometrics 1999; 55(1): 13–22. Google Scholar | Crossref | Medline | ISI |
| 41. | Lin, D, Spiekerman, C. Model checking techniques for parametric regression with censored data. Scand J Stat 1996; 23: 157–177. Google Scholar |
| 42. | Wood, SN . Fast stable direct fitting and smoothness selection for generalized additive models. J R Stat Soc Series B Stat Methodol 2008; 70: 495–518. Google Scholar | Crossref | ISI |
