Abstract
We propose a penalized variable selection method for the Cox proportional hazards model with interval censored data. It conducts a penalized nonparametric maximum likelihood estimation with an adaptive lasso penalty, which can be implemented through a penalized EM algorithm. The method is proven to enjoy the desirable oracle property. We also extend the method to left truncated and interval censored data. Our simulation studies show that the method possesses the oracle property in samples of modest sizes and outperforms available existing approaches in many of the operating characteristics. An application to a dental caries data set illustrates the method's utility.
References
| 1. | Fisher-Owens, SA, Gansky, SA, Platt, LJ, et al. Influences on children's oral health: a conceptual model. Pediatrics 2007; 120: e510–e520. Google Scholar | Crossref | Medline | ISI |
| 2. | Ismail, A, Sohn, W, Lim, S, et al. Predictors of dental caries progression in primary teeth. J Dental Res 2009; 88: 270–275. Google Scholar | SAGE Journals | ISI |
| 3. | Hastie T, Tibshirani R and Friedman J. The elements of statistical learning: data mining, inference and prediction. 2nd ed. Springer, 2009. Google Scholar |
| 4. | Tibshirani, R . Regression shrinkage and selection via the lasso. J Royal Stat Soc Ser B (Methodological) 1996; 58: 267–288. Google Scholar |
| 5. | Fan, J, Li, R. Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 2001; 96: 1348–1360. Google Scholar | Crossref | ISI |
| 6. | Zou, H . The adaptive lasso and its oracle properties. J Am Stat Assoc 2006; 101: 1418–1429. Google Scholar | Crossref | ISI |
| 7. | Tibshirani, R . The lasso method for variable selection in the Cox model. Stat Med 1997; 16: 385–395. Google Scholar | Crossref | Medline | ISI |
| 8. | Fan, J, Li, R. Variable selection for Cox's proportional hazards model and frailty model. Annals Stat 2002; 30: 74–99. Google Scholar | ISI |
| 9. | Zhang, HH, Lu, W. Adaptive lasso for Cox's proportional hazards model. Biometrika 2007; 94: 691–703. Google Scholar | Crossref | ISI |
| 10. | Vanichseni, S, Kitayaporn, D, Mastro, TD, et al. Continued high HIV-1 incidence in a vaccine trial preparatory cohort of injection drug users in Bangkok, Thailand. AIDS 2001; 15: 397–405. Google Scholar | Crossref | Medline |
| 11. | Zhang, Z, Sun, J. Interval censoring. Stat Meth Med Res 2010; 19: 53–70. Google Scholar | SAGE Journals | ISI |
| 12. | Schick, A, Yu, Q. Consistency of the GMLE with mixed case interval-censored data. Scand J Stat 2000; 27: 45–55. Google Scholar | Crossref |
| 13. | Zhang, Y, Hua, L, Huang, J. A spline-based semiparametric maximum likelihood estimation method for the Cox model with interval-censored data. Scand J Stat 2010; 37: 338–354. Google Scholar | Crossref | ISI |
| 14. | Zeng, D, Mao, L, Lin, DY. Maximum likelihood estimation for semiparametric transformation models with interval-censored data. Biometrika 2016; 103: 253–271. Google Scholar | Crossref | Medline |
| 15. | Wu, Y, Cook, RJ. Penalized regression for interval-censored times of disease progression: selection of HLA markers in psoriatic arthritis. Biometrics 2015; 71: 782–791. Google Scholar | Crossref | Medline |
| 16. | Scolas, S, El Ghouch, A, Legrand, C, et al. Variable selection in a flexible parametric mixture cure model with interval-censored data. Stat Med 2016; 35: 1210–1225. Google Scholar | Crossref | Medline |
| 17. | Zhao H, Wu Q, Li G, et al. Simultaneous estimation and variable selection for interval-censored data with broken adaptive ridge regression. J Am Stat Assoc. Epub ahead of print 22 April 2019. DOI: 10.1080/01621459.2018.1537922. Google Scholar |
| 18. | Alioum, A, Commenges, D. A proportional hazards model for arbitrarily censored and truncated data. Biometrics 1996; 52: 512–524. Google Scholar | Crossref | Medline |
| 19. | Wang, L, McMahan, CS, Hudgens, MG, et al. A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data. Biometrics 2016; 72: 222–231. Google Scholar | Crossref | Medline |
| 20. | Lu, W, Zhang, HH. Variable selection for proportional odds model. Stat Med 2007; 26: 3771–3781. Google Scholar | Crossref | Medline |
| 21. | Murphy, SA, van der Vaart, AW. Observed information in semi-parametric models. Bernoulli 1999; 5: 381–412. Google Scholar | Crossref |
| 22. | Zhang, Y, Li, R, Tsai, CL. Regularization parameter selections via generalized information criterion. J Am Stat Assoc 2010; 105: 312–323. Google Scholar | Crossref | Medline | ISI |
| 23. | Hui, FKC, Warton, DI, Foster, SD. Tuning parameter selection for the adaptive lasso using ERIC. J Am Stat Assoc 2015; 110: 262–269. Google Scholar | Crossref |
| 24. | Zeng, D, Gao, F, Lin, DY. Maximum likelihood estimation for semiparametric regression models with multivariate interval-censored data. Biometrika 2017; 104: 505–525. Google Scholar | Crossref | Medline |
| 25. | Murphy, SA, van der Vaart, AW. On profile likelihood. J Am Stat Assoc 2000; 95: 449–465. Google Scholar | Crossref | ISI |

