Abstract
Dependent censoring arises in biomedical studies when the survival outcome of interest is censored by competing risks. In survival data with microarray gene expressions, gene selection based on the univariate Cox regression analyses has been used extensively in medical research, which however, is only valid under the independent censoring assumption. In this paper, we first consider a copula-based framework to investigate the bias caused by dependent censoring on gene selection. Then, we utilize the copula-based dependence model to develop an alternative gene selection procedure. Simulations show that the proposed procedure adjusts for the effect of dependent censoring and thus outperforms the existing method when dependent censoring is indeed present. The non-small-cell lung cancer data are analyzed to demonstrate the usefulness of our proposal. We implemented the proposed method in an R “compound.Cox” package.
References
| 1. | Cox, DR . Regression models and life-tables (with discussion). J R Stat Soc Ser B 1972; 34: 187–220. Google Scholar |
| 2. | Jenssen, TK, Kuo, WP, Stokke, T Association between gene expressions in breast cancer and patient survival. Hum Genet 2002; 111: 411–420. Google Scholar | Medline | ISI |
| 3. | Matsui, S . Predicting survival outcomes using subsets of significant genes in prognostic marker studies with microarrays. BMC Bioinform 2006; 7: 156–156. Google Scholar | Medline | ISI |
| 4. | Chen, HY, Yu, SL, Chen, CH A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med 2007; 356: 11–20. Google Scholar | Medline | ISI |
| 5. | Matsui, S, Simon, R, Qu, P Developing and validating continuous genomic signatures in randomized clinical trials for predictive medicine. Clin Cancer Res 2012; 18: 21–21. Google Scholar | ISI |
| 6. | Tukey, JW . Tightening the clinical trial. Control Clin Trials 1993; 14: 266–285. Google Scholar | Medline |
| 7. | Radamacher, MD, Mcshane, LM, Simon, R. A paradigm for class prediction using gene expression profiles. J Comput Biol 2002; 9: 505–511. Google Scholar | Medline | ISI |
| 8. | Beer, DG, Kardia, SLR, Huang, CC Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med 2002; 8: 816–824. Google Scholar | Medline | ISI |
| 9. | Emura, T, Chen, YH, Chen, HY. Survival prediction based on compound covariate under Cox proportional hazard models. PLoS One 2012; 7: e47627–e47627. Google Scholar | Medline | ISI |
| 10. | Beyersmann, J, Allignol, A, Schumacher, M. Competing risks and multistate models with R, New York: Springer-Verlag, 2012. Google Scholar |
| 11. | Fine, JP, Gray, RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc 1999; 94: 548–560. Google Scholar | ISI |
| 12. | Binder, H, Allignol, A, Schumacher, M Boosting for high-dimensional time-to-event data with competing risks. Bioinformatics 2009; 25: 890–896. Google Scholar | Medline | ISI |
| 13. | Bakoyannis, G, Touloumi, G. Practical methods for competing risks data: A review. Stat Method Med Res 2012; 21: 257–272. Google Scholar | SAGE Journals | ISI |
| 14. | Mogensen, UB, Gerds, TA. A random forest approach for competing risks based on pseudo-values. Stat Med 2011; 32: 3102–3114. Google Scholar | ISI |
| 15. | Nelsen, RB . An introduction to copulas. Springer Series in Statistics, 2nd ed. New York: Springer-Verlag, 2006. Google Scholar |
| 16. | Zheng, M, Klein, JP. Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika 1995; 82: 127–138. Google Scholar | ISI |
| 17. | Heckman, JJ, Honore, BE. The identifiability of the competing risks models. Biometrika 1989; 76: 325–330. Google Scholar | ISI |
| 18. | Rivest, LP, Wells, MT. A martingale approach to the copula-graphic estimator for the survival function under dependent censoring. J Mult Anal 2001; 79: 138–155. Google Scholar | ISI |
| 19. | Chen, YH . Semiparametric marginal regression analysis for dependent competing risks under an assumed copula. J R Stat Soc Ser B 2010; 72: 235–251. Google Scholar |
| 20. | Wessels, LFA, Reinders, MJT, Hart, AAM A protocol for building and evaluating predictors of disease state based on microarray data. Bioinformatics 2002; 21: 3755–3762. Google Scholar | ISI |
| 21. | Witten, DM, Tibshirani, R. Survival analysis with high-dimensional covariates. Stat Method Med Res 2010; 19: 29–51. Google Scholar | SAGE Journals | ISI |
| 22. | Andersen, PK, Borgan, O, Gill, RD Statistical models based on counting processes, New York: Springer-Verlag, 1993. Google Scholar |
| 23. | Fleming, TR, Harrington, DP. Counting process and survival analysis, New York: John Wiley and Sons, 1991. Google Scholar |
| 24. | Struthers, CA, Kalbfleish, JD. Misspecified proportional hazard models. Biometrika 1986; 73: 363–369. Google Scholar | ISI |
| 25. | Oakes, D . Bivariate survival models induced by frailties. J Am Stat Assoc 1989; 84: 487–493. Google Scholar | ISI |
| 26. | Kalbfleisch, JD, Prentice, RL. The statistical analysis of failure time data, 2nd ed. New York: John Wiley and Sons, 2002. Google Scholar |
| 27. | Escarela, G, Carriere, JF. Fitting competing risks with an assumed copula. Stat Method Med Res 2003; 12: 333–349. Google Scholar | SAGE Journals | ISI |
| 28. | Braekers, R, Veraverbeke, N. A copula-graphic estimator for the conditional survival function under dependent censoring. Can J Stat 2005; 33: 429–447. Google Scholar | ISI |
| 29. | Emura T and Chen YH. Regression estimation based on the compound shrinkage method under the Cox proportional hazard model. R compound.Cox package, version 1.4, 2013. Google Scholar |
| 30. | Tsiatis, A . A nonidentifiability aspect of the problem of competing risks. Proc Natl Acad Sci USA 1975; 72: 20–22. Google Scholar | Medline | ISI |
| 31. | Verveij, PJM, van Houwelingen, HC. Cross validation in survival analysis. Stat Med 1993; 12: 2305–2314. Google Scholar | Medline | ISI |
| 32. | Harrell, FE, Califf, RM, Pryor, DB Evaluating the yield of medical tests. J Am Med Assoc 1982; 247: 2543–2546. Google Scholar | Medline | ISI |
| 33. | Harrell, FE, Lee, KL, Mark, DB. Multivariate prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med 1996; 15: 361–387. Google Scholar | Medline | ISI |
| 34. | Pepe, MS, Fleming, TR. Weighted Kaplan-Meier statistics: A class of distance tests for censored survival data. Biometrics 1989; 45: 497–507. Google Scholar | Medline | ISI |
| 35. | Frankel, PH, Reid, ME, Marshall, JR. A permutation test for a weighted Kaplan-Meier estimator with application to the nutritional prevention of cancer trial. Contemp Clin Trial 2007; 28: 343–347. Google Scholar | Medline | ISI |
