Abstract
In cancer studies, patients often experience two different types of events: a non-terminal event such as recurrence or metastasis, and a terminal event such as cancer-specific death. Identifying pathways and networks of genes associated with one or both of these events is an important step in understanding disease development and targeting new biological processes for potential intervention. These correlated outcomes are commonly dealt with by modeling progression-free survival, where the event time is the minimum between the times of recurrence and death. However, identifying pathways only associated with progression-free survival may miss out on pathways that affect time to recurrence but not death, or vice versa. We propose a combined testing procedure for a pathway’s association with both the cause-specific hazard of recurrence and the marginal hazard of death. The dependency between the two outcomes is accounted for through perturbation resampling to approximate the test’s null distribution, without any further assumption on the nature of the dependency. Even complex non-linear relationships between pathways and disease progression or death can be uncovered thanks to a flexible kernel machine framework. The superior statistical power of our approach is demonstrated in numerical studies and in a gene expression study of breast cancer.
References
| 1. | Subramanian, A, Tamayo, P, Mootha, VK Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 2005; 102: 15545–15550. Google Scholar | Crossref | Medline | ISI |
| 2. | McLendon, R, Friedman, A, Bigner, D Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 2008; 455: 1061–1068. Google Scholar | Crossref | Medline | ISI |
| 3. | Jones, S, Zhang, X, Parsons, DW Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science 2008; 321: 1801–1806. Google Scholar | Crossref | Medline | ISI |
| 4. | Goeman, JJ, Van De Geer, SA, De Kort, F A global test for groups of genes: testing association with a clinical outcome. Bioinformatics 2004; 20: 93–99. Google Scholar | Crossref | Medline | ISI |
| 5. | Goeman, JJ, Oosting, J, Cleton-Jansen, AM Testing association of a pathway with survival using gene expression data. Bioinformatics 2005; 21: 1950–1957. Google Scholar | Crossref | Medline | ISI |
| 6. | Liu, D, Lin, X, Ghosh, D. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 2007; 63: 1079–1088. Google Scholar | Crossref | Medline | ISI |
| 7. | Liu, D, Ghosh, D, Lin, X. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC Bioinf 2008; 9: 292–292. Google Scholar | Crossref | Medline | ISI |
| 8. | Cai, T, Tonini, G, Lin, X. Kernel machine approach to testing the significance of multiple genetic markers for risk prediction. Biometrics 2011; 67: 975–986. Google Scholar | Crossref | Medline | ISI |
| 9. | Lin, X, Cai, T, Wu, MC Kernel machine SNP-set analysis for censored survival outcomes in genome-wide association studies. Genet Epidemiol 2011; 35: 620–631. Google Scholar | Crossref | Medline | ISI |
| 10. | Wu, MC, Lee, S, Cai, T Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011; 89: 82–93. Google Scholar | Crossref | Medline | ISI |
| 11. | Wu, MC, Maity, A, Lee, S Kernel machine SNP-set testing under multiple candidate kernels. Genet Epidemiol 2013; 37: 267–275. Google Scholar | Crossref | Medline | ISI |
| 12. | Sinnott, JA, Cai, T. Omnibus risk assessment via accelerated failure time kernel machine modeling. Biometrics 2013; 69: 861–873. Google Scholar | Crossref | Medline | ISI |
| 13. | Hejblum, BP, Skinner, J, Thiébaut, R. Time-course gene set analysis for longitudinal gene expression data. PLOS Comput Biol 2015; 11: e1004310–e1004310. Google Scholar | Crossref | Medline | ISI |
| 14. | Panageas, KS, Ben-Porat, L, Dickler, MN When you look matters: the effect of assessment schedule on progression-free survival. J Natl Cancer Inst 2007; 99: 428–432. Google Scholar | Crossref | Medline |
| 15. | Burzykowski, T, Buyse, M, Piccart-Gebhart, MJ Evaluation of tumor response, disease control, progression-free survival, and time to progression as potential surrogate end points in metastatic breast cancer. J Clin Oncol 2008; 26: 1987–1992. Google Scholar | Crossref | Medline | ISI |
| 16. | Broglio, KR, Berry, DA. Detecting an overall survival benefit that is derived from progression-free survival. J Natl Cancer Inst 2009; 101: 1642–1642. Google Scholar | Crossref | Medline |
| 17. | Buyse, M, Sargent, DJ, Saad, ED. Survival is not a good outcome for randomized trials with effective subsequent therapies. J Clin Oncol 2011; 29: 4719–4720. Google Scholar | Crossref | Medline | ISI |
| 18. | Amir, E, Seruga, B, Kwong, R Poor correlation between progression-free and overall survival in modern clinical trials: are composite endpoints the answer? Euro J Cancer 2012; 48: 385–388. Google Scholar | Crossref | Medline | ISI |
| 19. | Sherrill, B, Kaye, JA, Sandin, R Review of meta-analyses evaluating surrogate endpoints for overall survival in oncology. OncoTargets Ther 2012; 5: 287–287. Google Scholar | Crossref | Medline | ISI |
| 20. | Fine, JP, Jiang, H, Chappell, R. On semi-competing risks data. Biometrika 2001; 88: 907–919. Google Scholar | Crossref | ISI |
| 21. | Cook, RJ, Lawless, JF. Marginal analysis of recurrent events and a terminating event. Stat Med 1997; 16: 911–924. Google Scholar | Crossref | Medline | ISI |
| 22. | Ghosh, D, Lin, D. Nonparametric analysis of recurrent events and death. Biometrics 2000; 56: 554–562. Google Scholar | Crossref | Medline | ISI |
| 23. | Ghosh, D, Lin, D. Marginal regression models for recurrent and terminal events. Stat Sin 2002; 12: 663–688. Google Scholar | ISI |
| 24. | Jiang, H, Fine, JP, Chappell, R. Semiparametric analysis of survival data with left truncation and dependent right censoring. Biometrics 2005; 61: 567–575. Google Scholar | Crossref | Medline | ISI |
| 25. | Ghosh, D . Semiparametric inferences for association with semi-competing risks data. Stat Med 2006; 25: 2059–2070. Google Scholar | Crossref | Medline | ISI |
| 26. | Peng, L, Fine, JP. Regression modeling of semicompeting risks data. Biometrics 2007; 63: 96–108. Google Scholar | Crossref | Medline | ISI |
| 27. | Lakhal, L, Rivest, LP, Abdous, B. Estimating survival and association in a semicompeting risks model. Biometrics 2008; 64: 180–188. Google Scholar | Crossref | Medline | ISI |
| 28. | Hsieh, JJ, Wang, W, Ding, AA. Regression analysis based on semicompeting risks data. J R Stat Soc B 2008; 70: 3–20. Google Scholar |
| 29. | Fu, H, Wang, Y, Liu, J Joint modeling of progression-free survival and overall survival by a Bayesian normal induced copula estimation model. Stat Med 2013; 32: 240–254. Google Scholar | Crossref | Medline | ISI |
| 30. | Liu, L, Wolfe, RA, Huang, X. Shared frailty models for recurrent events and a terminal event. Biometrics 2004; 60: 747–756. Google Scholar | Crossref | Medline | ISI |
| 31. | Ye, Y, Kalbfleisch, JD, Schaubel, DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics 2007; 63: 78–87. Google Scholar | Crossref | Medline | ISI |
| 32. | Zeng, D, Lin, D. Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics 2009; 65: 746–752. Google Scholar | Crossref | Medline | ISI |
| 33. | Xu, J, Kalbfleisch, JD, Tai, B. Statistical analysis of illness–death processes and semicompeting risks data. Biometrics 2010; 66: 716–725. Google Scholar | Crossref | Medline | ISI |
| 34. | Zeng, D, Chen, Q, Chen, MH Estimating treatment effects with treatment switching via semicompeting risks models: an application to a colorectal cancer study. Biometrika 2012; 99: 167–184. Google Scholar | Crossref | Medline | ISI |
| 35. | Zhang, Y, Chen, MH, Ibrahim, JG Bayesian gamma frailty models for survival data with semi-competing risks and treatment switching. Lifetime Data Anal 2014; 20: 76–105. Google Scholar | Crossref | Medline | ISI |
| 36. | Hu, C, Tsodikov, A. Joint modeling approach for semicompeting risks data with missing nonterminal event status. Lifetime Data Anal 2014; 20: 563–583. Google Scholar | Crossref | Medline | ISI |
| 37. | Lee, KH, Haneuse, S, Schrag, D Bayesian semiparametric analysis of semicompeting risks data: investigating hospital readmission after a pancreatic cancer diagnosis. J R Stat Soc C 2015; 64: 253–273. Google Scholar | Crossref | Medline | ISI |
| 38. | Gail, MH, Brinton, LA, Byar, DP Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst 1989; 81: 1879–1886. Google Scholar | Crossref | Medline |
| 39. | Schölkopf, B, Smola, A, Müller, KR. Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 1998; 10: 1299–1319. Google Scholar | Crossref | ISI |
| 40. | Kimeldorf, G, Wahba, G. A correspondence between bayesian estimation on stochastic processes and smoothing by splines. Ann Math Stat 1970; 41: 495–502. Google Scholar | Crossref |
| 41. | Scholkopf, B, Smola, A. Learning with kernels, Cambridge, MA: MIT Press, 2002. Google Scholar |
| 42. | Commenges, D, Andersen, PK. Score test of homogeneity for survival data. Lifetime Data Anal 1995; 1: 145–156. Google Scholar | Crossref | Medline |
| 43. | Braun M. Spectral properties of the kernel matrix and their application to kernel methods in machine learning. PhD Thesis, University of Bonn, Germany, 2005. Google Scholar |
| 44. | Olopade, OI, Grushko, Ta, Nanda, R Advances in breast cancer: Pathways to personalized medicine. Clin Cancer Res 2008; 14: 7988–7999. Google Scholar | Crossref | Medline | ISI |
| 45. | van de Vijver, MJ, He, YD, van’t Veer, LJ A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 2002; 347: 1999–2009. Google Scholar | Crossref | Medline | ISI |
| 46. | Oh, AS, Lorant, LA, Holloway, JN Hyperactivation of MAPK induces loss of ERalpha expression in breast cancer cells. Mol Endocrinol 2001; 15: 1344–1359. Google Scholar | Medline |
| 47. | Teschendorff, AE, Gomez, S, Arenas, A Improved prognostic classification of breast cancer defined by antagonistic activation patterns of immune response pathway modules. BMC Cancer 2010; 10: 604–604. Google Scholar | Crossref | Medline | ISI |
| 48. | Buck, MB, Knabbe, C. TGF-beta signaling in breast cancer. Ann N Y Acad Sci 2006; 1089: 119–126. Google Scholar | Crossref | Medline | ISI |
| 49. | Barcellos-Hoff, M, Akhurst, RJ. Transforming growth factor-β in breast cancer: too much, too late. Breast Cancer Res 2009; 11: 202–202. Google Scholar | Crossref | Medline | ISI |
| 50. | Fleming, T, Harrington, D. Counting processes and survival analysis 1991; Vol 169, New York: John Wiley & Sons. Google Scholar |
