Objective: Data mining methods have been used to build prediction models from cancer databases. However, the hierarchy and relative contribution of the prognostic factors affecting hypopharyngeal cancer survival have not been reported and were analyzed here using a decision tree and Cox regression models.
Methods: A total of 923 patients with American Joint Committee on Cancer (AJCC) stage III - IV hypopharyngeal cancer were identified from 2007 to 2013 in the Surveillance, Epidemiology, and End Results database. Classification and Regression Tree (CART) and Cox regression models were used to analyze race, staging, insurance status, treatment modality, sex, tumor differentiation, and marital status. The endpoint was the 5-year disease-specific survival.
Results: Univariate and multivariate analyses using Cox regression models revealed the 5-year disease-specific survival rates were higher in (1) non-black patients vs. black patients; (2) AJCC stage III vs. IV patients; (3) the “insured” group vs. the “other” insurance groups; (4) the “operation (OP)+radiotherapy (RT)” group vs. the “RT alone” group and the “RT alone” group vs. the “OP alone” group. The 5-year disease-specific survival curves based on treatment also showed “OP+RT” was superior to “RT alone” and “RT alone” was superior to “OP alone”. The relative importance of variables influencing the survival of patients with hypopharyngeal cancer was generated using the CART algorithm, with race being the most important, followed by staging, insurance status, and treatment modality.
Conclusion: A decision tree algorithm easily elucidates the hierarchies and relative contribution of variables for predicting survival.