Abstract
A new twoparameter probability distribution called hypertabastic is introduced to model the survival or timetoevent data. A simulation study was carried out to evaluate the performance of the hypertabastic distribution in comparison with popular distributions. We then demonstrate the application of the hypertabastic survival model by applying it to data from two motivating studies. The first one demonstrates the proportional hazards version of the model by applying it to a data set from multiple myeloma study. The second one demonstrates an accelerated failure time version of the model by applying it to data from a randomized study of glioma patients who underwent radiotherapy treatment with and without radiosensitizer misonidazole. Based on the results from the simulation study and two applications, the proposed model shows to be a flexible and promising alternative to practitioners in this field.
1. Introduction
Time to event models, commonly known as survival or reliability models, have been studied and applied in a variety of scientific disciplines such as medicine, engineering and business. The Hosmer and Lemeshow [1], Lee and Wang [2], Kleinbaum and Klein [3], and Collet [4] books give a detailed overview of survival data modeling techniques. Nonparametric and semiparametric survival models such as the Cox regression analysis have been the most widely used models in the analysis of time to event survival data [5]. On the other hand, if the assumption for parametric probability distribution is met for the data set under consideration, it will result in more efficient and easier to interpret estimates than nonparametric or semi parametric models. A comprehensive review was given by Efron [6] and Lee and Go [7].
Parametric hazard functions can enable clinicians and researchers to model various disease scenarios, assess disease prognosis and progression, give valuable insights on the pattern of failure, and understand the pathogenesis of a chronic disease and how they are affected by different treatment effects [810]. Estimation of hazard function is also useful in the analysis of changepoint hazard rate models. It helps policy makers with cost effective health care policy decisions [11]. Lundin et al. [12] estimated the survival probabilities in breast cancer patients and concluded that parametric survival estimates may be more precise than KaplanMeier estimates when there are few patients in a particular stratum. Royston and Parmar [13] modeled the baseline distribution function by restricted cubic regression spline. Kay et al. [14] discussed the use of hazard functions in breast cancer studies. They believe that the hazard function is an important tool in investigating disease curability and can help the clinician to express his ideas regarding disease progression and the biology of treatment effect. Foulkes et al. [15] used parametric modeling to assess the prognostic factors in the recurrence of ischemic strokes. Sama et al. [16] used five parametric models to analyze the survival time data of infections and they found that the best fit could be obtained using parametric models. They also indicated that parametric models can be used to model the duration of malaria infections. Kannan et al. [17] used loglogistic probability distribution to model altitude decompression sickness (DCS) risk and symptom onset time. They concluded that the loglogistic model could provide good estimates of the probability of DCS over time. Nardi and Schemper [18,19] emphasized the role of residuals for the selection of survival models. They argued that when empirical data is sufficient, parametric models provide some insight into the shape of the baseline hazard function.
In the Cox model, the baseline hazard function is regarded as a nuisance parameter, while in parametric models, the hazard function reflects the time course of the process under study. In this paper, we introduce a new twoparameter continuous probability distribution called hypertabastic probability distribution. The hypertabastic hazard function can assume a different variety of shapes. It can be used to analyze biomedical data such as cancer recurrence time. Based on the hypertabastic distribution, we introduce the hypertabastic survival model which includes the hypertabastic proportional hazards model with parametric baseline hazard function, the hypertabastic accelerated failure model and the hypertabastic proportional odds model. The hypertabastic distribution can be used to analyze the accelerated hazards regression model of Chen [20]. It can also be used to monitor the disease progression and regression and provide the clinicians with the time interval(s) on which the disease progresses or regresses and the time interval(s) on which the disease progression or regression speeds up or slows down. This vital information will make it easier for the physicians to take appropriate action regarding their patients.
2. Hypertabastic distribution
In this section we introduce a new probability distribution which can be used in many scientific disciplines. Such disciplines may include, but are not limited to, biomedical, engineering, and business fields.
Definition 2.1 (Hypertabastic Distribution) We say a continuous random variable T has a hypertabastic distribution if its cumulative distribution function is
The parameters α and β are both positive and Sech [•] and Coth [•] are hyperbolic secant and hyperbolic cotangent respectively. We often read as "T is hypertabastically distributed with parameters α and β "and write it as H (α, β). The probability density function of T is given by
where W(t) = α(1  t^{β }Coth(t^{β}))/β.
3. Hypertabastic survival and hazard Functions
Definition 3.1 (The Hypertabastic Survival Function) Let T be a continuous random variable representing the waiting time until the occurrence of an event. Then the hypertabastic survival function is defined as
where P (T > t) is the probability that waiting time exceeds t.
Definition 3.2 (The Hypertabastic Hazard Function) The hypertabastic hazard function h(t) which represents the instantaneous failure rate at time t given survival up to time t is defined as
The cumulative hazard function H(t) is defined as
The hazard function is a conditional failure rate which gives the instantaneous potential for failing at time t per unit time for an individual surviving to time t.
The most commonly used baseline hazard functions are Weibull, lognormal and loglogistic. The Weibull baseline hazard function has monotone increasing, monotone decreasing and constant forms. The lognormal baseline hazard function is nonmonotonic. It follows a behavior that increases to a maximum, and then decreases (unimodal shaped). The loglogistic baseline hazard function can be monotone decreasing or have unimodal shape. These different hazard shapes may reveal different biological mechanisms of disease progression and regression and can provide helpful medical information. The hypertabastic baseline hazard function shapes are given as follows along with illustrative examples from published literature that could fit a pattern of that particular behavior where applicable:
1. The hypertabastic baseline hazard function is monotone decreasing if 0 <β ≤ 0.25 (Figure 1a). Clark et al. [21] analyzed data for 825 patients diagnosed with primary epithelial ovarian carcinoma and observed that the hazard rate was initially high after the diagnosis and gradually decreased afterwards.
Figure 1. a) Hypertabastic hazard curve for 0 <β ≤ 0.25; b) Hypertabastic hazard curve for 0.25 <β < 1; c) Hypertabastic hazard curve for β = 1; d) Hypertabastic hazard curve for 1 ≤ β < 2; e) Hypertabastic hazard curve for β = 2; f) Hypertabastic hazard curve for β > 2.
2. The hypertabastic baseline hazard function first increases with time until it reaches its maximum and then decreases (unimodalshaped) if 0.25 <β < 1 (Figure 1b). Demicheli at al. [22] concluded that the hazard rate for nodepositive postmenopausal women was unimodalshaped. Schulman et al. [23] studied the influence of donor and recipient HLA locus mismatching on the development of obliterative bronchiolitis (OB) after lung transplantation and estimated the risk of OB after the transplant. Their estimated hazard function of developing OB indicated a unimodalshaped curve as well.
3. The hypertabastic baseline hazard initially increases with time, then it reaches its horizontal asymptote α provided that β = 1 (Figure 1c). Weitz and Fraser [24] concluded that hazard rate plateaus are explained as a generic consequence of considering death in terms of first passage time for processes undergoing a random walk with drift. They analyzed the hazard rate plateau in populations of fruit flies, yeast and other organisms.
4. The hypertabastic baseline hazard function is increasing with upward concavity until it reaches its inflection point and then it continues to increase with downward concavity thereafter if 1 <β < 2 (Figure 1d). Such a hazard can be used in many applied sciences when failure rate increases with respect to increase in time. For instance, it can be used to model leukemia survival times for patients not responding to treatment [3].
5. The hypertabastic baseline hazard function is increasing with upward concavity for a while and then it becomes a linear function with slope α if β = 2. (Figure 1e).
6. The hypertabastic baseline hazard function is increasing with upward concavity if β > 2. (Figure 1f).
Definition 3.3 (Disease progression and regression) Let the hazard function be defined on interval I and h'(t) be its derivative.
1. Disease progresses on the time interval I if h'(t) > 0 for all t in I.
2. Disease regresses on the time interval I if h'(t) < 0 for all t in I.
3. Disease neither progresses nor regresses on the interval I if h'(t) = 0 for all t in I.
Definition 3.4 (Speed of progression and regression) Let h(t) be a hazard function defined on time interval I and let h'(t) and h"(t) be first and second derivative of h(t) respectively.
1. If h'(t) > 0 and h"(t) > 0 for all t in I, then the disease progression speeds up on the time interval I.
2. If h'(t) > 0 and h"(t) < 0 for all t in I, then the disease progression slows down on the time interval I.
3. If h'(t) < 0 and h"(t) > 0 for all t in I, then the disease regression slows down on the time interval I.
4. If h'(t) < 0 and h"(t) < 0 for all t in I, then the disease regression speeds up on the time interval I.
Definitions 3.3 and 3.4 can assist clinicians, researches and pharmacologists to monitor disease status over time. If for instance the goal of the drug treatment study is to slow down the disease progression and to understand the time course and management of disease, the above mentioned definitions may be very useful. See for example, the paper by Chan and Holford [25] who studied the rate of deterioration of degenerative diseases over time.
4. Hypertabastic proportional hazards model
When the effect of risk factors is to change the baseline hazard function by a proportionate amount at all times t, we call the model a proportional hazards model. Suppose vector X is a pdimensional vector of covariates and assume g(Xθ) is a non negative function of X satisfying the condition that g(0θ) = 1. Let h_{0}(t) which has been defined in section 3 as h(t), be called baseline hazard function. This is the hazard function when there are no covariates in the model. The hypertabastic proportional hazard model assumes a hazard function h(tX, θ) of the form
where θ is a vector of unknown parameters. The hypertabastic survival function S(tX, θ) for the proportional hazards model is defined as
where the baseline survival function S_{0}(t) is defined as S(t) in section 3. The hypertabastic probability density function for the proportional hazard model is denoted by f(tX, θ) and is equal to
where the baseline probability density function f_{0}(t) is defined as f(t) in section 2. Depending on the type of censoring, the maximum likelihood function technique along with an appropriate loglikelihood function may be used to estimate the unknown parameters in this model. Most data used in survival analysis have only right censoring, therefore we will focus on right censoring. Consider a sample of right censored survival time's data of n individuals t_{1}, t_{2}, ..., t_{n }with associated pdimensional covariate vectors X_{1}, X_{2}, ..., X_{n }and an unknown parameter vector θ = (θ_{1}, θ_{2}, ..., θ_{p}).
Then, the hypertabastic proportional hazards loglikelihood function can be written as
where
Some statistical software packages use logarithm of survival time as their survival time variable in their model fittings. If this is the case, then the following alternative formula can be used as the proportional hazards loglikelihood function:
The maximum likelihood estimate of (p+2) dimensional parameter vector λ = (α, β, θ_{1}, θ_{2}, ..., θ_{p}) is the vector . Asymptotically, is normally distributed with mean vector λ and variancecovariance matrix V. An estimate of V can be obtained by calculating the inverse of the observed information matrix.
To assess the statistical significance of model parameters, one can use the well known statistical tests such as likelihood ratio test, Wald test, or the score test.
The hazard ratio HR(X_{i}, X_{j}) for individuals i and j with covariate vectors X_{i }and X_{j }is given by
The most common form of g(Xθ) is exp [θ^{T }X] [1]. If we use , then the hazard ratio HR(X_{i}X_{j}) = exp [θ^{T }(X_{i } X_{j})] where HR(X_{i}, X_{j}) is independent of time t. If X_{i }= X and X_{j }= 0, then the hazard ratio becomes HR(X_{i}, 0) = g(Xθ) and if we use , then , where θ_{j}x_{j }is the elasticity of hazard rate with respect to covariate x_{j}. Symbolically,
The elasticity of hazard function with respect to covariate x_{j }is a measure of the responsiveness of the hazard function to change in covariate x_{j}. Intuitively, elasticity of the hazard function with respect to change in covariate x_{j }is percent change in failure rate divided by percent change in covariate x_{j}.
5. Hypertabastic accelerated failure time model
When the covariates act multiplicatively on the timescale, the model is called accelerated failure time model [3,4]. The hypertabastic accelerated failure time model assumes a hazard function h(tX, θ) of the form
where h_{0}(•) is the baseline hypertabastic hazard function and the hypertabastic survival function for the accelerated failure time model is
where S_{0}(•) is the baseline hypertabastic survival function. Finally, the hypertabastic probability density function for the accelerated failure time model is
where f_{0}(•) is the baseline hypertabastic probability density function. The maximum likelihood technique can be used to estimate the parameters of this model and statistical tests similar to section 4 can be used to assess the significance of model covariates. For the right censored data, the hypertabastic accelerated failure time model would have a loglikelihood function of the form
where Z(t_{i}) = t_{i}g(X_{i}θ).
Again, if someone prefers to use the logarithm of time as the model survival time variable, then the alternative accelerated failure time loglikelihood function is
The maximum likelihood estimate of (p+2) dimensional parameter vector (α, β, θ_{1}, θ_{2}, ..., θ_{p}) and testing of hypothesis regarding model parameters are similar to methods of section 4.
6. Hypertabastic proportional odds model
If the effect of risk factor is to change the odds of survival beyond time t by a proportionate amount, then the model is called proportional odds model. The odds of surviving beyond time t are expressed as
where S_{0}(t) is the baseline hypertabastic survival function. The hypertabastic survival function for the proportional odds model is given by
and the hypertabastic baseline odds function of survival beyond time t is given by
The ratio of the odds of survival of individual i relative to individual j is equal to exp [θ^{T}(X_{i } X_{j})]. The hypertabastic proportional odds model can be fitted by maximizing an appropriate likelihood function.
7. Simulation study
To evaluate the performance of the hypertabastic model we conduct a simulation study in which we compare the overall fit of the proposed model with Weibull, loglogistic and lognormal models. Since all distributions under our consideration have exactly two parameters, we will use the negative of the loglikelihood as a measure of goodnessoffit. This measure would result in the same conclusion as the Akaike's Information Criterion (AIC) [26]. Thus the smallest value suggests a better fit. We conduct 1000 simulations with sample size of 200 and random censoring of approximately 40%. We sample timetoevent from 11 different parameter combinations of two parameter Weibull, lognormal and gamma distributions for a total of 33 combinations or 33000 simulations. We fit four mentioned models and average the log likelihood over 1000 runs to determine which model fits the simulated data with the overall most precision on the average. Simulation results are presented in Table 1.
Table 1. Average log likelihoods and their standard errors for Weibull (W), hypertabastic (HT), loglogistic (LL) and lognormal (LN) models based on 1000 simulations from eleven parameter combinations of two parameter Weibull, lognormal and gamma distributions
In simulations where we sample from a twoparameter Weibull distribution, obviously, the Weibull model fits the data with highest precision in all instances. Hypertabastic model is a close second, outperforming lognormal and loglogistic models for all combinations of parameters, with lognormal being the worst. Similarly when sampling is from a twoparameter lognormal distribution, the lognormal model outperforms all other models. The hypertabastic model and the loglogistic show similar results, with the loglogistic being slightly better in eight combinations of parameters and the Weibull model performing the worst. Finally, when sampling from a two parameter gamma distribution, the Weibull model fits with the most precision in seven out of eleven combinations. That is because the Weibull distribution and the gamma distribution have hazard functions which are similar in shapes. In another four instances, the hypertabastic model slightly outperforms the Weibull model. The loglogistic model comes in third, however it is close to the hypertabastic and the Weibull for several combinations, while the lognormal does the worst in all instances.
8. Application
All models presented in the next two examples were fitted using Mathematica 6.0 (Wolfram Research, Inc.) and SASv9 (SAS Institute Inc., Cary, NC). They were fitted using both time and log(time). However, only log(time) results are presented for the multiple myeloma proportional hazards and brain cancer accelerated failure time models since log(time) is commonly used as a default approach in many packages and procedures. Besides the hypertabastic survival model we fit Weibull, lognormal, loglogistic and Cox models. We present all parameter estimates along with their standard errors and compare 2loglikelihood estimates, as well as AIC, to assess the goodnessoffit of the models. SAS programs used to fit the hypertabastic proportional hazards models with time and log(time) for multiple myeloma data are provided [see Additional file 1].
Additional file 1. Hypertabastic model fitting to multiple myeloma data using SAS PROC NLP. SAS PROC NLP code provided here demonstrates how to fit hypertabastic model to multiple myeloma data using both time and log(time).
Format: DOC Size: 24KB Download file
This file can be viewed with: Microsoft Word Viewer
8.1 Analysis of multiple myeloma data
Multiple myeloma is a cancer formed by abnormal white blood cells, called malignant plasma cells. A malignant monoclonal plasma cell is called plasmacytoma. If the disease spreads throughout multiple bone marrow sites in the body, it is called multiple myeloma. This disease weakens a patient's immune system and is usually difficult to cure. To investigate the performance of the hypertabastic proportional hazards model and to compare it with Cox, Weibull, loglogistic and lognormal models, we analyze a cancer data set obtained from a study conducted by Krall et al. [27]. The data contains information on 65 patients with multiple myeloma in which the patients were treated with alkylating agents. This drug is designed to interfere with the cell's DNA and inhibits cancer cell growth. Of these 65 patients, only 17 survived the duration of the study. The data is rightcensored and the survival time from the date of diagnosis is measured in months. The covariates have been measured at diagnosis and the covariate list consists of a logarithm of white blood cell count, serum calcium, presence or absence of Bence Jones protein, proteinuria, gender, age, percent myeloid cells in peripheral blood, percent lymphocytes in peripheral blood, logarithm of percent plasma cells in bone marrow, total serum protein, presence or absence of infection, serum globin, logarithm of blood urea nitrogen, fractures, platelets, and hemoglobin. Our first task is to select risk factors that are statistically significant. To accomplish this task we use the hypertabastic loglikelihood function for proportional hazards model and follow the general variable selection strategy outlined by Collet [4]. We also examine the possibility of interactions. The two most significant risk factors that we found are the logarithm of blood urea nitrogen and hemoglobin. Using the stepwise regression, we fit the Cox proportional hazards model. The KaplanMeier survival curve for multiple myeloma data is shown in figure 2. The Cox regression did identify the same variables as the most significant prognostic factors. Table 2 gives results for the hypertabastic and the four comparison models. It shows that the 2loglikelihood and AIC statistics are lowest for the hypertabastic model when fitted to the multiple myeloma data. This indicates that the hypertabastic proportional hazards model fits the multiple myeloma data best. The most significant single variable identified by the hypertabastic model is the logarithm of blood urea nitrogen with an estimated chisquare value of 11.11 (p = 0.0014). The second significant variable is hemoglobin. All other models have identified these two variables as the most significant ones. The mean levels of patients' hemoglobin and the logarithm of blood urea nitrogen are 10.20 and 1.39 respectively. At this level, the median survival time for the hypertabastic, lognormal, loglogistic, Weibull and Cox are 20.04, 19.29, 21.01, 21.92 and 19 months respectively. Figures 3a and 3b show the graph of hypertabastic hazard and survival functions for multiple myeloma data. Figure 3a clearly shows that the hypertabastic hazard function is an increasing function of time. By examining figure 3b, we realize that for patients with a Log (BUN) reading of 1.39 and a hemoglobin level of 10.20 there is about a10% chance of survival beyond 65 months. At the above mentioned mean levels for the hemoglobin and logarithm of blood urea nitrogen, the hypertabastic hazard function shows that the failure rate (hazard) reaches its maximum velocity in about 5.15 months. At this point the disease progression has its highest speed (Figure 3c). Figures 4a and 4b show the 3dimensional graphs for survival of multiple myeloma patients as functions of time and Log(BUN), as well as time and HGB. Other models under consideration had monotone increasing hazard functions (graphs not shown).
Figure 2. KaplanMeier survival curve for multiple myeloma.
Table 2. Statistical results for hypertabastic proportional hazards and four comparison models for multiple myeloma data
Figure 3. a) Hypertabastic hazard curve for multiple myeloma; b) Hypertabastic survival curve for multiple myeloma; c) Hypertabastic velocity of hazard curve for multiple myeloma.
Figure 4. a) Hypertabastic 3D survival curve with variables time and Log (BUN) for multiple myeloma data; b) Hypertabastic 3D survival curve with variables time and HGB for multiple myeloma data.
8.2 Analysis of glioma brain cancer data
Glioma is a cancer of the brain which begins in glial cells. These cells support the neurons. These cells have a very high rate of growth which can quickly destroy the normal cells. The primary types of glioma cancers are astrocytomas, ependymomas and oligodendrogliomas. Shin et al. [28] studied ways to improve the effectiveness of radiation in the treatment of cerebral malignant astrocytoma. Their study focused on the assessment of the effect of multiple daily fractionated radiation therapy with and without misonidazole. They concluded that the addition of misonidazole did not significantly improve the patients' survival. In this section we apply the hypertabastic accelerated failure time technique to model the survival time of a sample of 30 patients from the randomized trials of radiotherapy with and without the radiosensitizer misinidazole. The data was obtained from the Medical Research Council Working Party (MRC) on misonidazole in gliomas. This data is right censored and has been previously analyzed for the selection of variables by MRC [29]. Survival time was measured in days and the longest survival time was 1098 days. We compared radiotherapy treatment of brain cancer patients with radiosensitizer misonidazole to radiotherapy without misonidazole. Figure 5 shows a KaplanMeier plot of the estimated survival curves for both groups. The logrank, Wilcoxon, and likelihood ratio tests are all nonsignificant, suggesting no significant difference between survival curves for the two groups. The KaplanMeier estimates of median survival time for radiotherapy with misonidazole group is 258.5 days and for the radiotherapy without misonidazole group is 488 days. The overall median survival time for both groups combined is 361 days. Using the KaplanMeier estimates of survival function, a plot of logcumulative hazard function against the logarithm of the survival time for individuals in two groups indicates that the data is coming from a Weibul distribution. Knowing this information led us to evaluate the performance of the hypertabastic model. First, we fit a hypertabastic accelerated failure time model to analyze the brain cancer data. Then the hypertabastic accelerated failure time model is compared with Weibul, loglogistic, lognormal accelerated failure time models and the Cox proportional hazards model. This model incorporates a binary covariate coded as treatment = 1 for the type of radiotherapy with misonidazole, and as treatment = 0 for radiotherapy without misonidazole. The second covariate is the age of the patient. Thus, this model contains two covariates treatment and age. We use the method of maximum likelihood to maximize hypertabastic accelerated failure time loglikelihood function for right censored data discussed in section 5. Table 3 gives a statistical summary for the glioma data. For instance, the hypertabastic estimated value for the coefficient of the variable radiosensitizer is 0.4387 with a standard error of 0.3437. The Wald and the likelihood ratio statistics associated with this variable are 1.6294 (p = 0.2018) and 1.6254 (p = 0.2023) respectively. Both tests indicate that the effect of individual variable radiosensitizer is nonsignificant. The estimated accelerator factor is 1.5507 (exponentiated value of parameter 0.4387). This means that after controlling for the age of the patient, the probability of a patient treated with "therapy with radiosensitizer misonidazole" surviving t days equals to the probability of a patient treated with "therapy without radiosensitizer misonidazole" surviving 1.5507 t days. For instance, the hypertabastic accelerated failure time model suggests that for 49 year old patients (median age for all patients under study), the probability of a patient treated with " therapy with radiosensitizer misonidazole " surviving 293 days equals to the probability of a patient treated with" therapy without radiosensitizer misonidazole " surviving 454 days. The Wald statistic for the age covariate is 22.18 (p < 0.0001).
Figure 5. KaplanMeier survival curves for glioma brain cancer.
Table 3. Statistical results for the hypertabastic accelerated failure time model and four comparison models for glioma brain cancer data
Using AIC, we came to the conclusion that both Weibull and hypertabastic models fit the data very well. However, the Weibull AIC value was 75.54 which indicates the best fit. For the hypertabastic model, the AIC value was 76.72 which was slightly less than Weibull. The remaining AIC values for lognormal, loglogistic and Cox are 79.15, 78.45, and 107.43 respectively. At the median age of 49, the median survival time for those patients who underwent radiotherapy with radiosensitizer misonidazole using the hypertabastic, Weibul, lognormal, loglogistic, and the Cox models are 289, 316, 270, 277, and 244 days respectively. The corresponding median survival times for the group without the radiosensitizer misonidazole are 449, 453, 423, 456, and 488 days respectively. By examining the KaplanMeier survival functions for the two groups we observe the crossing pattern which is a clear indication of violation of proportional hazards assumption. Figures 6a and 6b show the graphs of hypertabastic hazard and survival functions for both treatment groups at the age level of 49. The hazards for both groups are increasing function of time. For those patients who received radiotherapy treatment with radiosensitizer misonidazole, hazard function reached its maximum velocity in about 200 days. For those who did receive radiotherapy without misonidazole, hazard function reached its maximum velocity in about 311 days. These are the points in time where the speeds of failure rates (hazards) are highest (Figure 6c). Figures 7a and 7b represent 3dimensional graphs of survival by time and age for each treatment group separately.
Figure 6. a) Hypertabastic hazard curves for glioma brain cancer; b) Hypertabastic survival curves for glioma brain cancer; c) Hypertabastic velocity of hazard curves for glioma brain cancer.
Figure 7. a) Hypertabastic 3D survival curve with variables time and age for group using radiotherapy with radiosentisizer misonidazole for glioma brain cancer; b) Hypertabastic 3D survival curve with variables time and age for group using radiotherapy without radiosentisizer misonidazole for glioma brain cancer.
9. Discussion and conclusion
In this paper we have introduced a new survival model called hypertabastic survival model. The overall results of our simulation indicate that the hypertabastic model performs best when the data is generated from the Weibull distribution. When the data comes from the gamma distribution, both the Weibull and hypertabastic distributions perform well. In the case when we generate survival data from the lognormal distribution, the loglogistic and hypertabastic distributions perform well but the gamma and Weibull perform poorly. In the last case, the Weibull distribution performance is the worst. We believe that the hypertabastic survival model is to some extent robust with respect to variations in the distribution. We believe that the hypertabastic hazard function can play a role in modeling failure rates in medical, biological and engineering fields. The hypertabastic model can also assist physicians and clinicians with their treatment planning through its ability to predict patients' outcome as well as the risk of disease recurrence. Gilbert et al. [30] argued that the pattern of instantaneous risk over time is more interesting than the cumulative risk. For instance, in a cancer study where recurrence time of malignant tumor is of interest, a bimodal hazard curve may represent the elevated incidences of early and late recurrences and the magnitude of the hazard rates at the peaks may reveal the intensity of the failure rates. Therefore we recommend that clinicians, practitioners and analysts consider using this model along with other models, and compare it to the models they ordinarily use before making any decision as to which model would provide the best fit and prediction.
Competing interests
The author(s) declare that they have no competing interests.
Authors' contributions
The survival model equations were developed by MAT and ZB based on their previous work on hyperbolastic growth and survival models. Applications of the models to data were done by MAT, ZB, DKW and KPS. Writing of the manuscript was done by MAT and ZB, and final edits by DKW and KPS.
References

Hosmer DW, Lemeshow S: Applied Survival Analysis: Regression Modeling of Time to Event Data. New York: Wiley; 1999.

Lee ET, Wang JW: Statistical Methods for Survival Data Analysis. 3rd edition. New York: Wiley; 2003.

Kleinbaum DG, Klein M: Survival Analysis: A Selflearning Text. 2nd edition. New York: Springer; 2005.

Collet D: Modeling Survival Data in Medical Research. New York: Chapman and Hall/CRC; 1994.

Cox DR: Regression Models and Life Tables.
Journal of the Royal Statistical Society, Ser B 1972, 34:187220.

Efron B: The Efficiency of Cox's Likelihood Function for Censored Data.
Journal of the American statistical Association 1977, 72:557656. Publisher Full Text

Lee ET, Go OT: Survival Analysis in Public Health Research.
Annual Reviews in Public Health 1997, 18:105134. Publisher Full Text

Simes RJ, Zelen M: Exploratory Data Analysis and the Use of the Hazard Function for Interpreting Survival Data: An Investigator's Primer.

Tabatabai M, Williams DK, Bursac Z: Hyperbolastic Growth Models: Theory and Application.
Theoretical Biology and Medical Modeling 2005, 2:113. BioMed Central Full Text

Bursac Z, Tabatabai M, Williams DK: Nonlinear Hyperbolastic Growth Models and Applications in Cranofacial and Stem Cell Growth. In 2005 Proceedings of the American Statistical Association. Biometrics Section [CDROM], Alexandria, VA: American Statistical Association; 2006:190197.

Goodman MS, Li Y, Tiwari RC: Survival Analysis with Change Point Hazard Functions.
Harvard University Biostatistics Working Paper Series 2006.
Working Paper 40

Lundin J, Lundin M, Isola J, Joensuu H: Evaluation of a Webbased System for Survival Estimation in Breast cancer.
Studies in Health Technology and Informatics 2003, 95:788793.

Royston P, Parmar MK: Flexible Parametric Proportionalhazards and Proportionalodds Models for Censored Survival Data, with Application to Prognostic Modeling and Estimation of Treatment Effects.
Statistics in Medicine 2002, 21:21752197. PubMed Abstract  Publisher Full Text

Kay R, Schuerlen H, Schumacher M: On the Use of Hazard Functions in Breast Cancer Studies.

Foulkes MA, Sacco RL, Mohr JP, Hier DB, Price TR, Wolf PA: Parametric Modeling of Stroke Recurrence.
Neuroepidemiology 1994, 13:1927. PubMed Abstract  Publisher Full Text

Sama W, Dietz K, Smith T: Distribution of Survival Times of Deliberate Plasmodium Falciparum Infections in Tertiary Syphilis Patients.
Transactions of the Royal Society of Tropical Medicine and Hygiene 2006, 100:811816. Publisher Full Text

Kannan N, Raychaudhuri A, Pilamanis AA: A Log logistic Model for Altitude Decomposition Sickness.
Aviation, Space and Environmental Medicine 1998, 69:965970.

Nardi A, Schemper M: New Residuals for Cox Regression and Their Application to Outlier Screening.
Biometrics 1999, 55:523529. PubMed Abstract  Publisher Full Text

Nardi A, Schemper M: Comparing Cox and Parametric Models in Clinical Studies.
Statistics in Medicine 2003, 22:35973610. PubMed Abstract  Publisher Full Text

Chen YQ: Accelerated Hazards Regression Model and its Adequacy for Censored Survival Data.
Biometrics 2001, 57:853860. PubMed Abstract  Publisher Full Text

Clark TG, Stewart ME, Altman DG, Gabra H, Smyth JF: a Prognostic Model for Ovarian Cancer.
British Journal of Cancer 2001, 85:944952. PubMed Abstract  Publisher Full Text

Demicheli RM, Bonadonna G, Hrushesky WJ, Retsky MW, Valagussa P: Menopausal Status Dependence of the Timing of Breast Cancer Recurrence After Surgical Removal of the Primary Tumor.
Breast Cancer Research 2004, 6:689696. BioMed Central Full Text

Schulman LL, Weinberg AD, McGregor CC, SuciuFoca NM, Itescu S: Influence of Donor and Recipient HLA Locus Mismatching on Development of Obliterative Bronchiolitis after Lung Transplantation.
American Journal of Respiratory and Critical Care Medicine 2001, 163:437442.

Weitz JS, Fraser HB: Explaining mortality rate plateaus.
Proceedings of the National Academy of Science 2001, 98:1538315386. Publisher Full Text

Chan PL, Holford NH: Drug Treatment Effects on Disease Progression.
Annual Reviews of Pharmacology and Toxicology," 2001, 41:625659. Publisher Full Text

Akaike A: A New Look at the Statistical Model Identification.
IEEE Trans Autom Control AC 1974, 19:716723. Publisher Full Text

Krall JM, Utoff VA, Harley JB: A Stepup Procedure for Selecting Variables Associated with Survival.
Biometrics 1975, 31:4957. PubMed Abstract  Publisher Full Text

Shin KH, Urtasun RC, Fulton D, Geggie PHS, Tanasichuk H, Thomas H, Muller PJ, Curry B, Mielke B: Multiple Daily Fractionated Radiation Therapy and Misonidazole in the Management of Malignant Astrocytoma.
Cancer 1985, 56:758760. PubMed Abstract  Publisher Full Text

Medical Research Council Working Party: A study of the effect of misonidazole in conjunction with radiotherapy for the treatment of grades 3 and 4 astrocytomas. A report from the MRC Working Party on misonidazole in gliomas.

Gilbert PB, Wei LJ, Kosorok MR, Clomens JO: Simultaneous Inferences on the Contrast of Two Hazard Functions with Censored Observations.
Biometrics 2002, 58(4):773780. PubMed Abstract  Publisher Full Text