Chapter 5A

(AST405) Lifetime data analysis

Author

Md Rasel Biswas

5 Inference Procedures for Log-location-scale Distributions

5.1 Inference for location-scale distributions

Location-scale distributions have survivor function of the form $\begin{matrix} (5.1) & S (y; u, b) = S_{0} (\frac{y - u}{b}) - \infty < y < \infty \end{matrix}$ $- \infty < u < \infty and b > 0$
Log-lifetime $Y = \log T$ has a location-scale distribution with survivor function of the form Equation 5.1

Lifetime variable $T$ has a log-location-scale distribution with the survivor function $\begin{aligned} S_{T} (t; α, β) & = S_{0} (\frac{\log t - u}{b}) \\ = S_{0}^{⋆} [(t / α)^{β}] \end{aligned}$
- $S_{0}^{⋆} (w) = S_{0} (\log w)$ for $w > 0$
- $u = \log α$
- $b = (1 / β)$

Lifetime and log-lifetime distributions
- exponential, Weibull, log-logistic, log-normal, etc.
- extreme-value, logistic, normal, etc.

Likelihood based methods

Goal is to estimate the parameters $(u, b)$ or $(α, β)$
Some advantages of estimating $(u, b)$
- Log-likelihood function for $(u, b)$ is more closer to quadratic than that for $(α, β)$
- Large sample normal approximations for $(\hat{u}, \hat{b})$ tend to be more accurate that those for $(\hat{α}, \hat{β})$
A better choice of parameters for obtaining MLEs and implementing normal approximations is $(u, \log b)$ , which is used by most statistical software

Likelihood function

For a censored sample ${(t_{i}, δ_{i}), i = 1, \dots, n},$ the likelihood function $\begin{array}{r} L (u, b) = \prod_{i = 1}^{n} [\frac{1}{b} f_{0} (\frac{y_{i} - u}{b})]^{δ_{i}} [S_{0} (\frac{y_{i} - u}{b})]^{1 - δ_{i}} \end{array}$
- $y_{i} = \log t_{i}$
- $f_{0} (z) = - d S_{0} (z) / d z \to$ pdf

The standardized variable $z_{i} = \frac{y_{i} - u}{b}$
The likelihood function $\begin{array}{r} L (u, b) = \prod_{i = 1}^{n} [\frac{1}{b} f_{0} (z_{i})]^{δ_{i}} [S_{0} (z_{i})]^{1 - δ_{i}} \end{array}$

The corresponding log-likelihood function $\begin{aligned} ℓ (u, b) & = - r \log b + \sum_{i = 1}^{n} [δ_{i} \log f_{0} (z_{i}) + (1 - δ_{i}) \log S_{0} (z_{i})] \\ = - r \log b + \sum_{i = 1}^{n} ℓ_{i} (z_{i}, δ_{i}) \end{aligned}$
- $r = \sum_{i = 1}^{n} δ_{i}$

Score functions

$\begin{aligned} ℓ (u, b) & = - r \log b + \sum_{i = 1}^{n} ℓ_{i} (z_{i}, δ_{i}) \\ = - r \log b + \sum_{i = 1}^{n} [δ_{i} \log f_{0} (z_{i}) + (1 - δ_{i}) \log S_{0} (z_{i})] \end{aligned}$

$\begin{aligned} \frac{\partial ℓ (u, b)}{\partial u} & = \sum_{i = 1}^{n} \frac{\partial ℓ_{i} (z_{i}, δ_{i})}{\partial z_{i}} \times \frac{\partial z_{i}}{\partial u} \\ = \sum_{i = 1}^{n} \frac{\partial ℓ_{i} (z_{i}, δ_{i})}{\partial z_{i}} \times (\frac{- 1}{b}) \\ = - \frac{1}{b} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \end{aligned}$

$ℓ (u, b) = - r \log b + \sum_{i = 1}^{n} [δ_{i} \log f_{0} (z_{i}) + (1 - δ_{i}) \log S_{0} (z_{i})]$

$\begin{aligned} \frac{\partial ℓ (u, b)}{\partial b} & = - \frac{r}{b} + \sum_{i = 1}^{n} \frac{\partial ℓ_{i} (z_{i}, δ_{i})}{\partial z_{i}} \times \frac{\partial z_{i}}{\partial b} \\ = - \frac{r}{b} + \sum_{i = 1}^{n} \frac{\partial ℓ_{i} (z_{i}, δ_{i})}{\partial z_{i}} \times (\frac{- z_{i}}{b}) \\ = - \frac{r}{b} - \frac{1}{b} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \end{aligned}$

Hessian matrix

$\frac{\partial ℓ (u, b)}{\partial u} = - \frac{1}{b} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}]$

$\begin{aligned} \frac{\partial^{2} ℓ (u, b)}{\partial u^{2}} & = \frac{\partial}{\partial u} [\frac{\partial ℓ (u, b)}{\partial u}] \\ = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] \end{aligned}$

$\frac{\partial ℓ (u, b)}{\partial b} = - \frac{r}{b} - \frac{1}{b} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}]$ $\begin{aligned} \frac{\partial^{2} ℓ (u, b)}{\partial b^{2}} & = \frac{r}{b^{2}} + \frac{2}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ + \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i}^{2} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] \end{aligned}$

$\begin{aligned} \frac{\partial ℓ (u, b)}{\partial u} & = - \frac{1}{b} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ \frac{\partial^{2} ℓ (u, b)}{\partial u \partial b} & = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ + \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] \end{aligned}$

Score function and information matrix

$\begin{aligned} U (u, b) & = [\begin{array}{c} \frac{\partial ℓ (u, b)}{\partial u} \\ \frac{\partial ℓ (u, b)}{\partial b} \end{array}] \\ I (u, b) & = - H (u, b) = - [\begin{array}{c} \frac{\partial^{2} ℓ (u, b)}{\partial u^{2}} & \frac{\partial^{2} ℓ (u, b)}{\partial u \partial b} \\ \frac{\partial^{2} ℓ (u, b)}{\partial b \partial u} & \frac{\partial^{2} ℓ (u, b)}{\partial b^{2}} \end{array}] \end{aligned}$

Statistical inference

$(\hat{u}, \hat{b})^{'} = {\arg max}_{(u, b)^{'} \in Θ} ℓ (u, b)$

Variance-covariance matrix

$var (\hat{u}, \hat{b}) = [I (\hat{u}, \hat{b})]^{- 1} = \hat{V}$

Sampling distribution

$(\begin{matrix} \hat{u} \\ \hat{b} \end{matrix}) \sim N_{2} (\begin{matrix} [\begin{matrix} u \\ b \end{matrix}], [I (\hat{u}, \hat{b})]^{- 1} \end{matrix})$

Wald type CIs

For a large $n$ , $(\hat{u}, \hat{b})^{'}$ follows a bivariate normal distribution with mean $(u, b)^{'}$ and variance matrix $\hat{V}$
Standard error of $\hat{u}$ and $\hat{b}$ can be obtained from the diagonal elements of $\hat{V}$ $s e (\hat{u}) = {\hat{V}}_{11}^{1 / 2} and s e (\hat{b}) = {\hat{V}}_{22}^{1 / 2}$

Following pivotal quantities follow standard normal distributions $\begin{array}{r} Z_{1} = \frac{\hat{u} - u}{s e (\hat{u})}, Z_{2} = \frac{\hat{b} - b}{s e (\hat{b})}, Z_{2}^{'} = \frac{\log \hat{b} - \log b}{s e (\log \hat{b})} \end{array}$
- $s e (\log \hat{b}) = s e (\hat{b}) / \hat{b}$
$(1 - p) 100 %$ confidence intervals $\begin{aligned} \hat{u} & \pm z_{1 - p / 2} s e (\hat{u}) \\ \hat{b} & \pm z_{1 - p / 2} s e (\hat{b}) \\ \hat{b} & \exp {\pm z_{1 - p / 2} s e (\log \hat{b})} \end{aligned}$

Quantiles

$p$ th quantile of log lifetime $Y$ $\begin{aligned} P (Y \leq y_{p}) = p \Rightarrow S_{0} (\frac{y_{p} - u}{b}) & = 1 - p \\ \frac{y_{p} - u}{b} & = S_{0}^{- 1} (1 - p) \\ y_{p} & = u + b w_{p} \end{aligned}$
- $w_{p} = S_{0}^{- 1} (1 - p) = F_{0}^{- 1} (p) \to$ $p$ th quantile of $S_{0} (z)$ , the standardize distribution

Estimate of $p$ th quantile and the corresponding standard error $\begin{aligned} {\hat{y}}_{p} & = \hat{u} + \hat{b} w_{p} \\ s e ({\hat{y}}_{p}) & = \sqrt{{\hat{V}}_{11} + w_{p}^{2} {\hat{V}}_{22} + 2 w_{p} {\hat{V}}_{12}} \end{aligned}$
Pivotal quantity $Z_{p} = \frac{{\hat{y}}_{p} - y_{p}}{s e ({\hat{y}}_{p})} \sim N (0, 1)$
$(1 - q) 100 %$ confidence interval for $y_{p}$ ${\hat{y}}_{p} \pm z_{1 - q / 2} s e ({\hat{y}}_{p})$

Likelihood ratio procedures

Normal approximation based confidence intervals could be inaccurate for small samples
An alternative to normal approximation, bootstrap simulations can be used to estimate the distributions of pivots
All these methods can perform poorly in small samples with heavy censoring
Implementation of likelihood ratio based confidence intervals is relatively complicated, but LRT based CI often performs better in small and medium-size samples

To test the hypothesis $H_{0} : u = u_{0}$ , the following likelihood ratio test statistic can be used $Λ_{1} (u_{0}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (u_{0}, \tilde{b} (u_{0}))$
MLEs $\begin{aligned} (\hat{u}, \hat{b})^{'} & = {\arg max}_{(u, b)^{'} \in Θ} ℓ (u, b) unrestricted \\ \tilde{b} (u_{0}) & = {\arg max}_{b \in Θ_{1}} ℓ (u_{0}, b) under H_{0} \end{aligned}$

Under $H_{0} : u = u_{0}$ , asymptotically $Λ_{1} (u_{0}) \sim χ_{(1)}^{2}$
- Approximate two-sided $(1 - p) 100 %$ confidence interval for $u$ can be obtained as the set of values of $u_{0}$ for which $Λ_{1} (u_{0}) \leq χ_{(1), 1 - p}^{2}$

Homework - 1

Obtain the expression of likelihood ratio test statistic based confidence interval for the scale parameter $b$

The $p$ th quantile of location-scale distribution can be expressed as $y_{p} = u + w_{p} b, w h e r e w_{p} = S_{0}^{- 1} (1 - p)$
To obtain confidence intervals for a quantile, consider the null hypothesis $H_{0} : y_{p} = y_{p_{0}}$
The corresponding likelihood ratio test statistic $\begin{matrix} (5.2) & Λ (y_{p_{0}}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u}, \tilde{b}) \end{matrix}$

Steps

The estimates $\hat{u}$ and $\hat{b}$ are MLEs under $H_{1}$ $(\hat{u}, \hat{b})^{'} = {\arg max}_{(u, b)^{'} \in Θ} ℓ (u, b)$
Steps to obtain MLEs $\tilde{u}$ and $\tilde{b}$ , under $H_{0} : y_{p} = y_{p_{0}}$
1. Under $H_{0}$ , $y_{p_{0}} = u + w_{p} b \Rightarrow u = y_{p_{0}} - b w_{p}$
2. $\tilde{b} = {\arg max}_{b \in Θ} ℓ (y_{p_{0}} - w_{p} b, b)$
3. $\tilde{u} = y_{p_{0}} - w_{p} \tilde{b}$
$(1 - q) 100 %$ Confidence interval for $y_{p}$ can be obtained from the set of $y_{p_{0}}$ values such that $Λ (y_{p_{0}}) \leq χ_{(1), 1 - q}^{2}$

To obtain confidence interval for $S (y_{0})$ , consider the null hypothesis $H_{0} : S (y_{0}) = s_{0}$
The same likelihood ratio statistic Equation 5.2 can be used to test the hypothesis $Λ (s_{0}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u}, \tilde{b})$

Steps

Steps for obtaining MLEs $\tilde{u}$ and $\tilde{b}$ under $H_{0}$
1. Under $H_{0} : S (y_{0}) = s_{0}$ $S (y_{0}) = S_{0} (\frac{y_{0} - u}{b}) = s_{0} \Rightarrow u = y_{0} - S_{0}^{- 1} (s_{0}) b$
2. $\tilde{b} = {\arg max}_{b \in Θ} ℓ (y_{0} - S_{0}^{- 1} (s_{0}) b, b)$
3. $\tilde{u} = y_{0} - S_{0}^{- 1} (s_{0}) \tilde{b}$
The $(1 - p) 100 %$ confidence interval for $S (y_{0})$ can be defined as the set of $s_{0}$ values such that $\begin{array}{r} Λ (s_{0}) \leq χ_{(1), 1 - p}^{2} \end{array}$

The likelihood ratio procedure can provide quite accurate confidence intervals when the number of failures is about 20 or more
Two-sided intervals perform better than one-sided intervals as the former giving more closer to nominal coverage than the other

5.2 Weibull and extreme-value distributions

The pdf of Weibull distribution $\begin{array}{r} f (t; α, β) = \frac{β}{α} (\frac{t}{α})^{β - 1} \exp [- (t / α)^{β}] \end{array}$
- $α > 0$ and $β > 0$ are scale and shape parameters, respectively

The pdf of extreme-value distribution $\begin{aligned} f (y; u, b) & = \frac{1}{b} \exp [(y - u) / b] \exp [- e^{(y - u) / b}] \\ = \frac{1}{b} f_{0} (\frac{y - u}{b}) \end{aligned}$
- $u = \log α$
- $b = (1 / β)$
Extreme-value distribution is used to make inferences about Weibull distribution

Likelihood based inference procedures

Censored sample ${(t_{i}, δ_{i}), i = 1, \dots, n}$
Define $y_{i} = \log t_{i} and z_{i} = (y_{i} - u) / b$

General expression of likelihood function $ℓ (u, b) = - r \log b + \sum_{i = 1}^{n} [δ_{i} \log f_{0} (z_{i}) + (1 - δ_{i}) \log S_{0} (z_{i})]$
For extreme-value distribution $\begin{aligned} S_{0} (z) & = \exp (- e^{z}) \\ f_{0} (z) & = - \frac{d}{d z} S_{0} (z) = \exp (z - e^{z}) \end{aligned}$

$ℓ (u, b) = - r \log b + \sum_{i = 1}^{n} [δ_{i} \log f_{0} (z_{i}) + (1 - δ_{i}) \log S_{0} (z_{i})]$

Log-likelihood function for EV distribution $\begin{matrix} (5.3) & ℓ (u, b) = - r \log b + \sum_{i = 1}^{n} (δ_{i} z_{i} - e^{z_{i}}) \end{matrix}$
- $r = \sum_{i} δ_{i}$
This log-likelihood function $ℓ (u, b)$ is easily maximized to give $\hat{u}, \hat{b}$ (using software)

Score functions

The general expression for location-scale family can also help us find the expressions for the first (and also second) derivatives of $ℓ (u, b)$ .
General expressions $\begin{aligned} \frac{\partial ℓ (u, b)}{\partial u} & = - \frac{1}{b} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ \frac{\partial ℓ (u, b)}{\partial b} & = - \frac{r}{b} - \frac{1}{b} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \end{aligned}$

For extreme-value distribution $\begin{aligned} \frac{\partial \log f_{0} (z)}{\partial z} & = \frac{\partial}{\partial z} \log {\exp (z - e^{z})} = 1 - e^{z} \\ \frac{\partial \log S_{0} (z)}{\partial z} & = \frac{\partial}{\partial z} \log {\exp (- e^{z})} = - e^{z} \end{aligned}$
These gives straightforward expressions for the first and second derivatives of $ℓ (u, b)$ , which can be used to find MLEs, $\hat{u}, \hat{b}$

Hessian matrix at MLEs $\hat{u}$ and $\hat{b}$ $H (\hat{u}, \hat{b}) = - \frac{1}{{\hat{b}}^{2}} [\begin{matrix} r & \sum_{i = 1}^{n} {\hat{z}}_{i} e^{{\hat{z}}_{i}} \\ \sum_{i = 1}^{n} {\hat{z}}_{i} e^{{\hat{z}}_{i}} & r + \sum_{i = 1}^{n} {\hat{z}}_{i}^{2} e^{{\hat{z}}_{i}} \end{matrix}]$

For extreme-value distribution

$\begin{aligned} \frac{\partial^{2} ℓ (u, b)}{\partial u^{2}} & = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] = - \frac{1}{b^{2}} \sum_{i = 1}^{n} e^{z_{i}} \end{aligned}$ where $\begin{aligned} \frac{\partial^{2} \log f_{0} (z)}{\partial z^{2}} & = \frac{\partial}{\partial z} {1 - e^{z}} = - e^{z} = \frac{\partial^{2} \log S_{0} (z)}{\partial z^{2}} \end{aligned}$

$\begin{array}{r} \frac{\partial^{2} ℓ (u, b)}{\partial b^{2}} |_{u = \hat{u}, b = \hat{b}} = - \frac{1}{{\hat{b}}^{2}} \sum_{i = 1}^{n} e^{{\hat{z}}_{i}} = - \frac{r}{{\hat{b}}^{2}} \end{array}$

From Section 5.1.4, it can be shown that the Hessian matrix will be

$\begin{aligned} \frac{\partial^{2} ℓ (u, b)}{\partial b^{2}} & = \frac{r}{b^{2}} + \frac{2}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ + \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i}^{2} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] \\ = \frac{r}{b^{2}} + \frac{2}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} (1 - e^{z_{i}}) - (1 - δ_{i}) e^{z_{i}}] - \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i}^{2} e^{z_{i}} \\ = \frac{r}{b^{2}} + \frac{2}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} - e^{z_{i}}] - \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i}^{2} e^{z_{i}} \end{aligned}$

$\begin{array}{r} \frac{\partial^{2} ℓ (u, b)}{\partial b^{2}} |_{u = \hat{u}, b = \hat{b}} = - \frac{r}{{\hat{b}}^{2}} - \frac{1}{{\hat{b}}^{2}} \sum_{i = 1}^{n} {\hat{z}}_{i}^{2} e^{{\hat{z}}_{i}} \end{array}$

$\begin{aligned} \frac{\partial^{2} ℓ (u, b)}{\partial u \partial b} & = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} \frac{\partial \log f_{0} (z_{i})}{\partial z_{i}} + (1 - δ_{i}) \frac{\partial \log S_{0} (z_{i})}{\partial z_{i}}] \\ + \frac{1}{b^{2}} \sum_{i = 1}^{n} z_{i} [δ_{i} \frac{\partial^{2} \log f_{0} (z_{i})}{\partial z_{i}^{2}} + (1 - δ_{i}) \frac{\partial^{2} \log S_{0} (z_{i})}{\partial z_{i}^{2}}] \\ = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} (1 - e^{z_{i}}) - (1 - δ_{i}) e^{z_{i}} - z_{i} e^{z_{i}}] \\ = \frac{1}{b^{2}} \sum_{i = 1}^{n} [δ_{i} - e^{z_{i}} - z_{i} e^{z_{i}}] = \frac{1}{b^{2}} [r - \sum_{i = 1}^{n} e^{z_{i}} - \sum_{i = 1}^{n} z_{i} e^{z_{i}}] \end{aligned}$

$\begin{array}{r} \frac{\partial^{2} ℓ (u, b)}{\partial u \partial b} |_{u = \hat{u}, b = \hat{b}} = - \frac{1}{{\hat{b}}^{2}} \sum_{i = 1}^{n} {\hat{z}}_{i} e^{{\hat{z}}_{i}} \end{array}$

Covariance matrix

Observed information matrix at MLEs $\hat{u}$ and $\hat{b}$ $\begin{aligned} I (\hat{u}, \hat{b}) & = - H (\hat{u}, \hat{b}) \\ = \frac{1}{{\hat{b}}^{2}} [\begin{array}{c} r & \sum_{i = 1}^{n} {\hat{z}}_{i} e^{{\hat{z}}_{i}} \\ \sum_{i = 1}^{n} {\hat{z}}_{i} e^{{\hat{z}}_{i}} & r + \sum_{i = 1}^{n} {\hat{z}}_{i}^{2} e^{{\hat{z}}_{i}} \end{array}] \end{aligned}$
Covariance matrix of $(\hat{u}, \hat{b})^{'}$ $\begin{aligned} \hat{V} & = [I (\hat{u}, \hat{b})]^{- 1} \end{aligned}$

MLEs of $α$ and $β$ (Weibull model parameters) $\hat{α} = e^{\hat{u}} and \hat{β} = 1 / \hat{b}$
Covariance matrix of $(\hat{α}, \hat{β})^{'}$ (using multivariate delta method) $var (\hat{α}, \hat{β}) = G \hat{V} G^{'}$ where $G = [\begin{matrix} \frac{\partial g_{1} (u, b)}{d u} & \frac{\partial g_{1} (u, b)}{\partial b} \\ \frac{\partial g_{2} (u, b)}{\partial u} & \frac{\partial g_{2} (u, b)}{\partial b} \end{matrix}] = [\begin{matrix} e^{\hat{u}} & 0 \\ 0 & - \frac{1}{{\hat{b}}^{2}} \end{matrix}]$
- $α = g_{1} (u, b) = e^{u}$
- $β = g_{2} (u, b) = (1 / b)$

Wald-type statistics based $100 (1 - p) %$ CI for $u$ and $b$ $\begin{aligned} \hat{u} & \pm z_{1 - p / 2} s e (\hat{u}) \\ \hat{b} & \pm z_{1 - p / 2} s e (\hat{b}) \\ \hat{b} & \exp [\pm z_{1 - p / 2} s e (\log \hat{b})] \end{aligned}$

CI for $(u, b)$ (LRT based)

Log-likelihood function corresponding to $H_{0} : b = b_{0}$ is (from Equation 5.3) $ℓ (u, b_{0}) = - r \log b_{0} + \sum_{i = 1}^{n} [δ_{i} (\frac{y_{i} - u}{b_{0}}) - e^{(y_{i} - u) / b_{0}}]$
MLE of $u$ under $H_{0} : b = b_{0}$ $\begin{aligned} \frac{\partial ℓ (u, b_{0})}{\partial u} |_{u = \tilde{u}} = 0 \Rightarrow & - \frac{1}{b_{0}} [r - \sum_{i = 1}^{n} e^{(y_{i} - \tilde{u}) / b_{0}}] = 0 \\ \Rightarrow & \tilde{u} (b_{0}) = b_{0} \log [\frac{1}{r} \sum_{i = 1}^{n} e^{y_{i} / b_{0}}] \end{aligned}$

LRT statistics $Λ (b_{0}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u} (b_{0}), b_{0})$
$100 (1 - p) %$ CI for $b$ is defined by the set of $b_{0}$ values such that $Λ_{1} (b_{0}) \leq χ_{(1), 1 - p}^{2}$
Similarly, confidence interval for $u$ can be obtained using the corresponding LRT statistics (Homework)

CI for quantiles

The $p$ th quantile of $Y \sim E V (u, b)$ $\begin{aligned} S (y_{p}) & = S_{0} (\frac{y_{0} - u}{b}) = (1 - p) \\ \exp [- \exp (\frac{y_{p} - u}{b})] = (1 - p) \\ \frac{y_{p} - u}{b} = \log [- \log (1 - p)] = S_{0}^{- 1} (1 - p) = w_{p} \\ y_{p} = u + w_{p} b \end{aligned}$

CI for quantiles (Wald)

The estimate of $p$ th quantile ${\hat{y}}_{p} = \hat{u} + w_{p} \hat{b}$
Standard error of ${\hat{y}}_{p}$ (using the multivariate delta method) $var ({\hat{y}}_{p}) = [\begin{matrix} 1 & w_{p} \end{matrix}] \hat{V} [\begin{matrix} 1 \\ w_{p} \end{matrix}] = {\hat{V}}_{11} + {\hat{V}}_{22} w_{p}^{2} + 2 {\hat{V}}_{12} w_{p}$
Large sample based $100 (1 - q) %$ confidence interval for $y_{p}$ ${\hat{y}}_{p} \pm z_{1 - q / 2} s e ({\hat{y}}_{p})$
Find the $100 (1 - q) %$ confidence interval for $t_{p}$

CI for quantiles (LRT)

To obtain LRT statistic based confidence interval for the quantile $y_{p}$ , consider the following null hypothesis $H_{0} : y_{p} = y_{p_{0}}$
The corresponding LRT statistic $Λ (y_{p_{0}}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u}, \tilde{b})$
- The procedure of obtaining parameter estimates $\tilde{u}$ and $\tilde{b}$ (under $H_{0}$ ) is explained in Section 5.1.9.1)
LRT statistic based $(1 - q) 100 %$ confidence interval for $y_{p}$ can be obtained from the set of $y_{p_{0}}$ values such that $Λ (y_{p_{0}}) \leq χ_{(1), 1 - q}^{2}$

CI for $S (\cdot)$ (Wald)

To obtain confidence interval for survival probability $S (y_{0}) = S_{0} (\frac{y_{0} - u}{b}) = \exp [- \exp (\frac{y_{0} - u}{b})]$
We can defined $ψ = S_{0}^{- 1} (S (y_{0})) = \log [- \log (S (y_{0}))] = \frac{y_{0} - u}{b}$
MLE and SE $\begin{aligned} \hat{ψ} & = \frac{y_{0} - \hat{u}}{\hat{b}} \\ var (\hat{ψ}) & = a^{'} \hat{V} a = [\begin{array}{c} - 1 / \hat{b} & - \hat{ψ} / b \end{array}] [\begin{array}{c} {\hat{V}}_{11} & {\hat{V}}_{12} \\ {\hat{V}}_{21} & {\hat{V}}_{22} \end{array}] [\begin{array}{c} - 1 / \hat{b} \\ - \hat{ψ} / b \end{array}] \end{aligned}$

$(1 - p) 100 %$ CI for $ψ$ $\begin{aligned} \hat{ψ} - s e (\hat{ψ}) z_{1 - p 2} & < ψ \leq \hat{ψ} + s e (\hat{ψ}) z_{1 - p / 2} \\ L & < ψ \leq U \end{aligned}$
Confidence interval for $S (y_{0})$ $\begin{aligned} L & < \log [- \log (S (y_{0}))] \leq U \\ \exp [- \exp (U)] & < S (y_{0}) \leq \exp [- \exp (L)] \end{aligned}$

CI for $S (\cdot)$ (LRT)

Consider the null hypothesis $H_{0} : S (y_{0}) = s_{0}$ , where $S (y_{0}) = \exp [- \exp (\frac{y_{0} - u}{b})]$
The $(1 - p) 100 %$ confidence interval for $S (y_{0})$ can be defined as the set of $s_{0}$ values such that $Λ (s_{0}) \leq χ_{(1), 1 - p}^{2}$ , where $Λ (s_{0}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u}, \tilde{b})$
- The procedure of obtaining parameter estimates $\tilde{u}$ and $\tilde{b}$ (under $H_{0}$ ) is explained in Section 5.1.9.2)

Example 5.2.1:

Leukemia remission time data were given in Example 1.1.7 and used as an example for the non-parametric methods (e.g. Kaplan-Meier method) described in Chapter 3
Two groups of patients (6MP and placebo), each group has 21 patients, were followed up to observed either remission or censoring times (in weeks)

Remission time data

glimpse(gehan65)

Rows: 42
Columns: 3
$ time   <dbl> 6, 6, 6, 6, 7, 9, 10, 10, 11, 13, 16, 17, 19, 20, 22, 23, 25, 3…
$ status <dbl> 1, 1, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, …
$ drug   <chr> "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP",…

drug	status	n
6-MP	0	12
6-MP	1	9
placebo	1	21

Each group has 21 subjects, and all subjects of the placebo group were failed (end of remission) and drug group (6MP) has 12 censored times

Two separate Weibull distributions are assumed for the failure times of two treatment groups, e.g.
- 6MP group: $T \sim Weibull (α_{1}, β_{1}), Y = \log T \sim E V (u_{1}, b_{1})$
- Placebo group: $T \sim Weibull (α_{2}, β_{2}), Y = \log T \sim E V (u_{2}, b_{2})$
Objectives: Drawing inference about the parameters

Observed data ${(t_{i}, δ_{i}), i = 1, \dots, n}$
Log-likelihood function $ℓ (α, β) = \sum_{i = 1}^{n} [δ_{i} \log f (t_{i}; α, β) + (1 - δ_{i}) \log S (t_{i}; α, β)]$
MLEs $(\hat{α}, \hat{β})^{'} = {\arg max}_{(α, β)^{'} \in Θ} ℓ (α, β)$

Analysis of remission time data (Extreme-value distribution)

Define $y = \log t$ and corresponding probability density and survivor function $\begin{aligned} f (y; u, b) & = \frac{1}{b} \exp [(y - u) / b - e^{(y - u) / b}] \\ S (y; u, b) & = \exp [- e^{(y - u) / b}] \end{aligned}$
Log-likelihood function $\begin{array}{r} ℓ_{e v} (u, b) = \log \prod_{i = 1}^{n} [f (y_{i}; u, b)]^{δ_{i}} [S (y_{i}; u, b)]^{1 - δ_{i}} \end{array}$
MLEs $(\hat{u}, \hat{b})^{'} = {\arg max}_{(u, b)^{'} \in Θ} ℓ_{e v} (u, b)$

`survreg` function

R function survreg() can also be used to fit distributions of log-location-scale family, its syntax is similar to the syntax of survfit()

survreg(formula, data, dist)

In formula, response is a Surv object, e.g. to model the variables time and status $formula = Surv(time, status) \sim 1$
Lifetime or log-lifetime distributions can be passed to survreg by the argument dist

Available lifetime or log-lifetime distributions include “weibull”, “exponential”, “gaussian”, “logistic”, “lognormal”, “loglogistic”, “extreme”
The time argument of Surv function is either a lifetime or a log-lifetime depending on whether the mentioned dist is a lifetime (e.g. “weibull”) or a log-lifetime (e.g. “extreme”) $\begin{aligned} weibull \to & formula = Surv(time, status) \sim 1 \\ extreme \to & formula = Surv(log(time), status) \sim 1 \end{aligned}$

Data for the treatment (6MP) group

d6mp <- gehan65 |> 
   filter(drug == "6-MP")
glimpse(d6mp)

Rows: 21
Columns: 3
$ time   <dbl> 6, 6, 6, 6, 7, 9, 10, 10, 11, 13, 16, 17, 19, 20, 22, 23, 25, 3…
$ status <dbl> 1, 1, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0
$ drug   <chr> "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP", "6-MP",…

Analysis of the treatment group using `survreg` function

w_sreg_6mp <- survreg(Surv(time, status) ~ 1, 
                  data = d6mp, dist = "weibull")

ev_sreg_6mp <- survreg(Surv(log(time), status) ~ 1, 
                  data = d6mp, dist = "extreme")

Extreme-value distribution

Estimates of model parameters $u$ and $\log b$

broom::tidy(ev_sreg_6mp)

# A tibble: 2 × 5
  term        estimate std.error statistic  p.value
  <chr>          <dbl>     <dbl>     <dbl>    <dbl>
1 (Intercept)    3.52      0.273     12.9  6.28e-38
2 Log(scale)    -0.303     0.278     -1.09 2.77e- 1

Variance-covariance matrix of $(\hat{u}, \log \hat{b})$

vcov(ev_sreg_6mp)

            (Intercept) Log(scale)
(Intercept)  0.07473057 0.03305811
Log(scale)   0.03305811 0.07750538

Analysis of remission time data using `survreg` function

The survreg() function returns estimates of $(u, \log b)^{'}$ and corresponding variance matrix
For making inference about Weibull distribution, followings are required
1. estimate of $(u, b)^{'}$ and the corresponding variance matrix
2. estimate of $(α, β)^{'}$ and the corresponding variance matrix
It is important to understand the methods to obtain estimates and the corresponding variance of $(u, b)^{'}$ and $(α, β)^{'}$ from the estimates and the corresponding variance of $(u, \log b)^{'}$

Homework

Obtain the variance-covarince matrix of $(\hat{α}, \hat{β})^{'}$ and $(\hat{u}, \hat{b})^{'}$

CIs of $(α, β)$ (6MP group)

95% CI using the sampling distribution of $(\hat{α}, \hat{β})$ $\begin{aligned} \hat{α} \pm z_{.975} s e (\hat{α}) & = 33.765 \pm (1.96) (9.23) \\ = 15.674 to 51.856 \\ \hat{β} \pm z_{.975} s e (\hat{β}) & = 1.354 \pm (1.96) (0.377) \\ = 0.615 to 2.092 \end{aligned}$

95% CI using the sampling distribution of $(\hat{u}, \log \hat{b})$ $\begin{aligned} \hat{u} \pm z_{.975} s e (\hat{u}) & = 2.984 to 4.055 \\ \hat{α} \pm z_{.975} s e (\hat{α}) & = \exp (2.984) to \exp (4.055) \\ = 19.76 to 57.698 \end{aligned}$
- Similarly $\begin{aligned} \log \hat{b} \pm z_{.975} s e (\log \hat{b}) & = - 0.849 to 0.243 \\ \hat{β} \pm z_{.975} s e (\hat{β}) & = 1 / \exp (0.243) to 1 / \exp (- 0.849) \\ = 0.784 to 2.336 \end{aligned}$

(Obtain the variance matrix of $(\hat{u}, \hat{b})$ using the sampling distribution of $(\hat{u}, \log \hat{b})^{'}$ )

95% CI using the sampling distribution of $(\hat{u}, \hat{b})$

$\begin{aligned} \hat{u} \pm z_{.975} s e (\hat{u}) & = 2.984 to 4.055 \\ \hat{α} \pm z_{.975} s e (\hat{α}) & = \exp (2.984) to \exp (4.055) \\ = 19.76 to 57.698 \end{aligned}$

Similarly $\begin{aligned} \hat{b} \pm z_{.975} s e (\hat{b}) & = 0.336 to 1.142 \\ \hat{β} \pm z_{.975} s e (\hat{β}) & = 1 / 1.142 to 1 / 0.336 \\ = 0.876 to 2.979 \end{aligned}$

Using the method described in Section 5.2.5, we obtain the LRT-based CIs for $u$ and $b$

Plot of LRT statistic against different null values $u_{0}$ and 95% confidence interval for $u$ and $α$

Plot of LRT statistic against different null values $b_{0}$ and 95% confidence interval for $\log b$ and $β$

95% confidence intervals for $α$ and $β$ by different methods
parameter	method	6-MP
$α$	Wald $(\hat{α})$	(15.674, 51.856)
NA	Wald $(\hat{u})$	(19.76, 57.698)
NA	LRT	(21.933, 76.708)
$β$	Wald $(\hat{β})$	(0.615, 2.092)
NA	Wald $(\log \hat{b})$	(0.784, 2.336)
NA	Wald $(\hat{b})$	(0.876, 2.979)
NA	LRT	(0.726, 2.203)

Analyses for Placebo group

dplacebo <- gehan65 %>% 
  filter(drug == "placebo")

Model fit with the data of placebo group

w_sreg_p <- survreg(Surv(time, status) ~ 1, 
                  data = dplacebo, dist = "weibull")

Estimates of model parameters

broom::tidy(w_sreg_p)

# A tibble: 2 × 5
  term        estimate std.error statistic  p.value
  <chr>          <dbl>     <dbl>     <dbl>    <dbl>
1 (Intercept)    2.25      0.168     13.4  5.72e-41
2 Log(scale)    -0.315     0.174     -1.82 6.94e- 2

95% confidence intervals for $α$ and $β$ by different methods
parameter	method	6-MP	Placebo
$α$	Wald $(\hat{α})$	(15.674, 51.856)	(6.363, 12.601)
NA	Wald $(\hat{u})$	(19.76, 57.698)	(6.824, 13.175)
NA	LRT	(21.933, 76.708)	(6.659, 13.25)
$β$	Wald $(\hat{β})$	(0.615, 2.092)	(0.904, 1.837)
NA	Wald $(\log \hat{b})$	(0.784, 2.336)	(0.975, 1.926)
NA	Wald $(\hat{b})$	(0.876, 2.979)	(1.023, 2.077)
NA	LRT	(0.726, 2.203)	(0.951, 1.868)

Quantiles and their CIs

Estimate of $p$ th quantile ${\hat{y}}_{p} = \hat{u} + \hat{b} w_{p}$
- $w_{p} = \log [- \log (1 - p)]$
- $\hat{u} = 3.519$ and $\hat{b} = 0.739$ (for treatment group)
Wald-type CI (see Section 5.2.6.1 for detail)
${\hat{y}}_{p} \pm s e ({\hat{y}}_{p}) z_{1 - q / 2}$
Note the estimate of ${\hat{y}}_{p}$ depends on the estimate of $\hat{u}$ and $\hat{b}$ , and the corresponding variance matrix
- survreg() returns estimate and variance matrix for $\hat{u}$ and $\log \hat{b}$

95% confidence intervals for different quantiles of treatment group (6-MP)
$p$	$w_{p}$	${\hat{y}}_{p}$	$s e ({\hat{y}}_{p})$	lower	upper
0.25	-1.246	2.599	0.655	3.726	48.559
0.50	-0.367	3.249	0.264	15.357	43.225
0.75	0.327	3.761	0.395	19.822	93.241

Plot of LRT statistic against different null values $y_{p_{0}}$ and 95% confidence interval for $y_{.25}$ and $t_{.25}$ (6-MP group)

Plot of LRT statistic against different null values $y_{p_{0}}$ and 95% confidence interval for $y_{.5}$ and $t_{.5}$ (6-MP group)

95% confidence intervals of different quantiles using Wald and LRT method (6-MP group)
$p$	lower	upper	lower	upper
0.25	3.726	48.559	6.586	23.058
0.50	15.357	43.225	16.289	51.342
0.75	19.822	93.241	27.522	112.730

95% confidence intervals for different quantiles using Wald and LRT method (placebo group)
$p$	lower	upper	lower	upper
0.25	1.362	10.707	2.031	5.927
0.50	4.499	11.708	5.755	9.488
0.75	8.592	16.863	8.873	16.996

Survivor function

For Weibull distribution, the expression of survivor function $S (t; α, β) = \exp (- (t / α)^{β})$
Estimated survivor function $S (t; \hat{α}, \hat{β}) = \exp (- (t / \hat{α})^{\hat{β}})$

par	6-MP	placebo
$α$	33.765	9.482
$β$	1.354	1.370

Comparison survival probabilities of between two treatment groups

Figure 5.1: Comparison of parametric (Weibull) and non-parametric (step-function) estimates of survivor function using remission time data

Homework

Obtain Wald and LRT statistics based confidence interval for the survival probability $S (10)$

5 Inference Procedures for Log-location-scale Distributions

5.1 Inference for location-scale distributions

Likelihood based methods

Likelihood function

Score functions

Hessian matrix

Score function and information matrix

Statistical inference

Wald type CIs

Quantiles

Likelihood ratio procedures

Homework - 1

Steps

Steps

5.2 Weibull and extreme-value distributions

Likelihood based inference procedures

Score functions

Hessian matrix

Covariance matrix

CI for (u,b) (LRT based)

CI for quantiles

CI for quantiles (Wald)

CI for quantiles (LRT)

CI for S(⋅) (Wald)

CI for S(⋅) (LRT)

Example 5.2.1:

Remission time data

Analysis of remission time data (Extreme-value distribution)

survreg function

Data for the treatment (6MP) group

Analysis of the treatment group using survreg function

Extreme-value distribution

Analysis of remission time data using survreg function

Homework

CIs of (α,β) (6MP group)

Analyses for Placebo group

Quantiles and their CIs

Survivor function

Homework

CI for $(u, b)$ (LRT based)

CI for $S (\cdot)$ (Wald)

CI for $S (\cdot)$ (LRT)

`survreg` function

Analysis of the treatment group using `survreg` function

Analysis of remission time data using `survreg` function

CIs of $(α, β)$ (6MP group)