Chapter 5B

(AST405) Lifetime data analysis

Author

Md Rasel Biswas

5 Inference Procedures for Log-location-scale Distributions

5.1 Log-normal and normal distributions

Log-normal distribution

$T$ follows a log-normal distribution with location parameter $μ$ and scale parameter $σ$ if $Y = \log T \sim N (μ, σ^{2})$
The pdf and survivor function of log-normal distribution $\begin{aligned} f (t; μ, σ) & = \frac{1}{σ t \sqrt{2 π}} \exp [- \frac{1}{2} (\frac{\log t - μ}{σ})^{2}] \\ S (t; μ, σ) & = 1 - Φ (\frac{\log t - μ}{σ}) \end{aligned}$
- $μ$ and $σ$ are the parameters of both normal and log-normal distributions
- $Φ (\cdot) \to$ cumulative distribution function of standard normal distribution

Log-normal distribution is a member of the log-location-scale family of distributions and the corresponding location-scale distribution is normal with $\begin{aligned} S_{0} (z) & = 1 - Φ (z) \\ f_{0} (z) & = \frac{1}{\sqrt{2 π}} e^{- z^{2} / 2} = ϕ (z) \end{aligned}$
- $ϕ (\cdot) \to$ pdf of standard normal distribution
- $z = (y - μ) / σ$

Density function of log-lifetime $\begin{aligned} f (y; μ, σ) & = \frac{1}{σ} f_{0} (\frac{y - μ}{σ}) \\ = \frac{1}{σ \sqrt{2 π}} \exp [- \frac{1}{2} (\frac{y - μ}{σ})^{2}] \end{aligned}$
Survivor function of log-lifetime $\begin{array}{r} S (y; μ, σ) = S_{0} (\frac{y - μ}{σ}) = 1 - Φ (\frac{y - μ}{σ}) \end{array}$

Likelihood function normal distribution

Data ${(t_{i}, δ_{i}), i = 1, \dots, n}$
Log-likelihood function $\begin{aligned} ℓ (μ, σ) & = \log \prod_{i = 1}^{n} [(1 / σ) f_{0} (z_{i})]^{δ_{i}} [S_{0} (z_{i})]^{1 - δ_{i}} \\ = - r \log σ + \sum_{i = 1}^{n} δ_{i} \log f_{0} (z_{i}) + \sum_{i = 1}^{n} (1 - δ_{i}) \log S_{0} (z_{i}) \\ = - r \log σ - \frac{1}{2} \sum_{i = 1}^{n} δ_{i} z_{i}^{2} + \sum_{i = 1}^{n} (1 - δ_{i}) \log S_{0} (z_{i}) \end{aligned}$
- $z_{i} = (y_{i} - μ) / σ$ and $y_{i} = \log t_{i}$
- $r = \sum_{i = 1}^{n} δ_{i}$

Elements of hessian matrix and score function depend on the followings $\begin{aligned} \frac{\partial \log f_{0} (z)}{\partial z} & = - z \\ \frac{\partial^{2} \log f_{0} (z)}{\partial z^{2}} & = - 1 \\ \frac{\partial \log S_{0} (z)}{\partial z} & = - \frac{f_{0} (z)}{S_{0} (z)} \\ \frac{\partial^{2} \log S_{0} (z)}{\partial z^{2}} & = \frac{z f_{0} (z)}{S_{0} (z)} - [\frac{f_{0} (z)}{S_{0} (z)}]^{2} \end{aligned}$

MLEs $\begin{array}{r} (\hat{μ}, \hat{σ})^{'} = {\arg max}_{Θ} ℓ (μ, σ) \end{array}$
- Sampling distribution $\begin{array}{r} (\hat{μ}, \hat{σ})^{'} \sim N ((μ, σ)^{'}, V) \end{array}$ where $\hat{V} = [- H (\hat{μ}, \hat{σ})]^{- 1}$
Confidence intervals of parameters, quantiles, and survival probabilities can be obtained using the methods described for Weibull models

Estimate of survivor function (Log-normal distribution) $\begin{aligned} S (t; \hat{μ}, \hat{σ}) & = 1 - Φ (\frac{\log t - \hat{μ}}{\hat{σ}}) \\ = 1 - Φ (\hat{ψ}) \end{aligned}$ where $\hat{ψ} = Φ^{- 1} (1 - S (t; \hat{μ}, \hat{σ})) = \frac{\log t - \hat{μ}}{\hat{σ}}$
- Standard error of $\hat{ψ}$ $s e (\hat{ψ}) = \sqrt{a^{'} \hat{V} a}$ where $a = (- 1 / \hat{σ}, - \hat{ψ} / \hat{σ})^{'}$

Estimate of survivor function

$(1 - α) 100 %$ CI of $S (t)$ $\begin{aligned} L & < ψ < U \\ L & < Φ^{- 1} (1 - S (t; μ, σ)) < U \\ Φ (L) & < 1 - S (t; μ, σ) < Φ (U) \\ 1 - Φ (U) & < S (t; μ, σ) < 1 - Φ (L) \end{aligned}$ where $\begin{aligned} L & = \hat{ψ} - z_{1 - α / 2} s e (\hat{ψ}) \\ U & = \hat{ψ} + z_{1 - α / 2} s e (\hat{ψ}) \end{aligned}$

LRT statistics based method of obtaining CI for survivor function is described with $H_{0} : S (y_{0}) = S (\log t_{0}) = s_{0}$
The $100 (1 - α) %$ CI for $S (t)$ can be obtained from the values of $s_{0}$ that satisfy $Λ (s_{0}) = 2 ℓ (\hat{μ}, \hat{σ}) - 2 ℓ (\tilde{μ}, \tilde{σ}) \leq χ_{(1), 1 - α}^{2}$

Unrestricted and unrestricted MLEs are obtained as $\begin{aligned} unrestricted (\hat{μ}, \hat{σ})^{'} & = {\arg max}_{Θ} ℓ (μ, σ) \\ restricted (\tilde{μ}, \tilde{σ})^{'} & = {\arg max}_{Θ} ℓ (y_{0} - σ Φ^{- 1} (1 - s_{0}), σ) \end{aligned}$ where under $H_{0}$ , we can show $S (y_{0}) = 1 - Φ (\frac{y_{0} - μ}{σ}) = s_{0} \Rightarrow μ = y_{0} - σ Φ^{- 1} (1 - s_{0})$

Quantiles

The expression of estimate of $y_{p}$ ${\hat{y}}_{p} = \hat{μ} + \hat{σ} w_{p}$ where for normal distribution $w_{p} = S_{0}^{- 1} (1 - p) = Φ^{- 1} (p)$
Standard error of ${\hat{y}}_{p}$
$s e ({\hat{y}}_{p}) = \sqrt{a^{'} \hat{V} a}$ where $a = (1, w_{p})^{'}$

Homework

Obtain the expressions of Wald-type and LRT based $100 (1 - α) %$ confidence intervals of $y_{p}$

Example 5.3.1

Data are available on lifetimes (in thousand miles) of 96 locomotive controls, of which were failed.
The test was terminated after $135 K$ miles, so 59 lifetimes were censored at $135 K$ .

dat_ex531

# A tibble: 96 × 2
    time status
   <dbl>  <int>
 1  22.5      1
 2  37.5      1
 3  46        1
 4  48.5      1
 5  51.5      1
 6  53        1
 7  54.5      1
 8  57.5      1
 9  66.5      1
10  68        1
# ℹ 86 more rows

dat_ex531 %>% 
  count(status)

# A tibble: 2 × 2
  status     n
   <int> <int>
1      0    59
2      1    37

Log-normal and normal model fit

mod_LN <- survreg(Surv(time, status) ~ 1, 
                  dist = "lognormal",
                  data = dat_ex531)

mod_N <- survreg(Surv(log(time), status) ~ 1, 
                dist = "gaussian",
                data = dat_ex531)

MLEs $(\hat{μ}, \log \hat{σ})$ and corresponding standard errors

tidy(mod_LN)

# A tibble: 2 × 5
  term        estimate std.error statistic p.value
  <chr>          <dbl>     <dbl>     <dbl>   <dbl>
1 (Intercept)    5.19      0.129     40.3    0    
2 Log(scale)    -0.136     0.131     -1.04   0.297

Estimated variance of $(\hat{μ}, \log \hat{σ})$

mod_LN$var

            (Intercept) Log(scale)
(Intercept)  0.01657557 0.00983969
Log(scale)   0.00983969 0.01703353

Estimated variance $V (\hat{μ}, \hat{σ})$ from $V (\hat{μ}, \log \hat{σ})$

$G V (\hat{μ}, \log \hat{σ}) G^{'}$

           [,1]       [,2]
[1,] 0.01657557 0.00858735
[2,] 0.00858735 0.01297359

$G = [\begin{matrix} 1 & 0 \\ 0 & \exp (σ) \end{matrix}]$

95% Confidence intervals for $μ$ and $σ$
par	lower	upper	lower	upper
$μ$	4.942	5.447	5.000	5.400
$σ$	0.676	1.127	0.709	1.109

Estimate of $S (80)$ $\begin{aligned} S (80; \hat{μ}, \hat{σ}) & = 1 - Φ (\frac{\log 80 - \hat{μ}}{\hat{σ}}) \\ = 0.824 \end{aligned}$
- $\hat{μ} = 5.195 and \hat{σ} = 0.873$

Comparison of the estimates of survivor function

Estimate and confidence interval of $S (80)$
parameter	est	lower	upper
$S (80)$	0.824	0.667	0.924

Obtain LRT based 95% CI for $S (80)$

Quantiles

General expression of $p$ th quantile of log-lifetime ( $\hat{μ} = 5.195$ and $\hat{σ} = 0.873$ ) ${\hat{y}}_{p} = \hat{μ} + \hat{σ} w_{p}$
- $w_{p} = Φ^{- 1} (p)$

Estimate and confidence intervals of different quantiles of locomotive controls lifetime (normal distribution)
$p$	$w_{p}$	${\hat{y}}_{p}$	se	lower	upper
0.25	-0.674	4.606	0.105	NA	NA
0.50	0.000	5.195	0.129	NA	NA
0.75	0.674	5.783	0.194	NA	NA

5.2 Log-logistic and logistic distributions

Log-logistic distribution

$T$ follows a log-logistic distribution with parameters $α$ (scale) and $β$ (shape) if $Y = \log T$ follows a logistic distribution with parameters $u$ (location) and $b$ (scale)

The pdf, survivor, and hazard function of log-logistic distribution $\begin{aligned} f (t; α, β) & = \frac{(β / α) (t / α)^{β - 1}}{[1 + (t / α)^{β}]^{2}} \\ S (t; α, β) & = [1 + (t / α)^{β}]^{- 1} \\ h (t; α, β) & = \frac{(β / α) (t / α)^{β - 1}}{[1 + (t / α)^{β}]} \end{aligned}$

Logistic distribution

Log-logistic distribution is a member of the log-location-scale family of distributions and the corresponding location-scale distribution is logistic with $\begin{aligned} S_{0} (z) & = \frac{1}{1 + e^{z}} \\ f_{0} (z) & = \frac{e^{z}}{(1 + e^{z})^{2}} \end{aligned}$
- $z = (y - u) / b$

Density function of log-lifetime $\begin{aligned} f (y; u, b) & = \frac{1}{b} f_{0} (\frac{y - u}{b}) \\ = \frac{(1 / b) \exp [(y - u) / b]}{{1 + \exp [(y - u) / b]}^{2}} \end{aligned}$
Survivor function of log-lifetime $\begin{aligned} S (y; u, b) & = S_{0} (\frac{y - u}{b}) \\ = \frac{1}{1 + \exp [(y - u) / b]} \end{aligned}$

Data: ${(t_{i}, δ_{i}), i = 1, \dots, n}$
Log-likelihood function $\begin{aligned} ℓ (μ, σ) & = \log \prod_{i = 1}^{n} [(1 / b) f_{0} (z_{i})]^{δ_{i}} [S_{0} (z_{i})]^{1 - δ_{i}} \\ = - r \log b + \sum_{i = 1}^{n} δ_{i} \log f_{0} (z_{i}) + \sum_{i = 1}^{n} (1 - δ_{i}) \log S_{0} (z_{i}) \\ = - r \log b + \sum_{i = 1}^{n} [δ_{i} {z_{i} - \log (1 + e^{z_{i}})} - \log (1 + e^{z_{i}})] \end{aligned}$
- $z_{i} = (y_{i} - u) / b$ and $y_{i} = \log t_{i}$
- $r = \sum_{i = 1}^{n} δ_{i}$

Elements of hessian matrix and score function depend on the followings $\begin{aligned} \frac{\partial \log f_{0} (z)}{\partial z} & = 1 - \frac{2 e^{z}}{1 + e^{z}} \\ \frac{\partial^{2} \log f_{0} (z)}{\partial z^{2}} & = - 2 f_{0} (z) \\ \frac{\partial \log S_{0} (z)}{\partial z} & = \frac{- e^{z}}{1 + e^{z}} \\ \frac{\partial^{2} \log S_{0} (z)}{\partial z^{2}} & = \frac{- e^{z}}{(1 + e^{z})^{2}} \end{aligned}$

MLEs $\begin{array}{r} (\hat{u}, \hat{b})^{'} = {\arg max}_{Θ} ℓ (u, b) \end{array}$
Sampling distribution $\begin{array}{r} (\hat{u}, \hat{b})^{'} \sim N ((u, b)^{'}, V) \end{array}$ where $\hat{V} = [- H (\hat{u}, \hat{b})]^{- 1}$
Confidence intervals of parameters, quantiles, and survival probabilities can be obtained using the methods described for Weibull models

Estimate of survivor function (logistic distribution) $\begin{aligned} S_{0} (\frac{y - \hat{u}}{\hat{b}}) = S (y; \hat{u}, \hat{b}) & = \frac{1}{1 + \exp [(y - \hat{u}) / \hat{b}]} \\ \log [\frac{1 - S (y)}{S (y)}] & = \frac{y - \hat{u}}{\hat{b}} = \hat{ψ} = S_{0}^{- 1} (S (y)) \end{aligned}$
- Standard error of $\hat{ψ}$ $s e (\hat{ψ}) = \sqrt{a^{'} \hat{V} a}, where a = (- 1 / \hat{b}, - \hat{ψ} / \hat{b})^{'}$

$(1 - α) 100 %$ CI of $S (t)$

$\begin{aligned} L & < ψ < U \\ L & < \log \frac{1 - S (y)}{S (y)} < U \\ \exp (L) & < \frac{1 - S (y)}{S (y)} < \exp (U) \\ 1 + \exp (L) & < 1 + \frac{1 - S (y)}{S (y)} < 1 + \exp (L) \\ \frac{1}{1 + \exp (U)} & < S (y) < \frac{1}{1 + \exp (L)} \end{aligned}$ where $\begin{aligned} L & = \hat{ψ} - z_{1 - α / 2} s e (\hat{ψ}) \\ U & = \hat{ψ} + z_{1 - α / 2} s e (\hat{ψ}) \end{aligned}$

Estimate of survivor function

LRT statistics based method of obtaining CI for survivor function is described with $H_{0} : S (y_{0}) = S (\log t_{0}) = s_{0}$
The $100 (1 - α) %$ CI for $S (t)$ can be obtained from the values of $s_{0}$ that satisfy $Λ (s_{0}) \leq χ_{(1), 1 - α}^{2}$ where $\begin{array}{r} Λ (s_{0}) = 2 ℓ (\hat{u}, \hat{b}) - 2 ℓ (\tilde{u}, \tilde{b}) \end{array}$

Unrestricted and unrestricted MLEs are obtained as $\begin{aligned} unrestricted (\hat{u}, \hat{b})^{'} & = {\arg max}_{Θ} ℓ (u, b) \\ restricted (\tilde{u}, \tilde{b})^{'} & = {\arg max}_{Θ} ℓ (y_{0} - b \log {(1 - s_{0}) / s_{0}}, b) \end{aligned}$ where under $H_{0}$ , we can show $S (y_{0}) = s_{0} \Rightarrow u = y_{0} - b \log \frac{1 - s_{0}}{s_{0}}$

Quantiles

The expression of estimate of $y_{p}$ ${\hat{y}}_{p} = \hat{u} + \hat{b} w_{p}$ where for normal distribution $w_{p} = S_{0}^{- 1} (1 - p) = \log \frac{p}{1 - p}$
Standard error of ${\hat{y}}_{p}$
$s e ({\hat{y}}_{p}) = \sqrt{a^{'} \hat{V} a}$ where $a = (1, w_{p})^{'}$

Homework

Obtain the expressions of Wald-type and LRT based $100 (1 - α) %$ confidence intervals of $y_{p}$

Example 5.3.1

Data are available on lifetimes (in thousand miles) of 96 locomotive controls, of which were failed.
The test was terminated after $135 K$ miles, so 59 lifetimes were censored at $135 K$ .

dat_ex531

# A tibble: 96 × 2
    time status
   <dbl>  <int>
 1  22.5      1
 2  37.5      1
 3  46        1
 4  48.5      1
 5  51.5      1
 6  53        1
 7  54.5      1
 8  57.5      1
 9  66.5      1
10  68        1
# ℹ 86 more rows

dat_ex531 %>% 
  count(status)

# A tibble: 2 × 2
  status     n
   <int> <int>
1      0    59
2      1    37

Log-logistic and logistic model fit

mod_LL <- survreg(Surv(time, status) ~ 1, 
                  dist = "loglogistic",
                  data = dat_ex531)

mod_L <- survreg(Surv(log(time), status) ~ 1, 
                dist = "logistic",
                data = dat_ex531)

MLEs $(\hat{u}, \log \hat{b})$

[1]  5.1206418 -0.8266704

Estimated variance of $(\hat{u}, \log \hat{b})$

            (Intercept)  Log(scale)
(Intercept) 0.010490062 0.007837215
Log(scale)  0.007837215 0.022515937

MLEs of $(\hat{u}, \hat{b})$

[1] 5.1206418 0.4375036

Estimated variance of $(\hat{u}, \hat{b})$

            [,1]        [,2]
[1,] 0.010490062 0.003428809
[2,] 0.003428809 0.004309761

95% Confidence intervals for location and scale parameters
dist	par	est	lower	upper	lower	upper
Logistic	$u$	5.121	4.920	5.321	5.000	5.300
NA	$b$	0.438	0.326	0.587	0.360	0.559
Gaussian	$μ$	5.195	4.942	5.447	5.000	5.400
NA	$σ$	0.873	0.676	1.127	0.709	1.109

Estimate of $S (80)$ (log-logistic distribution) $\begin{aligned} S (80; \hat{u}, \hat{b}) & = \frac{1}{1 + \exp [(\log 80 - \hat{u}) / \hat{b}]} \\ = 0.844 \end{aligned}$
- $\hat{u} = 5.121 and \hat{b} = 0.438$

Estimate and corresponding Wald-type confidence interval of the survival probability $S (80)$
dist	est	lower	upper
Log-logistic	0.844	0.566	0.957
Log-normal	0.824	0.667	0.924

Obtain LRT based 95% CI for $S (80)$

Quantiles

General expression of $p$ th quantile of log-lifetime ( $\hat{u} = 5.121$ and $\hat{b} = 0.438$ ) ${\hat{y}}_{p} = \hat{u} + \hat{b} w_{p}$
- $w_{p} = \log \frac{p}{1 - p}$

Estimate and confidence intervals of different quantiles
dist	$p$	$w_{p}$	${\hat{y}}_{p}$	se	lower	upper
Logistic	0.25	-1.099	4.640	0.143	NA	NA
NA	0.50	0.000	5.121	0.102	NA	NA
NA	0.75	1.099	5.601	0.234	NA	NA
Gaussian	0.25	-0.674	4.826	0.101	NA	NA
NA	0.50	0.000	5.121	0.102	NA	NA
NA	0.75	0.674	5.416	0.177	NA	NA

Homework

Analyze the locomotive control lifetimes using Weibull model and compare the results

5.3 Comparison of distributions

Let $T_{j i}$ be the lifetime of $i$ th subject of the $j$ th group ( $i = 1, \dots, n_{j}$ , $j = 1, \dots, m$ )
Assume $T_{j i}$ follows a distribution of log-location-scale family with parameters $α_{j}$ (scale) and $β_{j}$ (shape)
The corresponding distribution of log-lifetime $Y_{j i} = \log T_{j i}$ is of a location-scale family distribution with parameters $u_{j}$ (location) and $b_{j}$ (scale) $u_{j} = \log α_{j} and b_{j} = (1 / β_{j})$

Survivor functions

The survivor function of $Y_{j i} = \log T_{j i}$ $\begin{array}{r} S_{j} (y) = S_{0} (\frac{y - u_{j}}{b_{j}}) \end{array}$
The survivor function of $T_{j i}$ $\begin{array}{r} S_{j} (t) = S_{0}^{⋆} [(t / α_{j})^{β_{j}}] \end{array}$
- $S_{0}^{⋆} (x) = S_{0} (\log x)$
- $u_{j} = \log α_{j}$
- $b_{j} = (1 / β_{j})$

Comparison of several normal populations is a well-known problem in statistics, where equal population variances are assumed, and the comparisons are performed on the basis of equality of population means

Quantile

General expression of the $p$ th quantile of the $j$ th population takes the form $\begin{array}{r} y_{j p} = u_{j} + b_{j} w_{p}, j = 1, \dots, m \end{array}$
- $w_{p} = S_{0}^{- 1} (1 - p)$

Equality of two populations

When the scales are not equal (i.e. $b_{1} \neq b_{2}$ ), the difference between the $p$ th quantiles does depend on the probability $p$ $\begin{array}{r} y_{1 p} - y_{2 p} = u_{1} - u_{2} + w_{p} (b_{1} - b_{2}) \end{array}$
Under the assumption of equality of the scales (i.e. $b_{1} = b_{2}$ ), difference between $p$ th (log-lifetime) quantile of a pair of populations (say 1 and 2) is constant, i.e. it does not depend on the probability $p \in (0, 1)$ $\begin{array}{r} y_{1 p} - y_{2 p} = u_{1} - u_{2} \end{array}$

The difference between two log-lifetime quantiles can be expressed in terms of the ratio of lifetime quantiles $\begin{aligned} y_{1 p} - y_{2 p} & = u_{1} - u_{2} \\ \log t_{1 p} - \log t_{2 p} & = \log α_{1} - \log α_{2} \\ t_{1 p} / t_{2 p} & = α_{1} / α_{2} \end{aligned}$
The ratio of the $p$ th quantiles of two lifetime distributions does not depend on the probability $p$ when the corresponding shape parameters are equal $(β_{1} = β_{2})$

Equality of all quantiles of two distributions, i.e. $y_{1 p} = y_{2 p} \forall p \in (0, 1),$ corresponds to equality of two distributions, i.e. $S_{1} (y) = S_{2} (y)$
Under the assumption of common scale (shape for lifetime) parameter, the null hypothesis of equality of two distributions can be expressed as $\begin{array}{r} H_{0} : u_{1} - u_{2} = 0 or H_{0} : (α_{1} / α_{2}) = 1 \end{array}$

Equality of two populations with survivor functions (say $S_{1}$ and $S_{2}$ ) can be expressed in terms of survivor functions
Since $y_{1 p} = y_{2 p} + u_{1} - u_{2} or t_{1 p} = t_{2 p} (α_{1} / α_{2}),$ the corresponding survivor functions can be expressed as $\begin{array}{r} S_{1} (y + u_{1} - u_{2}) = S_{2} (y) \\ S_{1} (t (α_{1} / α_{2})) = S_{2} (t) \end{array}$
That is, the survivor functions for $Y$ are translations of one another by an amount $(u_{1} - u_{2})$ along the $y$ -axis

Wald-type statistic

Data ${(t_{j i}, δ_{j i}), i = 1, 2}$ and $y_{j i} = \log t_{j i}$
Two populations can be compared in terms of $p$ th quantile $H_{0} : y_{1 p} = y_{2 p}$
Corresponding pivotal quantity $Z_{p} = \frac{({\hat{y}}_{1 p} - {\hat{y}}_{2 p}) - (y_{1 p} - y_{2 p})}{[var ({\hat{y}}_{1 p}) + var (y_{2 p})]^{1 / 2}} \sim N (0, 1) under H_{0}$
- The statistic $Z_{p}$ can be used to obtain confidence interval for $(y_{1 p} - y_{2 p})$

To test $H_{0} : b_{1} = b_{2}$ , the following pivotal quantity can be considered $Z_{b} = \frac{(\log {\hat{b}}_{1} - \log {\hat{b}}_{2}) - (\log b_{1} - \log b_{2})}{[var (\log {\hat{b}}_{1}) + var (\log {\hat{b}}_{2})]^{1 / 2}} \sim N (0, 1) under H_{0}$
- The statistic $Z_{b}$ can be used to obtain confidence interval for $(b_{1} / b_{2})$

When scales are equal, two populations can be compared with respect their location parameter $H_{0} : u_{1} = u_{2}$
The corresponding pivotal quantity $Z_{u} = \frac{({\hat{u}}_{1} - {\hat{u}}_{2}) - (u_{1} - u_{2})}{[var ({\hat{u}}_{1}) + var ({\hat{u}}_{2})]^{1 / 2}} \sim N (0, 1) under H_{0}$
- The statistic $Z_{u}$ can be used to obtain confidence interval for $(u_{1} - u_{2})$

Wald statistic cannot be used to test $H_{0} : u_{1} = u_{2}, b_{1} = b_{2}$

LRT based inference

Data ${(t_{j i}, δ_{j i}), j = 1, \dots, m, i = 1, \dots, n_{j}}$ and $y_{j i} = \log t_{j i}$
Different tests and confidence intervals of interest
1. $H_{0} : b_{1} = \dots = b_{m}$
2. Confidence interval for $(b_{1} / b_{2})$
3. Equality of several location parameters when scale parameters are equal $\begin{aligned} H_{0} & : u_{1} = \dots = u_{m}, b_{1} = \dots = b_{m} \\ H_{1} & : all u_{j} ’s are not equal, b_{1} = \dots = b_{m} \end{aligned}$
4. Confident interval for $(u_{1} - u_{2})$ when $b_{1} = b_{2}$
5. Confidence interval for $(y_{1 p} - y_{2 p})$ when $b_{1} \neq b_{2}$

Case 1

Hypothesis of interest $\begin{matrix} (5.1) & H_{0} : b_{1} = \dots = b_{m} = b (say) \end{matrix}$
Log-likelihood function $\begin{aligned} ℓ (u_{1}, \dots, u_{m}, b_{1}, \dots, b_{m}) & = \sum_{j = 1}^{m} ℓ_{j} (u_{j}, b_{j}) \end{aligned}$
Contribution to log-likelihood function for the $j$ th population $\begin{array}{r} ℓ_{j} (u_{j}, b_{j}) = - r_{j} \log b_{j} + \sum_{i = 1}^{n_{j}} [δ_{i} \log f_{0} (z_{j i}) + (1 - δ_{j i}) \log S_{0} (z_{j i})] \end{array}$
- $r_{j} = \sum_{i} δ_{j i}$

LRT statistic $\begin{aligned} Λ & = 2 ℓ ({\hat{u}}_{1}, \dots, {\hat{u}}_{m}, {\hat{b}}_{1}, \dots, {\hat{b}}_{m}) - 2 ℓ ({\tilde{u}}_{1}, \dots, {\tilde{u}}_{m}, \tilde{b}, \dots, \tilde{b}) \end{aligned}$
- $Λ \sim χ_{(m - 1)}^{2}$ under the null hypothesis defined in Equation 5.1
MLEs
- $({\hat{u}}_{j}, {\hat{b}}_{j})^{'} = {\arg max}_{Θ} ℓ_{j} (u_{j}, b_{j}), j = 1, \dots, m$
- $({\tilde{u}}_{1}, \dots, {\tilde{u}}_{m}, \tilde{b}, \dots, \tilde{b})^{'} = {\arg max}_{Θ} ℓ (u_{1}, \dots, u_{m}, b, \dots, b)$

Case 2

To obtain confidence interval of $(b_{1} / b_{2})$ , consider $H_{0} : (b_{1} / b_{2}) = a \Rightarrow H_{0} : b_{1} = a b_{2}$
$100 (1 - α) %$ confidence interval of $(b_{1} / b_{2})$ can be obtained from the range of $a$ values that satisfy $Λ (a) \leq χ_{(1), 1 - α}^{2},$ where the LRT statistic $Λ (a) = 2 ℓ ({\hat{u}}_{1}, {\hat{u}}_{2}, {\hat{b}}_{1}, {\hat{b}}_{2}) - 2 ℓ ({\tilde{u}}_{1}, {\tilde{u}}_{2}, a {\tilde{b}}_{2}, {\tilde{b}}_{2})$
- $({\hat{u}}_{j}, {\hat{b}}_{j})^{'} = {\arg max}_{Θ} ℓ_{j} (u_{j}, b_{j}), j = 1, 2$
- $({\tilde{u}}_{1}, {\tilde{u}}_{2}, {\tilde{b}}_{2})^{'} = {\arg max}_{Θ} ℓ (u_{1}, u_{2}, a b_{2}, b_{2})$

Case 3

Test equality of several location parameters when scale parameters are equal $\begin{aligned} H_{0} & : u_{1} = \dots = u_{m}, b_{1} = \dots = b_{m} \\ H_{1} & : all u_{j} ’s are not equal, b_{1} = \dots = b_{m} \end{aligned}$

MLEs
- under $H_{0},$ $(u^{⋆}, b^{⋆}) = {\arg max}_{Θ} ℓ (u, \dots, u, b, \dots, b)$
- under $H_{1},$ $({\tilde{u}}_{1}, \dots, {\tilde{u}}_{m}, \tilde{b}) = {\arg max}_{Θ} ℓ (u_{1}, \dots, u_{m}, b, \dots, b)$
LRT statistic $Λ = 2 ℓ ({\tilde{u}}_{1}, \dots, {\tilde{u}}_{m}, \tilde{b}, \dots, \tilde{b}) - 2 ℓ (u^{⋆}, \dots, u^{⋆}, b^{⋆}, \dots, b^{⋆})$
- Under the null hypothesis, $Λ$ follows $χ_{(m - 1)}^{2}$ distribution

Case 4

To obtain a confidence interval of $(u_{1} - u_{2})$ when $b_{1} = b_{2}$ , consider the null and alternative hypothesis $\begin{array}{r} H_{0} : u_{1} - u_{2} = δ, b_{1} = b_{2} vs H_{1} : u_{1} - u_{2} \neq δ, b_{1} = b_{2} \end{array}$
LRT statistic $Λ (δ) = 2 ℓ ({\tilde{u}}_{1}, {\tilde{u}}_{2}, \tilde{b}, \tilde{b}) - 2 ℓ (u_{2}^{⋆} + δ, u_{2}^{⋆}, b^{⋆}, b^{⋆})$
- under $H_{0},$ $(u^{⋆}, b^{⋆}) = {\arg max}_{Θ} ℓ (u, u, b, b)$
- under $H_{1},$ $({\tilde{u}}_{1}, {\tilde{u}}_{2}, \tilde{b}) = {\arg max}_{Θ} ℓ (u_{1}, u_{2}, b, b)$
$100 (1 - α)$ confidence interval for $(u_{1} - u_{2})$ can be obtained from the set of $δ$ values that satisfy $Λ (δ) \leq χ_{(1), 1 - α}^{2}$

Case 5

When $b_{1} \neq b_{2}$ , to obtain confidence interval for $(y_{1 p} - y_{2 p})$ consider the following hypothesis $H_{0} : y_{1 p} - y_{2 p} = Δ \Rightarrow H_{0} : u_{1} - u_{2} = Δ + (b_{2} - b_{1}) w_{p}$
- $w_{p} = S_{0}^{- 1} (1 - p)$
LRT statistic $Λ (Δ) = 2 ℓ ({\hat{u}}_{1}, {\hat{u}}_{2}, {\hat{b}}_{1}, {\hat{b}}_{2}) - 2 ℓ ({\tilde{u}}_{1}, {\tilde{u}}_{2}, {\tilde{b}}_{1}, {\tilde{b}}_{2})$

under $H_{0}$ $({\tilde{u}}_{1}, {\tilde{u}}_{2}, {\tilde{b}}_{1}, {\tilde{b}}_{2}) = {\arg max}_{Θ} ℓ (u_{2} + Δ + (b_{2} - b_{1}) w_{p}, u_{2}, b_{1}, b_{2})$
under $H_{1}$ $({\hat{u}}_{1}, {\hat{u}}_{2}, {\hat{b}}_{2}, {\hat{b}}_{1}) = {\arg max}_{Θ} ℓ (u_{1}, u_{2}, b_{1}, b_{2})$
$100 (1 - α)$ confidence interval for $(y_{1 p} - y_{2 p})$ can be obtained from the set of $Δ$ values that satisfy $Λ (Δ) \leq χ_{(1), 1 - α}^{2}$

Comparison of Weibull or extreme value distributions

Assume $T_{j i} \sim Weibull (α_{j}, β_{j})$ $(j = 1, \dots, m, i = 1, \dots, n_{j})$
- Data ${(t_{j i}, δ_{j i}), j = 1, \dots, m, i = 1, \dots, n_{j}}$
Survivor function of Weibull distribution $S_{j} (t) = \exp [- (t / α_{j})^{β_{j}}]$
Survivor function of extreme value distribution $S_{j} (y) = \exp [- e^{(y - u_{j}) / b_{j}}]$
- $u_{j} = \log α_{j}$
- $b_{j} = 1 / β_{j}$

Example 5.4.1

Data of the following table are on the time to breakdown of electrical insulating fluid subject to a constant voltage stress in a lifetest experiment

Estimate of voltage-specific extreme value models
voltage	${\hat{u}}_{j} \pm s e ({\hat{u}}_{j})$	${\hat{b}}_{j} \pm s e ({\hat{b}}_{j})$
26	$6.862 \pm 1.104$	$1.834 \pm 0.885$
28	$5.865 \pm 0.486$	$1.022 \pm 0.474$
30	$4.351 \pm 0.302$	$0.944 \pm 0.303$
32	$3.256 \pm 0.486$	$1.781 \pm 0.254$
34	$2.503 \pm 0.315$	$1.297 \pm 0.211$
36	$1.457 \pm 0.309$	$1.125 \pm 0.221$
38	$0.001 \pm 0.273$	$0.734 \pm 0.367$

Comparison of estimated survivor function

LRT (Case 1)

Null hypothesis $H_{0} : b_{1} = \dots = b_{7}$
LRT statistic $\begin{aligned} Λ & = 2 ℓ ({\hat{u}}_{1}, \dots, {\hat{u}}_{7}, {\hat{b}}_{1}, \dots, {\hat{b}}_{7}) - 2 ℓ ({\tilde{u}}_{1}, \dots, {\tilde{u}}_{7}, \tilde{b}, \dots, \tilde{b}) \\ = 2 (- 132.181) - 2 (- 136.578) \\ = 8.794 \end{aligned}$
- p-value $P r (χ_{(6)}^{2} \geq Λ) = 0.185$ It does not provide enough evidence to reject the null hypothesis of equality of the scale parameters.

Confidence interval of $(b_{1} / b_{2})$ (Case 2)

Wald-type $\begin{aligned} (\log {\hat{b}}_{1} - \log {\hat{b}}_{2}) & \pm z_{1 - α / 2} s e (\log {\hat{b}}_{1} - \log {\hat{b}}_{2}) \\ ({\hat{b}}_{1} / {\hat{b}}_{2}) & e^{\pm z_{1 - α / 2} s e (\log {\hat{b}}_{1} - \log {\hat{b}}_{2})} \\ (1.834 / 1.022) & e^{\pm (1.96) (0.624)} \\ 0.529 & < (b_{1} / b_{2}) < 6.095 \end{aligned}$
- Similarly confidence intervals for $(b_{j} / b_{j^{'}})$ $j > j^{'}$ can be obtained

Estimate and 95% confidence interval of pair-wise comparisons of scale parameters $(b_{j} / b_{j^{'}})$

Case 3

Equality of all location parameters when scales are equal $H_{0} : u_{1} = \dots = u_{m}, b_{1} = \dots = b_{m}$
LRT statistic $\begin{aligned} Λ & = 2 ℓ ({\tilde{u}}_{1}, \dots, {\tilde{u}}_{m}, \tilde{b}, \dots, \tilde{b}) - 2 ℓ (u^{⋆}, \dots, u^{⋆}, b^{⋆}, \dots, b^{⋆}) \\ = 2 (- 136.578) - 2 (- 176.584) \\ = 80.013 \end{aligned}$
- p-value $P (χ_{(1)}^{2} > 80.013) < .001 \to$ There is a strong evidence against the assumption of equality of $m$ location parameters

Case 4

Wald-type confidence interval of $(u_{1} - u_{2})$ $\begin{aligned} ({\hat{u}}_{1} - {\hat{u}}_{2}) & \pm z_{1 - α / 2} s e ({\hat{u}}_{1} - {\hat{u}}_{2}) \\ (6.862 - 4.351) & \pm (1.96) (1.206) \\ - 1.367 & < (u_{1} - u_{2}) < 3.361 \end{aligned}$
- There is no significant difference between $u_{1}$ and $u_{2}$

Estimate and 95% confidence interval of pair-wise comparisons of location parameters $(u_{j} - u_{j^{'}})$

Case 5

General expression of $p$ th quantile of the group $j$ $y_{j p} = u_{j} + b_{j} w_{p}, j = 1, \dots, m$
Difference of $p$ th quantile between groups 1 and 2 $y_{1 p} - y_{2 p} = u_{1} - u_{2} + (b_{1} - b_{2}) w_{p}$
- 95% confidence interval for the difference of median between groups 1 and 2 $\begin{aligned} {\hat{y}}_{1 m} - {\hat{y}}_{2 m} & \pm z_{1 - α / 2} s e ({\hat{y}}_{1 m} - {\hat{y}}_{2 m}) \\ (6.19 - 5.491) & \pm (1.96) (1.291) \\ - 1.831 & < (y_{1 m} - y_{2 m}) < 3.231 \end{aligned}$

Estimate and 95% confidence interval of pair-wise comparisons of medians $(y_{j, .5} - y_{j^{'}, .5})$

Homework

Analyse the breakdown time data using log-logistic and log-normal distributions and compare the results with that of Weibull distribution

5 Inference Procedures for Log-location-scale Distributions

5.1 Log-normal and normal distributions

Log-normal distribution

Likelihood function normal distribution

Estimate of survivor function

Quantiles

Homework

Example 5.3.1

Estimated variance V(μ^,σ^) from V(μ^,log⁡σ^)

Quantiles

5.2 Log-logistic and logistic distributions

Log-logistic distribution

Logistic distribution

Estimate of survivor function

Quantiles

Homework

Example 5.3.1

Quantiles

Homework

5.3 Comparison of distributions

Survivor functions

Quantile

Equality of two populations

Wald-type statistic

LRT based inference

Case 1

Case 2

Case 3

Case 4

Case 5

Comparison of Weibull or extreme value distributions

Example 5.4.1

LRT (Case 1)

Confidence interval of (b1/b2) (Case 2)

Case 3

Case 4

Case 5

Homework

Estimated variance $V (\hat{μ}, \hat{σ})$ from $V (\hat{μ}, \log \hat{σ})$

Confidence interval of $(b_{1} / b_{2})$ (Case 2)