|
Actuary /
CExamParaModelsActuary.CExamParaModels HistoryShow minor edits - Show changes to markup Changed line 21 from:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {$ [F(c)]^n $}.
to:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {$ [1-F(c)]^n $}.
Changed line 101 from:
to:
Changed line 104 from:
to:
Changed lines 106-107 from:
to:
Changed lines 106-107 from:
to:
Changed line 92 from:
This test statistic formula seems to be too complicated to be on the exam. to:
Changed line 103 from:
to:
Changed line 105 from:
to:
Changed lines 66-76 from:
If we calculate the test statistic and it's bigger than the critical value, we can reject the null hypothesis.
If we calculate the test statistic and it's smaller than the critical value, we do not reject the null hypothesis and view it as plausible.
We can compute a p-value such that
If we conduct the test at a significance level {$geq p$}, then it'll lead to a rejection of the null hypothesis.
If we conduct the test at a significance level {$< p$}, then it'll lead to a failure to reject the null hypothesis.
to:
Sometimes we're given the rejection region, and we can calculate {$\alpha$}. More often, it seems, we're given {$\alpha$}, and the methods to calculate the test statistic and rejection region, and we determine whether to reject {$H_0$} based on whether the test statistic is in the rejection region.
Given the same analysis on the same set of observations, if {$\alpha_1 > \alpha_2$}, then {$\alpha_1$} would result in a larger rejection region than {$\alpha_2$}, so we can have a situation where {$H_0$} is not rejected at the level of {$\alpha_2$}, but is rejected at the level of {$\alpha_1$}.
We can also compute a p-value (attained significance level) -- the smallest level of significance {$\alpha$} for which {$H_0$} is rejected based on the test statistic. The way to calculate the p-value should be stated as part of the test method.
Changed line 103 from:
to:
Changed lines 105-106 from:
to:
Changed line 21 from:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {F(c)^n$}.
to:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {$ [F(c)]^n $}.
Changed line 94 from:
to:
Changed line 5 from:
Given a distribution {$X$} with parameter {$\theta$}, construct a likelihood function {$L(\theta)$} and find the value of {$\theta$} that maximizes {$L$}. Or maximizes the loglikelihood function {$l(\theta) = \log L(\theta)$}. More specifically, given a series of events {$A_1, \ldots, A_n$}, where each {$A_n$} is an observed value (or interval), then
to:
Given a distribution {$X$} with parameter {$\theta$}, construct a likelihood function {$L(\theta)$} and find the value of {$\theta$} that maximizes {$L$}. Or maximizes the loglikelihood function, {$l(\theta) = \log{L(\theta)}$}. More specifically, given a series of events {$A_1, \ldots, A_n$}, where each {$A_n$} is an observed value (or interval), then
Changed line 111 from:
to:
Changed lines 117-118 from:
to:
Changed lines 105-106 from:
to:
Changed lines 111-113 from:
to:
{$$T = 2 \ln (L_1/L_0).$$}
Added lines 108-113:
Added lines 77-78:
If {$p$} is above 10%, then the data has no evidence to support the alternative hypothesis.
If {$p$} is below 1%, then the data gives strong support for the alternative hypothesis.
Changed line 82 from:
{$$D = \max_{t\leq x\leq u} \abs{F_n(x) - F^*(x)}.$$} to:
{$$D = \max_{t\leq x\leq u} |F_n(x) - F^*(x)|.$$} Changed lines 101-102 from:
to:
Changed lines 92-94 from:
{$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j);$$} {$$\{p_{n,j}} = F_n(c_j)-F_n(c_{j-1}); \text{i.e., the same probability with empirical distribution}.$$} to:
{$$\hat{p_j} = F^*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j);$$} {$$p_{n,j} = F_n(c_j)-F_n(c_{j-1}); \text{i.e., the same probability with empirical distribution}.$$} Changed lines 99-100 from:
to:
Changed lines 74-76 from:
We can compute a ran
to:
We can compute a p-value such that
If we conduct the test at a significance level {$geq p$}, then it'll lead to a rejection of the null hypothesis.
If we conduct the test at a significance level {$< p$}, then it'll lead to a failure to reject the null hypothesis.
Added line 74:
We can compute a ran
Changed line 94 from:
{$$\Chi^2 \sum_{j=1}^k \frac{n [\hat{p}_j -p_{n,j}]^2}{\hat{p}_j }.$$} to:
{$$\chi^2 =\sum_{j=1}^k \frac{n [\hat{p}_j -p_{n,j}]^2}{\hat{p}_j }.$$} Changed lines 96-98 from:
{$$\chi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} to:
{$$\chi^2 =\sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} Changed lines 90-91 from:
{$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j;$$} {$$\{p_{n,j}} = F_n(c_j)-F_n(c_{j-1}); \text{i.e., the same probability with empirical distribution.$$} to:
{$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j);$$} {$$\{p_{n,j}} = F_n(c_j)-F_n(c_{j-1}); \text{i.e., the same probability with empirical distribution}.$$} Changed lines 95-97 from:
{$$\xi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} to:
{$$\chi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} Changed lines 95-97 from:
{$$\Xi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} to:
{$$\xi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} Changed lines 95-97 from:
{$$\Chi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} to:
{$$\Xi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} Changed lines 91-97 from:
{$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j;$$} to:
{$$\{p_{n,j}} = F_n(c_j)-F_n(c_{j-1}); \text{i.e., the same probability with empirical distribution.$$}
{$$\Chi^2 \sum_{j=1}^k \frac{n [\hat{p}_j -p_{n,j}]^2}{\hat{p}_j }.$$}
{$$\Chi^2 \sum_{j=1}^k \frac{[E_j-O_j]^2}{E_j },$$} Changed line 78 from:
The critical values are to:
Changed line 80 from:
(These values appear to be always given in the problem.) to:
Changed lines 84-87 from:
{$$$$} The critical values are {$$ 1.22/\sqrt{n} \text{ for }\alpha = 0.1, \qquad 1.36/\sqrt{n} \text{ for }\alpha = 0.05, \qquad 1.63/\sqrt{n} \text{ for }\alpha = 0.01.$$} to:
{$$ 1.933/\sqrt{n} \text{ for }\alpha = 0.1, \qquad 2.492/\sqrt{n} \text{ for }\alpha = 0.05, \qquad 3.857/\sqrt{n} \text{ for }\alpha = 0.01.$$} This test statistic formula seems to be too complicated to be on the exam. Added lines 89-91:
{$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j;$$} {$$\hat{p_j} = F*(c_j)-F^*(c_{j-1}); \text{i.e., the probability that a (truncated) observations falls in } (c_{j-1}, c_j;$$} Changed line 64 from:
Point(?): Compare {$F^*$} with {$F_n$} (the empirical esitmate). There are several ways to do this:
to:
Point: Compare {$F^*$} with {$F_n$} (the empirical esitmate). There are several ways to do this:
Changed lines 79-80 from:
{$$ 1.22/\sqrt{n} \text{ for }\alpha = 0.1, 1.36/\sqrt{n} \text{ for }\alpha = 0..05, 1.63/\sqrt{n} \text{ for }\alpha = 0.01.$$} to:
{$$ 1.22/\sqrt{n} \text{ for }\alpha = 0.1, \qquad 1.36/\sqrt{n} \text{ for }\alpha = 0.05, \qquad 1.63/\sqrt{n} \text{ for }\alpha = 0.01.$$} (These values appear to be always given in the problem.) Added lines 82-87:
{$$A^2 =n \int_t^u \frac{[F_n(x)-F^*(x)]^2}{F^*(x)(1-F^*(x))} f^*(x)\,dx.$$}
{$$$$} The critical values are {$$ 1.22/\sqrt{n} \text{ for }\alpha = 0.1, \qquad 1.36/\sqrt{n} \text{ for }\alpha = 0.05, \qquad 1.63/\sqrt{n} \text{ for }\alpha = 0.01.$$} Added lines 72-73:
If we calculate the test statistic and it's bigger than the critical value, we can reject the null hypothesis.
If we calculate the test statistic and it's smaller than the critical value, we do not reject the null hypothesis and view it as plausible.
Added lines 76-79:
{$$D = \max_{t\leq x\leq u} \abs{F_n(x) - F^*(x)}.$$} The critical values are {$$ 1.22/\sqrt{n} \text{ for }\alpha = 0.1, 1.36/\sqrt{n} \text{ for }\alpha = 0..05, 1.63/\sqrt{n} \text{ for }\alpha = 0.01.$$} Changed lines 64-71 from:
Point(?): Compare {$F^*$} with {$F_n$} (the empirical esitmate). to:
Point(?): Compare {$F^*$} with {$F_n$} (the empirical esitmate). There are several ways to do this:
Graphically: graph a few things (either {$F^*$} with {$F_n$} directly, or some other values) and visually tell how close the functions are.
Hypothesis tests: this involves constructing of the following hypotheses:
H_0: The data came from a population with the stated model. (null hypothesis)
H_1: The data did not come from such a population. (alternative hypothesis)
along with the test statistic (a function of the observations) and a rejection region, the boundary of which is the critical values to tell you if the null hypothesis is accepted or rejected.
Type I error occurs when we reject a null hypothesis when it is true. Define {$\alpha$} to be the significance level, the probability of rejecting a null hypothesis when it is true. This is usually set in advance.
Type II error occurs when we do not reject a null hypothesis when the alternative is true.
Added line 64:
Point(?): Compare {$F^*$} with {$F_n$} (the empirical esitmate). Added line 63:
{$$f^*(x)= \frac{f(x)}{1-F(t)}, \text{ for } x\geq t, \text{ and } 0 \text{ for } x<t;$$} Changed lines 60-62 from:
5. Determine the acceptability of a fitted model and/or compare models using: to:
5. Determine the acceptability of a fitted model and/or compare models If the data has been modeled with a parametric model, with a resulting distribution function {$F$} and density function {$f$}, and the data is truncated at {$t$}, then the modified functions are:
{$$F^*(x)= \frac{F(x)-F(t)}{1-F(t)}, \text{ for } x\geq t, \text{ and } 0 \text{ for } x<t;$$} Changed lines 46-52 from:
to:
There is also a method called delta method that is used to approximate expected values and variances of a function of estimators. Here is the one-dimensional statement of the theorem:
Suppose {$\hat{\theta}$} is an estimator of {$\theta$} that has an asymptotic normal distribution with mean {$\theta$} and variance {$\sigma^2/n$}. Then {$g(\hat{\theta})$} has an asymptotic normal distribution with mean {$g(\theta)$} and asymptotic variance {$g'(\theta)^2 \sigma^2/n$}.
Steps for the calculation:
a)Mind the mle {$\hat{\theta}$} of the parameter {$\theta$}.
a)Figure out the quantity we're estimating {$g(\theta)$} This could be, say, a probability.
b)Then the new mle is {$g(\hat{\theta})$}.
c)Find the mean and variance of the first estimator {$\hat{\theta}$}; then find the mean and variance of the 2nd estimator according to the theorem.
Changed lines 45-46 from:
{$$ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\vec\theta) \approx \frac{ (l(\vec{theta}+\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s} -l(\vec{theta}+\frac{1}{2}h_r\vec{e_r} - \frac{1}{2}h_s \vec{e_s} -l(\vec{theta}-\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s} +l(\vec{theta}-\frac{1}{2}h_r\vec{e_r}- \frac{1}{2}h_s \vec{e_s} }{h_rh_s}$$} to:
{$$ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\vec\theta) \approx \frac{ (l(\vec{\theta}+\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s}) -l(\vec{\theta}+\frac{1}{2}h_r\vec{e_r} - \frac{1}{2}h_s \vec{e_s}) -l(\vec{\theta}-\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s}) +l(\vec{\theta}-\frac{1}{2}h_r\vec{e_r}- \frac{1}{2}h_s \vec{e_s}) }{h_rh_s}$$} Changed lines 44-46 from:
{$$ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\vec\theta) \approx \frac{(l(theta}{h_rh_s}$$} to:
{$$ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\vec\theta) \approx \frac{ (l(\vec{theta}+\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s} -l(\vec{theta}+\frac{1}{2}h_r\vec{e_r} - \frac{1}{2}h_s \vec{e_s} -l(\vec{theta}-\frac{1}{2}h_r\vec{e_r} + \frac{1}{2}h_s \vec{e_s} +l(\vec{theta}-\frac{1}{2}h_r\vec{e_r}- \frac{1}{2}h_s \vec{e_s} }{h_rh_s}$$} Changed lines 45-46 from:
{$$$$} to:
{$$ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\vec\theta) \approx \frac{(l(theta}{h_rh_s}$$} Changed line 32 from:
{$$I(\theta)_{rs} = E\left[ \frac{\partial}{\partial \theta_r} l(\theta)\frac{\partial}{\partial \theta_s} l(\theta)\right].$$} to:
{$$I(\theta)_{rs} = -E\left[ \frac{\partial}{\partial \theta_r}\frac{\partial}{\partial \theta_s} l(\theta)\right] = E\left[ \frac{\partial}{\partial \theta_r} l(\theta)\frac{\partial}{\partial \theta_s} l(\theta)\right].$$} Changed lines 37-38 from:
to:
Variations:
a) Instead of taking expected values in c) above, plug in observed values.
b) Instead of taking derivatives in b) above, use the delta approximation
{$$$$} Changed lines 35-36 from:
to:
Changed lines 30-33 from:
{$I$} is called (Fisher's) information. The multivariable version of this is: {$$I$$} is an information matrix, where the {$(r,s)$}th element is to:
{$I$} is called (Fisher's) information.
The multivariable version of this is: {$$I$$} is an information matrix, where the {$(r,s)$}th element is
Changed lines 33-34 from:
So that the variances of the individual RVs (estimators) are on the diagonal, and the off diagonal entries are the covariances. to:
So that the variances of the individual RVs (estimators) are on the diagonal, and the off diagonal entries are the covariances.
Steps for the calculation:
Find a formula for the likelihood function {$L$} and the loglikelyhood function {$l$}, in terms of the parameters, the number of sample points {$n$}, and each observed value {$x_1, \ldots, x_n$}.
Take the 2nd partial derivatives with respect to each parameter.
Changed lines 28-32 from:
We can estimate the variance of the mle (maximum likelihood estimator) by the following (due to a long and complicated theorem): {$Var(\hat{\theta}) \approx \frac{1}{I(\theta)}$}, where
{$$I(\theta) = -E\left[ \frac{\partial^2}{\partial^2 \theta} l(\theta)\right] = E\left[ \left(\frac{\partial}{\partial \theta} l(\theta)\right)^2\right]$$} to:
We can estimate the variance of the mle (maximum likelihood estimator) by the following (due to a long and complicated theorem): If the underlying density function {$f$} is smooth enough, then {$Var(\hat{\theta}) \approx \frac{1}{I(\theta)}$}, where
{$$I(\theta) = -E\left[ \frac{\partial^2}{\partial \theta^2} l(\theta)\right] = E\left[ \left(\frac{\partial}{\partial \theta} l(\theta)\right)^2\right]$$} Changed lines 33-34 from:
to:
The multivariable version of this is: {$$I$$} is an information matrix, where the {$(r,s)$}th element is {$$I(\theta)_{rs} = E\left[ \frac{\partial}{\partial \theta_r} l(\theta)\frac{\partial}{\partial \theta_s} l(\theta)\right].$$} So that the variances of the individual RVs (estimators) are on the diagonal, and the off diagonal entries are the covariances. Changed lines 28-29 from:
to:
We can estimate the variance of the mle (maximum likelihood estimator) by the following (due to a long and complicated theorem): {$Var(\hat{\theta}) \approx \frac{1}{I(\theta)}$}, where
{$$I(\theta) = -E\left[ \frac{\partial^2}{\partial^2 \theta} l(\theta)\right] = E\left[ \left(\frac{\partial}{\partial \theta} l(\theta)\right)^2\right]$$} {$I$} is called (Fisher's) information. Added lines 28-29:
Changed lines 24-26 from:
{$f(x_j)/(1-F(t))$}, i.e., probability of {$x_j$} given that {$x_j \geq t$}. to:
Changed lines 11-15 from:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {F(c)^n$}.
If the data is truncated at {$t$}, we can either
Use the shift approach
to:
Added line 14:
Added line 17:
Changed lines 21-26 from:
to:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {F(c)^n$}.
If the data is truncated at {$t$}, we can either
Use the shift approach: Ignore all observations under {$t$}, and shift all observations above {$t$} by subtracting {$t$} off of every one of them.
Use the unshifted approach: Ignore all observations under {$t$}, and keep original observations {$x_j$} above {$t$}, but instead of {$f(x_j)$}, use
{$f(x_j)/(1-F(t))$}, i.e., probability of {$x_j$} given that {$x_j \geq t$}. Changed lines 11-12 from:
to:
If the data is censored at {$c$}, and {$n$} observations are censored, then the factor to use for those observations is {F(c)^n$}.
If the data is truncated at {$t$}, we can either
Use the shift approach
Changed line 16 from:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {$(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1})$} where the {$g$}th percentile lands in by {$j = \text{greatest integer }<= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
to:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {$(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1})$} where the {$g$}th percentile lands in by {$j = \text{greatest integer }<= g/100 \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
Changed line 12 from:
I to:
Changed line 16 from:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {$(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1)$} where the {$g$}th percentile lands in by {$j = greatest integer <= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
to:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {$(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1})$} where the {$g$}th percentile lands in by {$j = \text{greatest integer }<= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
Changed line 16 from:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1)$} where the {$g$}th percentile lands in by {$j = greatest integer <= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
to:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {$(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1)$} where the {$g$}th percentile lands in by {$j = greatest integer <= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
Changed lines 10-11 from:
{$$L(\theta) = \prod_{j=1}^{k} [F(c_j) - F(c_{j-1})]^{n_j} (\text{ All cdf values given }\theta).$$} to:
{$$L(\theta) = \prod_{j=1}^{k} [F(c_j) - F(c_{j-1})]^{n_j} \text{ (All cdf values given }\theta).$$} Changed lines 2-3 from:
1. Estimate the parameters of failure time and loss distributions using: to:
1. Estimate the parameters of failure time and loss distributions Assuming {$n$} independent observations throughout, label them {$x_1, \ldots, x_n$}. The general problem is: Given a distribution, find the parameters of the distribution.
Added lines 5-12:
Given a distribution {$X$} with parameter {$\theta$}, construct a likelihood function {$L(\theta)$} and find the value of {$\theta$} that maximizes {$L$}. Or maximizes the loglikelihood function {$l(\theta) = \log L(\theta)$}. More specifically, given a series of events {$A_1, \ldots, A_n$}, where each {$A_n$} is an observed value (or interval), then
{$$L(\theta) = \prod_{j=1}^{n} Prob(X\in A_j | \theta).$$} When {$A_j$} is a single point observation {$x_j$}, we can estimate {$Prob(X=x_j)$} by the density function {$f(x_j)$}. When {$A_j$} is an interval we can use cdf {$F$}.
If there is more than one parameter, we can maximize the likelihood function by taking partial derivatives.
If the data is grouped into {$(0, c_1], (c_1, c_2], \ldots, (c_k,\infty)$}, and there are {$n_j$} observations in each {$(c_{j-1}, c_j]$}, then
{$$L(\theta) = \prod_{j=1}^{k} [F(c_j) - F(c_{j-1})]^{n_j} (\text{ All cdf values given }\theta).$$} I Added line 14:
If there are {$p$} parameters, form and solve a system of {$p$} equations by equating the 1st to {$p$}th moment of the data, and of the distribution.
Added line 16:
If there are {$p$} parameters, and {$g_1, \ldots, g_p$} are arbitrary values between 0 and 100, form and solve a system of {$p$} equations by equation the {$g_1, \ldots, g_p$}th percentile of the data and of the distribution; more accurately, use smoothed empirical estimate of a {$g$}th percentile: Look at the {$n+1$} intervals formed by {(-\infty, x_1), (x_1, x_2), \ldots, (x_n, \infty)$}. Determine the interval {$(x_{j}, x_{j+1)$} where the {$g$}th percentile lands in by {$j = greatest integer <= g% \cdot (n+1) $}. Let {$h$} be the leftover decimal part. Then the {$g$}th percentile is a weighted average between {$x_j$} and {$x_{j+1}$} by {$h$}.
Added lines 1-25:
Construction and Selection of Parametric Models (25-30%)1. Estimate the parameters of failure time and loss distributions using: a) Maximum likelihood
b) Method of moments
c) Percentile matching
d) Bayesian procedures
2. Estimate the parameters of failure time and loss distributions with censored and/or truncated data using maximum likelihood. 3. Estimate the variance of estimators and the confidence intervals for the parameters and functions of parameters of failure time and loss distributions. 4. Apply the following concepts in estimating failure time and loss distributions: a) Unbiasedness
b) Asymptotic unbiasedness
c) Consistency
d) Mean squared error
e) Uniform minimum variance estimator
5. Determine the acceptability of a fitted model and/or compare models using: a) Graphical procedures
b) Kolmogorov-Smirnov test
c) Anderson-Darling test
d) Chi-square goodness-of-fit test
e) Likelihood ratio test
f) Schwarz Bayesian Criterion
|