Package 'ForestFit' reference manual

Title:	Statistical Modelling for Plant Size Distributions
Description:	Developed for the following tasks. 1 ) Computing the probability density function, cumulative distribution function, random generation, and estimating the parameters of the eleven mixture models. 2 ) Point estimation of the parameters of two - parameter Weibull distribution using twelve methods and three - parameter Weibull distribution using nine methods. 3 ) The Bayesian inference for the three - parameter Weibull distribution. 4 ) Estimating parameters of the three - parameter Birnbaum - Saunders, generalized exponential, and Weibull distributions fitted to grouped data using three methods including approximated maximum likelihood, expectation maximization, and maximum likelihood. 5 ) Estimating the parameters of the gamma, log-normal, and Weibull mixture models fitted to the grouped data through the EM algorithm, 6 ) Estimating parameters of the nonlinear height curve fitted to the height - diameter observation, 7 ) Estimating parameters, computing probability density function, cumulative distribution function, and generating realizations from gamma shape mixture model introduced by Venturini et al. (2008) <doi:10.1214/07-AOAS156> , 8 ) The Bayesian inference, computing probability density function, cumulative distribution function, and generating realizations from univariate and bivariate Johnson SB distribution, 9 ) Robust multiple linear regression analysis when error term follows skewed t distribution, 10 ) Estimating parameters of a given distribution fitted to grouped data using method of maximum likelihood, and 11 ) Estimating parameters of the Johnson SB distribution through the Bayesian, method of moment, conditional maximum likelihood, and two - percentile method.
Authors:	Mahdi Teimouri [aut, cre, cph, ctb]
Maintainer:	Mahdi Teimouri <[email protected]>
License:	GPL (>= 2)
Version:	2.4.3
Built:	2025-03-10 04:33:56 UTC
Source:	https://github.com/cran/ForestFit

Trees height and diameter at breast height

Description

The DBH data contains the diameter at breast height (dbh), height and condition data for all trees centered in 108 plots of size 0.2 hectare immediately following a single prescribed burn and also following three 5-yr (year) interval reburns (four burns total) and a single 15-yr interval reburn (two burns total) and associated treatment information. The trees information were established from mixed ponderosa pine (Pinus ponderosa Dougl. ex Laws.) that contained scattered western junipers (Juniperus occidentalis Hook.). The plots were located in the Malheur National Forest on the southern end of the Blue Mountains near Burns, Oregon, USA.

Usage

data(DBH)data(DBH)

Format

A text file with 5732 observations from 17 variables related of trees characteristics such as dbh and height.

References

B. K., Kerns, D. J., Westlind, and M. A. Day. 2017. Season and interval of burning and cattle exclusion in the southern blue mountains, oregon: Overstory tree height, diameter and growth. Forest Service Research Data Archive, <doi:10.2737/RDS-2017-0041> .

Computing probability density function of the gamma shape mixture model

Description

Computes probability density function (pdf) of the gamma shape mixture (GSM) model. The general form for the pdf of the GSM model is given by

$f(x,{\Theta}) = \sum_{j=1}^{K}\omega_j \frac{\beta^j}{\Gamma(j)} x^{j-1} \exp\bigl( -\beta x\bigr),$

where $\Theta=(\omega_1,\dots,\omega_K, \beta)^T$ is the parameter vector and known constant $K$ is the number of components. The vector of mixing parameters is given by $\omega=(\omega_1,\dots,\omega_K)^T$ where $\omega_j$ s sum to one, i.e., $\sum_{j=1}^{K}\omega_j=1$ . Here $\beta$ is the rate parameter that is equal for all components.

Usage

dgsm(data, omega, beta, log = FALSE)dgsm(data, omega, beta, log = FALSE)

Arguments

`data`	Vector of observations.
`omega`	Vector of the mixing parameters.
`beta`	The rate parameter.
`log`	If `TRUE`, then log(pdf) is returned.

Value

A vector of the same length as data, giving the pdf of the GSM model.

Author(s)

Mahdi Teimouri

References

S. Venturini, F. Dominici, and G. Parmigiani, 2008. Gamma shape mixtures for heavy-tailed distributions, The Annals of Applied Statistics, 2(2), 756–776.

Examples

data<-seq(0,20,0.1)
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
dgsm(data, omega, beta)
data<-seq(0,20,0.1)
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
dgsm(data, omega, beta)

Computing the probability density function of Johnson's SB (JSB) distribution

Description

Computes the probability density function of the four-parameter JSB distibution given by

$f\bigl(x\big|\Theta\bigr) = \frac {\delta \lambda}{\sqrt{2\pi}(x-\xi)(\lambda+\xi-x)}\exp\Biggl\{-\frac{1}{2}\Bigg[\gamma+\delta\log \biggl(\frac{x-\xi}{\lambda+\xi-x}\biggr) \Bigg]^2\Biggr\},$

where $\xi<x<\lambda+\xi$ , $\Theta=(\delta,\gamma,\lambda,\xi)^T$ with $\delta, \lambda> 0$ , $-\infty<\gamma<\infty$ , and $-\infty<\xi<\infty$ .

Usage

djsb(data, param, log = FALSE)djsb(data, param, log = FALSE)

Arguments

`data`	Vector of observations.
`param`	Vector of the parameters $\delta$ , $\gamma$ , $\lambda$ , and $\xi$ .
`log`	If `TRUE`, then log(pdf) is returned.

Value

A vector of length n, giving the density function of JSB distribution.

Author(s)

Mahdi Teimouri

Examples

delta <- 1
gamma <- 3
lambda <- 12
xi <- 5
param <- c(delta, gamma, lambda, xi)
data <- rjsb(20, param)
djsb(data, param, log = FALSE)
delta <- 1
gamma <- 3
lambda <- 12
xi <- 5
param <- c(delta, gamma, lambda, xi)
data <- rjsb(20, param)
djsb(data, param, log = FALSE)

Computing the probability density function of bivariate Johnson's SB (JSBB) distribution

Description

Computes the probability density function of the 9-parameter JSBB distibution given by

$f_{Y_{1},Y_{2}}\bigl(y_1,y_2\big \vert\Theta\bigr) = f_{Y_1, Y_2}(y_1, y_2) =\frac{\delta_1\delta_2\lambda_1\lambda_2\exp\Bigl\{\frac{-z^{2}_{1}-z^{2}_{2} +2\rho z_{1}z_{2}}{2(1-\rho^2)}\Bigr\}}{2\pi \sqrt{1-\rho^2}\bigl(y_1-\xi_1\bigr)\bigl(y_2-\xi_2\bigr)\bigl(\lambda_1+\xi_1-y_1\bigr)\bigl(\lambda_2+\xi_2-y_2\bigr)},$

where

$z_{i}=\delta_i \log \Bigl(\frac{y_{i}-{\xi}_i}{{\xi}_i+{\lambda}_i-y_{i}}\Bigr)+\gamma_{i},$

for $i=1,2$ . The parameter space of SBB distribution is $\Theta=({\bf{\delta}},{\bf{\gamma}},{\bf{\lambda}},{\bf{\xi}}, \rho)^{\top}$ in which ${\bf{\delta}}=(\delta_1,\delta_2)^{\top}$ , ${\bf{\gamma}}=(\gamma_1,\gamma_2, \rho)^{\top}$ , ${\bf{\lambda}}=(\lambda_1,\lambda_2)^{\top}$ , and ${\bf{\xi}}=(\xi_1,\xi_2)^{\top}$ . The supports of marginals are $\xi_1<y_1<\lambda_1+\xi_1$ and $\xi_2<y_2<\lambda_2+\xi_2$ . The support of the parameter space is $\delta_1>0,\delta_2>0,-\infty<\gamma_1<+\infty,-\infty<\gamma_2<+\infty, \lambda_1>0,\lambda_2>0, -\infty<\xi_1<+\infty, -\infty<\xi_2<+\infty$ and $-1<\rho<+1$ .

Usage

djsbb(data, param, log = FALSE)djsbb(data, param, log = FALSE)

Arguments

`data`	Vector of observations.
`param`	Vector of the parameters ${\bf{\delta}}$ , ${\bf{\gamma}}$ , ${\bf{\lambda}}$ , ${\bf{\xi}}$ , $\rho$ .
`log`	If `TRUE`, then log of density function is returned.

Value

A vector of length n, giving the density function of JSBB distribution.

Author(s)

Mahdi Teimouri

Examples

Delta <- c(2.5, 3)
Gamma <- c(2, 1)
Lambda <- c(1, 3)
Xi <- c(0, 2)
rho <- -0.5
param <- c(Delta[1], Gamma[1], Lambda[1], Xi[1], Delta[2], Gamma[2], Lambda[2], Xi[2], rho)
data <- rjsbb(20, param)
djsbb(data, param, log = FALSE)
Delta <- c(2.5, 3)
Gamma <- c(2, 1)
Lambda <- c(1, 3)
Xi <- c(0, 2)
rho <- -0.5
param <- c(Delta[1], Gamma[1], Lambda[1], Xi[1], Delta[2], Gamma[2], Lambda[2], Xi[2], rho)
data <- rjsbb(20, param)
djsbb(data, param, log = FALSE)

Computing probability density function of the well-known mixture models

Description

Computes probability density function (pdf) of the mixture model. The general form for the pdf of the mixture model is given by

$f(x,{\Theta}) = \sum_{j=1}^{K}\omega_j f_j(x,\theta_j),$

where $\Theta=(\theta_1,\dots,\theta_K)^T$ , is the whole parameter vector, $\theta_j$ for $j=1,\dots,K$ is the parameter space of the $j$ -th component, i.e. $\theta_j=(\alpha_j,\beta_j)^{T}$ , $f_j(.,\theta_j)$ is the pdf of the $j$ -th component, and known constant $K$ is the number of components. The vector of mixing parameters is given by $\omega=(\omega_1,\dots,\omega_K)^T$ where $\omega_j$ s sum to one, i.e., $\sum_{j=1}^{K}\omega_j=1$ . Parameters $\alpha_j$ and $\beta_j$ are the shape and scale parameters of the $j$ -th component or both are the shape parameters. In the latter case, the parameters $\alpha$ and $\beta$ are called the first and second shape parameters, respectively. We note that the constants $\omega_j$ s sum to one, i.e. $\sum_{j=1}^{K}\omega_j=1$ . The families considered for each component include Birnbaum-Saunders, Burr type XII, Chen, F, Frechet, Gamma, Gompertz, Log-normal, Log-logistic, Lomax, skew-normal, and Weibull with pdf given by the following.

Birnbaum-Saunders

$f(x,\theta)=\frac{\sqrt{\frac{x}{\beta}}+\sqrt{\frac{\beta}{x}}}{2\alpha x}\phi \Biggl( \frac{\sqrt{\frac{x}{\beta}}-\sqrt{\frac{\beta}{x}}}{\alpha}\Biggr),$
Burr XII

$f(x,\theta)=\alpha \beta x^{\alpha-1} \Bigl(1+x^{\alpha}\Bigr)^{-\beta-1},$
Chen

$f(x,\theta)=\alpha \beta x^{\alpha}\exp\bigl(x^\alpha\bigr) \exp\Bigl\{-\beta \exp\bigl(x^\alpha\bigr)+\beta\Bigr\},$
F

$f(x,\theta)=\frac{\Gamma\Bigl(\frac{\alpha+\beta}{2}\Bigl)}{\Gamma\bigl(\frac{\alpha}{2}\bigl) \Gamma\bigl(\frac{\beta}{2}\bigl)}\Bigl( \frac{\alpha}{\beta}\Bigl)^{\frac{\alpha}{2}} x^{\frac{\alpha}{2}-1}\Big(1+\frac{\alpha}{\beta}x\Big)^{-\frac{\alpha+\beta}{2}},$
Frechet

$f(x,\theta)=\frac{\alpha}{ \beta} \Bigl( \frac {x}{\beta}\Bigr) ^{-\alpha-1}\exp\Bigl\{ -\Bigl( \frac {x}{\beta}\Bigr)^{-\alpha} \Bigr\},$
gamma

$f(x,\theta)=\bigl[ \beta^\alpha \Gamma(\alpha)\bigr]^{-1} x^{\alpha-1} \exp\Bigl( -\frac {x}{\beta}\Bigr),$
Gompertz

$f(x,\theta)=\beta\exp\bigl(\alpha x\bigr) \exp\Biggl\{\frac{\beta \exp\bigl(\alpha x\bigr)-1}{\alpha} \Biggr\},$
log-logistic

$f(x,\theta)=\frac{ \alpha}{ \beta^{\alpha}} x^{\alpha-1} \left[ \Bigl( \frac {x}{\beta}\Bigr)^\alpha +1\right]^{-2},$
log-normal

$f(x,\theta)=\bigl(\sqrt{2\pi} \beta x \bigr)^{-1}\exp\biggl\{ -\frac {1}{2}\left( \frac {\log x-\alpha}{\beta}\right) ^2\biggr\},$
Lomax

$f(x,\theta)=\frac{\alpha \beta}{(1+\alpha x)^{\beta+1}},$
skew-normal

$f(x,\theta)=2\phi\Bigl(\frac{x-\alpha}{\beta}\Bigr)\Phi\Bigl(\lambda\frac{x-\alpha}{\beta}\Bigr),$
Weibull

$f(x,\theta)=\frac {\alpha}{\beta} \Bigl( \frac {x}{\beta} \Bigr)^{\alpha - 1}\exp\Bigl\{ -\Bigl( \frac {x}{\beta}\Bigr)^\alpha \Bigr\},$

where $\theta=(\alpha,\beta)$ . In the skew-normal case, $\phi(.)$ and $\Phi(.)$ are the density and distribution functions of the standard normal distribution, respectively.

Usage

dmixture(data, g, K, param)dmixture(data, g, K, param)

Arguments

`data`	Vector of observations.
`g`	Name of the family including "`birnbaum-saunders`", "`burrxii`", "`chen`", "`f`", "`Frechet`", "`gamma`", "`gompetrz`", "`log-normal`", "`log-logistic`", "`lomax`", "`skew-normal`", and "`weibull`".
`K`	Number of components.
`param`	Vector of the $\omega$ , $\alpha$ , $\beta$ , and $\lambda$ .

Details

For the skew-normal case, $\alpha$ , $\beta$ , and $\lambda$ are the location, scale, and skewness parameters, respectively.

Value

A vector of the same length as data, giving the pdf of the mixture model of families computed at data.

Author(s)

Mahdi Teimouri

Examples

data<-seq(0,20,0.1)
K<-2
weight<-c(0.6,0.4)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
dmixture(data, "weibull", K, param)
data<-seq(0,20,0.1)
K<-2
weight<-c(0.6,0.4)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
dmixture(data, "weibull", K, param)

Estimating parameters of the Johnson's SB (JSB) distribution using the Bayesian approach

Description

Suppose $x=(x_1,\dots,x_n)^T$ denotes a vector of $n$ independent observations coming from a four-parameter JSB distribution with probability density function given given by

$f\bigl(x\big|\Theta\bigr) = \frac {\delta \lambda}{\sqrt{2\pi}(x-\xi)(\lambda+\xi-x)}\exp\Biggl\{-\frac{1}{2}\Bigg[\gamma+\delta\log \biggl(\frac{x-\xi}{\lambda+\xi-x}\biggr) \Bigg]^2\Biggr\},$

where $\xi<x<\lambda+\xi$ , $\Theta=(\delta,\gamma,\lambda,\xi)^T$ with $\delta, \lambda> 0$ , $-\infty<\gamma<\infty$ , and $-\infty<\xi<\infty$ . Using the Bayesian approach, we compute the Bayes' estimators of the JSB distribution parameters.

Usage

fitbayesJSB(data, n.burn=8000, n.simul=10000)fitbayesJSB(data, n.burn=8000, n.simul=10000)

Arguments

`data`	Vector of observations.
`n.burn`	Length of the burn-in period, i.e., the point after which Gibbs sampler is supposed to attain convergence. By default `n.burn` is 8000.
`n.simul`	Total numbers of Gibbs sampler iterations. By default `n.simul` is 10,000.

Details

The Bayes' estimators are obtained by averaging on the all iterations between n.burn and n.simul.

Value

A list of objects in two parts as

Bayes' estimators of the parameters.
A sequence of four goodness-of-fit measures consist of Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Author(s)

Mahdi Teimouri

References

N. L. Johnson, 1949. Systems of frequency curves generated by methods of translation, Biometrika, 36, 149–176.

L. J. Norman, S. Kotz, and N. Balakrishnan, 1994. Continuous Univariate Distributions, volume I, John Wiley & Sons.

Examples


# Here we use the SW dataset provided by FIA that represents a typical loblolly pine plantation.
# As the variable of interest, we fit the JSB distribution to the diameter at breast height (SW$DIA)
# in inches.
data(SW)
data<-SW$DIA
fitbayesJSB(data, n.burn=4000, n.simul=5000)

# Here we use the SW dataset provided by FIA that represents a typical loblolly pine plantation.
# As the variable of interest, we fit the JSB distribution to the diameter at breast height (SW$DIA)
# in inches.
data(SW)
data<-SW$DIA
fitbayesJSB(data, n.burn=4000, n.simul=5000)

Estimating parameters of the Weibull distribution using the Bayesian approach

Description

Suppose $x=(x_1,\dots,x_n)^T$ denotes a vector of $n$ independent observations coming from a three-parameter Weibull distribution. Using the methodology given in Green et al. (1994), we compute the Bayes' estimators of the shape, scale, and location parameters.

Usage

fitbayesWeibull(data, n.burn=8000, n.simul=10000)fitbayesWeibull(data, n.burn=8000, n.simul=10000)

Arguments

`data`	Vector of observations.
`n.burn`	Length of the burn-in period, i.e., the point after which Gibbs sampler is supposed to attain convergence. By default `n.burn` is 8000.
`n.simul`	Total numbers of Gibbas sampler iterations. By default `n.simul` is 10,000.

Details

The Bayes' estimators are obtained by averaging on the all iterations between n.burn and n.simul.

Value

A list of objects in two parts as

Bayes' estimators of the parameters.
A sequence of four goodness-of-fit measures consist of Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Note

The methodology used here for computing the Bayes' estimator of the location parameter is different from that used by Green et al. (1994). This means that the location parameter is allowed to be any real value.

Author(s)

Mahdi Teimouri

References

E. J. Green, F. A. R. Jr, A. F. M. Smith, and W. E. Strawderman, 1994. Bayesian estimation for the three-parameter Weibull distribution with tree diameter data, Biometrics, 50(1), 254-269.

Examples


n<-100
alpha<-2
beta<-2
theta<-3
data<-rweibull(n,shape=alpha,scale=beta)+theta
fitbayesWeibull(data, n.burn=4000, n.simul=5000)

n<-100
alpha<-2
beta<-2
theta<-3
data<-rweibull(n,shape=alpha,scale=beta)+theta
fitbayesWeibull(data, n.burn=4000, n.simul=5000)

Estimatinng the parameters of the nonlinear curve fitted to the height-diameter(H-D) observations

Description

Estimates the parameters of the nine well-known nine three-parameter nonlinear curves fitted to the height-diameter observations. These nine models are given by the following.

Richards (Richards(1959))

$H=1.3+\beta_1+\frac{\beta_2}{D+\beta_3},$
Gompertz (Winsor(1992))

$H=1.3+\beta_1 e^{-\beta_2e^{-\beta_3 D}},$
Hossfeld IV (Zeide(1993))

$H=1.3+\frac{\beta_1}{1+\frac{1}{\beta_2 D^{\beta_3}}},$
Korf (Flewelling and De Jong(1994))

$H=1.3+\beta_1 e^{-\beta_2D^{-\beta_3}},$
logistic (Pearl and Reed (1920))

$H=1.3+\frac{\beta_1}{1+\beta_2e^{-\beta_3D}},$
Prodan (Prodan(1968))

$H=1.3+\frac{D^2}{\beta_1 D^2+\beta_2 D+\beta_3},$
Ratkowsky (Ratkowsky(1990))

$H=1.3+\beta_1 e^{-\frac{\beta_2}{D+\beta_3}},$
Sibbesen (Huang et al. (1992))

$H=1.3+\beta_1 D^{\beta_2 D^{-\beta_3}},$
Weibull (Yang et al. (1978))

$H=1.3+\beta_1\Bigl(1-e^{-\beta_2 D^{\beta_3}}\Bigr),$

Usage

fitcurve(h,d,model,start)fitcurve(h,d,model,start)

Arguments

`h`	Vector of height observations.
`d`	Vector of diameter observations.
`model`	The name of the fitted model including "`chapman-richards`", "`gompertz`", "`hossfeldiv`", "`korf`", "`logistic`", "`prodan`" , "`ratkowsky`", "`Sibbesen`", and "`weibull`".
`start`	A vector of starting values for the parameters $\beta_1$ , $\beta_2$ , and $\beta_3$ .

Value

A list of objects in four parts as

Estimated parameters and corresponding summaries including standard errors, computed $t$ -statistics, and $p$ -values.
Residuals.
Covariance matrix of the estimated model parameters (coefficients) $\hat{\beta}_1$ , $\hat{\beta}_2$ , and $\hat{\beta}_3$ .
Residual standard error, i.e., $\hat{\sigma}$ .
number of trials for attaining convergence.
The hieght-diameter scatterplot superimposed by the fitted model.

Author(s)

Mahdi Teimouri

References

J. W. Flewelling and R. De Jong. (1994). Considerations in simultaneous curve fitting for repeated height-diameter measurements, Canadian Journal of Forest Research, 24(7), 1408-1414.

S. Huang, S. J. Titus, and D. P. Wiens. 1992. Comparison of nonlinear height±diameter functions for major Alberta tree species. Canadian Journal of Forest Research, 22, 1297-1304.

R. Pearl and L. J. Reed. (1920). On the rate of growth of the population of the United States since 1790 and its mathematical representation, Proceedings of the National Academy of Sciences of the United States of America, 6(6), 275.

M. Prodan. 1968. The spatial distribution of trees in an area. Allg. Forst Jagdztg, 139, 214-217.

D. A. Ratkowsky. 1990. Handbook of nonlinear regression, New York, Marcel Dekker, Inc.

F. J. Richards. 1959. A flexible growth function for empirical use. Journal of Experimental Botany, 10, 290-300.

S. B. Winsor. 1992. The Gompertz curve as a growth curve. Proceedings of National Academic Science, USA, 18, 1-8.

R. C. Yang, A. Kozak, J. H. G. Smith. 1978. The potential of Weibull-type functions as a flexible growth curves. Canadian Journal of Forest Research, 8, 424-431.

B. Zeide. 1993. Analysis of growth equation. Forest Science, 39, 594-616.

Examples

# use the heigth and diameter at breast height (dbh) of the plot 55 in DBH data set.
# The first column of DBH dataset contains the plot number. Also, H and D denote the
# height and dbh variables that located at columns 10 and 11 of data set DBH, respectively.
 data(DBH)
 D<-DBH[DBH[,1]==55,10]
 H<-DBH[DBH[,1]==55,11]
 start<-c(9,5,2)
 fitcurve(H,D,"weibull", start=start)
 # use the heigth and diameter at breast height (dbh) of the plot 55 in DBH data set.
# The first column of DBH dataset contains the plot number. Also, H and D denote the
# height and dbh variables that located at columns 10 and 11 of data set DBH, respectively.
 data(DBH)
 D<-DBH[DBH[,1]==55,10]
 H<-DBH[DBH[,1]==55,11]
 start<-c(9,5,2)
 fitcurve(H,D,"weibull", start=start)

Estimating parameters of the three-parameter Birnbaum-saunders (BS), generalized exponential (GE), and Weibull distributions fitted to grouped data

Description

Suppose a sample of $n$ independent observations each follows a three-parameter BS, GE, or Weibull distributions have been divided into $m$ separate groups of the form $(r_{i-1},r_i]$ , for $i=1,\dots,m$ . So, the likelihood function is given by

$L(\Theta)=\frac{n!}{f_{1}!f_{2}!\dots f_{m}!}\prod_{i=1}^{m}\Bigl[F\bigl(r_{i}\big|\Theta\bigr)-F\bigl(r_{i-1}\big|\Theta\bigr)\Bigr]^{f_i},$

where the $r_0$ is the lower bound of the first group, $r_m$ is the upper bound of the last group, and $f_i$ is the frequency of observations within $i$ -th group provided that $n=\sum_{i=1}^{m}f_{i}$ . The cdf of a three-parameter BS, GE, and Weibull distributions are given by

$F(x;\Theta)=\biggl(1-\exp \bigl\{-\beta(x-\mu)\bigr\} \biggr)^{\alpha},$

$F(x;\Theta)=\Phi\Biggl(\frac{\sqrt{\frac{x}{\beta}}-\sqrt{\frac{\beta}{x}}}{\alpha}\Biggr),$

and

$F(x;\Theta)=1- \exp \Bigl\{-\left(\frac{x-\mu}{\beta} \right)^{\alpha} \Bigr\},$

where $\Theta=(\alpha,\beta,\mu)^T$ .

Usage

fitgrouped1(r, f, family, method1, starts, method2)fitgrouped1(r, f, family, method1, starts, method2)

Arguments

`r`	A numeric vector of length $m+1$ . The first element of $r$ is lower bound of the first group and other $m$ elements are upper bound of the $m$ groups. We note that upper bound of the $(i-1)$ -th group is the lower bound of the $i$ -th group, for $i=2,\dots,m$ . The lower bound of the first group and upper bound of the $m$ -th group are chosen arbitrarily.
`f`	A numeric vector of length $m$ containing the group's frequency.
`family`	Can be either `"birnbaum-saunders"`, `"ge"`, or `"weibull"`.
`method1`	A character string determining the method of estimation. It can be one of `"aml"`, `"em"` and `"ml"`. The short forms `"aml"`, `"em"`, and `"ml"` are described as follows.

""aml" (for method of approximated maximum likelihood (aml)), ""em" (for method of expectation maximization (em)), and ""ml" (for method of maximum likelihood (ml)).

`starts`	A numeric vector of the initial values for the shape, scale, and location parameters, respectively.
`method2`	The method for optimizing the log-likelihood function. It invovles one of `"BFGS"`, `"Nelder-Mead"`, `"CG"`, `"L-BFGS-B"` or `"SANN"`.

Details

If the method is "em", then the initial values ("starts") and the log-likelihood optimizing method ("method2") are ignored.

Value

A two-part list of objects given by the following:

Estimated parameters of the three-parameter GE, Birnbaum-Saunders, or Weibull distribution fitted to the gropued data.
A sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Chi-square (Chi-square), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Author(s)

Mahdi Teimouri

References

G. J. McLachlan and T. Krishnan, 2007. The EM Algorithm and Extensions, John Wiley & Sons.

A. P. Dempster, N. M. Laird, and D. B. Rubin, 1977. Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, Series B (methodological), 1-38.

M. Teimouri and A. K. Gupta, 2012. Estimation Methods for the Gompertz–Makeham Distribution Under Progressively Type-I Interval Censoring Scheme, National Academy Science Letters, 35(3).

Examples

r<-c(0,1,2,3,4,10)
f<-c(2,8,12,15,4)
starts<-c(2,2,0)
fitgrouped1(r,f,"birnbaum-saunders","em")
fitgrouped1(r,f,"weibull","ml",starts,"CG")
fitgrouped1(r,f,"ge","em")

r<-c(0,1,2,3,4,10)
f<-c(2,8,12,15,4)
starts<-c(2,2,0)
fitgrouped1(r,f,"birnbaum-saunders","em")
fitgrouped1(r,f,"weibull","ml",starts,"CG")
fitgrouped1(r,f,"ge","em")

Estimating parameters of the three-parameter Birnbaum-saunders (BS), generalized exponential (GE), and Weibull distributions fitted to grouped data

Description

$L(\Theta)=\frac{n!}{f_{1}!f_{2}!\dots f_{m}!}\prod_{i=1}^{m}\Bigl[F\bigl(r_{i}\big|\Theta\bigr)-F\bigl(r_{i-1}\big|\Theta\bigr)\Bigr]^{f_i},$

Usage

fitgrouped2(r, f, param, start, cdf, pdf, method = "Nelder-Mead", lb = 0, ub = Inf
            , level = 0.05)fitgrouped2(r, f, param, start, cdf, pdf, method = "Nelder-Mead", lb = 0, ub = Inf
            , level = 0.05)

Arguments

`r`	A numeric vector of length $m+1$ . The first element of $r$ is lower bound of the first group and other $m$ elements are upper bound of the $m$ groups. We note that upper bound of the $(i-1)$ -th group is the lower bound of the $i$ -th group, for $i=2,\dots,m$ . The lower bound of the first group and upper bound of the $m$ -th group are chosen arbitrarily.
`f`	A numeric vector of length $m$ containing the group's frequency.
`param`	Vector of the of the family parameter's names.
`start`	Vector of the initial values.
`cdf`	Expression of the cumulative distribution function.
`pdf`	Expression of the probability density function.
`method`	The method for the numerically optimization that includes one of `CG`, `Nelder-Mead`, `BFGS`, `L-BFGS-B`, `SANN`.
`lb`	Lower bound of the family's support. That is zero by default.
`ub`	Upper bound of the family's support. That is `Inf` by default.
`level`	Significance level for constructing asymptotic confidence interval That is `0.05` by default for constructing a `95%` confidence interval.

Value

A two-part list of objects given by the following:

Maximum likelihood (ML) estimator for the parameters of the fitted family to the gropued data, asymptotic standard error of the ML estimator, lower bound of the asymptotic confidence interval, and upper bound of the asymptotic confidence interval at the given level.
A sequence of goodness-of-fit measures consist of Anderson-Darling (AD), Cramer-von Mises (CVM), and Kolmogorov-Smirnov (KS) statistics.

Author(s)

Mahdi Teimouri

Examples

    r <- c(2.5, 3.5, 4.5, 5.5, 6.5, 7.5, 8.5, 9.5, 10.5)
    f <- c(33, 111, 168, 147, 96,  45, 18, 4, 0)
param <- c("alpha", "beta", "mu")
  pdf <- quote( alpha/beta*((x-mu)/beta)^(alpha-1)*exp( -((x-mu)/beta)^alpha ) )
  cdf <- quote( 1-exp( -((x-mu)/beta)^alpha ) );
   lb <- 2
   ub <- Inf
start <-c(2, 3, 2)
level <- 0.05
fitgrouped2(r, f, param, start, cdf, pdf, method = "Nelder-Mead", lb = lb, ub = ub, level = 0.05)
r <- c(2.5, 3.5, 4.5, 5.5, 6.5, 7.5, 8.5, 9.5, 10.5)
    f <- c(33, 111, 168, 147, 96,  45, 18, 4, 0)
param <- c("alpha", "beta", "mu")
  pdf <- quote( alpha/beta*((x-mu)/beta)^(alpha-1)*exp( -((x-mu)/beta)^alpha ) )
  cdf <- quote( 1-exp( -((x-mu)/beta)^alpha ) );
   lb <- 2
   ub <- Inf
start <-c(2, 3, 2)
level <- 0.05
fitgrouped2(r, f, param, start, cdf, pdf, method = "Nelder-Mead", lb = lb, ub = ub, level = 0.05)

Estimating parameters of the gamma shape mixture model

Description

Estimates parameters of the gamma shape mixture (GSM) model whose probability density function gets the form as follows.

$f(x,{\Theta}) = \sum_{j=1}^{K}\omega_j \frac{\beta^j}{\Gamma(j)} x^{j-1} \exp\bigl( -\beta x\bigr),$

Usage

fitgsm(data,K)fitgsm(data,K)

Arguments

`data`	Vector of observations.
`K`	Number of components.

Details

Supposing that the number of components, i.e., $K$ is known, the parameters are estimated through the EM algorithm developed by the maintainer.

Value

A list of objects in three parts as

The EM estimator of the rate parameter.
The EM estimator of the mixing parameters.
A sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Author(s)

Mahdi Teimouri

References

A. P. Dempster, N. M. Laird, and D. B. Rubin, 1977. Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society Series B, 39, 1-38.

S. Venturini, F. Dominici, and G. Parmigiani, 2008. Gamma shape mixtures for heavy-tailed distributions, The Annals of Applied Statistics, 2(2), 756–776.

Examples

n<-100
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
data<-rgsm(n,omega,beta)
K<-length(omega)
fitgsm(data,K)
n<-100
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
data<-rgsm(n,omega,beta)
K<-length(omega)
fitgsm(data,K)

Estimating parameters of the Johnson's SB (JSB) distribution using four methods

Description

Suppose $x=(x_1,\dots,x_n)^T$ denotes a vector of $n$ independent observations coming from a four-parameter JSB distribution with probability density function given given by

$f\bigl(x\big|\Theta\bigr) = \frac {\delta \lambda}{\sqrt{2\pi}(x-\xi)(\lambda+\xi-x)}\exp\Biggl\{-\frac{1}{2}\Bigg[\gamma+\delta\log \biggl(\frac{x-\xi}{\lambda+\xi-x}\biggr) \Bigg]^2\Biggr\},$

where $\xi<x<\lambda+\xi$ , $\Theta=(\delta,\gamma,\lambda,\xi)^T$ with $\delta, \lambda> 0$ , $-\infty<\gamma<\infty$ , and $-\infty<\xi<\infty$ . Using Bayesian approach, method of conditional maximum likelihood (CML, Johnson (1949)), method of moment (MM, Fonseca(2009)), and two-percentile method that proposed by Knoebel and Burkhart (1991) (KB). We compute all four estimators when the scale $\lambda$ , and location $\xi$ , parameters are predetermined. The method proposed by Ogana (2018) has been used for predetermining the scale and location parameters. Let DBH accounts for diameter at breast height (DBH), for estimating parameters $\delta$ and $\gamma$ through the Bayesian approach, the location and scale parameters are predetermined as $\xi = \min(DBH) - 1.34$ and $\lambda = \max(DBH) - \xi + 3.8$ , respectively. For the MM, CML, and KB methods, the parameters $\xi$ and $\lambda$ are predetermined in the same way as suggested by Ogana (2018). determine

Usage

fitJSB(y, n.burn=8000, n.simul=10000)fitJSB(y, n.burn=8000, n.simul=10000)

Arguments

`y`	Vector of DBH observations.
`n.burn`	Length of the burn-in period, i.e., the point after which Gibbs sampler is supposed to attain convergence. By default `n.burn` is 8000.
`n.simul`	Total numbers of Gibbs sampler iterations. By default `n.simul` is 10,000.

Details

The Bayes' estimators are obtained by averaging on the all iterations between n.burn and n.simul.

Value

A list of objects in two parts as

Four estimators including Bayes, MM, CML, and KB.
A sequence of four goodness-of-fit measures consist of Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

References

N. L. Johnson, 1949. Systems of frequency curves generated by methods of translation, Biometrika, 36, 149-176.

B . R. Knoebel and E. Burkhart, 1991. A bivariate distribution approach to modeling forest diameter distributions at two points in time, Biometrics, 3, 241-253.

T. F. Fonseca, 2009. Describing maritime pine diameter distributions with Johnson's SB distribution using a new all-parameter recovery approach, Forest Science, 55, 367-373.

F. N. Ogana, 2018. Evaluation of four methods of fitting Johnson’s SBB for height and volume predictions, Journal of Forest Science, 64, 187-197.

Examples

# Here we use the SW dataset provided by FIA that represents a typical loblolly pine plantation.
# As the variable of interest, we fit the JSB distribution to the diameter at breast height (SW$DIA)
# in inches.
data(SW)
y <- SW$DIA
fitJSB(y, n.burn=8000, n.simul=10000)
# Here we use the SW dataset provided by FIA that represents a typical loblolly pine plantation.
# As the variable of interest, we fit the JSB distribution to the diameter at breast height (SW$DIA)
# in inches.
data(SW)
y <- SW$DIA
fitJSB(y, n.burn=8000, n.simul=10000)

Estimating parameters of the well-known mixture models

Description

Estimates parameters of the mixture model using the expectation maximization (EM) algorithm. General form for the cdf of a statistical mixture model is given by

$F(x,{\Theta}) = \sum_{j=1}^{K}\omega_j F_j(x,\theta_j),$

where $\Theta=(\theta_1,\dots,\theta_K)^T$ , is the whole parameter vector, $\theta_j$ for $j=1,\dots,K$ is the parameter space of the $j$ -th component, i.e. $\theta_j=(\alpha_j,\beta_j)^{T}$ , $F_j(.,\theta_j)$ is the cdf of the $j$ -th component, and known constant $K$ is the number of components. Parameters $\alpha$ and $\beta$ are the shape and scale parameters or both are the shape parameters. In the latter case, the parameters $\alpha$ and $\beta$ are called the first and second shape parameters, respectively. We note that the constants $\omega_j$ s sum to one, i.e. $\sum_{j=1}^{K}\omega_j=1$ . The families considered for the cdf $F$ include Birnbaum-Saunders, Burr type XII, Chen, F, Frechet, Gamma, Gompertz, Log-normal, Log-logistic, Lomax, skew-normal, and Weibull.

Usage

fitmixture(data, family, K, initial=FALSE, starts)fitmixture(data, family, K, initial=FALSE, starts)

Arguments

`data`	Vector of observations.
`family`	Name of the family including: "`birnbaum-saunders`", "`burrxii`", "`chen`", "`f`", "`Frechet`", "`gamma`", "`gompetrz`", "`log-normal`", "`log-logistic`", "`lomax`", "`skew-normal`", and "`weibull`".
`K`	Number of components.
`initial`	The sequence of initial values including $\omega_1,\dots,\omega_K,\alpha_1,\dots,\alpha_K,\beta_1,\dots,\beta_K$ . For skew normal case the vector of initial values of skewness parameters will be added. By default the initial values automatically is determind by k-means method of clustering.
`starts`	If `initial=TRUE`, then sequence of the initial values must be given.

Details

It is worth noting that identifiability of the mixture models supposed to be held. For skew-normal case we have $\theta_j=(\alpha_j,\beta_j,\lambda_j)^{T}$ in which $-\infty<\alpha_j<\infty$ , $\beta_j>0$ , and $-\infty<\lambda_j<\infty$ , respectively, are the location, scale, and skewness parameters of the $j$ -th component, see Azzalini (1985).

Value

The output has three parts, The first part includes vector of estimated weight, shape, and scale parameters.
The second part involves a sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.
The last part of the output contains clustering vector.

Author(s)

Mahdi Teimouri

References

A. Azzalini, 1985. A class of distributions which includes the normal ones, Scandinavian Journal of Statistics, 12, 171-178.

A. P. Dempster, N. M. Laird, and D. B. Rubin, 1977. Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society Series B, 39, 1-38.

M. Teimouri, S. Rezakhah, and A. Mohammdpour, 2018. EM algorithm for symmetric stable mixture model, Communications in Statistics-Simulation and Computation, 47(2), 582-604.

Examples

# Here we model the northern hardwood uneven-age forest data (HW$DIA) in inches using a
# 3-component Weibull mixture distribution.
data(HW)
data<-HW$DIA
K<-3
fitmixture(data,"weibull", K, initial=FALSE)
# Here we model the northern hardwood uneven-age forest data (HW$DIA) in inches using a
# 3-component Weibull mixture distribution.
data(HW)
data<-HW$DIA
K<-3
fitmixture(data,"weibull", K, initial=FALSE)

Estimating parameters of the well-known mixture models fitted to the grouped data

Description

Estimates parameters of the gamma, log-normal, and Weibull mixture models fitted to the grouped data using the expectation maximization (EM) algorithm. General form for the cdf of a statistical mixture model is given by

$F(x,{\Theta}) = \sum_{k=1}^{K}\omega_k F_k(x,\theta_k),$

where $\Theta=(\theta_1,\dots,\theta_K)^T$ , is the whole parameter vector, $\theta_k$ for $k=1,\dots,K$ is the parameter space of the $j$ -th component, i.e. $\theta_k=(\alpha_k,\beta_k)^{T}$ , $F_j(.,\theta_j)$ is the cdf of the $k$ -th component, and known constant $K$ is the number of components. Parameters $\alpha$ and $\beta$ are the shape and scale parameters. The constants $\omega_k$ s sum to one, i.e. $\sum_{k=1}^{K}\omega_k=1$ . The families considered for the cdf $F$ include Gamma, Log-normal, and Weibull. If a sample of $n$ independent observations each follows a distribution with cdf $F$ have been divided into $m$ separate groups of the form $(r_{i-1},r_i]$ , for $i=1,\dots,m$ . So, the likelihood function of the observed data is given by

$L(\Theta|f_1,\dots,f_m)=\frac{n!}{f_{1}!f_{2}!\dots f_{m}!}\prod_{i=1}^{m}\Bigl[\frac{F_i(\Theta)}{F(\Theta)}\Bigr]^{f_i},$

where

$F_i(\Theta)=\sum_{k=1}^{K}\omega_k\int_{r_{i-1}}^{r_i}f(x|\theta_k)dx,$

$F(\Theta)=\sum_{k=1}^{K}\omega_kf(x|\theta_k)dx,$

in which $f(x|\theta_k)$ denotes the pdf of the $j$ -th component. Using the the EM algorithm proposed by Dempster et al. (1977), we can solve $\partial L(\Theta|f_1,\dots,f_m)/{\partial \Theta}=0$ by introducing two new missing variables.

Usage

fitmixturegrouped(family, r, f, K, initial=FALSE, starts)fitmixturegrouped(family, r, f, K, initial=FALSE, starts)

Arguments

`family`	Name of the family including: "`gamma`", "`log-normal`", "`skew-normal`", and "`weibull`".
`r`	A numeric vector of length $m+1$ . The first element of $r$ is lower bound of the first group and other $m$ elements are upper bound of the $m$ groups. We note that upper bound of the $(i-1)$ -th group is the lower bound of the $i$ -th group, for $i=2,\dots,m$ . The lower bound of the first group and upper bound of the $m$ -th group are chosen arbitrarily. If raw data are available, the smallest and largest observations are chosen for lower bound of the first group and upper bound of the $m$ -th group, respectively.
`f`	A numeric vector of length $m$ containing the group's frequency.
`K`	Number of components.
`initial`	The sequence of initial values including $\omega_1,\dots,\omega_K,\alpha_1,\dots,\alpha_K,\beta_1,\dots,\beta_K$ . For skew normal case the vector of initial values of skewness parameters will be added. By default the initial values automatically is determind by k-means method of clustering.
`starts`	If `initial=TRUE`, then sequence of the initial values must be given.

Details

Identifiability of the mixture models supposed to be held. For skew-normal mixture model the parameter vector of $k$ -th component gets the form $\theta_k=(\alpha_k,\beta_k,\lambda_k)^{T}$ where $\alpha_k,\beta_k,$ and $\lambda_k$ denote the location, scale, and skewness parameters, respectively.

Value

The output has two parts, The first part includes vector of estimated weight, shape, and scale parameters.
A sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Author(s)

Mahdi Teimouri

References

G. J. McLachlan and P. N. Jones, 1988. Fitting mixture models to grouped and truncated data via the EM algorithm, Biometrics, 44, 571-578

Examples

n<-50
K<-2
m<-10
weight<-c(0.3,0.7)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
data<-rmixture(n, "weibull", K, param)
r<-seq(min(data),max(data),length=m+1)
D<-data.frame(table(cut(data,r,labels=NULL,include.lowest=TRUE,right=FALSE,dig.lab=4)))
f<-D$Freq
fitmixturegrouped("weibull",r,f,K,initial=FALSE)
n<-50
K<-2
m<-10
weight<-c(0.3,0.7)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
data<-rmixture(n, "weibull", K, param)
r<-seq(min(data),max(data),length=m+1)
D<-data.frame(table(cut(data,r,labels=NULL,include.lowest=TRUE,right=FALSE,dig.lab=4)))
f<-D$Freq
fitmixturegrouped("weibull",r,f,K,initial=FALSE)

Estimating parameters of the Weibull distribution through classical methods

Description

Estimates the parameters of the two- and three-parameter Weibull model with pdf and cdf given by

$f(x;\alpha,\beta,\theta)=\frac{\alpha}{\beta} \left(\frac{x-\theta}{\beta }\right)^{\alpha -1} \exp \biggl\{-\left(\frac{x-\theta}{\beta } \right)^{\alpha } \biggr\},$

and

$F(x;\alpha,\beta,\theta)=1- \exp \biggl\{-\left(\frac{x-\theta}{\beta } \right)^{\alpha } \biggr\},$

where $x>\theta$ , $\alpha > 0$ , $\beta >0$ and $-\infty<\theta<\infty$ . Here, the parameters $\alpha$ , $\beta$ , and $\theta$ are known in the literature as the shape, scale, and location, respectively. If $\theta=0$ , then $f(x;\alpha,\beta)$ and $F(x;\alpha,\beta)$ in above are the pdf and cdf of a two-parameter Weibull distribution, respectively.

Usage

fitWeibull(data, location, method, starts)fitWeibull(data, location, method, starts)

Arguments

`data`	Vector of observations
`starts`	Initial values for starting the iterative procedures such as Newton-Raphson.
`location`	Either TRUE or FALSE. If location=TRUE, then shift parameter will be considered; otherwise the shift parameter omitted.
`method`	Used method for estimating the parameters. In the two-parameter case, methods are "`greg1`" (for the method of generalized regression type 1), "`greg2`" (for the method of generalized regression type 2), "`lm`" (for the method of L-moment), "`ml`" (for the method of maximum likelihood (ML)), "`mlm`" (for the method of logarithmic moment), "`moment`" (for the method of moment), "`pm`" (for the method of percentile), "`rank`" (for the method of rank correlation), "`reg`" (for the method of least square), "`ustat`" (for the method of U-statistic), "`wml`" (for the method of weighted ML), and "`wreg`" (for the method of weighted least square). In three-parameter case the methods are "`mle`" (for the method of ML), "`mm1`" (for the method of modified moment (MM) type 1), "`mm2`" (for the method of MM type 2), "`mm3`" (for the method of MM type 3), "`mml1`" (for the method of modified ML type 1), "`mml2`" (for the method of modified ML type 2), "`mml3`" (for the method of modified ML type 3), "`mml4`" (for the method of modified ML type 4), "`moment`" (for the method of moment), "`mps`" (for the method of maximum product spacing), "`tlm`" (for the method of T-L moment), and "`wml`" (for the method of weighted ML).

Details

For the method wml, all weights have been provided for sample size less that or equal to 100. This means that both methods ml and wml give the same estimates for samples of size larger than 100.

Value

A list of objects in two parts given by the following:

Estimated parameters for two- or three-parameter Weibull distribution.
A sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Cramer-von Mises (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Author(s)

Mahdi Teimouri

References

R. C. H. Cheng and M. A. Stephens, 1989. A goodness-of-fit test using Moran's statistic with estimated parameters, Biometrika, 76(2), 385-392.

C. A. Clifford and B. Whitten, 1982. Modified maximum likelihood and modified moment estimators for the three-parameter Weibull distribution, Communication in Statistics-Theory and Methods, 11(23), 2631-2656.

D. Cousineau, 2009. Nearly unbiased estimators for the three-parameter Weibull distribution with greater efficiency than the iterative likelihood method, British Journal of Mathematical and Statistical Psychology, 62, 167-191.

G. Cran, 1988. Moment estimators for the 3-parameter Weibull distribution, IEEE Transactions on Reliability, 37(4), 360-363.

J. R. Hosking, 1990. L-moments: analysis and estimation of distributions using linear combinations of order statistics, Journal of the Royal Statistical Society. Series B (Methodological), 52(1), 105-124.

Y. M. Kantar, 2015. Generalized least squares and weighted least squares estimation methods for distributional parameters, REVSTAT-Statistical Journal, 13(3), 263-282.

M. Teimouri and S. Nadarajah, 2012. A simple estimator for the Weibull shape parameter, International Journal of Structural Stability and Dynamics, 12(2), 2395-402.

M. Teimouri, S. M. Hoseini, and S. Nadarajah, 2013. Comparison of estimation methods for the Weibull distribution, Statistics, 47(1), 93-109.

F. Wang and J. B. Keats, 1995. Improved percentile estimation for the two-parameter Weibull distribution, Microelectronics Reliability, 35(6), 883-892.

L. Zhang, M. Xie, and L. Tang, 2008. On Weighted Least Squares Estimation for the Parameters of Weibull Distribution. In: Pham H. (eds) Recent Advances in Reliability and Quality in Design. Springer Series in Reliability Engineering. Springer, London.

Examples

n<-100
alpha<-2
beta<-2
theta<-3
data<-rweibull(n,shape=alpha,scale=beta)+theta
starts<-c(2,2,3)
fitWeibull(data, TRUE, "mps", starts)
fitWeibull(data, TRUE, "wml", starts)
fitWeibull(data, FALSE, "mlm", starts)
fitWeibull(data, FALSE, "ustat", starts)
n<-100
alpha<-2
beta<-2
theta<-3
data<-rweibull(n,shape=alpha,scale=beta)+theta
starts<-c(2,2,3)
fitWeibull(data, TRUE, "mps", starts)
fitWeibull(data, TRUE, "wml", starts)
fitWeibull(data, FALSE, "mlm", starts)
fitWeibull(data, FALSE, "ustat", starts)

Mixed norther hardwood

Description

Tree list from a U.S. Forest Service Forest Inventory and Analysis (FIA) plot PLT_CN 247006253010661 measured in 2012 and represents a typical northern hardwood uneven-age forest.

Usage

data(HW)data(HW)

Format

A data frame containing 25 trees (rows) and two columns. Columns are the trees' scientific name and diameter at breast height in inches.

Computing cumulative distribution function of the gamma shape mixture model

Description

Computes cumulative distribution function (cdf) of the gamma shape mixture (GSM) model. The general form for the cdf of the GSM model is given by

$F(x,{\Theta}) = \sum_{j=1}^{K}\omega_j F(x,j,\beta),$

where

$F(x,j,\beta) = \int_{0}^{x} \frac{\beta^j}{\Gamma(j)} y^{j-1} \exp\bigl( -\beta y\bigr) dy,$

in which $\Theta=(\omega_1,\dots,\omega_K, \beta)^T$ is the parameter vector and known constant $K$ is the number of components. The vector of mixing parameters is given by $\omega=(\omega_1,\dots,\omega_K)^T$ where $\omega_j$ s sum to one, i.e., $\sum_{j=1}^{K}\omega_j=1$ . Here $\beta$ is the rate parameter that is equal for all components.

Usage

pgsm(data, omega, beta, log.p = FALSE, lower.tail = TRUE)pgsm(data, omega, beta, log.p = FALSE, lower.tail = TRUE)

Arguments

`data`	Vector of observations.
`omega`	Vector of the mixing parameters.
`beta`	The rate parameter.
`log.p`	If `TRUE`, then log(cdf) is returned.
`lower.tail`	If `FALSE`, then `1-cdf` is returned.

Value

A vector of the same length as data, giving the cdf of the GSM model.

Author(s)

Mahdi Teimouri

References

S. Venturini, F. Dominici, and G. Parmigiani, 2008. Gamma shape mixtures for heavy-tailed distributions, The Annals of Applied Statistics, 2(2), 756–776.

Examples

data<-seq(0,20,0.1)
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
pgsm(data, omega, beta)
data<-seq(0,20,0.1)
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
pgsm(data, omega, beta)

Computing the cumulative distribution function of Johnson's SB (JSB) distribution

Description

Computes the cumulative distribution function of the four-parameter JSB distibution given by

$F\bigl(x\big|\Theta\bigr) = \int_{\xi}^{x}\frac {\delta \lambda}{\sqrt{2\pi}(u-\xi)(\lambda+\xi-u)}\exp\Biggl\{-\frac{1}{2}\Bigg[\gamma+\delta\log \biggl(\frac{u-\xi}{\lambda+\xi-u}\biggr) \Bigg]^2\Biggr\} du,$

where $\xi<x<\lambda+\xi$ , $\Theta=(\delta,\gamma,\lambda,\xi)^T$ with $\delta, \lambda> 0$ , $-\infty<\gamma<\infty$ , and $-\infty<\xi<\infty$ .

Usage

pjsb(data, param, log.p = FALSE, lower.tail = TRUE)pjsb(data, param, log.p = FALSE, lower.tail = TRUE)

Arguments

`data`	Vector of observations.
`param`	Vector of the parameters $\delta$ , $\gamma$ , $\lambda$ , and $\xi$ .
`log.p`	If `TRUE`, then log(cdf) is returned.
`lower.tail`	If `FALSE`, then `1-cdf` is returned.

Value

A vector of length n, giving random generated values from JSB distribution.

Author(s)

Mahdi Teimouri

Examples

data<-rnorm(10)
param<-c(delta<-1, gamma<-3, lambda<-12, xi<-5)
pjsb(data, param, log.p = FALSE, lower.tail = TRUE)
data<-rnorm(10)
param<-c(delta<-1, gamma<-3, lambda<-12, xi<-5)
pjsb(data, param, log.p = FALSE, lower.tail = TRUE)

Computing cumulative distribution function of the well-known mixture models

Description

Computes cumulative distribution function (cdf) of the mixture model. The general form for the cdf of the mixture model is given by

$F(x,{\Theta}) = \sum_{j=1}^{K}\omega_j F(x,\theta_j),$

where $\Theta=(\theta_1,\dots,\theta_K)^T$ , is the whole parameter vector, $\theta_j$ for $j=1,\dots,K$ is the parameter space of the $j$ -th component, i.e. $\theta_j=(\alpha_j,\beta_j)^{T}$ , $F_j(.,\theta_j)$ is the cdf of the $j$ -th component, and known constant $K$ is the number of components. The vector of mixing parameters is given by $\omega=(\omega_1,\dots,\omega_K)^T$ where $\omega_j$ s sum to one, i.e., $\sum_{j=1}^{K}\omega_j=1$ . Parameters $\alpha$ and $\beta$ are the shape and scale parameters or both are the shape parameters. In the latter case, the parameters $\alpha$ and $\beta$ are called the first and second shape parameters, respectively. The families considered for each component include Birnbaum-Saunders, Burr type XII, Chen, F, Frechet, Gamma, Gompertz, Log-normal, Log-logistic, Lomax, skew-normal, and Weibull.

Usage

pmixture(data, g, K, param)pmixture(data, g, K, param)

Arguments

`data`	Vector of observations.
`g`	Name of the family including: "`birnbaum-saunders`", "`burrxii`", "`chen`", "`f`", "`frechet`", "`gamma`", "`gompetrz`", "`log-normal`", "`log-logistic`", "`lomax`", "`skew-normal`", and "`weibull`".
`K`	Number of components.
`param`	Vector of the $\omega$ , $\alpha$ , $\beta$ , and $\lambda$ .

Details

For the skew-normal case, $\alpha$ , $\beta$ , and $\lambda$ are the location, scale, and skewness parameters, respectively.

Value

A vector of the same length as data, giving the cdf of the mixture model computed at data.

Author(s)

Mahdi Teimouri

Examples

data<-seq(0,20,0.1)
K<-2
weight<-c(0.6,0.4)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
pmixture(data, "weibull", K, param)
data<-seq(0,20,0.1)
K<-2
weight<-c(0.6,0.4)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
pmixture(data, "weibull", K, param)

Simulating realizations from the gamma shape mixture model

Description

Simulates realizations from a gamma shape mixture (GSM) model with probability density function given by

$f(x,{\Theta}) = \sum_{j=1}^{K}\omega_j \frac{\beta^j}{\Gamma(j)} x^{j-1} \exp\bigl( -\beta x\bigr),$

Usage

rgsm(n, omega, beta)rgsm(n, omega, beta)

Arguments

`n`	Number of requested random realizations.
`omega`	Vector of the mixing parameters.
`beta`	The rate parameter.

Value

A vector of length n, giving random generated values from GSM model.

Author(s)

Mahdi Teimouri

References

S. Venturini, F. Dominici, and G. Parmigiani, 2008. Gamma shape mixtures for heavy-tailed distributions, The Annals of Applied Statistics, 2(2), 756–776.

Examples

n<-100
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
rgsm(n, omega, beta)
n<-100
omega<-c(0.05, 0.1, 0.15, 0.2, 0.25, 0.25)
beta<-2
rgsm(n, omega, beta)

Simulating realizations from the Johnson's SB (JSB) distribution

Description

Simulates realizations from four-parameter JSB distribution with probability density function given by

$f\bigl(x\big|\Theta\bigr) = \frac {\delta \lambda}{\sqrt{2\pi}(x-\xi)(\lambda+\xi-x)}\exp\Biggl\{-\frac{1}{2}\Bigg[\gamma+\delta\log \biggl(\frac{x-\xi}{\lambda+\xi-x}\biggr) \Bigg]^2\Biggr\},$

where $\xi<x<\lambda+\xi$ , $\Theta=(\delta,\gamma,\lambda,\xi)^T$ with $\delta>0$ , $\lambda> 0$ , $-\infty<\gamma<\infty$ , and $-\infty<\xi<\infty$ .

Usage

rjsb(n, param)rjsb(n, param)

Arguments

`n`	Number of requested random realizations.
`param`	Vector of the parameters $\delta$ , $\gamma$ , $\lambda$ , and $\xi$ .

Value

A vector of length n, giving random generated values from JSB distribution.

Author(s)

Mahdi Teimouri

Examples

n<-100
param<-c(delta<-1, gamma<-3, lambda<-12, xi<-5)
rjsb(n, param)
n<-100
param<-c(delta<-1, gamma<-3, lambda<-12, xi<-5)
rjsb(n, param)

Simulating realizations from bivariate Johnson's SB (JSBB) distribution.

Description

Simulates realizations from four-parameter JSB distribution.

Usage

rjsbb(n, param)rjsbb(n, param)

Arguments

`n`	Number of requested random realizations.
`param`	Vector of the parameters $\bf{\delta}$ , $\bf{\gamma}$ , $\bf{\lambda}$ , $\bf{\xi}$ , $\rho$ .

Value

A vector of length n, giving random generated values from JSBB distribution.

Author(s)

Mahdi Teimouri

Examples

Delta <- c(2.5, 3)
Gamma <- c(2,1)
Lambda <- c(1, 3)
Xi <- c(0, 2)
rho <- -0.5
param <- c(Delta, Gamma, Lambda, Xi, rho)
rjsbb(20, param)
Delta <- c(2.5, 3)
Gamma <- c(2,1)
Lambda <- c(1, 3)
Xi <- c(0, 2)
rho <- -0.5
param <- c(Delta, Gamma, Lambda, Xi, rho)
rjsbb(20, param)

Generating random realizations from the well-known mixture models

Description

Generates iid realizations from the mixture model with pdf given by

$f(x,{\Theta}) = \sum_{j=1}^{K}\omega_j f(x,\theta_j),$

where $K$ is the number of components, $\theta_j$ , for $j=1,\dots,K$ is parameter space of the $j$ -th component, i.e. $\theta_j=(\alpha_j,\beta_j)^{T}$ , and $\Theta$ is the whole parameter vector $\Theta=(\theta_1,\dots,\theta_K)^{T}$ . Parameters $\alpha$ and $\beta$ are the shape and scale parameters or both are the shape parameters. In the latter case, parameters $\alpha$ and $\beta$ are called the first and second shape parameters, respectively. We note that the constants $\omega_j$ s sum to one, i.e., $\sum_{j=1}^{K}\omega_j=1$ . The families considered for the cdf $f$ include Birnbaum-Saunders, Burr type XII, Chen, F, Frechet, Gamma, Gompertz, Log-normal, Log-logistic, Lomax, skew-normal, and Weibull.

Usage

rmixture(n, g, K, param)rmixture(n, g, K, param)

Arguments

`n`	Number of requested random realizations.
`g`	Name of the family including "`birnbaum-saunders`", "`burrxii`", "`chen`", "`f`", "`frechet`", "`gamma`", "`gompetrz`", "`log-normal`", "`log-logistic`", "`lomax`", "`skew-normal`", and "`weibull`".
`K`	Number of components.
`param`	Vector of the $\omega$ , $\alpha$ , $\beta$ , and $\lambda$ .

Details

For the skew-normal case, $\alpha$ , $\beta$ , and $\lambda$ are the location, scale, and skewness parameters, respectively.

Value

A vector of length $n$ , giving a sequence of random realizations from given mixture model.

Author(s)

Mahdi Teimouri

Examples

n<-50
K<-2
weight<-c(0.3,0.7)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
rmixture(n, "weibull", K, param)
n<-50
K<-2
weight<-c(0.3,0.7)
alpha<-c(1,2)
beta<-c(2,1)
param<-c(weight,alpha,beta)
rmixture(n, "weibull", K, param)

Robust multiple linear regression modelling when error term follows a skew Student's $t$ distribution

Description

Robust multiple linear regression modelling with skew Student's $t$ error term. The density function of skew Student's $t$ is given by

$f(x,{\Theta}) = \frac{2}{\sigma} t\bigl(z;\nu\bigr) T\biggl(\lambda z\sqrt{\frac{\nu+1}{\nu+z^2}};\nu+1\biggr),$

where $z=(x-\mu)/\sigma$ , $-\infty<\mu<\infty$ is the location parameter, $\sigma>0$ is the scale parameter, and $-\infty<\lambda<\infty$ is the skewness parameter. Also, $t(u,\nu)$ and $T(u,\nu)$ denote the density and distribution functions of the Student's $t$ distribution with $\nu$ degrees of freedom at point $u$ , respectively. If $\lambda=0$ , then the skew Student's $t$ distribution turns into the ordinary Student's $t$ distribution that is symmetric around $\mu$ . Since Student's $t$ is a heavy tailed distribution, it is so useful for regression analysis in presence of outliers.

Usage

skewtreg(y, x, Fisher=FALSE)skewtreg(y, x, Fisher=FALSE)

Arguments

`y`	vector of response variable.
`x`	vector or matrix of explanatory variable(s).
`Fisher`	Either TRUE or FALSE. By default `Fisher==FALSE`; otherwise the observed Fisher information matrix and asymptotic standard errors for estimated regression coefficients are evaluated.

Value

A list of estimated regression coefficients, asymptotic standard error, corresponding p-values, estimated parameters of error term (skew Student's $t$ ), F statistic, R-square and adjusted R-square, and observed Fisher information matrix is given.

Author(s)

Mahdi Teimouri

Examples


n<-100
x<-rnorm(n)
y<-2+2*x+rt(n,df=2)
skewtreg(y,x,Fisher=FALSE)

n<-100
x<-rnorm(n)
y<-2+2*x+rt(n,df=2)
skewtreg(y,x,Fisher=FALSE)

Southern loblolly pine plantation

Description

Tree list from a U.S. Forest Service Forest Inventory and Analysis (FIA) plot PLT_CN 259082471010854 measured in 2011 and represents a typical loblolly pine plantation.

Usage

data(SW)data(SW)

Format

A data frame containing 18 trees (rows) and two columns. Columns are the trees' scientific name and diameter at breast height in inches.

Starting message when loading ForestFit

Description

It contains a welcome message for user of ForestFit.

Package 'ForestFit'

Help Index

Trees height and diameter at breast height

Description

Usage

Format

References

Computing probability density function of the gamma shape mixture model

Description

Usage

Arguments

Value

Author(s)

References

Examples

Computing the probability density function of Johnson's SB (JSB) distribution

Description

Usage

Arguments

Value

Author(s)

Examples

Computing the probability density function of bivariate Johnson's SB (JSBB) distribution

Description

Usage

Arguments

Value

Author(s)

Examples

Computing probability density function of the well-known mixture models

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Estimating parameters of the Johnson's SB (JSB) distribution using the Bayesian approach

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Estimating parameters of the Weibull distribution using the Bayesian approach

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

Examples

Estimatinng the parameters of the nonlinear curve fitted to the height-diameter(H-D) observations

Description

Usage

Arguments

Value

Author(s)

References

Examples

Estimating parameters of the three-parameter Birnbaum-saunders (BS), generalized exponential (GE), and Weibull distributions fitted to grouped data

Description

Usage

Arguments

Details

Value

Author(s)

References

Examples

Estimating parameters of the three-parameter Birnbaum-saunders (BS), generalized exponential (GE), and Weibull distributions fitted to grouped data

Description

Usage

Arguments

Value

Author(s)

Examples