Remove ads
Probability distribution From Wikipedia, the free encyclopedia
The scaled inverse chi-squared distribution , where is the scale parameter, equals the univariate inverse Wishart distribution with degrees of freedom .
This family of scaled inverse chi-squared distributions is linked to the inverse-chi-squared distribution and to the chi-squared distribution:
If then as well as and .
Instead of , the scaled inverse chi-squared distribution is however most frequently parametrized by the scale parameter and the distribution is denoted by .
In terms of the above relations can be written as follows:
If then as well as and .
This family of scaled inverse chi-squared distributions is a reparametrization of the inverse-gamma distribution.
Specifically, if
Either form may be used to represent the maximum entropy distribution for a fixed first inverse moment and first logarithmic moment .
The scaled inverse chi-squared distribution also has a particular use in Bayesian statistics. Specifically, the scaled inverse chi-squared distribution can be used as a conjugate prior for the variance parameter of a normal distribution. The same prior in alternative parametrization is given by the inverse-gamma distribution.
The probability density function of the scaled inverse chi-squared distribution extends over the domain and is
where is the degrees of freedom parameter and is the scale parameter. The cumulative distribution function is
where is the incomplete gamma function, is the gamma function and is a regularized gamma function. The characteristic function is
where is the modified Bessel function of the second kind.
The maximum likelihood estimate of is
The maximum likelihood estimate of can be found using Newton's method on:
where is the digamma function. An initial estimate can be found by taking the formula for mean and solving it for Let be the sample mean. Then an initial estimate for is given by:
The scaled inverse chi-squared distribution has a second important application, in the Bayesian estimation of the variance of a Normal distribution.
According to Bayes' theorem, the posterior probability distribution for quantities of interest is proportional to the product of a prior distribution for the quantities and a likelihood function:
where D represents the data and I represents any initial information about σ2 that we may already have.
The simplest scenario arises if the mean μ is already known; or, alternatively, if it is the conditional distribution of σ2 that is sought, for a particular assumed value of μ.
Then the likelihood term L(σ2|D) = p(D|σ2) has the familiar form
Combining this with the rescaling-invariant prior p(σ2|I) = 1/σ2, which can be argued (e.g. following Jeffreys) to be the least informative possible prior for σ2 in this problem, gives a combined posterior probability
This form can be recognised as that of a scaled inverse chi-squared distribution, with parameters ν = n and τ2 = s2 = (1/n) Σ (xi-μ)2
Gelman and co-authors remark that the re-appearance of this distribution, previously seen in a sampling context, may seem remarkable; but given the choice of prior "this result is not surprising."[1]
In particular, the choice of a rescaling-invariant prior for σ2 has the result that the probability for the ratio of σ2 / s2 has the same form (independent of the conditioning variable) when conditioned on s2 as when conditioned on σ2:
In the sampling-theory case, conditioned on σ2, the probability distribution for (1/s2) is a scaled inverse chi-squared distribution; and so the probability distribution for σ2 conditioned on s2, given a scale-agnostic prior, is also a scaled inverse chi-squared distribution.
If more is known about the possible values of σ2, a distribution from the scaled inverse chi-squared family, such as Scale-inv-χ2(n0, s02) can be a convenient form to represent a more informative prior for σ2, as if from the result of n0 previous observations (though n0 need not necessarily be a whole number):
Such a prior would lead to the posterior distribution
which is itself a scaled inverse chi-squared distribution. The scaled inverse chi-squared distributions are thus a convenient conjugate prior family for σ2 estimation.
If the mean is not known, the most uninformative prior that can be taken for it is arguably the translation-invariant prior p(μ|I) ∝ const., which gives the following joint posterior distribution for μ and σ2,
The marginal posterior distribution for σ2 is obtained from the joint posterior distribution by integrating out over μ,
This is again a scaled inverse chi-squared distribution, with parameters and .
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.