Wishart distribution
From Wikipedia, the free encyclopedia
| Probability density function |
|
| Cumulative distribution function |
|
| Parameters | deg. of freedom (real) scale matrix ( pos. def) |
|---|---|
| Support | is positive definite |
| Probability density function (pdf) | ![]() |
| Cumulative distribution function (cdf) | |
| Mean | ![]() |
| Median | |
| Mode | ![]() |
| Variance | ![]() |
| Skewness | |
| Excess kurtosis | |
| Entropy | |
| Moment-generating function (mgf) | |
| Characteristic function | ![]() |
In statistics, the Wishart distribution, named in honor of John Wishart, is a generalization of the gamma distribution to multiple dimensions. It is any of a family of probability distributions for nonnegative-definite matrix-valued random variables ("random matrices"). These distributions are of great importance in the estimation of covariance matrices in multivariate statistics.
Contents |
[edit] Definition
Suppose X is an n × p matrix, each row of which is independently drawn from p-variate normal distribution with zero mean:
Then the Wishart distribution is the probability distribution of the p×p random matrix
known as the scatter matrix. One indicates that S has that probability distribution by writing
The positive integer n is the number of degrees of freedom. Sometimes this is written W(V, p, n).
If p = 1 and V = 1 then this distribution is a chi-square distribution with n degrees of freedom.
[edit] Occurrence
The Wishart distribution arises frequently in likelihood-ratio tests in multivariate statistical analysis. It also arises in the spectral theory of random matrices.
[edit] Probability density function
The Wishart distribution can be characterized by its probability density function, as follows.
Let W be a p × p symmetric matrix of random variables that is positive definite. Let V be a (fixed) positive definite matrix of size p × p.
Then, if n ≥ p, then W has a Wishart distribution with n degrees of freedom if it has a probability density function fW given by
where Γp(·) is the multivariate gamma function defined as
In fact the above definition can be extended to any real n > p − 1.
[edit] Characteristic function
The characteristic function of the Wishart distribution is
In other words,
where
denotes expectation.
(here Θ and
are matrices the same size as
(
is the identity matrix); and i is the square root of minus one).
[edit] Theorem
If
has a Wishart distribution with m degrees of freedom and variance matrix
—write
—and
is a q × p matrix of rank q, then
[edit] Corollary 1
If
is a nonzero
constant vector, then
.
In this case,
is the chi-square distribution and
(note that
is a constant; it is positive because
is positive definite).
[edit] Corollary 2
Consider the case where
(that is, the j-th element is one and all others zero). Then corollary 1 above shows that
gives the marginal distribution of each of the elements on the matrix's diagonal.
Noted statistician George Seber points out that the Wishart distribution is not called the "multivariate chi-square distribution" because the marginal distribution of the off-diagonal elements is not chi-square. Seber prefers to reserve the term multivariate for the case when all univariate marginals belong to the same family.
[edit] Estimator of the multivariate normal distribution
The Wishart distribution is the probability distribution of the maximum-likelihood estimator (MLE) of the covariance matrix of a multivariate normal distribution. The derivation of the MLE is perhaps surprisingly subtle and elegant. It involves the spectral theorem and the reason why it can be better to view a scalar as the trace of a 1×1 matrix than as a mere scalar. See estimation of covariance matrices.
[edit] Drawing values from the distribution
The following procedure is due to Smith & Hocking [1]. One can sample random p × p matrices from a p-variate Wishart distribution with scale matrix
and n degrees of freedom (for
) as follows:
- Generate a random p × p lower triangular matrix
such that:
, i.e. aii is the square root of a sample taken from a chi-square distribution 
- aij, for j < i, is sampled from a standard normal distribution N1(0,1)
- Compute the Cholesky decomposition of
. - Compute the matrix
. At this point,
is a sample from the Wishart distribution
.
Note that if
, the identity matrix, then the sample can be directly obtained from
since the Cholesky decomposition of
.
[edit] See also
- Estimation of covariance matrices
- Hotelling's T-square distribution
- Inverse-Wishart distribution
- Sampling Wishart Distribution
deg. of freedom (
scale matrix (
is positive definite







![f_{\mathbf W}(w)=
\frac{
\left|w\right|^{(n-p-1)/2}
\exp\left[ - {\rm trace}({\mathbf V}^{-1}w/2 )\right]
}{
2^{np/2}\left|{\mathbf V}\right|^{n/2}\Gamma_p(n/2)
}](../../../../math/2/2/8/2282948f1300222d990f8258ed687f34.png)
![\Gamma_p(n/2)=
\pi^{p(p-1)/4}\Pi_{j=1}^p
\Gamma\left[ (n+1-j)/2\right].](../../../../math/2/7/a/27a0609bf75d6febaa5722a222a42646.png)

![\Theta \mapsto {\mathcal E}\left\{\mathrm{exp}\left[i\cdot\mathrm{trace}({\mathbf W}{\mathbf\Theta})\right]\right\}
=
\left|{\mathbf I} - 2i{\mathbf\Theta}{\mathbf V}\right|^{-n/2}](../../../../math/7/4/1/7411d4204e663b3717ef91fd58ce304a.png)



